Thales Blog

Vormetric Certified For HADOOP 5 (CDH5) Big Data Environment

April 24, 2014

Ashvin Kamaraju Ashvin Kamaraju | Vice President of Engineering, Strategy & Innovation More About This Author >

Cloudera recently announced the general availability of Cloudera HADOOP Version 5 (CDH 5), also dubbed as the “Enterprise Data Hub”.   Vormetric is proud and pleased that we have once again successfully certified Vormetric Data Security for Cloudera HADOOP. This further strengthens our partnership and demonstrates our commitment to mutual customers.

With a recent investment of one billion dollars in Cloudera and Hortonworks, the two leading HADOOP distribution companies, the market has validated the potential market size (rumored to be about $20billion by 2016) and the disruption that Big Data will create.

With the explosive growth of data in every aspect of our lives and in enterprises across the world, there is growing demand to derive value from this data and provide business intelligence. Enterprises depend on this intelligence so they can meet their customers’ needs in a timely manner and with greater precision. Along with traditional sources of data such as transactional systems and data warehouses, new sources of data such as those from the “Internet of Things”, click logs, social media interactions and sensors have emerged. Collectively, these vastly larger information volumes and new assets are known as Big Data. Enterprises are using technologies such as HADOOP with MapReduce to extract maximum value from data from all sources.

HADOOP has rapidly gained acceptance in the enterprise and is becoming a central repository for all the data in an enterprise. The data is extensively mined and analyzed by several different applications for several different objectives. Enterprise customers benefit from HADOOP 2.0, which serves as a data repository, not only for batch processing, but also for real time SQL queries. Customers can derive more value from all of their data, from a single system, at a much lower cost than a traditional data warehouse appliance.

HADOOP systems are distributed shared nothing clusters of compute and storage. They must have good system performance, as it is critical to ensure data queries complete as fast as possible and with very low latencies.  For economies of scale, these systems are typically built from off the shelf Intel X86 systems with high processor core densities and terabytes of storage capacity.

With HADOOP systems storing a vast majority of enterprise data, it is only logical that data processed by HADOOP systems must be secured in order to avoid data breaches as well as to comply with several regulations such as the Payment Card Industry Data Security Standard (PCI DSS), Sarbanes Oxley, HIPAA HITECH and many state and federal data privacy laws.

Vormetric Data Security products enable enterprise customers to comply with regulatory requirements by encrypting data that is stored and processed by HADOOP systems, centralizing key management and enforcing access control policies. Encrypting data incurs some performance overhead; however the overhead when using Vormetric Data Security under several different HADOOP workloads is minimal and exceeds typically acceptable norms of performance vs. security tradeoffs.