The release of Cloudera 5.4 continues to drive the vision of an enterprise data hub built with the most critical business needs in mind. Big data can be a daunting challenge but also a huge opportunity, with the ability to open up more data to more users to quickly derive more value. To truly leverage all of the benefits of an enterprise data hub, you need a production-ready platform, such as the newly released Cloudera 5.4.
A Focus on Open Standards
The core of Cloudera’s platform is built on open standards, which are critical to enterprises and end-users alike. Open standards guarantee sustained engineering quality and maturity to ensure these tools are ready for production and will provide lasting value, which in turn leads to broader adoption with users able to choose the best, most powerful tool for the job. Additionally, these are the tools that third-parties build integrations with for seamless compatibility, and e
nsuring the same tools and skills can be used. Finally, all of this leads to wide support across industry vendors, so not only can enterprises leverage the benefits of open source, they can do so without fear of lock-in with flexible portability. Carefully curated frameworks, including Impala, Apache Spark, and Apache Solr, provide leading performance and innovation. The Cloudera 5.4 release features key milestones for some of our most popular open standard components:
- Apache HBase, the open standard NoSQL datastore, recently achieved a major milestone with its 1.0 release and is included with CDH 5.4, along with some added performance improvements.
- Apache Kafka, the emerging open standard for data ingest, also has deeper platform integration with CDH 5.4 through Cloudera Manager custom service descriptors (CSDs).
- The latest version of the popular open standard, Apache Spark 1.3, is also included with CDH 5.4 and includes deeper integrations with other platform components and several other critical features.
For full details on what’s new with every open standard component, check out the Release Notes. To learn more about how Cloudera approaches open source and open standards with the platform, download “Cloudera’s Commitment to Open Source and Open Standards.”
The Leaders in Data Governance
With 5.4, Cloudera continues to lead the way in data governance for Hadoop. As described in a previous blog, data governance is an absolutely critical part of any enterprise application storing sensitive data. However, for Hadoop, it can be especially challenging to keep track of who’s accessing what data and what they’re doing with it. The only way to achieve the necessary governance required for compliance, or even just basic visibility, is with a native solution.
Cloudera Navigator is the only native, end-to-end governance solution for Apache Hadoop. It provides auditing, lineage, metadata management, and data lifecycle management – necessary for meeting even the most stringent compliance requirements. The release of 5.4 continues to add coverage completeness to Cloudera Navigator, with Impala integration into the data lineage functionality, auditing of Cloudera Navigator activity, and schema inference for Apache Avro and Apache Parquet. Additionally, the release also includes performance improvements and an updated metadata search interface to further simplify data governance.
For more details on how Cloudera Navigator has defined data governance for Hadoop, register for the webinar, “Big Data Meets Data Governance with Cloudera Navigator” for a full overview, live demo, and details on key partner integrations.
Deployment Flexibility in the Cloud
Deployment flexibility for Cloudera’s platform not only means vendor choice and portability (through the largest partner ecosystem), but also consistency of experience, regardless of whether the deployment is on-premise or with our cloud partners. Cloudera 5.4 continues to drive this flexibility and reliable experience with the addition of Openstack (available in beta) and extended Microsoft Azure support.
Deploying Hadoop in the cloud is becoming increasingly popular for our customers, including the recently announced The Financial Industry Regulatory Authority (FINRA), Tubular Labs, and Digital Globe.
In addition to the above partners, you can also download Cloudera Director, the self-service reliable way to get up and running in the cloud; quickly start building a proof of concept using the AWS QuickStart; or easily try Hadoop using Cloudera Live.
Download Cloudera 5.4 to get started now.