Cloudera Extends Reach into Cloud with Azure Data Lake

Categories: Cloud Corporate

It’s easy to understand why customers are moving to the cloud. Unlike the promises of marginal TCO savings brought on by incremental increases in performance or high availability by the old guard database vendors, the notion of utility computing and “pay as you go” benefits are just too good to ignore. Now that customers have accelerated their migration to the cloud, the question is not “should” we move to the cloud, but “how fast” can we move to the cloud.

Cloudera has been evangelizing the benefits of scale out, distributed computing since the birth of the company (hence the name Cloud-era). Since October 2014, Cloudera has enjoyed a robust engineering relationship with Microsoft Azure, ensuring we provide the most performant and reliable platform for new and existing Cloudera customers.

Towards the continuation of this elastic, cloud computing vision, Cloudera is proud to announce the fruits of a joint engineering effort to integrate Cloudera and Azure Data Lake Store (ADLS).  ADLS is a massively scalable distributed file system optimized for analytics. This means that Azure customers can store data of any type with no limits to performance and scalability – meeting the increasingly aggressive data needs of your business for volume, velocity and variety. Whether collecting real-time IoT sensor data from thousands of devices simultaneously for predictive maintenance, thwarting millions of attack vectors from insecure networks via cybersecurity, or determining precision medicine to pinpoint prescription doses to cancer patients, the combination of Cloudera with Azure Data Lake Store will bring the benefits of improved performance, ease of administration and lower cost to your business.

The April 18th, 2017 official release of Cloudera 5.11 includes integration for Azure Data Lake Store among other benefits to the award winning modern data platform. Cloudera has been collaborating heavily with Microsoft ADLS engineering leading up to the release, which can be seen in more detail in this blog from Paige Liu, Microsoft Azure Software Engineer. In summary, this important update to Microsoft Azure brings the following benefits to our customers’ business:

  1. Leverage cloud-native Azure infrastructure for transient workloads to lower TCO
  2. Run Spark (1.6 and 2.1) and Hive jobs directly against ADLS data, processing data more quickly
  3. Enhanced security with encryption at rest and flexible key management

The general availability of Cloudera with ADLS integration is a much awaited release from both Cloudera and Microsoft customers who are looking for the automatic scale out benefits of being in the public cloud while remaining cost conscious with no compromise in performance or availability. For more information on Cloudera with ADLS integration, please refer to the updated Cloudera documentation.


Leave a Reply