Author Archives: Cloudera

Open Source and Its Influence on the Apache Hadoop Ecosystem

Categories: Open Source Software

By any measure, the Apache Hadoop ecosystem–less than 10 years old–is astoundingly successful. Few other open source platforms have been adopted so rapidly, and so widely. Throughout history, only Linux rivals it with respect to sheer gravitational influence on users and vendors. But don’t take the reasons for Hadoop’s success for granted. They include: All…

Read More

A Look at Apache Solr as the Open Standard for Search

Categories: Open Source Software Product

This blog was penned by the following Clouderans: Alex Gutow, Justin Kestelyn and Eva Andreasson. Building an open and integrated enterprise data hub goes beyond just utilizing arbitrary open source components. As described in, “Compatibility and Innovation: Where One Ends, the Other Begins,” there needs to be a balance of stability and innovation, and building…

Read More

Cloudera to Release First Recursive Hadoop Stack

Categories: Enterprise Data Hub Spark YARN

Cloudera to Release First Recursive Hadoop Stack Promises ease-of-use of MapReduce and speed of Hive! PALO ALTO, Calif., April 1, 2015: Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™, today announced the pending release of Spark-on-Hive-on-MapReduce-on-Spark-on-Oozie-on-HBase-on-Hive-on-Spark-on-Flume-on-HDFS-on-Impala-on-Spark-on-Hive-on-MapReduce-on-Spark-on-Oozie-on-HBase-on-Hive-on-Spark-on-Flume-on-HDFS-on-Impala-on-Spark-on-Hive-on-MapReduce-on-Spark-on-Oozie-on-HBase-on-Hive-on-Spark-on-Flume-on-HDFS-on-Impala-on-…, the industry’s first truly recursive data platform. A recursive data platform leans on advanced concepts from…

Read More