Tag Archives: batch processing

Faster Batch Processing with Hive-on-Spark

Categories: Product

Apache Spark has quickly emerged as a powerful data processing framework for Apache Hadoop, well-poised to succeed MapReduce in the ecosystem. Cloudera’s One Platform Initiative is hastening this transition with focused development on the scale, security, management, and streaming aspects necessary for Spark to support a wide range of enterprise applications. Spark’s power and popularity…

Read More

A Year in Review for Apache Spark

Categories: Product

Though Apache Spark was first created nearly three years ago, the past year has seen tremendous growth and adoption of the project. Spark has now become the most popular Apache Software Foundation project, with fifty-percent more activity than the core Apache Hadoop project itself, and over 750 contributors across hundreds of companies. As part of…

Read More