Category Archives: General

MapReduce and Spark

Categories: General Product

About a week ago, I posted an article on Cloudera’s strategy on SQL in the Apache Hadoop ecosystem. In the article, I argued that a special-purpose distributed query processing engine will perform better than one that translates work into a general-purpose MapReduce framework, even if MapReduce is improved to trim latency and improve throughput. Notwithstanding…

Read More