MapReduce and Spark

Categories: General Product

About a week ago, I posted an article on Cloudera’s strategy on SQL in the Apache Hadoop ecosystem. In the article, I argued that a special-purpose distributed query processing engine will perform better than one that translates work into a general-purpose MapReduce framework, even if MapReduce is improved to trim latency and improve throughput. Notwithstanding…

Read More

Sebastian Thrun: Launching our Data Science & Big Data Track Built with Leading Industry Partners

Categories: Cloudera University Partners

I am excited to launch our new Data Science and Big Data Track. This is the first Data Science and Big Data curriculum built directly with industry pioneers. You can learn the latest techniques, tools, and concepts developed at the companies that made Big Data big. Our first class in this track, Introduction to Hadoop…

Read More