Author Archives: Sean Anderson

Sean Anderson

About Sean Anderson

Sean is a tenured infrastructure scaling and cloud strategy consultant with a strong focus on strategic partnerships and innovative hybrid technology. He has been a part of integral shifts in technology including the rise of cloud computing, open source standardization, and big data. Sean quickly became a go-to resource and speaker for data specific workloads focusing on technologies like Hadoop, MongoDB, Redis, Elasticsearch, SQL, and Data Warehousing. At Rackspace Hosting, Sean helped build and launch open-source cloud platforms around Hadoop, MongoDB, Elasticsearch and Redis. Sean is currently marketing manager for IT Solutions at Cloudera; the pioneers of Apache Hadoop.

Enhanced Streaming and Machine Learning with Apache Spark 2.0

Categories: Spark

Apache Spark has risen to be the taster’s choice of high-scale distributed computation and solidified itself as the de-facto processing engine in the Apache Hadoop ecosystem. In fact, recently Curt Monash of DBMS2 wrote, “The greatest use for Spark seems to be the same as the canonical first use for MapReduce: data transformation.” But the…

Read More

We’re excited about Wrangle 2016. Want to know why?

Categories: Data Science Events

This year on July 28th we will once again host the Wrangle Conference – the definitive single track conference by and for data scientists. Wrangle explores the principles, practice, and application of data science across many industries. This is an opportunity for you to hear directly from practitioners on how they worked to solve complex…

Read More