By my best estimation, this upcoming Strata+Hadoop World will be my tenth. That may sound like a ton, but given that the event is hosted twice a year in the US, and three times outside of North America that’s roughy four years of watching the industry/event grow and transform. I have seen some key themes surface over the years including the transformation of enterprise data management, the rise of cloud, and the emergence of Apache Spark.
This year I am starting to feel we will see a new focus area steal the limelight. Machine learning, deep learning, and artificial intelligence (AI) have in short order captured the hearts and minds of data systems and solutions providers everywhere. A rather likely explanation for this is that big data platform users are starting to experience success leveraging these technologies and they are important initiatives that impact the way that organizations position themselves strategically in their respective landscapes.
The maturity of big data and modern analytics are converging to create a recipe that is permeating the enterprise. It’s no longer a question of whether an enterprise should have a big data and modern analytics strategy; it is a requirement. Just like Apache Hadoop and Spark birthed vibrant ecosystems advanced analytics is positioned to see much of the same high velocity innovation.
Cloudera is a modern platform for machine learning and advanced analytics with common libraries delivered by Spark and tightly coupled integrations of the newest open source frameworks. Our customers like Experian, Wargaming, and ShopperMotion have built advanced capabilities with machine learning and big data that will continue to deliver strong value in the years ahead.
At this year’s Strata+Hadoop World in San Jose March 13th-16th you can take part in several discussions that will help you better understand the new challenges and solutions for modern data science.
- Making self-service data science a reality with Matt Brandwein and Tristan Zajonc of Cloudera
- Unraveling data with Spark using machine learning with Jayant Shekhar of Sparkflows Inc. and Vartika Singh and Jeffrey Shmain of Cloudera
- Big data as a force for good with Craig Hibbeler of MasterCard Advisors, David Goodman of Nethope and Mike Olson of Cloudera
- Scalable deep learning for the enterprise with DL4J with Dave Kale, Susan Eraly, Melanie Warrick and Josh Patterson of Skymind)
- Sparklyr: An R interface for Apache Spark with Edgar Ruiz of RStudio
- Guerrilla guide to Python and Apache Hadoop with Juliet Hougland of Cloudera
- Modeling big data with R, sparklyr, and Apache Spark with John Mount of Win-Vector LLC
I will be front and center of many of these sessions hoping to get a glimpse of what is next for Hadoop and the expanded analytics ecosystem. I will also be at the Cloudera booth and hosting an open meetup if you are interested in hearing more about data science at Cloudera.