Tag Archives: Spark

What To Consider When You’re Considering Cloud

Categories: Analytic Database Cloud Data Engineering Data Science Operational Database Spark

In a blog posted earlier this week, my esteemed colleague Sean Anderson laid out a powerful argument for machine learning (ML) as a way to fuel recommendation engines, churn reduction engines, and IoT workflows. Leveraging components like Apache Spark, and its machine-learning libraries, data scientists are able to design and train complex models using troves…

Read More

Apache Spark Market Survey (Part 1 of 2)

Categories: Spark

As an IT industry analyst (and former technical product manager), I’m always fascinated with how enterprises large and small adopt new technologies. What does it take for a new solution to not only present a compelling opportunity, but also prove itself ready for prime time? What separates out the eventual market dominating solution from all…

Read More

Enhanced Streaming and Machine Learning with Apache Spark 2.0

Categories: Spark

Apache Spark has risen to be the taster’s choice of high-scale distributed computation and solidified itself as the de-facto processing engine in the Apache Hadoop ecosystem. In fact, recently Curt Monash of DBMS2 wrote, “The greatest use for Spark seems to be the same as the canonical first use for MapReduce: data transformation.” But the…

Read More

Big Data Governance: Bridging the Gap between Mainframe and Apache Hadoop

Categories: Partners

As Apache Hadoop celebrates its 10th birthday this year, it has become the central component of the next generation data architecture. Many of the world’s largest organizations have several production workloads running on Hadoop for new revenue generating applications, to stay competitive and relevant in their industry and to become more agile and efficient. As…

Read More