Category Archives: Spark

Apache Spark Market Survey (Part 1 of 2)

Categories: Spark

As an IT industry analyst (and former technical product manager), I’m always fascinated with how enterprises large and small adopt new technologies. What does it take for a new solution to not only present a compelling opportunity, but also prove itself ready for prime time? What separates out the eventual market dominating solution from all…

Read More

Enhanced Streaming and Machine Learning with Apache Spark 2.0

Categories: Spark

Apache Spark has risen to be the taster’s choice of high-scale distributed computation and solidified itself as the de-facto processing engine in the Apache Hadoop ecosystem. In fact, recently Curt Monash of DBMS2 wrote, “The greatest use for Spark seems to be the same as the canonical first use for MapReduce: data transformation.” But the…

Read More

Fanning the Flames with Apache Spark: Evolving Big Data Processing

Categories: Enterprise Data Hub General Partners Spark

Poca favilla gran fiamma seconda. “From a little spark follows a great flame.” – Dante At Cask, we are passionate about software development and developer productivity in the service of solving big customer challenges. We share our customers’ passion for becoming insight-driven organizations. Such enterprises are always on the lookout for new ways to leverage…

Read More

Ralph Kimball and Kaiser Permanente: Q&A Part II – Building the Landing Zone

Categories: Analytic Database Cloudera University Compliance Corporate Data Science Enterprise Data Hub General Partners Product Security Spark Success Stories

In a recent Cloudera webinar, “The Future of Data Warehousing: ETL Will Never be the Same”, Dr. Ralph Kimball, data warehousing / business intelligence thought leader and evangelist for dimensional modeling, and Manish Vipani, VP and Chief Architect of Enterprise Architecture at Kaiser Permanente, outlined the benefits of Hadoop for modernizing the ETL “back room” of…

Read More

Ralph Kimball and Kaiser Permanente: Q&A Part I – Hadoop and the Data Warehouse

Categories: Analytic Database Compliance Corporate Data Science Enterprise Data Hub General Product Security Spark Success Stories

In a recent Cloudera webinar, “The Future of Data Warehousing: ETL Will Never be the Same”, Dr. Ralph Kimball, data warehousing / business intelligence thought leader and evangelist for dimensional modeling, and Manish Vipani, VP and Chief Architect of Enterprise Architecture at Kaiser Permanente, outlined the benefits of Hadoop for modernizing the ETL “back room”…

Read More