Category Archives: Data Science

Open Data Science and Machine Learning for Business with Cloudera Data Science Workbench on HDP

Categories: Data Science

It’s official – Cloudera and Hortonworks have merged, and today I’m excited to announce the availability of Cloudera Data Science Workbench (CDSW) for Hortonworks Data Platform (HDP).  Trusted by large data science teams across hundreds of enterprises — Western Union and IQVIA to name just a couple — CDSW is now also ready to help…

Read More

Big Data Fabric Weaves Together Automation, Scalability, and Intelligence

Categories: Data Science

Today’s data landscape is characterized by exponentially increasing volumes of data, comprising a variety of structured, unstructured, and semi-structured data types originating from an expanding number of disparate data sources located on-premises, in the cloud, and at the edge. In conjunction with the evolving data ecosystem are demands by business for reliable, trustworthy, up-to-date data…

Read More

An introduction to Federated Learning

Categories: Data Science Machine Learning

We’re excited to release Federated Learning, the latest report and prototype from Cloudera Fast Forward Labs. Federated learning makes it possible to build machine learning systems without direct access to training data. The data remains in its original location, which helps to ensure privacy and reduces communication costs. This article is about the business case…

Read More

The Data Science Iron Triangle – Modern BI and Machine Learning

Categories: Data Engineering Data Science Data Warehouse Machine Learning

The New Iron Triangle It is cliché to discuss IT/business solutions as people, process, and technology. Some call it the “golden triangle,” but in this blog, we refer to it as the iron triangle. Since the 1960s, technology has disrupted business through the advent of computing and information management. These systems replaced highly manual operations…

Read More