Category Archives: Product

Continuous Ingest in the Face of Data Drift (Part 2)

Categories: Analytic Database Data Science Enterprise Data Hub General Partners Product

In my previous post I discussed the causes and impacts of data drift, a natural consequence of Big Data which creates serious data quality and data pipeline operational issues. Now I will describe the features of StreamSets Data Collector, how they address ingesting data in a “drifty” environment and describe some common use cases. StreamSets…

Read More

Continuous Ingest in the Face of Data Drift (Part 1)

Categories: Analytic Database Data Science Enterprise Data Hub General Partners Product

Big data has come a long way, with adoption accelerating as CIOs recognize the business value of extracting insights from the troves of data collected by their companies and business partners. But, as is often the case with innovations, mainstream adoption of big data has exposed a new challenge: how to ingest data continuously from…

Read More

Data Driven Entertainment Marketing Gains Competitive Advantage

Categories: Enterprise Data Hub Events General Partners Product Success Stories

This week in New York City is the National Retail Federation (NRF) where hospitality and retail organizations from around the globe gather to learn about the next big thing in their industries, share best practices and identify ideas for taking advantage of technology to drive better business decisions, create business opportunities and capture efficiencies as…

Read More

A Year in Review for Apache Spark

Categories: Product

Though Apache Spark was first created nearly three years ago, the past year has seen tremendous growth and adoption of the project. Spark has now become the most popular Apache Software Foundation project, with fifty-percent more activity than the core Apache Hadoop project itself, and over 750 contributors across hundreds of companies. As part of…

Read More