Author Archives: Arvind Prabhaker, Founder and CTO, StreamSets Inc

Continuous Ingest in the Face of Data Drift (Part 2)

Categories: Data Science Data Warehouse Enterprise Data Hub General Partners Product

In my previous post I discussed the causes and impacts of data drift, a natural consequence of Big Data which creates serious data quality and data pipeline operational issues. Now I will describe the features of StreamSets Data Collector, how they address ingesting data in a “drifty” environment and describe some common use cases. StreamSets…

Read More