New Advanced Analytics and Data Wrangling Tutorials on Cloudera Live

Categories: General

When it comes to learning Apache Hadoop and CDH (Cloudera’s open source platform including Hadoop), there is no better place to start than Cloudera Live.  With a quick, one-button deployment option, Cloudera Live launches a four-node Cloudera cluster that you can learn and experiment in free for two-weeks. To help plan and extend the capabilities of your cluster, we also offer various partner deployments. Building on the addition of interactive tutorials and Tableau and Zoomdata integration, we have added a new tutorial on Apache Spark and a new Trifacta partner deployment.

One of the most popular tools in the Hadoop ecosystem is Apache Spark. This easy-to-use, general-purpose framework is extensible across multiple use cases – including batch processing, iterative advanced analytics, and real-time stream processing. With support and development from multiple industry vendors and partner tools, Spark has quickly become a standard within Hadoop.

With the new tutorial, “Relationship Strength Analytics Using Spark,” it will walk you through the basics of Spark and how you can utilize the same, unified enterprise data hub to launch into advanced analytics. Using the example of product relationships, it will walk you through how to discover what products are commonly viewed together, how to optimize product campaigns together for better sales, and discover other insights about product relationships to help build advanced recommendations.

In addition to the Spark tutorial, we have also added another partner deployment. One of the key strengths of implementing an enterprise data hub is its ability to integrate with other popular tools that you may already be using or would like to implement. One such tool is Trifacta. Trifacta lets you easily transform raw, complex data into clean and structured formats for analysis, so you can get more value from your data faster. With the new Trifacta deployment on Cloudera Live, you get the full functionality of Cloudera’s platform, along with an integrated trial of the Trifacta Data Transformation Platform to help you wrangle a variety of complex data.

To get started with Cloudera Live, visit www.cloudera.com/live

To learn more about Hadoop, you can also check out our online resources or see what training options best suit your needs.

facebooktwittergoogle_pluslinkedinmail

2 responses on “New Advanced Analytics and Data Wrangling Tutorials on Cloudera Live

Leave a Reply