EncartaLabs

Talend Big Data - Spark Batch

( Duration: 1 Day )

The Talend Big Data - Spark Batch training course, covers Big Data batch Jobs that use the Spark framework.

By attending Talend Big Data - Spark Batch workshop, attendees will learn to:

  • Develop a Big Data batch Job using the Spark framework
  • Execute Spark Jobs in YARN client and cluster mode
  • Enable Spark history server event logging
  • Copy data from a local file to HDFS
  • Copy data from MySQL to HDFS
  • Create a Hive table and copy data from HDFS to it
  • Import tweets to HDFS
  • Join, sort, and aggregate data
  • Use caches for faster processing
  • Query data from a Hive table using Hive QL
  • Query data from Spark datasets using Spark SQL

  • Attend Talend Big Data - Essentials course or equivalent experience.

Anyone who wants to use Talend Studio to interact with Big Data systems

COURSE AGENDA

1

Spark in context

  • Concepts
2

Introduction to Spark

  • Developing and configuring a Big Data batch Job to use the Spark framework
  • Executing a Big Data Spark batch Job
  • Tracking a Big Data Spark batch Job execution
3

Sentiment analysis use case

  • Using the Twitter application programming interface (API) with Talend components
  • Loading tweets into HDFS
  • Processing tweets with a Big Data batch Job using the Spark framework
  • Enabling Spark history server event logging
  • Executing a Big Data Spark batch Job in YARN cluster mode
  • Deploying and scheduling Job execution from Talend Administration Center (TAC)
4

Download analysis use case

  • Retrieving RDBMS data from a Big Data Spark batch Job
  • Loading data into a Hive table and HDFS
  • Executing HiveQL queries from a Big Data Spark batch Job
  • Using caches for faster Spark batch Job processing
  • Performing download analysis with a Big Data Spark batch Job
  • Executing a Spark SQL query on data read from a NoSQL HBase table

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top