The Talend Big Data - Machine Learning training course, covers the implementation of machine learning algorithms in Big Data batch Jobs using the Spark framework.
By attending Talend Big Data - Machine Learning workshop, attendees will learn to:
- Connect to a Hadoop cluster from a Talend Job
- Use context variables and metadata
- Read and write files in HDFS in a Big Data batch Job
- Configure a Big Data batch Job to use the Spark framework
- Create and test recommendation models
- Create and test classification models
- Use a machine learning algorithm to deduplicate data
- Attend Talend Big Data - Essentials or Talend Data Quality - Essentials course or equivalent experience.
Anyone who wants to use Talend Studio to industrialize machine learning algorithms