EncartaLabs

Talend Data Integration

The Talend Data Integration training course helps you use Talend Studio for Data Integration as quickly as possible. It focuses on the basic capabilities of Studio and how you can use it to build reliable, maintainable data integration tasks that solve practical problems, including extracting data from common database and file formats, transforming it, and integrating it into targets. Attendees will also learn to use more advanced features of Talend Data Integration which provides an extensible, highly scalable set of tools to access, transform, and integrate data from any business system to meet both operational and analytical data integration needs.

By attending Talend Data Integration - Essentials workshop, attendees will learn to:

  • Create a project
  • Create and run a Job that reads, converts, and writes data
  • Merge data from several sources within a Job
  • Save a schema for repeated use
  • Create and use metadata and context variables within Jobs
  • Connect to, read from, and write to a database from a Job
  • Access a web service from a Job
  • Work with master Jobs and subJobs
  • Build, export, and test-run Jobs outside Studio
  • Implement basic error-handling techniques
  • Use best practices for Job and component naming, hints, and documentation

By attending Talend Data Integration - Advanced workshop, attendees will learn:

  • Start Talend Studio and connect it to a remote repository
  • Run a Job from Talend Studio on a remote JobServer
  • Use debugging tools that do not require deep Java development skills
  • Implement several methods of parallel execution in a Talend Job
  • Design a Job using the memorization logic
  • Create Joblets
  • Use custom Java codes in Talend Studio Jobs
  • Configure a database table to be monitored for changes in a separate change data capture (CDC) database

For Talend Data Integration - Essentials training course:
  • Basic knowledge of computing, including familiarity with Java or another programming language, SQL, and general database concepts
For Talend Data Integration - Advanced training course:
  • Attend Talend Data Integration - Essentials course and familiarity with Java or another programming language, SQL, and general database concepts
  • Anyone who wants to use Talend Data Integration to perform data integration and management tasks: software developers and development managers

COURSE AGENDA

Talend Data Integration - Essentials
(Duration : 2 Days)

1

Getting started with Talend Data Integration

  • Starting Talend Studio
  • Creating your first Job
  • Running a Job
  • Using the component help
  • Designing a Job using best practices
  • Documenting a Job
2

Working with files

  • Working with delimited files
  • Working with hierarchical files
3

Working with databases

  • Creating tables in MySQL databases
  • Reading data from MySQL database tables
  • Applying best practices
4

Using repository metadata

  • Using delimited file metadata
  • Using XML file metadata
  • Using database metadata
  • Using generic schemas
  • Updating metadata
5

Processing data

  • Mapping data using tMap
  • Joining data using tMap
  • Capturing join rejects
  • Filtering data and capturing filtering rejects
  • Using other data processing components
6

Using contexts and context variables

  • Creating a built-in context variable
  • Connecting to databases using context variables
  • Creating a context group in the repository
  • Loading context variables from a flow
7

Building executables and Docker images from data integration Jobs

  • Building a stand-alone Job
  • Building a new version of the Job
  • Building a Docker image
8

Controlling execution

  • Managing files
  • Processing files
  • Managing Job execution using a master Job
9

Handling errors

  • Detecting and handling basic errors
  • Raising a warning
10

Working with web services

  • Accessing a SOAP web service
11

Use case: Creating a master sales table from different data sources

  • Setting up a customer table
  • First challenge
  • Setting up a sales table
  • Joining data
  • Performing calculations
  • Second challenge
  • Creating a master Job
Talend Data Integration - Advanced
(Duration : 1 Day)

1

Using Git in Talend Studio

  • Using Git in Talend Studio
2

Remote Job execution

  • Creating and running a Job remotely
3

Basic debugging

  • Debugging Jobs using Debug Run
4

Parallel execution

  • Writing large files
  • Writing to databases
  • Automatic parallelization
  • Partitioning
5

Memorizing data

  • Preparing and memorizing a sequence of rows with the tMemorizeRows component
  • Reusing memorized rows in tMap
6

Joblets

  • Creating a Joblet from an existing Job
  • Creating a Joblet from scratch
  • Triggering Joblets
7

Using Custom Java Code

  • Using Java in Talend Studio
  • Executing simple instructions with tJava
  • Processing data with custom Java code using the tJavaRow and tJavaFlex components
  • Creating and using Java routines
8

Change data capture

  • Examining databases
  • Configuring the CDC database
  • Monitoring changes
  • Updating a warehouse
  • Resetting the database

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top