EncartaLabs

StreamSets DataOps Platform

( Duration: 2 Days )

This StreamSets DataOps Platform training course provides skills for building, managing, and monitoring data flow pipelines.

By attending StreamSets DataOps Platform workshop, delegates will learn to:

  • Use of the StreamSets Data Collector (SDC) engine to create complex pipelines that ingest data from a variety of sources, manipulate that data, and then export it to destinations including Apache Kafka, relational database management systems, and Apache Hadoop.
  • Configure and use the StreamSets Transformer engine to access the various environments, transfer and transform data, and run jobs, and monitor the performance of pipelines across all instances of StreamSets products running in the organization.
  • Manage users, design and share pipelines, use the Pipeline Repository, configure and run jobs, and monitor the performance of pipelines across the organization with StreamSets Control Hub.

  • General knowledge of operating systems, networking, programming concepts, and databases.

The StreamSets DataOps Platform class is ideal for:

  • Data Engineers who will be building, managing, and monitoring data flow pipelines.

COURSE AGENDA

1

Getting Started

  • Set Up a Deployment
  • Build a Pipeline
  • Run a Job
  • Monitor a Job
  • Schedule a Job
2

Build Pipelines with Data Collector

  • JDBC Pipeline
  • CDC Pipeline
  • Snowflake Pipeline
  • Databricks Pipeline
  • Kafka Pipeline
3

Build Pipelines with Transformer

  • Build Transformer Pipelines
  • Understand Apache Spark
  • Tuning Transformer Pipelines
  • Origins, Operators, Destinations
4

Managing Pipelines

  • Create a Topology
  • Set up Alerts and Subscriptions
  • Installing Packages, Libraries and Drivers
  • Collaboration and Version Control
  • Implementing Pipeline CI/CD
5

Extending the DataOps Platform

  • Working with the REST API
  • Advanced Processing with Script Evaluators
6

Logging and Troubleshooting

  • Logging and Troubleshooting with Data Collectors
  • Logging and Troubleshooting with Transformer

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top