EncartaLabs

Apache Airflow

( Duration: 2 Days )

Apache Airflow training course is designed to familiarize delegates with the use of Airflow schedule and maintain numerous Extract, Transform and Load (ETL) processes running on a large scale Enterprise Data Warehouse (EDW).

As Data Warehouses (DWs) increase in complexity it is important to have a dependable, scalable, intuitive, and simple scheduling and management program to monitor the flow of data and watch how transformations are completed. Apache Airflow, originally conceived of by AirBnb to help manage the complexities of their EDW, is being adopted by tech companies everywhere for its ease of management, scalability, and elegant design. Airflow is rapidly becoming the go-to technology for companies scaling out large data warehouses.

The course begins with an introduction to Airflow which includes a brief background and history of Airflow and covers the Airflow framework, database and User Interface (UI). Next, the course dives into Airflow development including operators and plugins, Directed Acyclic Graphs (DAGs), and scheduling. The course concludes with a session on deploying with Airflow and complex task dependency management.

By attending Apache Airflow workshop, delegates will learn to:

  • Assess how to organize and arrange scheduling.
  • Determine how to standardize Extract, Transform and Load (ETL) formats and processes.
  • Integrate Scheduling code into regular code flows.

  • DevOps engineers who want to monitor their enterprise data warehouses.

COURSE AGENDA

1

Introduction to Airflow

  • Background and history – Airbnb, DAGS, python
  • The Airflow framework and python
  • The Airflow database
  • The Airflow UI and Webserver
2

Development in Airflow

  • Introduction to DAGs
  • Airflow, Abstraction for developers
  • Operators and plugins
  • Hooks, connections and variables
  • Visualizing your DAGs
  • Scheduling- when and how to
  • RabbitMQ and multiple workers
3

Deploying with Airflow

  • Working with AIRFLOW management
  • Complex task dependency management
  • Airflow and Docker- setting up the environment
  • Setting up large scale environments with Airflow in git

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top