EncartaLabs

Data Analysis with Apache Drill

( Duration: 2 Days )

This Data Analysis with Apache Drill training course covers how to use Drill to explore structured or unstructured, known or unknown, data without writing code. You explore and run SQL queries on a variety of data types, including Parquet, JSON, and CSV files. You also join data from multiple data sources without having to do any transformation on the data. The course goes on to describe how a query is received and executed by Drill. You also learn the different services involved at each step, how Drill optimizes a query for distributed SQL xecution, and how to troubleshoot and tune Drill queries.

By attending Data Analysis with Apache Drill workshop, delegates will learn to:

  • Query structured table data
  • Query dynamic and complex data
  • Query data files
  • Perform complex queries
  • Work with tables, views, and temporary tables
  • Explore unknown data
  • Explore and visualize data with business intelligence tools
  • Define and query data with secondary indexes
  • Perform advanced query operations
  • Extend drill with custom functions
  • Drill architecture and query execution process
  • Monitor Drill activity and Resources
  • Use performance tuning
  • Use Drill with secured data
  • Examine Drill error messages
  • Configure log file settings
  • Troubleshoot Apache Drill

  • Basic Hadoop knowledge and beginning Linux knowledge
  • Beginner to intermediate knowledge of SQL

The Data Analysis with Apache Drill class is ideal for:

  • Data Analysts, Data Scientists and Developers

COURSE AGENDA

1

Welcome to Class

  • Course introduction
  • Prepare your lab environment
2

Interface with Apache Drill

  • Introduction to Apache Drill
  • Explore data that can be used with Drill
  • Interface with Apache Drill
3

SQL Analytics with Apache Drill

  • Query structured table data
  • Query dynamic and complex data
  • Query data files
  • Perform complex queries
4

Explore and Visualize Data with Apache Drill

  • Create and drop tables, views, and temporary tables
  • Explore unknown data
  • Explore and visualize data with BI tools
5

Advanced Apache Drill Operations

  • Define and query data with secondary indexes
  • Advanced query operations
  • Extend Drill with custom functions
6

Monitor and Tune Drill Performance

  • Drill architecture and query execution process
  • Monitor Drill activity and resources
  • Performance tuning
  • Use Drill with secured data
7

Troubleshoot and Debug Queries

  • Examine Drill error messages
  • Configure log file settings
  • Troubleshoot Apache Drill

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top