This Data Analysis with Apache Drill training course covers how to use Drill to explore structured or unstructured, known or unknown, data without writing code. You explore and run SQL queries on a variety of data types, including Parquet, JSON, and CSV files. You also join data from multiple data sources without having to do any transformation on the data. The course goes on to describe how a query is received and executed by Drill. You also learn the different services involved at each step, how Drill optimizes a query for distributed SQL xecution, and how to troubleshoot and tune Drill queries.
By attending Data Analysis with Apache Drill workshop, delegates will learn to:
- Query structured table data
- Query dynamic and complex data
- Query data files
- Perform complex queries
- Work with tables, views, and temporary tables
- Explore unknown data
- Explore and visualize data with business intelligence tools
- Define and query data with secondary indexes
- Perform advanced query operations
- Extend drill with custom functions
- Drill architecture and query execution process
- Monitor Drill activity and Resources
- Use performance tuning
- Use Drill with secured data
- Examine Drill error messages
- Configure log file settings
- Troubleshoot Apache Drill
- Basic Hadoop knowledge and beginning Linux knowledge
- Beginner to intermediate knowledge of SQL
The Data Analysis with Apache Drill class is ideal for:
- Data Analysts, Data Scientists and Developers