EncartaLabs

Apache Kudu

( Duration: 2 Days )

This Apache Kudu training course covers the basics of Apache Kudu, a data storage system for the Hadoop platform that is optimized for analytical queries. The course covers common Kudu use cases and Kudu architecture. You will learn how to create, manage, and query Kudu tables, and to develop Spark applications that use Kudu.

By attending Apache Kudu workshop, delegates will learn:

  • A high-level explanation of Kudu
  • How does it compares to other relevant storage systems and which use cases would be best implemented with Kudu
  • About Kudu’s architecture as well as how to design tables that will store data for optimum performance.
  • Data management techniques on how to insert, update, or delete records from Kudu tables using Impala, as well as bulk loading methods
  • Finally, develop Apache Spark applications with Apache Kudu

  • Knowledge of SQL.
  • Familiarity with Impala is preferred but not required.
  • Knowledge to develop Apache Spark applications using either Python or Scala.
  • Basic Linux experience is expected.

The Apache Kudu class is ideal for:

  • Software developers, data engineers, DBAs, data scientists, and data analysts.

COURSE AGENDA

1

Introduction

2

Overview and Architecture

  • What Is Kudu?
  • Why Use Kudu?
  • Kudu Use Cases
  • Architecture Overview
  • Kudu Tools
3

Apache Kudu Tables

  • Kudu Tables
  • Data Storage Options
  • Designing Schemas
  • Partitioning Tables for Best Performance
  • Using Kudu Tools with Tables
4

Using Apache Kudu with Apache Impala

  • Apache Impala Overview
  • Creating and Querying Tables
  • Deleting Tables
  • Loading and Modifying Data in Kudu Tables
  • Defining Partitioning Strategy
5

Developing Apache Spark Applications with Apache Kudu

  • Apache Spark and Apache Kudu
  • Kudu, Spark SQL, and DataFrames
  • Managing Kudu Table Data with Scala
  • Creating Kudu Tables with Scala

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top