EncartaLabs

Apache Hive

( Duration: 2 Days )

Apache Hive is a data warehouse system built on top of Hadoop to query Big Data. Hive originated at Facebook and was open sourced in August 2008. The challenge Facebook had to address is one faced by many companies since then. Eventually data growth in a company challenges the capabilities of deployed RDBMS or NoSQL systems. Reports and analytics start to take minutes, then hours, and eventually overlap with other queries and the whole system grinds to a halt. Another common scenario company’s start processing big data with Hadoop discovers the value of making the data accessible beyond the development team capable of writing complex map-reduce jobs.

  • Basic familiarity with SQL and/or a scripting language
  • No pre-existing knowledge of Hadoop is required

COURSE AGENDA

1

Introducing Hive

2

Getting Started with Hive

3

Data Types

4

HiveQL - Data Definition, Data Manipulation, Queries, Views, Indexes

5

Schema Design

6

Development with Hive

7

Tuning

8

MapReduce Scripts

9

Partitions and Buckets

10

Storage Formats

11

Joins

12

Hive & AWS

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 3,500 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 20,000 corporate candidates across india and abroad
  • All our trainings are conducted in workshop mode with more focus on hands On

View our other course offerings by visiting www.encartalabs.com/course-catalogue

Contact us for delivering this course as a public/open-house workshop for a group of 10+ candidates at our venue

Top