EncartaLabs

Apache Hbase

( Duration: 2 Days )

HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and is written in Java. HBase features compression, in-memory operation, and Bloom filters on a per-column basis as outlined in the original BigTable paper. Tables in HBase can serve as the input and output for MapReduce jobs run in Hadoop, and may be accessed through the Java API but also through REST, Avro or Thrift gateway APIs.

By attending Apache HBase workshop, Participants will learn to:

  • Understand HBase Architecture and use HBase shell to directly manipulate HBase Tables
  • Design optimal HBase schemas for efficient data storage and recovery
  • Connect to HBase using the Java API
  • Understand how to bulk load data into HBase using MapReduce
  • Administer HBase cluster and Identify, resolve performance bottlenecks

  • Core Java knowledge

Java Professionals, Big data and Database Professionals, Professional planning to switch to Hadoop/Big data Technologies

COURSE AGENDA

1

Introduction to HBase

  • CAP Theorem and Eventual consistency
  • NoSQL
  • Creating Table - Shell and Programming
  • Column Families
  • Column Value & Key Pair
  • Index & Query
  • Learning Scan
  • MapReduce and HBase
  • Importing into HBase
2

HBase Programming - Learning API

  • CRUD Operations
  • Get
  • Delete
  • Scan Operations
  • Result Scanner
  • Batch and Caching
  • Filters
  • Counters
  • Co-Processors
  • Pooling
  • Admin Operations
  • Create Table
  • Column Families
  • Table Properties
  • Table Operations
  • Schema Operations
  • Cluster Operations
  • Status Operations
  • Map Reduce Integration
  • Static vs Dynamic provisioning
  • Data Sink
  • Data Source
  • Custom Processing
3

HBase Architecture

  • Seek vs Transfer
  • Storage
  • Write Path
  • File Type
  • Root level
  • Table level
  • Region level
  • Region split
  • Compaction
  • HFile Format
  • Key Value Format
  • Write Ahead Log
  • Storage
  • Write Path
  • File Types
  • Read Path
  • Regions
  • Region life cycle
  • Replication
  • Zoo Keeper
4

HBase DB Design

  • Designing Keys
  • Schema Design
  • Handling Index
  • Integration for search
  • Transaction
5

Implementing HBase CRUD Framework - Case Study

  • Building a CRUD framework
  • Building HBase Admin core
  • Building HBase master data manager
  • Building Table administration
  • Building filter framework
  • Building configuration framework

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 3,500 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 20,000 corporate candidates across india and abroad
  • All our trainings are conducted in workshop mode with more focus on hands On

View our other course offerings by visiting www.encartalabs.com/course-catalogue

Contact us for delivering this course as a public/open-house workshop for a group of 10+ candidates at our venue

Top