Apache Hbase

( Duration: 2 Days )

HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and is written in Java. HBase features compression, in-memory operation, and Bloom filters on a per-column basis as outlined in the original BigTable paper. Tables in HBase can serve as the input and output for MapReduce jobs run in Hadoop, and may be accessed through the Java API but also through REST, Avro or Thrift gateway APIs.

By attending Apache HBase workshop, Participants will learn to:

  • Understand HBase Architecture and use HBase shell to directly manipulate HBase Tables
  • Design optimal HBase schemas for efficient data storage and recovery
  • Connect to HBase using the Java API
  • Understand how to bulk load data into HBase using MapReduce
  • Administer HBase cluster and Identify, resolve performance bottlenecks

  • Core Java knowledge

Java Professionals, Big data and Database Professionals, Professional planning to switch to Hadoop/Big data Technologies



Introduction to HBase

  • CAP Theorem and Eventual consistency
  • NoSQL
  • Creating Table - Shell and Programming
  • Column Families
  • Column Value & Key Pair
  • Index & Query
  • Learning Scan
  • MapReduce and HBase
  • Importing into HBase

HBase Programming - Learning API

  • CRUD Operations
  • Get
  • Delete
  • Scan Operations
  • Result Scanner
  • Batch and Caching
  • Filters
  • Counters
  • Co-Processors
  • Pooling
  • Admin Operations
  • Create Table
  • Column Families
  • Table Properties
  • Table Operations
  • Schema Operations
  • Cluster Operations
  • Status Operations
  • Map Reduce Integration
  • Static vs Dynamic provisioning
  • Data Sink
  • Data Source
  • Custom Processing

HBase Architecture

  • Seek vs Transfer
  • Storage
  • Write Path
  • File Type
  • Root level
  • Table level
  • Region level
  • Region split
  • Compaction
  • HFile Format
  • Key Value Format
  • Write Ahead Log
  • Storage
  • Write Path
  • File Types
  • Read Path
  • Regions
  • Region life cycle
  • Replication
  • Zoo Keeper

HBase DB Design

  • Designing Keys
  • Schema Design
  • Handling Index
  • Integration for search
  • Transaction

Implementing HBase CRUD Framework - Case Study

  • Building a CRUD framework
  • Building HBase Admin core
  • Building HBase master data manager
  • Building Table administration
  • Building filter framework
  • Building configuration framework

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.