Hadoop Development – Essentials

( Duration: 2 Days )

Hadoop Development - Essentials training course gives awareness about the Hadoop framework which is the de facto platform for Big Data computation. Apache Hadoop is an open-source software framework that supports data-intensive distributed applications, licensed under the Apache v2 license. It supports the running of applications on large clusters of commodity hardware. The Hadoop framework transparently provides applications with both reliability and data motion. Hadoop implements a computational paradigm named map/reduce, where the application is divided into many small fragments of work, each of which may be executed or re-executed on any node in the cluster. In addition, it provides a distributed file system that stores data on the computer nodes, providing very high aggregate bandwidth across the cluster.

Participants will learn the internals of Hadoop framework and MapReduce paradigm, its need and types, other related projects on Hadoop like Hive, Pig, HBase, Impala etc. Business and financial benefits will be covered during the session. This would give fair understanding and awareness about the Hadoop ecosystem and its role in the technological ecosystem.

By attending Hadoop Development - Essentials workshop, Participants will learn:

  • Using the Hadoop & HDFS platform
  • Loading data into HDFS
  • Introduction to MapReduce
  • Writing and debugging MapReduce jobs
  • Implementing common algorithms on Hadoop
  • Using Mahout for advanced data mining
  • Benchmarking and optimizing performance

  • Project / Program / Technical managers
  • Technical / Team leads
  • Software analysts/ engineers
  • Pre-sales consultant
  • Business development managers



Hadoop and MapReduce: An Overview

  • Big Data and the questions
  • Hadoop and the answers
  • Hadoop Cluster Configuration

Hadoop Internals and MapReduce Design Patterns

  • Hadoop framework Internals
  • MapReduce Internals
  • MapReduce Design Patterns and Use-Cases

Hadoop sub-projects

  • Hive
  • Pig
  • HBase
  • Impala

Hadoop in Production

  • Best practices for Hadoop cluster
  • Best Practices for MapReduce
  • Hadoop in the cloud
  • Big Data and Social Media

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 3,500 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 20,000 corporate candidates across india and abroad
  • All our trainings are conducted in workshop mode with more focus on hands On

View our other course offerings by visiting www.encartalabs.com/course-catalogue

Contact us for delivering this course as a public/open-house workshop for a group of 10+ candidates at our venue