EncartaLabs

IBM BigInsights - Foundation

( Duration: 3 Days )

The IBM BigInsights Foundation training course is for those who want a foundation of IBM BigInsights. This course consists of two separate modules.

The first module is IBM BigInsights Overview and it will give you an overview of IBM's big data strategy as well as a why it is important to understand and use big data. It will cover IBM BigInsights as a platform for managing and gaining insights from big data. As such, you will see how the BigInsights have aligned their offerings to better suit your needs with the IBM Open Platform (IOP) along with the three specialized modules with value-add that sits on top of the IOP. Along with that, you will get an introduction to the BigInsights value-add including Big SQL, BigSheets, and Big R.

The second module is IBM Open Platform with Apache Hadoop. IBM Open Platform (IOP) with Apache Hadoop is the first premiere collaborative platform to enable Big Data solutions to be developed on the common set of Apache Hadoop technologies. The Open Data Platform initiative (ODP) is a shared industry effort focused on promoting and advancing the state of Apache Hadoop and Big Data technologies for the enterprise. This module provides an in-depth introduction to the main components of the ODP core --namely Apache Hadoop (inclusive of HDFS, YARN, and MapReduce) and Apache Ambari -- as well as providing a treatment of the main open-source components that are generally made available with the ODP core in a production Hadoop cluster.

  • There are no pre-requisites for this course but knowledge of Linux would be beneficial.

The IBM BigInsights Foundation workshop is ideal for:

  • Big data engineers
  • Data scientist
  • Developers or programmers
  • Administrators who are interested in learning about IBM's Open Platform with Apache Hadoop.

COURSE AGENDA

1

Introduction to Big Data

2

Introduction to IBM BigInsights

  • Getting started with IBM BigInsights
3

IBM BigInsights for Analysts

  • Working with Big SQL and BigSheets
4

IBM BigInsights for Data Scientist

  • Analyzing data with Big R, Jaql, and AQL
5

IBM BigInsights for Enterprise Management

6

IBM Open Platform with Apache Hadoop

  • Exploring the HDFS
7

Apache Ambari

  • Managing Hadoop clusters with Apache Ambari
8

Hadoop Distributed File System

  • File access & basic commands with HDFS
9

MapReduce and Yarn

  • Introduction to MapReduce based on MR1
  • Limitations of MR1
  • YARN and MR2
10

Apache Spark

  • Working with Spark's RDD to a Spark job
11

Coordination, management, and governance

12

Data Movement

  • Moving data into Hadoop with Flume and Sqoop
13

Storing and Accessing Data

  • Representing Data: CSV, XML, JSON, and YAML
  • Open Source Programming Languages: Pig, Hive, and Other [R, Python, etc]
  • NoSQL Concepts
  • Accessing Hadoop data using Hive
  • Querying Hadoop data using Hive
14

Advanced Topics

  • Controlling job workflows with Oozie
  • Search using Apache Solr

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top