Call : (+91) 99 8080 3767
Mail : info@EncartaLabs.com
EncartaLabs

Big Data on AWS

( Duration: 3 Days )

In Big Data on AWS training course, you will learn about cloud-based big data solutions like Amazon EMR, Amazon Redshift, Amazon Kinesis, and the rest of the AWS big data platform. Learn to use Amazon EMR to process data using the broad ecosystem of Hadoop tools like Hive and Hue, create big data environments, work with Amazon DynamoDB, Amazon Redshift, Amazon QuickSight, Amazon Athena and Amazon Kinesis, and design big data environments for security and cost-effectiveness.

By attending Big Data on AWS workshop, delegates will learn to:

  • Use Apache Hadoop with Amazon EMR
  • Launch and configure an Amazon EMR cluster
  • Use common programming frameworks for Amazon EMR, including Hive, Pig, and Streaming
  • Use Hue to improve the ease-of-use of Amazon EMR
  • Use in-memory analytics with Spark on Amazon EMR
  • Understand how services like AWS Glue, Amazon Kinesis, Amazon Redshift, Amazon Athena, and Amazon QuickSight can be used with big data workloads

  • Basic familiarity with big data technologies, including Apache Hadoop, HDFS, and SQL/NoSQL querying
  • Working knowledge of core AWS services and public cloud implementation
  • Basic understanding of data warehousing, relational database systems, and database design

The Big Data on AWS class is ideal for:

  • Individuals responsible for designing and implementing big data solutions, namely Solutions Architects and SysOps Administrators
  • Data Scientists and Data Analysts interested in learning about big data solutions on AWS

COURSE AGENDA

1

Overview of Big Data

  • What is big data
  • The big data pipeline
  • Big data architectural principals
2

Big Data ingestion and transfer

  • Overview: Data ingestion
  • Transferring data
3

Big data streaming and Amazon Kinesis

  • Stream processing of big data
  • Amazon Kinesis
  • Amazon Kinesis Data Firehose
  • Amazon Kinesis Video Streams
  • Amazon Kinesis Data Analytics
4

Big data storage solutions

  • AWS data storage options
  • Storage solutions concepts
  • Factors in choosing a data store
5

Big data processing and analytics

  • Big data processing and analytics
  • Amazon Athena
6

Apache Hadoop and Amazon EMR

  • Introduction to Amazon EMR and Apache Hadoop
  • Best practices for ingesting data
  • Amazon EMR
  • Amazon EMR architecture
7

Using Amazon EMR

  • Developing and running your application
  • Launching your cluster
  • Handling output from your completed jobs
8

Hadoop programming frameworks

  • Hadoop frameworks
  • Other frameworks for use on Amazon EMR
9

Web interfaces on Amazon EMR

  • Hue on Amazon EMR
  • Monitoring your cluster
10

Apache Spark on Amazon EMR

  • Apache Spark
  • Using Spark
11

Using AWS Glue to automate ETL workloads

  • What is AWS Glue?
  • AWS Glue: Job orchestration
12

Amazon Redshift and big data

  • Data warehouses vs. traditional databases
  • Amazon Redshift
  • Amazon Redshift architecture
13

Securing your Amazon deployments

  • Securing your Amazon deployments
  • Amazon EMR security overview
  • AWS Identity and Access Management (IAM) overview
  • Securing data
  • Amazon Kinesis security overview
  • Amazon DynamoDB security overview
  • Amazon Redshift security overview
14

Managing big data costs

  • Total cost considerations for Amazon EMR
  • Amazon EC2 pricing models
  • Amazon Kinesis pricing models
  • Cost considerations for Amazon DynamoDB
  • Cost considerations and pricing models for Amazon Redshift
  • Optimizing cost with AWS
15

Visualizing and orchestrating big data

  • Visualizing big data
  • Amazon QuickSight
  • Orchestrating a big data workflow
16

Big data design patterns

  • Common architectures

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 6,000 various courses on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top