EncartaLabs

Big Data on AWS

( Duration: 3 Days )

In Big Data on AWS training course, we show you how to use Amazon EMR to process data using the broad ecosystem of Hadoop tools like Hive and Hue. We also teach you how to create big data environments, work with Amazon DynamoDB, Amazon Redshift, Amazon Quicksight, Amazon Athena and Amazon Kinesis, and leverage best practices to design big data environments for security and cost-effectiveness.

By attending Big Data on AWS workshop, delegates will learn to:

  • Fit AWS solutions inside of a big data ecosystem
  • Leverage Apache Hadoop in the context of Amazon EMR
  • Identify the components of an Amazon EMR cluster
  • Launch and configure an Amazon EMR cluster
  • Leverage common programming frameworks available for Amazon EMR including Hive, Pig, and Streaming
  • Leverage Hue to improve the ease-of-use of Amazon EMR
  • Use in-memory analytics with Spark on Amazon EMR
  • Choose appropriate AWS data storage options
  • Identify the benefits of using Amazon Kinesis for near real-time big data processing
  • Leverage Amazon Redshift to efficiently store and analyze data
  • Comprehend and manage costs and security for a big data solution
  • Secure a Big Data solution
  • Identify options for ingesting, transferring, and compressing data
  • Leverage Amazon Athena for ad hoc query analytics
  • Use visualization software to depict data and queries using Amazon QuickSight
  • Orchestrate big data workflows using AWS Data Pipeline

  • Basic familiarity with big data technologies, including Apache Hadoop, MapReduce, HDFS, and SQL/NoSQL querying.
  • Working knowledge of core AWS services and public cloud implementation
  • Basic understanding of data warehousing, relational database systems, and database design

COURSE AGENDA

1

Day 1

  • Overview of Big Data
  • Big Data Ingestion and Transfer
  • Big Data Streaming and Amazon Kinesis
  • Big Data Storage Solutions
  • Big Data Processing and Analytics
2

Day 2

  • Apache Hadoop and Amazon EMR
  • Using Amazon EMR
  • Hadoop Programming Frameworks
  • Web Interfaces on Amazon EMR
  • Apache Spark on Amazon EMR
3

Day 3

  • Using AWS Glue to automate ETL workloads
  • Amazon Redshift and Big Data
  • Visualizing and Orchestrating Big Data
  • Managing Big Data Costs
  • Securing Your Amazon Deployments
  • Big Data Design Patterns

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top