EncartaLabs

Apache Mahout

Apache Mahout is a machine learning project that combines the advantages of having a permissive open source license supporting almost any business use-case you can think of; a very active community responding to user requests and helping analyze your specific data problems; a production ready implementation of algorithms covering most of the sophisticated data analysis jobs you would want to run on your data while still being open and easy to adjust to your specific needs.

This Apache Mahout training course provides the fundamentals of apache mahout core concepts, machine learning, engine in apache mahout, clustering, classification, apache mahout and Amazon EMR, etc.

By attending Apache Mahout workshop, delegates will learn:

  • About big data and how it is changing the way business is done
  • How machine learning helps in predictions, analysis and rapid processing
  • About Apache Mahout and the algorithms it uses for machine learning

  • Basic knowledge of Java programming
  • Familiarity with Hadoop terms and concepts would be beneficial

COURSE AGENDA

Apache Mahout - Essentials
(Duration : 2 Days)

1

Introduction to Apache Mahout

2

Recommendations using Apache Mahout

3

User based recommendation

4

Item based recommendation

5

Implementing a recommender using map reduce

6

Clustering

7

Clustering algorithms

8

Implementing clustering in Hadoop

9

Classification

10

Evaluating a classifier

11

Developing a classifier


Apache Mahout - Advanced
(Duration : 2 Days)

1

Recommendation Engine

2

Recommendation systems

3

Content Based

4

Collaborative filtering

5

User based

6

Threshold

7

Item based

8

Mahout Optimizations

9

Recommendation platform

  • Similarity measures
  • Manhattan distance
  • Euclidean distance
  • Cosine Similarity
  • Pearson's Correlation Similarity
  • Loglikihood Similarity
10

Tanimoto

11

Evaluating Recommendation engines

  • Online
  • Offline
12

Clustering

  • Common Clustering Algorithms
  • K-means
  • Fuzzy K-means, Mean Shift etc
  • Representing data
  • Feature Selection
  • Vectorization
  • Representing Vectors
13

Classification

14

Common Algorithms

  • Mahout on Hadoop
  • Apache Mahout & Myrrix
15

Mahout on Amazon EMR

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top