EncartaLabs

Orange

Data Mining is the process of finding anomalies, patterns and correlations within large data sets to predict outcomes. Using a broad range of techniques such as statistics and machine learning , you can use this information to increase revenues, cut costs, improve customer relationships, reduce risks and more. Orange is a platform built for data mining, predictive analytics on a GUI based workflow. This signifies that you do not have to know how to code to be able to work using Orange and mine data, crunch numbers and derive insights. Orange comes with many machine learning and predictive analytics toolbox such as supervised learning, unsupervised learning, model evaluation, image analytics, text mining that make it a great tool for predictive analytics and machine learning. This Data Mining and Machine Learning with Orange training course will teach you how to apply data mining and machine learning techniques using Orange.

Text Mining is the process of exploring and analysing large amounts of unstructured text data aided by software that can identify concepts, patterns, topics, keywords and other attributes in the data. Text mining helps organisations find potentially valuable business insights in corporate documents, customer emails, call center logs, verbatim survey comments, social network posts, medical records and other sources of text-based data. Orange is a powerful free open-source data analytics and visualisation tool for text mining.

Orange is a platform built for data mining, predictive analytics on a GUI based workflow. This signifies that you do not have to know how to code to be able to work using Orange and mine data, crunch numbers and derive insights. You can perform tasks ranging from basic visuals to data manipulations, transformations, and data mining. It consolidates all the functions of the entire process into a single workflow.

By attending Data Mining and Machine Learning with Orange workshop, delegates will learn:

  • Apply data mining and machine learning principles to assess business insights
  • Integrate information from datasets
  • Apply predictive data modelling techniques to identify underlying trends in data
  • Apply machine learning classification techniques to gain new insights from data
  • Apply clustering techniques to discover data pattern and make decision
  • Develop prototype algorithms with dimension reduction techniques
  • Construct association rules to Identify patterns across multiple data sets to derive insights

By attending Text Mining with Orange workshop, delegates will learn:

  • Text Preprocessing
  • Text Clustering
  • Text Classification
  • Topic Modeling

By attending Predictive Analytics with Orange workshop, delegates will learn:

  • Overview of Orange
  • Classification and Predictive Modeling
  • Regression Analysis
  • Clustering
  • Image Analytics
  • Dimension Reduction

  • Data Analysts
  • Data Scientists
  • Text Analysts
  • Marketeers
  • Engineers

COURSE AGENDA

Data Mining and Machine Learning with Orange
(Duration : 2 Days)

1

Overview of Data Mining and Machine Learning

  • Data Mining Process
  • Overview of Machine Learning
  • Impact of Data Mining and ML to Access Business Insights
2

Data Preparation

  • Import/Export Data
  • Filter Data
  • Join Data
  • Clean Data
3

Regression

  • What is Regression
  • Linear Regression
  • Underfitting and Overfitting
  • Regularization Techniques
4

Classification

  • What is Classification
  • Classification Algorithms
  • K-Fold Cross Validation
  • Model Evaluation Metrics
  • Confusion Matrix
5

Clustering

  • What is Clustering
  • K-Means Clustering
  • Silhouette Analysis
  • Hierarchical Clustering
6

Dimension Reduction

  • Principal Component Analysis (PCA)
  • Feature Ranking
7

Association Analysis

  • Association Rules
  • Constructing Rules
Text Mining with Orange
(Duration : 1 Day)

1

Overview of Text Mining

  • What is Text Mining
  • Upload Corpus
  • Upload Document
2

Preprocessing Text

  • Pre-process Text
  • Stopwords
  • Bag of Words (BoW)
  • Similarity Hashing
3

Online API Search

  • Wikipedia
  • Twitter
4

Text Clustering

  • Cosine Distance
  • Hierarchical Clustering & Dendrogram
  • Text Clustering Workflow
  • Similarity Clustering with Hashing
5

Text Classification

  • What is Classification
  • Classification Algorithms
  • Text Classification and Prediction
  • K-Fold Cross Validation
  • Similarity Hashing
6

Topic Modeling

  • Latent Semantic Indexing (LSP)
  • Latent Dirichlet Allocation (LDA)
Predictive Analytics with Orange
(Duration : 1 Day)

1

Overview of Predictive Analytics and Orange

  • Data Mining Process
  • Introduction to Machine Learning
  • Supervised vs UnSupervised Learnings
  • Overview of Orange
2

Data Preparation

  • Load Data to Orange
  • Interactive Visualization
  • Filter Data
  • Merge and Concat Data
  • Preprocess Data
  • Feature Statistics
  • Save Data
3

Regression

  • What is Regression
  • Linear Regression
  • Model Evaluation Metrics for Regression
  • Regularization
4

Classification

  • What is Classification
  • Classification Algorithms
  • K-Fold Cross Validation
  • Model Evaluation Metrics for Classification
  • Confusion Matrix
  • ROC Analysis for Binary Classification
5

Clustering

  • What is Clustering
  • K-Means Clustering
  • Silhouette Analysis
  • Hierarchical Clustering
6

Dimension Reduction

  • What is Dimension Reduction
  • Principal Component Analysis (PCA)
  • Feature Ranking
  • t-SNE and MDS
7

Association Analysis

  • What is Association Analysis
  • Apriori Algorithm
  • Association Analysis with Orange

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top