EncartaLabs

IBM SPSS Modeler - Data Preparation

( Duration: 1 Day )

The Data Preparation Using IBM SPSS Modeler training course covers advanced topics to aid in the preparation of data for a successful data science project. You will learn how to use functions, deal with missing values, use advanced field operations, handle sequence data, apply advanced sampling methods, and improve efficiency.

By attending Data Preparation Using IBM SPSS Modeler workshop, attendees will learn:

  • Using functions to cleanse and enrich data
  • Using additional field transformations
  • Working with sequence data
  • Sampling, partitioning and balancing data
  • Improving efficiency

  • Experience using IBM SPSS Modeler including familiarity with the Modeler environment, creating streams, reading data files, exploring data, setting the unit of analysis, combining datasets, deriving and reclassifying fields, and basic knowledge of modeling.

The Data Preparation Using IBM SPSS Modeler class is ideal for:

  • Anyone who wants to become familiar with the full range of techniques available in IBM SPSS Modeler for data preparation.

COURSE AGENDA

1

Using functions to cleanse and enrich data

  • Use date functions
  • Use conversion functions
  • Use string functions
  • Use statistical functions
  • Use missing value functions
2

Using additional field transformations

  • Replace values with the Filler node
  • Recode continuous fields with the Binning node
  • Change a field's distribution with the Transform node
3

Working with sequence data

  • Use sequence functions
  • Count an event across records
  • Expand a continuous field into a series of continuous fields with the Restructure node
  • Use geospatial and time data with the Space-Time-Boxes node
4

Sampling, partitioning and balancing data

  • Draw simple and complex samples with the Sample node
  • Create a training set and testing set with the Partition node
  • Reduce or boost the number of records with the Balance node
5

Improving efficiency

  • Use database scalability by SQL pushback
  • Process outliers and missing values with the Data Audit node
  • Use the Set Globals node
  • Use parameters
  • Use looping and conditional execution

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top