EncartaLabs

Sun Grid Engine Advanced Administration

( Duration: 4 Days )

The Sun Grid Engine Advanced Administration training course provides delegates with the necessary skills to install, configure, use, and troubleshoot N1 Grid Engine (N1GE). This course describes installation and advanced administration of this product. Delegates learn how to use the command-line interface to submit interactive and batch jobs to the grid. In addition, delegates perform basic configuration tasks, such as displaying, adding, modifying, and deleting queue, host, and global configurations and complexes. The course also describes resource management and approaches for troubleshooting failures in the grid. Finally, delegates learn about the special scheduling features available in N1GE including the share tree, functional, deadline, and overrides scheduling policies.

By attending Sun Grid Engine Advanced Administration workshop, Participants will learn to:

  • Perform installation tasks and run basic user commands, including: describing the features and architecture of the N1GE software; describing distributed resource management (DRM); performing a Network File System (NFS) installation of N1GE; and submitting batch and interactive jobs to the grid
  • Configure N1GE, including performing basic queue, host, and cluster configuration; administering complexes; and integrating applications in the grid
  • Perform advanced tasks, including administering and troubleshooting N1GE and configuring scheduling policies

To succeed fully in this course, delegates should be able to perform routine system administration tasks on the Solaris(TM) Operating System (Solaris OS).

Delegates who can benefit from this course are those responsible for installing, configuring, using, and troubleshooting N1GE.

COURSE AGENDA

1

Introducing the Grid Software

  • Define grid computing and DRM
  • Describe the types of grids
  • Define the architecture of N1GE
  • Describe jobs, queues, user types, and host types
  • Schedule jobs in the grid
  • Describe the flow of information in N1GE
  • Define High Performance Computing (HPC) environments
  • Describe the Grid Engine project
2

Installing the Grid Software

  • Describe the various types of N1GE installations
  • Describe the various spooling types
  • Describe the default scheduler profiles
  • Perform an NFS installation of N1GE
  • Describe the contents of the primary N1GE directories
3

Submitting Jobs to the Grid

  • Describe N1GE commands and job types
  • Submit batch jobs using the qsub command
  • Submit interactive jobs using the qsh, qrsh, qtcsh, and qlogin commands
  • Obtain status information for submitted jobs
  • Administer submitted jobs
4

Modifying Configuration Parameters

  • Describe the types of N1GE parameters you can configure
  • Configure the N1GE cluster parameters
  • Configure the N1GE host parameters
  • Configure the N1GE queue parameters
  • Configure the N1GE scheduler
  • Configure N1GE users
5

Configuring Resource Management and Load Parameters

  • Describe items that affect resource management: job requirements; resources; global, queue, host, and user-defined resource attributes; and inheritance rules
  • Administer the system complex list, including global, host, user-defined, and queue-related resource attributes
  • Configure the default load parameters and define custom load sensors
6

Controlling the Event Chain and Integrating Applications

  • Define the N1GE event chain and application integration
  • Describe the execution methods
  • Integrate custom applications into N1GE
  • Integrate HPC environments
7

Administering and Troubleshooting N1GE

  • Perform routine administration of N1GE queues
  • Examine N1GE log files to troubleshoot failures
  • Use command debugging to resolve failed job submissions
  • Obtain and apply patches for N1GE
  • Back up the grid engine system configuration
  • Diagnose problems in N1GE
8

Resource Allocation and Scheduling Policies

  • Describe resource allocation and scheduling
  • Describe and configure N1GE scheduling parameters
  • Describe and configure the share tree scheduling policy
  • Describe and configure the functional scheduling policy
  • Describe and configure the override scheduling policy
9

Usage Accounting and Reporting

  • Understand the various methods of gathering accounting and reporting statistics
  • Install and configure the N1GE reporting database module
  • Install and use the N1GE web-based Accounting and Reporting Console (ARCo)

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 4,000 Modules on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting http://encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top