Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP
More options
HP.com home

HP XC System Software: User's Guide
Version 3.0

» 

Technical documentation

Complete book in PDF
» Feedback
Content starts here

 » Table of Contents

 » Glossary

 » Index

HP Part Number: 5991-4847

Published: January 2006

Abstract

This document provides information about the HP XC user and programming environment.


Table of Contents

About This Document
Intended Audience
Document Organization
HP XC Information
For More Information
Supplementary Information
Manpages
Related Information
Typographic Conventions
HP Encourages Your Comments
1 Overview of the User Environment
System Architecture
HP XC System Software
Operating System
Node Platforms
Node Specialization
Storage and I/O
File System
System Interconnect Network
Network Address Translation (NAT)
Determining System Configuration Information
User Environment
LVS
Modules
Commands
Application Development Environment
Parallel Applications
Serial Applications
Run-Time Environment
SLURM
Load Sharing Facility (LSF-HPC)
Standard LSF
How LSF-HPC and SLURM Interact
HP-MPI
Components, Tools, Compilers, Libraries, and Debuggers
2 Using the System
Logging In to the System
LVS Login Routing
Using the Secure Shell to Log In
Overview of Launching and Managing Jobs
Introduction
Getting Information About Queues
Getting Information About Resources
Getting Information About System Partitions
Launching Jobs
Getting Information About Your Jobs
Stopping and Suspending Jobs
Resuming Suspended Jobs
Performing Other Common User Tasks
Determining the LSF Cluster Name and the LSF Execution Host
Getting System Help and Information
3 Configuring Your Environment with Modulefiles
Overview of Modules
Supplied Modulefiles
Modulefiles Automatically Loaded on the System
Viewing Available Modulefiles
Viewing Loaded Modulefiles
Loading a Modulefile
Loading a Modulefile for the Current Session
Automatically Loading a Modulefile at Login
Unloading a Modulefile
Modulefile Conflicts
Creating a Modulefile
Viewing Modulefile-Specific Help
4 Developing Applications
Application Development Environment Overview
Compilers
MPI Compiler
Examining Nodes and Partitions Before Running Jobs
Interrupting a Job
Setting Debugging Options
Developing Serial Applications
Serial Application Build Environment
Building Serial Applications
Developing Parallel Applications
Parallel Application Build Environment
Building Parallel Applications
Developing Libraries
Designing Libraries for the CP4000 Platform
5 Submitting Jobs
Overview of Job Submission
Submitting a Serial Job Using Standard LSF
Submitting a Serial Job Using LSF-HPC
Submitting a Serial Job with the LSF bsub Command
Submitting a Serial Job Through SLURM only
Submitting a Non-MPI Parallel Job
Submitting a Parallel Job That Uses the HP-MPI Message Passing Interface
Submitting a Batch Job or Job Script
Running Preexecution Programs
6 Debugging Applications
Debugging Serial Applications
Debugging Parallel Applications
Debugging with TotalView
7 Tuning Applications
Using the Intel Trace Collector and Intel Trace Analyzer
Building a Program — Intel Trace Collector and HP-MPI
Running a Program – Intel Trace Collector and HP-MPI
Visualizing Data – Intel Trace Analyzer and HP-MPI
8 Using SLURM
Introduction to SLURM
SLURM Utilities
Launching Jobs with the srun Command
The srun Roles and Modes
Using the srun Command with HP-MPI
Using the srun Command with LSF-HPC
Monitoring Jobs with the squeue Command
Terminating Jobs with the scancel Command
Getting System Information with the sinfo Command
Job Accounting
Fault Tolerance
Security
9 Using LSF
Using Standard LSF on an HP XC System
Using LSF-HPC
Introduction to LSF-HPC in the HP XC Environment
Determining the LSF Execution Host
Determining Available LSF-HPC System Resources
Submitting Jobs
Getting Information About Jobs
Translating SLURM and LSF-HPC JOBIDs
Working Interactively Within an LSF-HPC Allocation
LSF-HPC Equivalents of SLURM srun Options
10 Advanced Topics
Enabling Remote Execution with OpenSSH
Running an X Terminal Session from a Remote Node
Using the GNU Parallel Make Capability
Example Procedure 1
Example Procedure 2
Example Procedure 3
Local Disks on Compute Nodes
I/O Performance Considerations
Shared File View
Private File View
Communication Between Nodes
A Examples
Building and Running a Serial Application
Launching a Serial Interactive Shell Through LSF-HPC
Running LSF-HPC Jobs with a SLURM Allocation Request
Example 1. Two cores on Any Two Nodes
Example 2. Four cores on Two Specific Nodes
Launching a Parallel Interactive Shell Through LSF-HPC
Submitting a Simple Job Script with LSF-HPC
Submitting an Interactive Job with LSF-HPC
Submitting an HP-MPI Job with LSF-HPC
Using a Resource Requirements String in an LSF-HPC Command
Glossary
Index

List of Examples

4-1 Directory Structure
4-2 Recommended Directory Structure
5-1 Submitting a Serial Job Using Standard LSF
5-2 Submitting a Serial Job Using LSF-HPC
5-3 Submitting an Interactive Serial Job Using LSF-HPC only
5-4 Submitting an Interactive Serial Job Using LSF-HPC and the LSF-SLURM External Scheduler
5-5 Submitting a Non-MPI Parallel Job
5-6 Submitting a Non-MPI Parallel Job to Run One Task per Node
5-7 Submitting an MPI Job
5-8 Submitting an MPI Job with the LSF-SLURM External Scheduler Option
5-9 Submitting a Job Script
5-10 Submitting a Batch Script with the LSF-SLURM External Scheduler Option
5-11 Submitting a Batch Job Script That Uses a Subset of the Allocation
5-12 Submitting a Batch job Script That Uses the srun --overcommit Option
5-13  Environment Variables Available in a Batch Job Script
7-1 The vtjacobic Example Program
7-2 C Example – Running the vtjacobic Example Program
8-1 Simple Launch of a Serial Program
8-2 Displaying Queued Jobs by Their JobIDs
8-3 Reporting on Failed Jobs in the Queue
8-4 Killing a Job by Its JobID
8-5 Cancelling All Pending Jobs
8-6 Sending a Signal to a Job
8-7 Using the sinfo Command (No Options)
8-8 Reporting Reasons for Downed, Drained, and Draining Nodes
9-1 Using the External Scheduler to Submit a Job to Run on Specific Nodes
9-2 Using the External Scheduler to Submit a Job to Run One Task per Node
9-3 Using the External Scheduler to Submit a Job That Excludes One or More Nodes
9-4 Using the bjobs Command (Short Output)
9-5 Using the bjobs Command (Long Output)
9-6 Using the bhist Command (Short Output)
9-7 Using the bhist Command (Long Output)
9-8 Launching an Interactive MPI Job
9-9 Launching an Interactive MPI Job on All Cores in the Allocation
Printable version
Privacy statement Using this site means you accept its terms Feedback to webmaster
© 2003 Hewlett-Packard Development Company, L.P.