Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP
More options
HP.com home
HP XC System Software: User's Guide > Chapter 10 Using LSF-HPC

Determining Available System Resources

» 

Technical documentation

Complete book in PDF
» Feedback
Content starts here

 » Table of Contents

 » Glossary

 » Index

For the best use of system resources when launching an application, it is useful to know the system resources that are available for your use. This section describes how to obtain information about system resources such as the number of cores available, LSF execution host node information, and LSF-HPC system queues.

Examining System Core Status

The bhosts command displays LSF-HPC resource usage information. This command is useful to examine the status of the system cores. The bhosts command provides a summary of the jobs on the system and information about the current state of LSF-HPC. For example, it can be used to determine if LSF-HPC is ready to start accepting batch jobs.

LSF-HPC daemons run on only one node in the HP XC system, so the bhosts command will list one host, which represents all the resources of the HP XC system. The total number of cores for that host should be equal to the total number of cores assigned to the SLURM lsf partition.

By default, this command returns the host name, host status, and job state statistics.

The following example shows the output from the bhosts command:

$ bhosts
HOST_NAME           STATUS JL/U MAX  NJOBS RUN SSUSP USUSP RSV
lsfhost.localdomain   ok     -  16    0     0     0     0    0

Of note in the bhosts output:

  • The HOST_NAME column displays the name of the LSF execution host.

  • The MAX column displays the total core count (usable cores) of all available computer nodes in the lsf partition.

  • The STATUS column shows the state of LSF-HPC and displays a status of either ok or closed.

  • The NJOBS column displays the number of jobs. Note that in LSF terminology, a parallel job with 10 tasks counts as 10 jobs.

Getting Information About the LSF Execution Host Node

The lshosts command displays resource information about the LSF-HPC cluster. This command is useful for verifying machine-specific information.

LSF-HPC daemons run on only one node in the HP XC system, so the lshosts command will list one host — which represents all the resources assigned to it by the HP XC system. The total number of cores for that host should be equal to the total number of cores assigned to the SLURM lsf partition.

By default, lshosts returns the following information: host name, host type, host model, core factor, number of cores, total memory, total swap space, server information, and static resources.

The following example shows the output from the lshosts command:

$ lshosts
HOST_NAME   type    model    cpuf ncpus maxmem maxswp server RESOURCES
lsfhost.loc SLINUX6 Itanium2 16.0    12  3456M      -    Yes (slurm)
n7          UNKNOWN UNKNOWN_  1.0     -      -      -     No ()
n8          UNKNOWN UNKNOWN_  1.0     -      -      -     No ()
n2          UNKNOWN UNKNOWN_  1.0     -      -      -     No ()

Of note in the lshosts output:

  • The HOST_NAME column displays the name of the LSF execution host, lsfhost.localdomain and any other HP XC nodes that have been granted a floating client license because LSF commands were executed on them. LSF-HPC does not know about these floating client hosts, so they are listed as UNKNOWN types and models.

  • The type column displays the type of resource. This value is SLINUX64 for all HP XC systems.

  • The ncpus column displays the total core count (usable cores) of all available computer nodes in the lsf partition.

  • The maxmem column displays minimum maxmem over all available computer nodes in the lsf partition.

  • The maxtmp column (not shown) displays minimum maxtmp over all available computer nodes in the lsf partition. Use the lshosts -l command to display this column.

Getting Host Load Information

The LSF lsload command displays load information for LSF execution hosts.

$ lsload
HOST_NAME    status  r15s  r1m  r15m  ut  pg  ls  it  tmp  swp  mem
lsfhost.loc   ok        -    -     -   -   -   4   -    -    -    -

In the previous example output, the LSF execution host (lsfhost.localdomain) is listed under the HOST_NAME column. The status is listed as ok, indicating that it can accept remote jobs. The ls column shows the number of current login users on this host.

See the OUTPUT section of the lsload manpage for further information about the output of this example. In addition, see the Platform Computing Corporation LSF documentation and the lsload(1) manpage for more information about the features of this command.

For individual compute node load information, see the discussion on metrics in shownode(1).

Examining System Queues

All jobs on the HP XC system that are submitted to LSF-HPC are placed into an LSF-HPC job queue.HP recommends that you examine the status and availability of LSF-HPC system queues before launching a job so that you can select the most appropriate queue for your job.

You can easily verify the status, limits, and configurations of LSF-HPC queues with the bqueues command. This command is fully described in Platform Computing Corporation's LSF documentation and manpages.

See bsub(1) for more information on submitting jobs to specific queues.

For more information on the bqueues command, see bqueues(1).

Getting Information About the lsf Partition

Information about the SLURM lsf compute node partition can be viewed with the SLURM sinfo command. A partition is one or more compute nodes that have been grouped together. A SLURM lsf partition is created when the HP XC system is installed. This partition contains the resources that will be managed by LSF-HPC and available for jobs submitted to LSF-HPC.

The sinfo command reports the state of the lsf partition and all other partitions on the system. The sinfo command displays a summary of available partition and node information such as partition names, nodes/partition, and cores/node). It has a wide variety of filtering, sorting, and formatting options.

The following example shows the use of the sinfo command to obtain lsf partition information:

$ sinfo -p lsf
PARTITION AVAIL TIMELIMIT NODES  STATE NODELIST
lsf          up  infinite   128   idle n[1-128]

Use the following command to obtain more information on the nodes in the lsf partition:

$ sinfo -p lsf -lNe
NODELIST    NODES PARTITION STATE CPUS MEMORY TMP_DISK WEIGHT FEATURES REASON
n[1-128]      128       lsf  idle    2   3456        1      1   (null) none

See “Getting System Information with the sinfo Command” and the sinfo(1) manpage and for further information about using the sinfo command.

Printable version
Privacy statement Using this site means you accept its terms Feedback to webmaster
© 2003 Hewlett-Packard Development Company, L.P.