 |
» |
|
|
 |
Proper maintenance of the file system is crucial
to the operation of the HP XC system. This section describes
key directories and offers guidelines for the maintenance of the file
system. The basic file system layout is the same as that
of a standard Linux file system. Additions to the layout for the HP XC system
are described in this section. Figure 1-1 illustrates the top of the file system hierarchy for a node in
the HP XC system.  |  |  |  |  | IMPORTANT: The HP XC system relies on key files.
Interfering with these files can cause the system to fail. The best
way to avoid this situation is to respect the placement of directories
and files, especially when installing software packages. The file
system layout is structured to isolate the files specific to the HP XC System Software from
base operating system files. This isolation enables flexibility and
ease of potential upgrades of the system software and keeps software
from conflicting with user-installed software. You can avoid many
pitfalls by strictly maintaining separation between the HP XC System Software and
other software. |  |  |  |  |
The following sections describe the base operating
system files, directories specific to the HP XC System Software, system software
configuration files, and log files. Key Operating System Directories |  |
Following are the key directories: - /bin
Holds the binary executable
files that any user can invoke. - /sbin
Holds the binary executable
files that are specifically for the system administrator or designated
privileged users. - /etc
Contains files for the configuration
of the system and components of the system, including networking information,
printers, and so on. - /hptc_cluster
Reserved for the exclusive
use of the HP XC System Software. This directory is
a file system that is mounted on all the nodes in the HP XC system.
This systemwide directory contains key directories and files for global
system use. See “Systemwide Directory, /hptc_cluster” for more information. - /opt
Holds software packages
from HP and other vendors. - /opt/hp
Is reserved for optional HP applications
and utilities that apply to the HP XC system. HP-MPI
is an example of such a package. - /opt/hptc
Reserved for the exclusive
use of the HP XC System Software. HP XC specific software and associated
software packages are maintained under this directory. For more information,
see “HP XC System Software Directory, /opt/hptc”. This directory also contains subdirectories
for HP XC service configuration files. For more information
on those directories, see “HP XC Service Configuration Files”. - /usr
Maintains a hierarchy of
standard commands and files. - /usr/bin
Holds the binary executable
files that any user can invoke. - /usr/sbin
Holds the binary executable
files that are specifically for the system administrator or designated
privileged users. - /usr/local
Provides an area to store
executable programs that were produced locally. - /var
Holds various log, spooling,
and temporary storage directories. - /tmp
Provides temporary file
storage.
New third-party software packages may break the
system. Be sure to assign individual areas when using relocatable
software packages, that is, do not intermingle other vendors'
software with the HP XC system files directories. Systemwide Directory, /hptc_clusterThe /hptc_cluster directory
is the global file system on an HP XC system. This file system
is shared and mounted by all nodes. This directory contains configuration
and log file information that is applicable across the system; various
services rely on the files in this directory. These log files are
in the /hptc_cluster/adm/logs directory. Use the following guidelines for the /hptc_cluster directory: Keep this directory small. Ensure that the minimum
amount of data is maintained in this directory. Ensure that the /hptc_cluster file system utilization is less than 90
percent. You can use the df command to determine
the current value. Also, Nagios reports on high utilization based
on thresholds in the nagios_vars.ini file; for
more information, see Chapter 8 Minimize the number of
nodes that access this storage. Do not install install
software in this directory unless you are instructed to do so. Do not use the /hptc_cluster directory for parallel job storage and synchronization.
Instead, create another file system and mount it internally. See Chapter 18 for more information. Do not use this directory
for home directories or user accounts.
During system installation, you are given an option
to locate the /hptc_cluster file system somewhere
other than the installation on the head node. For example, you can
place it on a partition on the head node's disk, on a networked
disk, on an HP SFS server, or on a SAN. This directory is mounted on the head node. You
must ensure the persistence of this file system mount. HP XC System Software Directory, /opt/hptcThe HP XC System Software maintains the /opt/hptc directory for its exclusive use. Its software is installed in that
directory. Do not install or replace any other software in
this directory unless it is an officially supported patch. Software
packages, such as the SLURM resource manager, are installed in directories
under the /opt/hptc directory under their own
names. Software packages under the /opt/hptc directory have their own subdirectories, usually bin, etc, sbin and presumably
others). There are symbolic links to files in these subdirectories
(for example, /opt/hptc/package/bin) to the corresponding directory (for example, /opt/hptc/bin). The SLURM resource manager is such a package;
its structure in the /opt/hptc directory structure
is illustrated in Figure 1-2; the SLURM directory and its subdirectories are shown as dashed
line boxes. The SLURM package has its own bin, etc, lib, and sbin, subdirectories. Each executable in the /opt/hptc/slurm/bin directory has a symbolic link to the /opt/hptc/bin directory. Likewise, there are symbolic
links in the other SLURM subdirectories. HP XC Service Configuration FilesThe /opt/hptc/etc/ directory
includes several subdirectories containing scripts used to configure
services on nodes at installation time. The /opt/hptc/etc/sconfig.d directory contains scripts for system configuration. The /opt/hptc/etc/gconfig.d directory contains scripts used to gather information needed to
configure a service on the HP XC system. The /opt/hptc/etc/nconfig.d directory contains scripts associated with specific services, which
configure nodes that run these services . The /opt/hptc/etc/cconfig.d directory contains scripts associated with client-specific services. Log Files |  |
The software components that comprise the HP XC system
generate their own log files. As discussed in Chapter 7: Monitoring the System, some
information is aggregated from local nodes to regional nodes and from
regional nodes to the global node. Table 1-1 lists the components and the pathname of their log files. Table 1-1 Log Files | Component | pathname of Log File |
|---|
Console Management Facility
(CMF) | /hptc_cluster/adm/logs/cmf.dated/* | collectl utility | /var/log/collectl | LSF | /opt/hptc/lsf/top/log (linked to /var/lsf/log) | Myrinet® gm_drain_test | /var/log/diag/myrinet/gm_drain_test/ | Myrinet gm_prodmode_mon diagnostic tool | /var/log/diag/myrinet/gm_prodmode_mon/links.log /hptc_cluster/adm/logs/aggregator_nodename.log (alerts) | ovp | /hptc_cluster/adm/logs/ovp/ovp_nodename_mmddyy[rnn] /hptc_cluster/adm/logs/ovp/current_ovp_log (a symbolic link to the most recent log file) | powerd | /var/log/powerd/powerd.log | Quadrics qselantestp | /var/log/diag/quadrics/qselantestp/ (default directory) | Quadrics qsnet2_drain_test utility | /var/log/diag/quadrics/qsnet2_drain_test | SLURM | /var/slurm/log/slurmctld.log and /var/slurm/log/slurmd.log | syslog-ng | /hptc_cluster/adm/logs/aggregator_nodename.log (one log file for each aggregator) /hptc_cluster/adm/logs/consolidated.log | startsys/stopsys | /hptc_cluster/adm/logs/startsys.log | systemimager/flamethrower | /hptc_cluster/adm/logs/imaging.log | cluster_config | /var/log/nconfig.log (contains output from the cluster_config utility) | Nagios | /opt/hptc/nagios_log /hptc_cluster/adm/logs/sel/sel-nxxx (System Event log) | transfer_to_avail/ transfer_from_avail | /var/log/transfer_avail.log | Fresh installation | /var/log/postinstall.log (shows the RPMs that were installed) | Software upgrade | /var/log/upgrade (if this directory exists, a software upgrade was done) |
|