| United States-English |
|
|
|
![]() |
Managing Serviceguard Twelfth Edition > Chapter 8 Troubleshooting
Your ClusterTroubleshooting Approaches |
|
The following sections offer a few suggestions for troubleshooting by reviewing the state of the running system and by examining cluster status data, log files, and configuration files. Topics include:
The netstat -in command can be used to examine the LAN configuration. The command, if executed on ftsys9 after the halting of node ftsys10, shows that the package IP addresses are assigned to lan0 on ftsys9 along with the primary LANIP address.
Messages from the Cluster Manager and Package Manager are written to the system log file. The default location of the log file is /var/adm/syslog/syslog.log. Also, package-related messages are logged into the package log file. The package log file is located in the package directory, by default. You can use a text editor, such as vi, or the more command to view the log file for historical information on your cluster. It is always a good idea to review the syslog.log file on each of the nodes in the cluster when troubleshooting cluster problems. This log provides information on the following:
The following entries from the file /var/adm/syslog/syslog.log show a package that failed to run due to a problem in the pkg5_run script. You would look at the pkg5_run.log for details.
The following is an example of a successful package starting:
The Serviceguard Object Manager daemon cmomd logs messages to the file /var/opt/cmom/cmomd.log. You can review these messages using the cmreadlog command, as follows: # cmreadlog /var/opt/cmom/cmomd.log Messages from cmomd include information about the processes that request data from the Object Manager, including type of data, timestamp, etc. An example of a client that requests data from Object Manager is Serviceguard Manager. Serviceguard Manager maintains a log file of user activity. This file is stored in the HP-UX directory /var/opt/sgmgr or the Windows directory X:\Program Files\Hewlett-Packard\Serviceguard Manager\log (where X refers to the drive on which you have installed Serviceguard Manager). You can review these messages using the cmreadlog command, as in the following HP-UX example: # cmreadlog /var/opt/sgmgr/929917sgmgr.log Messages from Serviceguard Manager include information about the login date and time, Object Manager server system, timestamp, etc. If you have VERITAS Cluster Volume Manager (CVM), and problems starting the cluster, check the log file for the system multi-node package. For CVM 3.5, the file is VxVM-CVM-pkg.log. For CVM 4.1, the file is SG-CFS-pkg.log. Review the following ASCII configuration files:
Ensure that the files are complete and correct according to your configuration planning worksheets. Ensure that the package control script is found on all nodes where the package can run and that the file is identical on all nodes. Ensure that the script is executable on all nodes. Ensure that the name of the control script appears in the package configuration file, and ensure that all services named in the package configuration file also appear in the package control script. Information about the starting and halting of each package is found in the package’s control script log. This log provides the history of the operation of the package control script. By default, it is found at /etc/cmcluster/package_name/control_script.log; but another location may have been specified in the package configuration file’s SCRIPT_LOG_FILE parameter. This log documents all package run and halt activities. If you have written a separate run and halt script for a package, each script will have its own log. In addition, cmcheckconf can be used to troubleshoot your cluster just as it was used to verify the configuration. The following example shows the commands used to verify the existing cluster configuration on ftsys9 and ftsys10:
The cmcheckconf command checks:
It doesn’t check:
The command cmscancl displays information about all the nodes in a cluster in a structured report that allows you to compare such items as IP addresses or subnets, physical volume names for disks, and other node-specific items for all nodes in the cluster. cmscancl actually runs several different HP-UX commands on all nodes and gathers the output into a report on the node where you run the command. To run the cmscancl command, the root user on the cluster nodes must have the .rhosts file configured to allow the command to complete successfully. Without that, the command can only collect information on the local node, rather than all cluster nodes. The following are the types of configuration data that cmscancl displays for each node: Table 8-1 Data Displayed by the cmscancl Command
cmviewconf allows you to examine the binary cluster configuration file, even when the cluster is not running. The command displays the content of this file on the node where you run the command. The following networking commands can be used to diagnose problems:
Use these commands on all nodes. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||||||