These release notes cover the December 2002 release of Support Plus for HP-UX 11.00 running on S800/S700 systems.
NOTE: As of the September 1999 release, the name of the Diagnostic/IPR Media has been changed to Support Plus. In addition, the format has changed so that there is a separate CD-ROM for each version of the operating system (HP-UX 10.20 and HP-UX 11.0).
The Support Tools Manager (STM) provides a complete set of online support tools for HP-UX systems, enabling you to verify and troubleshoot PA-RISC system hardware, and to examine system logs.
STM offers several tool types, including information tools, verifiers, exercisers, expert tools, firmware update tools, diagnostics and utilities.
Installed with STM (as of IPR 9902) are the EMS Hardware Monitors, an important tool for maintaining system availability. The EMS hardware monitors allow you to monitor the operation of a wide variety of hardware products and be alerted immediately if any failure or other unusual event occurs. For more information, see /usr/sbin/stm/Rel_NOTES.HWE.
For the latest and most complete information on STM and EMS Hardware Event Monitors, see the Web page "Diagnostics":
http://docs.hp.com/hpux/diag/
At this site, you will find Overviews, Tutorials, Quick Reference Cards, Frequently Asked Questions (FAQs), and much other material.
The online Support Tools Manager (STM) was enhanced and updated for the current release.
Changes to User Interface and Platform
Removed variable address of type int, as it cannot hold value of type unsigned long long; hence, cause for address truncation has been removed.
... PDC Version (core cell)....:??.??
This was labeled incorrectly. It wasn't the PDC version, but it was the PDC Firmware datecode. This revision of the system info tool replaces the incorrect label, which now is displayed in the following manner:
... PDC Firmware Date Code.....: 4228 (yyww 1960+yy=year;ww=Week of year)
SAL
Thu May 9 16:18:27 2002: Attempt to rename the system map file from
/var/stm/data/uut_status_tmp to
/var/stm/data/uut_status failed with errno 2.
EACCES (13), EBUSY (16), EDQUOT (69), EEXIST (17),
EFAULT(14) errnos returned from a rename system call
indicate that one of the file paths was not a valid
path to a file.
Possible Causes/Recommended Action:
Correct the permission or other indicated file
system problem.
Thu May 9 16:18:27 2002: Re-map hardware configuration process with process
identifier (5772), initiated by user request,
completed.
UIAL
Fri May 10 09:06:04 2002: User Name: root, UI Process ID: 6704
The UUT status file
(/var/tmp/stm6704/hpdst325/data/uut_status)
representing the new device map from the Unit Under
Test (UUT) could not be successfully loaded into
memory.
Fri May 10 09:06:04 2002: User Name: root, UI Process ID: 6704
The most recent device map for the Unit Under Test
(UUT) could not be built successfully. This means
operations apparently available, based on this old
map, may not be, and might fail.
Please refer to the Map Log and/or the System
Activity Log on that system for more details.
Changed code to correctly display the software id of the system, as opposed to the negative software id that was displayed earlier.
Example of suspended message:
.... hpdst268.cup.hp.com : 15.244.81.93 ....
-- Information Tool Log for PCI SCSI Interface on path 0/10/0/0 --
Log creation time: Wed Aug 28 15:51:15 2002
Hardware path: 0/10/0/0
The pci path (0/10/0/0) is currently suspended and no info data can be
retrieved until it is resumed.
Example of non-suspended message:
.... hpdst268.cup.hp.com : 15.244.81.93 ....
-- Information Tool Log for PCI SCSI Interface on path 0/10/0/0 --
Log creation time: Wed Aug 28 15:52:12 2002
Hardware path: 0/10/0/0
Product ID: PCI SCSI Interface
Device ID: 0x000f
Revision ID 0x0001
Vendor ID: 0x1000 ( Symbios Logic Inc.)
Class Code: 0x010000
Base Class: 0x01 ( Mass Storage Controller. )
Sub-Class/Interface: 00/00 ( SCSI bus controller )
Device Status: 0x0200
Bit 9-10: DEVSEL timing 01 - medium
Memlogd on non-Superdome-class systems is reporting the page status of "marked for deallocation" pages as having a page status of "Deallocated: page is no longer in use", instead of a page status of "Pending: page could not be obtained", when displaying the memlog file via Logtool.
For example, there may be a solid sbe that has been entered into the Page Deallocation Table (PDT) by memlogd which continues to occur. Although memlogd has requested the OS to set the page containing this sbe to bad to prevent further access to this page, the page can only be marked for deallocation, but is still active. Hence, if the page status of this page shows "Deallocated: page is no longer in use", and errors on this page continue to occur, this page status is truly incorrect. This often then leads to confusion for the user, who asks: "The page is in the PDT and the page status shows that it is deallocated, so why am I still getting memory errors on this page???".
In this case, the following is an example of the behavior that will be seen with this problem:
*** Display of the memlog file via Logtool: a memory entry with a count of 2 and a page status of "Deallocated: page is no longer in use."
Memory Controller in Slot EXT0 ========================================================== Slot: 0a Error Type: Single/hard: solid, repeatable single-bit error. Page Status: Deallocated: page is no longer in use. Bit Num / Bank: 27 / 0 Logged By: Memlogd First Detected: Tue Sep 10 23:33:39 2002 Last Detected: Tue Sep 10 23:36:41 2002 Error Count: 2 Error Addr: 0x2188ded0 ==========================================================
*** Display of the memlog file via Logtool: sometime later, the count of the memory error goes up, even though the page status of this memory error was previously "Deallocated: page is no longer in use":
Memory Controller in Slot EXT0 ========================================================== Slot: 0a Error Type: Single/hard: solid, repeatable single-bit error. Page Status: Deallocated: page is no longer in use. Bit Num / Bank: 27 / 0 Logged By: Memlogd First Detected: Tue Sep 10 23:33:39 2002 Last Detected: Tue Sep 10 23:37:42 2002 Error Count: 3 Error Addr: 0x2188ded0 ==========================================================
Fix to JAGad96824:
To fix this, modified memlogd to set the page status of "marked for deallocation" pages as having a page status of "pending: page could not be obtained" in the memlog file.
When many A5236As (Transformers) are connected, to use this tool follow these steps:
The multiple update tool will download the firmware on all the controllers.
: Monitoring Changes for disc30, sdisk and disk array devices
As of IPR 9902 (Feb 99 release), there has been a change to the way that monitoring is done for disc30, sdisk and the HA Disk Array Models 10, 20, and 30FC.
Formerly, the "diaglogd exec" programs (pdisc30_exec and psdisk_exec) handled driver error entries for these devices.
As of IPR 9902, these programs have been deleted and their functionality is now provided by the EMS Hardware Monitors.
If you had customized the configuration files for the diaglogd exec programs (disk30_exec.cfg and sdisk_exec.cfg) you may wish to re-configure the EMS Hardware Monitors to achieve the same results.
Use CHART to report defects in STM. The project name is diag.stm.tools.hpux for individual tools, and diag.stm.ui.hpux for the user interface. If you don't have access to CHART, contact an HP representative to enter a defect for you.
The product number for STM is B4708AA.
SD PRODUCT: Sup-Tool-Mgr
Description: On-line Diagnostic System (Series 800/700)
SD SUB-PRODUCT: Manuals
Description: Support Tools Manager Manual Pages
FILESET: STM-MAN
Description: S800/S700 STM Manual Pages
FILESET: STM-SHLIBS
Description: S800/S700 STM Shared Libraries
FILESET: STM-UI-RUN Corequisite Filesets: STM-SHLIBS
Description: S800/S700 STM User Interface
FILESET: STM-UUT-RUN Corequisite Filesets: STM-SHLIBS
Description: S800/700 STM Unit Under Test Runtime