Announcement
System Fault Management (SFM) is a collection of tools used to monitor the health of HP servers and receive information about hardware such as memory, CPU, power supplies, and cooling devices. SFM operates in the Web-Based Enterprise Management (WBEM) environment.
SFM includes the following tools:
- SFM Providers
- EVWEB
- EMT
This document contains the following sections:
- SFM Providers
- EVWEB
- EMT
- System Requirements
- Supported Browsers
- Limitations and Workarounds
- Product Documentation
- Software and Documentation Availability in Native Languages
- Product Structure
- Reporting Defects
SFM Providers
SFM providers are tools that gather information related to various hardware devices and report to the Common Interface Model Object Manager (CIMOM).
Table 1 lists the SFM providers, and their respective functions:
Table 1: Providers and their Functions
Providers Functions CPU Instance Provider Retrieves information about processor inventory and consolidated health of the processor subsystem Memory Instance Provider Gathers information about memory inventory and consolidated health of the memory subsystem EMS Wrapper Provider Converts events generated by the EMS Hardware Monitors into indications and reports those indications to the CIMOM Filter Metadata (FMD) Provider Provides the facility to predefine the important filter in a repository. FMD also ensures that all important or chosen indications are logged to the local event archive. FMD creates HP-advised subscriptions when SFM is installed. Environmental Providers Retrieve information about cooling devices (fans) and power supply (bulk power supply and AC input lines) on HP servers. They also retrieve consolidated health of cooling, power, system temperature, and system voltage subsystems on HP servers. Event Manager Common Information Model (EVM CIM) Provider Converts EVM events into indications and reports those indications to the CIMOM SFMIndicationProvider Generates WBEM indications when an abnormal activity is detected on the monitored devices and reports these WBEM indications to the CIMOM. Firmware Revision Instance Provider Retrieves information about the firmware revision of system hardware components, such as system firmware version and Management Processor (MP) firmware version. Disk Instance Provider Retrieves information about the consolidated health status and inventory information of direct attached disk drives, such as SCSI drives. MP Instance Provider Retrieves information about the management processor of the system. Enclosure Instance Provider Retrieves information about the Onboard Administrator (OA), such as OA description, OA IP address, OA MAC address, and the URL to launch the OA. Record Log Provider Enables event analysis tools such as Web-Based Enterprise Services (WEBES) to access details of indications generated by the SFMIndicationProvider that are available in the SFM database, for event analysis. The Record Log Provider also enables event analysis tools to access MCA logs for event analysis. Temperature Provider Retrieves properties such as sensor number, current temperature reading, and temperature sensor status. ComputerSystem Chassis Provider Retrieves properties such as the serial number, product ID, and virtual Universally Unique ID (UUID). MCA Indication Provider Generates WBEM indications when MCA logs are present.
- The MCA Indication Provider is introduced. This provider generates an indication when Machine Check Abort (MCA) logs are present due to an MCA.
- The Record Log Provider is enhanced to support MCA logs. Event analysis tools can access MCA log details for event analysis.
- The Disk Provider is enhanced to display Agile information, such as LegacyHardwarePath and AgileHardwarePath, of the disk drives.
- The sfmconfig command is enhanced to support device-specific throttling to filter indications based on event category, provider name and event ID.
- The ComputerSystem Chassis Provider is introduced. It provides the following details related to the physical system:
The ComputerSystem Chassis Provider provides the following details related to the logical server:
- SerialNumber
- ProductId
- Model
These values are retained when an OS instance is moved to another server.
- VirtualSerialNumber
- VirtualUUID
- The Temperature Sensor Provider is enhanced to provide details such as the processor temperature and the memory board temperature.
Starting with the HP-UX 11i v3 March 2008 release, SFM is the default monitoring mode. In the SFM mode, the SFMIndicationProvider generates WBEM indications related to the devices it monitors. However, the SFMIndicationProvider is represented by different provider names in the WBEM indication details. These provider names reflect the resource name to which the indication is related. For example, on an HP Integrity® system, the provider name displayed in a memory indication is MemoryIndicationProviderIA.
Table 2 lists the errors monitored by the SFMIndicationProvider and the corresponding provider names in the WBEM indication details:
Table 2: Nature of Error and the Representation of SFMIndicationProvider in WBEM Indications
Nature of error Provider Name in WBEM Indication Corrected machine check or CPU errors CMC_IndicationProviderIA Corrected platform errors or CPE errors CPE_IndicationProviderIA Memory errors MemoryIndicationProviderIA Core hardware errors CoreHardwareIndicationProviderIA Forward progress logs (IPMI Events) FPL_IndicationProvider In addition to the SFMIndicationProvider, the EMSWrapperProvider converts events received from the EMS Hardware Monitors into WBEM indications.
Table 3 lists the errors monitored by the EMS Hardware Monitors and the corresponding provider names in the WBEM indication details:
Table 3: Nature of Error and the Representation of EMSWrapperProvider in WBEM Indications
Nature of error Provider Name in WBEM Indication Corrected machine check or CPU errors on HP 9000 systems LPMC_IndicationProviderPA Direct-attached disk drive errors on either HP 9000 or HP Integrity systems DiskIndicationProvider Memory errors on HP 9000 systems MemoryIndicationProviderPA Core hardware errors on HP 9000 systems CoreHardwareIndicationProviderPA Chassis errors on HP 9000 systems ChassisIndicationProviderPA For information on how EMS Hardware Monitors and the corresponding providers are mapped, see Representation of Monitors in the SFM Administrator’s and User’s Guide, at:
http://www.docs.hp.com/en/diag.htmlFor a list of WBEM indications and their details, see the EMS Event Descriptions at:
http://docs.hp.com/en/diag/ems/eme_summ.htmNote: You can switch to the OnlineDiag monitoring mode. For information on how to switch to the OnlineDiag mode, see Configuring the SFMIndicationProvider in the SFM Administrator’s and User’s Guide at:
Defect Fixes
http://www.docs.hp.com/en/diag.html
- QXCR1000752090
Problem: Memory consumption of the SFM provider process increases sometimes.
Cause: This problem occurs due to a memory leak in the FPL Indication Provider.
Resolution: The memory leak is fixed.- QXCR1000776917
Problem: When a large volume of events is logged, the SFMDB event repository sometimes consumes large amounts of space in the /var filesystem.
Resolution: This problem is fixed. Old logs are cleared at regular intervals to avoid this condition.- QXCR1000795115
Problem: The WBEM Wrapper Monitor version and the SFM version on the system do not match.
Resolution: This problem is fixed. The two versions now correspond.- QXCR1000808748
Problem: SFM does not install properly in a Dynamic Root Disk (DRD) environment.
Resolution: This problem is fixed. DRD checks are enabled to support SFM installation in a DRD environment.EVWEB
This section describes EVWEB.
EVWEB is a tool that can be used to view and administer WBEM indications generated on the HP-UX 11i v3 system.
The EVWEB tool includes the following components:
Benefits
- Event Subscription Administrator
Event Subscription Administrator enables users to subscribe to an indication and view it. In addition, users with administrative privileges can also modify, and delete subscriptions. By subscribing to an indication, users can obtain detailed information about various WBEM indications. Users can also view indications generated by the High Availability Monitors. Indications generated by High Availability Monitors are called HP threshold indications.
As a part of event subscription, users must specify event subscription criteria. Users must also select one or more destinations to receive information about indications.
Users can select one or more destination from the following list:
- Event Archive: The path to Event Archive is /var/opt/sfmdb/pgsql. Event Archive is the default destination.
- syslog: The path to syslog is /var/adm/syslog/syslog.log.
- Email: Event notification will be emailed to the specified email address. Users can specify multiple email addresses.
- Event Viewer
The Event Viewer enables users to view the indications stored in the Event Archive. In addition, users with administrative privileges can also delete these indications. By default, HP-advised subscriptions are stored in the Event Archive. The Event Viewer also enables users to search for an indication logged in the Event Archive.- Log Viewer
The Log Viewer enables users to view and search the low level logs stored in the log database.Following lists the benefits of EVWEB:
Features
- Enables users to manage all WBEM indications that are supported by SFM.
- Provides an option to customize the indication destination to receive information about HP-advised subscriptions.
- Enables users to view the command-line equivalent of an action performed using the GUI, thereby, educating users about the usage of various commands.
Following lists the features that EVWEB offers:
- Provides quick search and advanced search mechanisms to view events from the Event Archive
- Generates a list of events in a printer-friendly format (GUI only)
- Enables users with administrative privileges to delete indications
- Enables users with administrative privileges to manage subscriptions, such as creating, modifying, and deleting indications
- Enables users to view subscriptions created using EVWEB
- Enables users to view externally created subscriptions.
Subscriptions created by using tools other than EVWEB are termed as externally created event subscriptions.- Enables users to view HP-advised subscriptions. HP-advised subscriptions are provided by default by HP.
Note: EVWEB supports these features on browser-based GUI and the CLI.
Defect FixesLimitations
- QXCR1000800209
The -o option is added to the evweb subscribe command. This option enables the user to specify the provider name as a filter option.
- When a HP-advised subscription is copied to create or modify another subscription, the subscription criteria is not copied. However, only destinations are copied to the new subscription.
- Event details displayed in EVWEB Event Viewer and embedded in the EVWEB email notification may not have similar readability or formatting as provided by the EMS event notification. However, this issue is not applicable to HP_DeviceIndication class indications.
EMT
This section describes EMT.
Error Management Technology (EMT) is a component of SFM. EMT includes Common Error Repository (CER), which is an online, searchable, and updateable error repository. The CER contains error metadata such as error description, error number, error type, severity, cause of the error, and corrective actions for errors generated on the HP-UX 11i v3 system.
BenefitsFollowing lists the benefits of EMT:
Features
- Enables users to view most errors that can occur on the HP-UX 11i v3 system.
- Provides an option to the administrators to add, modify, and delete custom solutions.
- Enables users to view the command-line equivalent of an action performed using the GUI, thereby, educating users about the usage of various commands.
EMT offers the following features:
Note: EMT supports these features on browser-based GUI and the CLI.
- Provides both quick search and advanced search mechanisms to view error metadata from CER
- Generates a list of errors in a printer-friendly format (GUI only)
- Enables users with administrative privileges to create, modify, and delete custom solutions
Following is a limitation of EMT:
- When you make a generic query to the CER, a huge amount of data is retrieved from the CER. However, this behavior may affect the performance of EMT.
System Requirements
SFM is supported on the following systems running the HP-UX 11i v3 operating system:
- HP 9000 servers
- rp3410
- rp3440
- rp4410
- rp4440
- rp7405
- rp7410
- rp7420
- rp8400
- rp8420
- SD16, SD32, SD64
- SD16A, SD32A, SD64A
- SD16B, SD32B, SD64B
- HP Integrity servers
- cx2600
- cx2620
- rx1600
- rx1620
- rx2600
- rx2620
- rx2660
- rx3600
- rx4640
- rx5670
- rx6600
- rx7620
- rx7640
- rx8620
- rx8640
- SD16A, SD32A, SD64A
- SD16B, SD32B, SD64B
- BL860c HP Server Blade
- BL870c HP Server Blade
SFM supports the following systems based on the Dual-Core Intel® Itanium® Processor 9100 series and running the HP-UX 11i v3 operating system:
- rx7640
- rx8640
- SD16B
- SD32B
- SD64B
Following lists the software requirements for using SFM:
- HP-UX 11i v3 February 2007
- OpenSSL Version A.00.09.07e.013 or later
- WBEM Services Version A.02.07 or later
- EVM-EventMgr B.11.31
- SysMgmtBase B.00.02.03
- SysMgmtWeb version A.2.2.4 (HP-UX Web Based System Management User Interface)
- HP Systems Insight Manager (HP SIM) version 5.0.01
- Online Diagnostics B.11.31.03.yy
Notes:
- SysMgmtWeb is optional. However, you will not be able to access EVWEB GUI if SysMgmtWeb is not installed on the system. SysMgmtWeb, WBEMServices, and Online Diagnostics are available on the Operating Environment (OE) media.
- HP SIM is required only for remote administration of indications and instances. HP SIM version 5.0.01 is the minimum requirement. However, HP recommends you install HP SIM version C.05.02.01.xx.yy.
- The mentioned versions of the software are minimum requirements. All future versions support SFM by default.
Supported Browsers
Following lists the browsers supported by SFM:
- Internet Explorer version 6.0 and above
- Mozilla version 1.5 and above
Limitations and Workarounds
- After the system is rebooted or the CIMOM is restarted, the first request to SFM hardware inventory providers such as the CPU Instance Provider, Memory Provider, and the Environmental Providers may fail with the generation of the CIM_ERR_FAILED status code. Also, a message is displayed on the client system that states " Inventory information is being built currently. Please try after some time" . However, on subsequent requests, the SFM hardware inventory providers respond with requested information instantaneously.
- Hardware inventory providers are not supported on HP Virtual Machines.
- After an OE update, the disk inventory must be refreshed. To refresh the inventory, enter the following command at the HP-UX prompt:
/opt/sfm/bin/sfmconfig -r -c Disk- QXCR1000827347
During installation of the SysFaultMgmt bundle, HW monitoring mode is switched from EMS to SFM. In the process of switching from EMS to SFM monitoring mode, sometimes the following error may be generated and logged in the syslog file:
Error occurred while starting SFM Monitor while switching from EMS to SFM Mode. Hardware monitoring is in inconsistent state. For details, see the /var/opt/sfm/log/install.log file.
Workaround: After a cold install or upgrade of the SysFaultMgmt bundle, if the error message is observed in the syslog file, you must enter the following commands at the command prompt to clear transient states:
# /opt/sfm/bin/sfmconfig -w -s
This command enables the hardware monitors in SFM.- QXCR1000900238
SFMDB does not start when the time zone is GMT0.
Problem: On IA systems when Time Zone is set to GMT0, postmaster will stop and log messages in sfmdb.log. This is because GMT0 is not a valid time zone recognized by PostgresSQL. As a result, SFM database does not come up.
Solution: When large volume of messages are logged, SFMDB consumes a large amount of space in the/var filesystem. To reduce this, delete SFMDB.log file by executing the command rm -f /var/opt/sfmdb/pgsql/sfmdb.log or move to a different location other than /var filesystem.
Install PHSS_39073 patch. Change the time zone from GMT0 to GMT+0 and restart PostgresSQL by executing the following commands:
/sbin/init.d/sfmdb stop
/sbin/init.d/sfmdb startProduct Documentation
For more information on SFM, see the following documents at:
http://docs.hp.com/en/diag.html
- SFM Frequently Asked Questions (FAQs)
- System Fault Management Administrator's Guide
- SFM Provider Data Sheets
- SFM Tables of Versions
- SFM Patch Descriptions
Software and Documentation Availability in Native Languages
SFM software and documents are available only in the English language.
Product Structure
The SFM product, consisting of SFM providers and EVWEB, is installed as part of the SysFaultMgmt bundle.
Following are the commands you must use to obtain the bundle, product, sub-product, and the fileset information about the SysFaultMgmt depot:
- Bundle
$ swlist -s <SysFaultMgmt Depot Location> SysFaultMgmt C.04.00.xx.yy HPUX System Fault Management- Product(s)
$ swlist -l product -s <SysFaultMgmt Depot Location> SFM-CORE C.04.00.xx HPUX System Fault Management SFMDB C.04.00.xx HP System Management Database (SFMDB)- Sub-product(s)
$ swlist -l subproduct -s <SysFaultMgmt Depot Location> SFM-CORE.HS-PROVIDER HS-PROVIDER SFM-CORE.ERROR-MGMT Error Management Technology SFM-CORE.EVWEB SFM-CORE.FMD-PROVIDER FMD-PROVIDER SFM-CORE.GS GS SFM-CORE.SFM-HAS SFM-HAS SFM-CORE.SFM-PROVIDER SFM-PROVIDER SFMDB C.04.00.xx HP System Management Database (SFMDB)- Fileset(s)
$ swlist -l fileset -s <SysFaultMgmt Depot Location> # SFM-CORE C.04.00.xx HPUX System Fault Management SFM-CORE.HS_PRO_COREIA C.04.00.xx HealthState Instance Provider Platform Specific Fileset SFM-CORE.HS_PRO_COREPA C.04.00.xx HealthState Instance Provider Platform Specific Fileset SFM-CORE.CTR_PRO_COMM C.04.00.xx Control Provider Common Fileset SFM-CORE.CTR_PRO_COREIA C.04.00.xx Control Provider Platform Specific Fileset SFM-CORE.CTR_PRO_COREPA C.04.00.xx Control Provider Platform Specific Fileset SFM-CORE.EMT_COMM C.04.00.xx EMT Common components SFM-CORE.EMT_COREIA C.04.00.xx EMT core platform specific fileset SFM-CORE.EMT_COREPA C.04.00.xx EMT core platform specific fileset SFM-CORE.EVWEB_COMM C.04.00.xx Event Manager (EvWEB) Common components SFM-CORE.EVWEB_COREIA C.04.00.xx EvWEB core platform specific fileset SFM-CORE.EVWEB_COREPA C.04.00.xx EvWEB core platform specific fileset SFM-CORE.EVWEB_DOC C.04.00.xx EvWEB Online help fileset SFM-CORE.EVWEB_GUI_COMM C.04.00.xx EvWEB GUI common fileset SFM-CORE.EVWEB_GUI_IA C.04.00.xx EvWEB GUI platform specific fileset SFM-CORE.EVWEB_GUI_PA C.04.00.xx EvWEB GUI platform specific fileset SFM-CORE.EVWEB_MAN C.04.00.xx EVWEB Man pages fileset SFM-CORE.FMD_PRO_COMM C.04.00.xx Filter Metadata Instance Provider Common Fileset SFM-CORE.FMD_PRO_COREIA C.04.00.xx Filter Metadata Instance Provider Platform Specific Fileset SFM-CORE.FMD_PRO_COREPA C.04.00.xx Filter Metadata Instance Provider Platform Specific Fileset SFM-CORE.GS_COMM C.04.00.xx General Services Common Fileset SFM-CORE.GS_COREIA C.04.00.xx General Services Platform Specific Fileset SFM-CORE.GS_COREPA C.04.00.xx General Services Platform Specific Fileset SFM-CORE.HAS-IA C.04.00.xx Hardware Access ServicesIA SFM-CORE.HAS-PA C.04.00.xx Hardware Access Services PA SFM-CORE.MISC_COMM C.04.00.xx MISC Common Fileset SFM-CORE.MISC_COREIA C.04.00.xx MISC Platform Specific Fileset SFM-CORE.MISC_COREPA C.04.00.xx MISC Platform Specific Fileset SFM-CORE.SFM_PRO_COMM C.04.00.xx SysFaultMgmt Provider Module COMMON SFM-CORE.SFM_PRO_IA C.04.00.xx SysFaultMgmt Provider Module IA SFM-CORE.SFM_PRO_PA C.04.00.xx SysFaultMgmt Provider Module PA # SFMDB C.04.00.xx HP System Management Database (SFMDB) SFMDB.SMPGSQL-DOC C.04.00.xx PostgreSQL (SFMDB) Documentation Files SFMDB.SMPGSQL-INC C.04.00.xx PostgreSQL (SFMDB) Header Files SFMDB.SMPGSQL-LIB C.04.00.xx PostgreSQL (SFMDB) Library Files (Architecture dependent) SFMDB.SMPGSQL-LIB C.04.00.xx PostgreSQL (SFMDB) Library Files (Architecture dependent) SFMDB.SMPGSQL-MAN C.04.00.xx PostgreSQL (SFMDB) Manual Pages SFMDB.SMPGSQL-RUN C.04.00.xx PostgreSQL (SFMDB) Executable Files (Architecture dependent) SFMDB.SMPGSQL-RUN C.04.00.xx PostgreSQL (SFMDB) Executable Files (Architecture dependent) SFMDB.SMPGSQL-SHA C.04.00.xx PostgreSQL (SFMDB) Share File SFMDB.SMPGSQL-SRC C.04.00.xx PostgreSQL (SFMDB) Source FilesReporting Defects
You can report defects related to SFM by filing a request on QuIX. The name of the project is SysFaultMgmt. If you do not have access to QuIX, contact your local HP representative to file a defect on your behalf.