© Copyright Hewlett-Packard Company 1997-1998,2006
All Rights Reserved.
The CPU Expert Tool provides
sophisticated troubleshooting for problems associated with a specific
Processor in a Multiprocessor environment. The ability to deallocate,
reallocate, and assign a process to a processor allows on-line control
of processor functionality.
Online help for the CPU Expert Tool running under
the command line user interface (cstm).
Help for the CPU Expert Tool
The CPU Expert Tool is
fully interactive, with its own set of menu commands.
The CPU Expert Tool contains commands which will cause servere performance
degradation as a result of processor deallocation, it is assumed that users
are aware of the dangers of its utilization.
To use the CPU Expert Tool safely,
you should be trained and knowledgeable about CPU modules.
Command Summary
File Commands
Exit (EX) - Exits tool and returns to main STM prompt
Options Commands
Activate (AC) - Re-Activate the specified SPU(s) (same as old Allocate command)
Assign (AS) - Assign a process (pid) to a SPU
Deactivate (DA) - Deactivate the specified SPU(s) (same as old Deallocate command)
Display (DS) - Display the SPU status
Mark Deconfig (MD) - Mark SPU for deconfiguration on next reboot
Mark Reconfig (MR) - Mark SPU for reconfiguration on next reboot
Exercise Commands
runexercise (REX) - Run exerciser for a specified SPU
termexercise (TEX) - Terminate the running exerciser
Help Commands
Generalhelp (HE) - Displays online help for the CPU Expert Tool
Version (VERS)- Displays version and copyright information
Control-C - Interrupt or abort execution of tool; escape to the user interface
Introduction
The CPU Expert Tool allows on-line control of processor functionality
in a multi-processor environment by issuing processor specific commands.
This product provides individual operations that enable the user
to identify a malfunctioning processor. The goal of the Expert Tool is to
provide the user with an interactive tool that allows processor deallocating
or reallocating of individual processor modules without requiring reset of
the SPU or restart of the operating system. The ability to assign a specific
process to an individual processor is an another capability
of this product.
This tool should allow the multiprocessor
system under test to be reduced in scope via programmatic elimination
of one or more modules.
Thus, problems caused by a CPU module or specific
process can be isolated without off-line testing.
The CPU Expert Tool requires that you install a license at the main
STM window. Consult a Mesa Adminstrator for the password.
If you experience problems, consult the Step-by-Step Instructions
Running the CPU Expert Tool
To run the CPU Expert Tool on a device:
Troubleshooting a Suspect processor
To troubleshoot a suspect processor:
* Try running a specific process (identified by a PID Number)
on a Processor to identify Process specific Problems.
* Deallocate all processors that may not have any problems, and assign
a PID to allocated processor that is suspected to be malfunctioning.
* Continue Deallocating and Reallocating Processors untill a
malfunctioning processor is identified.
* If you get confused on which processors are diabled/enabled run the
"Display" command, which will indicate the status of all Processors.
Common Task
If you run into problems, see the trouble help topic.Troubleshooting Tool-Related Problems
Consult the Failure Log (for failures)
and the Test Activity Log, accessible from the STM prompt.
The CPU Expert Tool requires a license to operate. Install the license
at the main cstm prompt and try to run the tool again.
Cpu Exercise
The function !!Exercise!! applied on a specified cpu to run stress tests
on data cache, data tlb and float point unit in turn.
Data cache exerciser -- exercises the data cache by forcing misses,
replacements and flushes.
Data TLB exerciser -- exercises the data TLB by forcing page faults
which forces TLB replacements.
Floating point exerciser -- exercises the floating point unit
by providing a series of known inputs and checks for corresponding outputs.
The benchmark algorithm used here is "tomcatv".
Execution Time Limit
The parameter !!Execution Time Limit!! can be specified by user
as option for limit the cpu exercise execution time in minutes.
The default value is 10 minutes.
Error Allowed Number
The parameter !!Error Allowed Number!! can be specified by user
as option for limit the allowed number of errors
before the cpu exercise test termination.
The default value is 10.
Commands
The CPU Expert Tool offers the following
commands and function keys:
Exit (EX)
Use the "Exit" command to leave the CPU Expert Tool and
return to the main STM prompt.
Activate (AC)
The "Activate" command will only return a processor that has been previously
DEACTIVATED to normal system use.
Will return error message if aotivation routine
does not return SUCCESSFUL or wrong pdev # was selected.
The "Activate" command
will display in the main window the SPU, Physical Device, and HPA number along
with the current number of allocated processors after execution.
Assign (AS)
The "Assign" command Forces (and locks) a process to
a specific processor module identified by the
physical device number. PID and pdev number arguments are required. If a wrong
physical device number in entered, an error message will display. The user is
required to find an active PID numbers with a different application (top, ps ef
).
Deactivate (DA)
The "Deactivate" command Deactivates a processor from system use.
A pdev number argument is
required. A deactivated processor may not be reactivated if the processor is marked for deconfiguration until is marked for reconfiguration. The Expert Tool will display a warning message if there
is only one processor currently activated or on any attempt to dea
ctivate the Monarch. The !!Deactivate!! command will display in the main w
indow the SPU, Physical Device, and HPA number along with the current nu
mber of allocated processors after execution.
Display (DS)
The "Display" command will display a listing of the current processor status
and identify information for all system processors:
  SPU Number:
  Physical Device Number:
  HPA: (hex)
  Status: (Enabled/Disabled)
  Monarch: (Yes/No)
Mark Deconfig (MD)
The "Mark Deconfig" command is used to mark a SPU for deconfiguration on next reboot. The Expert Tool will display a warning message if there is only one processor configured or on any attempt to mark for deconfiguration the Monarch. The !!Mark Deconfig!! command will display in the main window the SPU, Physical Device, and HPA number along with the current number of marked for deconfiguration processors after execution.
Mark Reconfig (MR)
- Mark SPU for reconfiguration on next reboot
The "Mark Reconfig" command is used to mark a SPU for reconfiguration on next reboot. Processors may be marked for reconfiguration if the processor is currently deconfigured (not the same as marked for deconfiguration) or if the processor is marked for deconfiguration !!and!! deactivated. The Expert Tool will display a warning message if there are no processors deconfigured, there are no processors marked for deconfiguration !!and!! deactivated, or an attempt to mark for reconfiguration the Monarch. The !!Mark Reconfig!! command will display in the main window the SPU, Physical Device, and HPA number along with the current number of marked for reconfiguration processors after execution.
Runexercise (REX)
The "Runexercise" command will launch
To monitor the running processes, the user may use the Top utility to obtain
execution information on the running process.
The running record can be found on cpu expert activity log.
Termexercise (TEX)
The "Termexercise" command terminates running the exerciser
as user desires to stop the exercise earlier than the "cpu exercise".
The exercise termination message will be indicated on main window of
this expert tool.
Generalhelp (HE)
The "Generalhelp" command displays online help for the
CPU Expert Tool (the help system you are currently viewing).
For online help on STM as a whole, use the "Help" command at the STM prompt.
Version (VERS)
The "Version" command displays version and copyright information
for the CPU Expert Tool.
Control-C (Abort or Interrupt)
While a test is executing, you can press Control-C to
abort or interrupt the text.
When "Control-C"
is detected, a list of choices will be displayed on the screen.
The list includes "Abort", "Interrupt", "Escape" and "Continue".
Choosing "Abort" or "Escape"
halts the tool's execution and returns to the STM main window.
The tool will be requested to abort and the state of the tool
execution in the
system map will be set to ABORT_PENDING. When the tool actually exits,
its state will be set to ABORTED.
"Continue" causes the tool to continue with what it was doing
at the time of Control-C.
Choosing "Interrupt"
causes the tool
to prompt you if you want to continue with the operation. If the reply is
"Continue", the tool resumes what it was doing at the time of the interrupt.
If the reply is "Abort", it terminates the operation and returns
to the tool's prompt.