The CPU Expert Tool provides sophisticated troubleshooting for problems associated with a specific processor in a multi-processor environment. Functionality includes the on-line ability to deactivate, reactivate, and assign a process to a processor.
The CPU Expert Tool is fully interactive, with its own set of menu commands.
CAUTION: The CPU Expert Tool contains commands which will cause severe performance degradation as a result of processor deactivation. It is assumed that users are aware of the dangers of its use.
To use the CPU Expert Tool safely, you should be trained and knowledgeable about CPU modules.
File Command
Exit (EX) - Exits tool and returns to main STM prompt
Options Commands
Activate (AC) - Re-Activate the specified SPU(s) (same as old Allocate command)
Assign (AS) - Assign a process (pid) to a SPU
Deactivate (DA) - Deactivate the specified SPU(s) (same as old Deallocate command)
Display (DS) - Display the SPU status
Exercise Commands
runexercise (REX) - Run exerciser for a specified SPU
termexercise (TEX) - Terminate the running exerciser
Help Commands
Generalhelp (HE) - Displays online help for the CPU Expert Tool
Version (VERS)- Displays version and copyright information
Control-C - Interrupt or abort execution of tool; escape to the user interface
The CPU Expert Tool allows on-line control of processor functionality in a multi-processor environment by issuing processor-specific commands.
This product provides individual operations that enable the user to identify a malfunctioning processor. The goal of the Expert Tool is to provide the user with an interactive tool that allows deactivation or reactivation of individual processor modules without requiring reset of the SPU or restart of the operating system. The ability to assign a specific process to an individual processor is another capability of this product.
This tool should allow the multi-processor system under test to be reduced in scope via programmatic elimination of one or more modules. Thus, problems caused by a CPU module or specific process can be isolated without off-line testing.
The CPU Expert Tool requires that you enter a license at the main STM window. Consult a Mesa Administrator for the password.
If you experience problems, consult Troubleshooting Tool-Related Problems.
To run the CPU Expert Tool on a device:
To troubleshoot a suspect processor:
If you run into problems, see the Troubleshooting Tool-Related Problems help topic.
Consult the Failure Log (for failures) and the Test Activity Log, accessible from the STM prompt.
The CPU Expert Tool requires a license to operate. Enter the license at the main cstm prompt and try to run the tool again.
The function Exercise applied on a specified CPU to run stress tests on data cache, data Translation Lookaside Buffer (TLB) and floating point unit in turn.
Data cache exerciser -- exercises the data cache by forcing misses, replacements and flushes.
Data TLB exerciser -- exercises the data TLB by forcing page faults which forces TLB replacements.
Floating point exerciser -- exercises the floating point unit by providing a series of known inputs and checks for corresponding outputs. The benchmark algorithm used here is "tomcatv".
The parameter Execution Time Limit can be specified by the user as an option for limiting the CPU exercise execution time in minutes. The default value is 10 minutes.
The parameter Error Allowed Number can be specified by the user as an option for limiting the allowed number of errors before CPU exercise test termination. The default value is 10.
The CPU Expert Tool offers the following commands and function keys:
Exit (EX)
Use the "Exit" command to leave the CPU Expert Tool and return to the main STM prompt.
Activate (AC)
The "Activate" command will only return a processor that has been previously DEACTIVATED to normal system use. Will return an error message if the activation routine does not return SUCCESSFUL or if the wrong device was selected. The "Activate" command will display in the main window the SPU, Slot Number, and LID along with the current number of activated processors after execution.
Assign (AS)
The "Assign" command Forces (and locks) a process to a specific processor module. If a wrong device is entered, an error message will display. The user is required to find an active PID number from a different application (top, ps -ef ).
Deactivate (DA)
The "Deactivate" command removes a processor from system use. The Expert Tool will display a warning message if there is only one processor currently activated or on any attempt to deactivate the Monarch. The Deactivate command will display in the main window the SPU, Slot Number, and LID number along with the current number of activated processors after execution.
Display (DS)
The "Display" command will display a listing of current processor status and information for all system processors:
SPU Number: Slot Number: LID: Status: (Active/Inactive) Monarch: (Yes/No)
Runexercise (REX)
The "Runexercise" command will launch the CPU Exerciser for a selected CPU. On the popup dialog box, the user can modify parameters of the exerciser such as Execution Time Limit and Error Allowed Number. By design, only one CPU exerciser can run at any time.
To monitor the running processes, the user may use the Top utility to obtain execution information on the running process.
The running record can be found on CPU expert activity log.
Termexercise (TEX)
The "Termexercise" command terminates running the exerciser if the user desires to stop the exercise earlier than the "CPU exercise". The exercise termination message will be indicated on the main window of the expert tool.
Generalhelp (HE)
The "Generalhelp" command displays online help for the CPU Expert Tool (the help system you are currently viewing).
For online help on STM as a whole, use the "Help" command at the STM prompt.
Version (VERS)
The "Version" command displays version and copyright information for the CPU Expert Tool.
Control-C (Abort or Interrupt)
While a test is executing, you can press Control-C to abort or interrupt the text. When "Control-C" is detected, a list of choices will be displayed on the screen. The list includes "Abort", "Interrupt", "Escape" and "Continue".
Choosing "Abort" or "Escape" halts the tool's execution and returns to the STM main window.
The tool will be requested to abort and the state of the tool execution in the system map will be set to ABORT_PENDING. When the tool actually exits, its state will be set to ABORTED.
"Continue" causes the tool to continue with what it was doing at the time of Control-C.
Choosing "Interrupt" causes the tool to prompt you if you want to continue with the operation. If the reply is "Continue", the tool resumes what it was doing at the time of the interrupt. If the reply is "Abort", it terminates the operation and returns to the tool's prompt.