© Copyright Hewlett-Packard Company 1997-1998,2006
All Rights Reserved.
Online help for the CPU Expert Tool running under the graphical user interface (xstm).
Help for the CPU Expert Tool

The CPU Expert Tool provides sophisticated troubleshooting for problems associated with a specific Processor in a multiprocessor environment. The ability to deactivate, reactivate, assign a process to a processor, mark a processor for deconfiguration, and mark a processor for reconfiguration. Allows on-line control of processor functionality.
The CPU Expert Tool is fully interactive, with its own set of menu commands.
The CPU Expert Tool contains commands which will cause servere performance degradation as a result of processor deactivation or processor deconfiguration, it is assumed that users are aware of the dangers of its utilization.
To use the CPU Expert Tool safely, you should be trained and knowledgeable about CPU modules.


Introduction


The CPU Expert Tool allows on-line control of processor functionality in a multi-processor environment by issuing processor specific commands.
This product provides individual operations that enable the user to identify a malfunctioning processor. The goal of the Expert Tool is to provide the user with an interactive tool that allows processor deactivating or reactivating of individual processor modules without requiring reset of the SPU or restart of the operating system. The ability to assign a specific process to an individual processor is an another capability of this product. More over, processors may be marked for deconfiguration or reconfiguration upon the next boot. Care should be exercised to NOT leave a cpu deconfigured or marked for deconfiguration as the system performance will be adversely effected. In addition, care must be taken on iCOD machines to not leave more cpu's active and/or marked for configuration than the customer has license for.
This tool should allow the multiprocessor system under test to be reduced in scope via programmatic elimination of one or more modules. Thus, problems caused by a CPU module or specific process can be isolated without off-line testing.

The CPU Expert Tool requires that you install a HP-Only license at the main STM window. Consult a Mesa Adminstrator for the password.
If you experience problems, consult the help topic.

Step-by-Step Instructions


    Running the CPU Expert Tool

    To run the CPU Expert Tool on a device:
    • * At the main STM window, select a CPU device. The CPU Expert Tool will access all CPU devices on the Unit Under Test.
    • * If desired, change the options for the CPU Expert Tool at the main STM window. Options include the level of logging for the tool. For more information, see online help for STM as a whole (use the Help menu in the STM main window).
    • * Run the CPU Expert Tool.
    • * To select the desired commands, use the pull-down menus in the main window of the tool. For more information, see:
    • *troubdisk
    • *commontasks
    • *commands
    • * To run any option commands, pull down Options Menu and make a selection.
    • * To run exercise commands, pull down CPU_exercise Menu and make a selection.
    • * If you run into problems, see the help topic.
    • * Exit the tool by selecting the command from the File Menu.

    Troubleshooting a Suspect Processor

    The following procedure is one possible method for troubleshooting processors. The idea is to use utilize the process of elimination ideology. First find easy problems then perform more-time consuming operations to find less obvious problems.

    To troubleshoot a suspect processor:
    * Try running a specific process (identified by a PID Number) on a Processor to identify Process specific Problems.
    * Deactivate all processors that may not have any problems, and assign a PID to aotivated processor that is suspected to be malfunctioning.
    * Continue Deactivating and Reactivating Processors untill a malfunctioning processor is identified.
    * If you get confused on which processors are diabled/enabled run the command, which will indicate the status of all Processors.

    Common Tasks

    If you run into problems, see the trouble help topic.

    Troubleshooting Tool-Related Problems

    • * !!A failure or other problem with the CPU Expert Tool:!!
      Consult the Failure Log (for failures) and the Test Activity Log, accessible from the STM main window.
    • * !!Problem with the device:!!
      Make sure the device is connected, powered on, and on-line. You could try running the verifier on the device to see why the device is not responding.
      You can stop the executing function by pressing the in the window of the expert tool. To stop the expert tool entirely and return to the xstm Main Window, press the .
    • * !!You start a CPU Expert Tool command but nothing seems to happen:!!
      The CPU Expert Tool may be waiting for you to respond to a query message it is displaying. Look underneath the windows on your workstation for a query message. Also, de-iconize any windows related to stm or the expert tool.
    • * !!When you try to run the CPU Expert Tool from the main STM interface, you do not see it listed:!!
      The CPU Expert Tool requires a license to operate. Install the license at the main STM window and try to run the tool again.

    Cpu Exercise


    The function !!Exercise!! applied on a specified cpu to run stress tests on data cache, data tlb and float point unit in turn.
      Data cache exerciser -- exercises the data cache by forcing misses, replacements and flushes.

      Data TLB exerciser -- exercises the data TLB by forcing page faults which forces TLB replacements.

      Floating point exerciser -- exercises the floating point unit by providing a series of known inputs and checks for corresponding outputs. The benchmark algorithm used here is "tomcatv".

      Execution Time Limit The parameter !!Execution Time Limit!! can be specified by user as option for limit the cpu exercise execution time in minutes. The default value is 10 minutes.

      Error Allowed Number The parameter !!Error Allowed Number!! can be specified by user as option for limit the allowed number of errors before the cpu exercise test termination. The default value is 10.

    Commands and Buttons


    The CPU Expert Tool offers the following commands and buttons:
    • * File Menu
    • * Options Menu
    • * CPU-Exercise Menu
    • * Help Menu
    • * Buttons


      Exit


      Use the !!Exit!! command to terminate the execution of the CPU Expert Tool and return to the main STM window. If the user tries to exit without reactivating deactivated processor(s), a warning message will appear. The message will indicate that all processors have not been reactivated, and reactivation of all processors will be done upon exiting.

      Activate


      The !!Activate!! command will only return a processor that has been previously DEACTIVATED to normal system use. Will return error message if activation routine does not return SUCCESSFUL or wrong pdev # was selected. The !!Activate!! command will display in the main window the SPU, Physical Device, and HPA number along with the current number of activated processors after execution.

      Assign


      Forces (and locks) a process to a specific processor module identified by the physical device number. PID and pdev number arguments are required. If a wrong physical device number in entered, an error message will display. The user is required to find an active PID numbers with a different application (top, ps ef).

      Deactivate


      Deactivates a processor from system use. A pdev number argument is required. The Expert Tool will display a warning message if there is only one processor currently activated or on any attempt to deactivate the Monarch. The !!Deactivate!! command will display in the main window the SPU, Physical Device, and HPA number along with the current number of activated processors after execution.

      Display


      The !!Display!! command will display a listing of the current processor status and identify information for all system processors:

      SPU Number:
      Physical Device Number:
      HPA: (hex)
      Status: (Enabled/Disabled)
      Monarch: (Yes/No)

      Mark Deconfig


      The "Mark Deconfig" command is used to mark a SPU for deconfiguration on next reboot. The Expert Tool will display a warning message if there is only one processor configured or on any attempt to mark for deconfiguration the Monarch. The !!Mark Deconfig!! command will display in the main window the SPU, Physical Device, and HPA number along with the current number of marked for deconfiguration processors after execution.

      Mark Reconfig


      The "Mark Reconfig" command is used to mark a SPU for reconfiguration on next reboot. Processors may be marked for reconfiguration if the processor is currently deconfigured (not the same as marked for deconfiguration) or if the processor is marked for deconfiguration !!and!! deactivated. The Expert Tool will display a warning message if there are no processors deconfigured, there are no processors marked for deconfiguration !!and!! deactivated, or an attempt to mark for reconfiguration the Monarch. The !!Mark Reconfig!! command will display in the main window the SPU, Physical Device, and HPA number along with the current number of marked for reconfiguration processors after execution.

      Run


      The !!Run!! command will launch for a selected cpu. On the popup dialog box, the user can modify parameters of the exerciser such as and . By design, only one cpu exerciser can run at any time.
      To monitor the running processes, the user may use the Top utility to obtain execution information on the running process.
      The running record can be found on cpu expert activity log.

      Termination


      The !!Termination!! command terminates running the exerciser as user desires to stop the exercise earlier than the . The exercise termination message will be indicated on main window of this expert tool.

      General Help


      The !!General Help!! command displays online help for the CPU Expert Tool (the help system you are currently viewing).
      For information on using the help system, select the "Using Help..." command in the Help menu in this Help window.
      For online help on STM as a whole, use the Help menu in the STM main window.


      Version


      The !!Version!! command displays version and copyright information for the CPU Expert Tool.

      Abort Button


      The !!Abort!! button allows you to abort the CPU Expert Tool, halting its execution and returning to the STM main window.
      The tool will be requested to abort and the state of the tool execution in the system map will be set to ABORT_PENDING. When the tool actually exits, its state will be set to ABORTED.

      Interrupt Button


      The !!Interrupt!! button allows you to interrupt the tool. When the tool receives the interrupt, it will stop whatever it is doing and prompt the user whether they want to continue with the current operation. If the choice is "Continue", the tool will continue with the current operation. If the choice is "Abort", the tool will abort the current operation and return to the window for the CPU Expert Tool.