Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP
More options
HP.com home
HP 9000 V-Class Server: Architecture > Chapter 9 System utilities

EMUC and Power-on

» 

Technical documentation

Complete book in PDF
» Feedback
Content starts here

 » Table of Contents

 » Glossary

The EMUC performs all environmental monitoring on the ECUB. It attaches to the core logic bus so that processors can monitor the system by accessing these CSRs.

The EMUC works in conjunction with a hardware section on the ECUB known as the power-on circuit. This circuit controls powering up the entire system. It operates when the rest of the system is powered off or in some indeterminate state. It drives the environment LED display which is a basic (minimal hardware, no software) indication of what environmental error caused the ECUB to power down the system.

The teststation can also read the environmental LED display.

Environmental monitoring functions

The EMUC and the power-on circuit monitor the following environmental conditions:

  • ASIC installation error sensing

  • FPGA configuration and status

  • Thermal sensing

  • Fan Sensing

  • Power failure sensing

  • 48V failure

  • 48V maintenance

  • Ambient air temperature sensing

  • Power-on

    Table 9-1 Environmental conditions monitored by the EMUC and power-on circuit

    ConditionTypeAction
    ASIC Not Installed OKEnvironmental errorPower not turned on, LED indication
    FPGA not OKEnvironmental errorPower not turned on, LED indication
    48V failEnvironmental errorPower turned off, LED indication
    Midplane power failEnvironmental errorPower turned off, LED indication
    Board over tempEnvironmental errorPower off in one second, LED indication, Interrupt
    Fan not turningEnvironmental errorPower off in one second, LED indication, Interrupt
    Ambient air hotEnvironmental errorPower off in one second, LED indication, interrupt
    Other power failEnvironmental errorPower off in one second, LED indication, interrupt
    Ambient air warmEnvironmental warningLED indication, interrupt
    48V maintenanceEnvironmental warningLED indication, interrupt
    Hard errorHard errorLED indication, interrupt

     

Environmental conditions detected by power-on function

The power-on function detects environmental errors (such as ASIC Install or FPGA Not OK) immediately and does not turn on power to the system until the conditions are corrected. It also detects environmental errors such as 48V Fail while the system is powering up and Midplane Power Fail after the system has powered up. If a failure is detected in these two cases, the power-on circuit turns off power to the system.

Environmental warnings such as 48V maintenance are also detected by the power-on circuit. It applies these to the EMUC, which then sends an environmental warning interrupt to the system processors.

In all cases, the power-on circuit lights an environmental LED display code. The environmental LED display code is prioritized so that it only displays the highest priority error or warning.

Environmental conditions detected by EMUC

The EMUC detects most of the environmental conditions. It samples error conditions during a time period derived from a local 10-Hz clock that drives the power-on circuit. It registers all the environmental error conditions twice and then ORs them together. If the conditions persist for 200 milliseconds, the environmental error bit is set, and an environmental error interrupt is sent to the EPUC, which sends it on to the processors. The EMUC then waits 1.2 seconds and commands the power-on circuit to power down the system.

This same procedure exists for an environmental warning except that an environmental warning interrupt is sent and the circuit does not power down the system.

The environmental error interrupt and the 1.2 second delay provide the system adequate time to read CSRs to determine the cause of the error, log the condition in NVRAM, and display the condition on the LCD.

After the system is powered down, the ECUB is still powered up, but all outputs are disconnected from the system.

Environmental LED display

Second-level registers in the EMUC drive the 6-bit display. The EMUC prioritizes the environmental errors and warnings and passes the information to the power-on circuit. This circuit prioritizes the 6-bit field with its environmental conditions and produces a 7-bit field plus an attention bit (ATTN) that drives the Display. ATTN is on if there is an environmental warning.

In general, the power-on-detected errors are a higher priority than EMUC-detected errors, the lower the error code number, the higher its priority. Environmental warnings are lower priority than the environmental errors. Table 9-2 “Environmental LED display” shows the LED display error codes.

Table 9-2 Environmental LED display

ATTN bitLED Display Description
100ECUB 3.3V error (highest priority)
101ASIC Install 0 (ENRB)
102ASIC Install 1 (MEM)
103FPGA not OK
104-07DC OK error (UL, UR, LL, LR)
108-1148V error, NPSUL fail, PWRUP=0-9
112-1B48V error, NPSUR failure, PWRUP=0-9
11C-2548V error, NPSLL failure, PWRUP=0-9
126-2F48V error, NPSLR failure, PWRUP=0-9
130-3948V error, no supply failure, PWRUP=0-9
13A48V 7yo-yo error
13BENRB power failure (ENRBPB)
13CClock failure
13D-3FNot used (3)
140-47MB0-MB7 power failure
148-4FPB0L, PB1R, PB2L, PB3R, PB4L, PB5R, PB6L, PB7R power failure
150-57PB0R, PB1L, PB2R, PB3L, PB4R, PB5L, PB6R, PB7L power failure (possibly switch R and L)
158-5BIOB (LR,LF,RF,RR) power failure
15C-61Fan failure (UR,UM,UL,LR,LM,LL)
162Ambient hot
163Overtemp ENRB
164-67Overtemp quadrant (RL, RU, LL, LU)
168Hard error
169Ambient warm
16A-6FNot used (6)
170-73DC supply maintenance (UL,UR,LL,LR)
174-7FNot used (12)
000-09PWRUP state (00=System all powered up), attention LED off

 

The top of the table is the highest priority, the bottom the lowest. If a higher condition occurs, that one is displayed.

Monitored environmental conditions

This section describes each environmental condition that is monitored by the power-on circuit and the EMUC.

ECUB 3.3V error

This error indicates that the ECUB 3.3V power supply has failed, but the 5V supply has not.

ASIC installation error

Each ASIC has install lines to prevent power-up if an ASIC is installed incorrectly (such as an EPAC installed in an ERACs position). If an ASIC is improperly installed, the ECUB does not power up the system. This condition is not monitored after power up.

DC OK error

When this error is displayed, the power-on circuit did not power up the system, because one or more 48V power supplies reported an error. In systems with redundant 48V power supplies, this error means that two or more 48V supplies reported an error.

48V error

If the 48V supply has dropped below 42 volts for any reason other than normally turning off the system or an ac failure, then this error is displayed by the power-on circuit. Also, the 48V supply that reported the error and the power-up state of the system at the time of the error is displayed.

48V yo-yo error

This error indicates that a 48V error occurred and the ECUB lost and then later regained power without the machine being turned off. The power-on circuit will display this error and not power on the system, because the 48V supply is likely at fault.

Clock failure

If the system clock fails, then the EMUC will be unable to monitor environmental errors that could possibly damage the system. If the power-on circuit receives no response from the EMUC, it powers down the system and displays this error.

FPGA configuration and status

The EMUC is programmed by a serial data transfer from EEPROM upon utility board power-up. If the transfer does not complete properly, the EMUC cannot configure itself and many environmental conditions cannot be monitored. The power-on circuit monitors both the EMUC and EPUC and does not power up the system, if they are not configured correctly.

Board over-temperature

There is one temperature sensor per board that detects board overheating. The sensors are bussed together into four system quadrants plus the ENRB and applied to the EMUC.

Fan sensing

Sensors in the six fans determine if the fans are running properly. The EMUC waits 12.8 seconds for the fans to spin up after power-up before monitoring them.

Power failure

Because a power failure on a board could cause damage to other boards, a mechanism is in place to detect 3.3V failures on each board. Power failures are considered environmental errors, and the system is powered down after they are detected.

ENRB power failure

If the ENRB power fails, the power-on circuit powers down the entire system. The ECUB is still active, but the power-on circuit displays the power failure condition and disables all ECUB outputs that drive the system. This condition persists until power is cycled on the ECUB.

48V maintenance

There are up to four 48V power supplies. Each sends a signal to the power-on circuit. If any supply fails at any time, the circuit asserts the 48V maintenance line to the EMUC, which reports the environmental warning to the processors. The power-on circuit displays the highest priority 48V supply that failed.

Ambient air sensors

The ambient air sensors detect a too warm or too hot condition in the input air stream. Ambient air too warm is an environmental warning; ambient air too hot is an environmental error that powers down the system.

The temperature set points are set by the teststation. The digital temperature sensor has nonvolatile storage for the temperature set points. Power-on reset starts the digital temperature sensor without the core logic microprocessor intervening.

Environmental control

Described in the following sections are functions the ECUB performs to control the system environment.

Power-on

When the power switch is turned on, the outputs of the 48V power supplies become active. Several hundred milliseconds after the ECUB 5V supply reaches an acceptable level, the power-on circuit starts powering up the other dc-to-dc converters of the system in succession.

The power-on circuit does not power up the system if an ASIC is installed incorrectly (see the section “ASIC installation error”) or if an FPGA is not configured (see the section “FPGA configuration and status”). It keeps the system powered up unless an environmental condition occurs that warrants a power-down.

Voltage margining

Voltage margin is divided into four groups to minimize control, but allows all boards that communicate with each other to be margined separately for nominal, upper, and lower voltage.

EMUC CSRs

This section describes some of the EMUC CSRs.

EMUC Processor Report register

The Processor Report register indicates the processors that are working in the system. Each processor reports by writing to this register and setting the bit corresponding to the processor number.

Figure 9-4 Processor Report register definition

Processor Report register definition

P0-P15 comprise a fully readable and writable field. The bits are cleared on reset. Once a bit is written to a one value, it remains set until cleared by reset. Writes of a zero value do nothing. The bit, Px, set to a one value, indicates that processor x has reported in working.

EMUC Processor Semaphore register

The Processor Semaphore register provides a signaling function for processor synchronization. This is an atomic read-and-increment register.

Figure 9-5 Processor Semaphore register definition

Processor Semaphore register definition

Count is cleared on reset. Writes load any value. Reads return the value of Count and then increment Count atomically.

EMUC ERAC Data register

The ERAC data register holds the data to be written to the destination ERAC CSR or the data that has been read from the ERAC CSR.

Figure 9-6 ERAC Data register definition

ERAC Data register definition

ERAC Data bits comprise a fully readable and writable field. After an ERAC read operation, the ERAC Data register holds the data. After the ERAC write operation, the data is stored in the ERAC register, and ERAC Data is undefined.

EMUC ERAC Configuration Control register

The ERAC Configuration Control register selects the target ERAC, the address of the CSR within that ERAC, and the type of CSR access (read or write). It controls the ERAC CSR operation and then returns status of the operation.

Figure 9-7 ERAC Configuration Control register definition

ERAC Configuration Control register definition

The fields and bits of the ERAC configuration Control registers are defined as follows:

  • Select field (bits 6:7)—Selects the target ERAC. This field is write only.

  • Address field (bits 10:11)—Selects the address of the CSR within that ERAC.

  • SB Start/Busy bit (bit 14)—Starts the ERAC operation by writing a one value. Reading the bit returns the status of the operation (0=idle, 1=busy). SB and SEL must be written together.

  • RW bit (bit 15)—Selects the type of operation: Read (RW=1), or Write (RW=0).

EMUC Reset register

The EMUC Reset register initiates a reset or displays the type of the last reset. This CSR also contains the revision status.

Figure 9-8 EMUC Reset register definition

EMUC Reset register definition

The bits and field of the Reset register are defined as follows:

  • SR (Soft Reset) bit (bit 6)—Initiates a soft reset.

  • HR (Hard Reset) bit (bit 7)—Initiates a hard reset.

    The combination of SR and HR bits in the read mode indicate the resets shown in Table 9-3 “Reset register read codes”. The combination of SR and HR bits in the write mode indicate the resets given in Table 9-4 “Reset register write codes”.

    Table 9-3 Reset register read codes

    SR HR - ReadLast reset was
    0 0Power-on reset
    0 1Hard reset
    1 0Soft reset

     

    Resets are initiated by writing to this register. Reset is asserted according to the codes in Table 9-4 “Reset register write codes”. The only difference between a hard and soft reset is the action taken by the software upon reading the codes.

    Table 9-4 Reset register write codes

    SR HR - WriteAction taken
    X 1Hard reset
    1 0Soft reset

     

  • Revision field (bits 8:15) indicates the revision of the EMUC FPGA. This field is read only.

Printable version
Privacy statement Using this site means you accept its terms Feedback to webmaster
© Hewlett-Packard Development Company, L.P.