over non-recoverable threshold.
Explanation
The service processor on the specified blade server has detected that the specified component has reached or exceeded the specified nonrecoverable threshold (such as over temperature or over voltage).
Severity
Error
Alert Category
Blades (Critical)
-
mmTrapBladeC
Log Source
Blade_##
Automatically notify service
No
Recoverable
Yes
Example Message
- Memory bank 2 (BANK2 TEMP) temperature over nonrecoverable threshold. Reading: 6.10, Threshold: 7.00.
- System board (Inlet Temp) temperature over non-recoverable threshold. Reading: 6.10, Threshold: 7.00.
Alarm Panel LED (BC T and BC HT)
Critical
User response
Refer to the Problem Determination and Service Guide for the specified blade server to determine the device-specific actions to resolve this event. The
Problem Determination and Service Guide is available on the Web.
If you do not have access to the Problem Determination and Service Guide, perform these steps:
- If the event is related to an over temperature condition:
- Check the room ambient temperature to ensure that it is within the operating specifications for the chassis.
- If an air filter is installed, make sure that it is cleaned or replaced.
- Make sure that all fan/blower modules are running. Replace fan modules if necessary.
- Make sure that a device or filler is installed in each bay in the front and rear of the chassis, and make sure that there is nothing covering the bays.
Any missing components can cause a major reduction in airflow for the blade server.
- If the event is related to an over voltage condition:
- If the over voltage problem is occurring on all blade servers, look for other events in the log related to power and resolve those events.
If the over voltage problem is occurring on all blade servers in the same power domain, the problem might be in one of the power modules that
power that domain (log in to the advanced management module to see the power modules that are associated with each power domain).
Replace the power modules, one at a time, to see if the problem is resolved.
- If no other blade server has this same problem, the issue is specific to the blade as a hardware or firmware problem.
If the blade is still functioning the log can be ignored, but the blade should be monitored to see if the voltages get worse.
- Check the IBM Support Web page for any service bulletins that might be related to this problem.