BIOS stability improvements in early detection and error handling related to memory DIMMs

Products Affected:
NX-G6/G7 Platforms with BIOS versions prior to PB42.300, PU42.300 or PW42.300 or later


Description:

If we go in IPMI we can see critical error like

Correctable ECC
IPMI Error.


Nutanix has further improved the ability to detect problematic DIMMs and prevent unnecessary service
disruption as a result of uncorrectable memory errors (UECC).


BIOS improvements introduced in PB41.002,PU41.002 and PW42.000 or later include:
• Improved proactive detection of memory errors during Patrol Scrub alerts will generate in NCC
• Reduction in correctable memory error threshold to generate an alert
• Enabling of Adaptive Data Correction (ADC)


BIOS improvements introduced in PB42.300,PU42.300 and PW42.300 include:


• Enabling of Post Package Repair (PPR)
• Patrol Scrub correctable errors integrated into the CECC error handling and operate as a part of the RAS workflow
• Fixed an issue where a watchdog “three-strike” error could cause the host to stop or restart
unexpectedly


Resolution:

Latest Stable BIOS Version

Nutanix has released latest stable BIOS version with stability and lots of improvements / fixes in early detection and error handling related to memory DIMMs.

Nutanix NX – BIOS latest and stable version are that can be repair DIMM after host reboot.:

  1. PB42.300 or later
  2. PU42.300 or later
  3. PW42.300 or later

Leave a Reply