综述¶
命令¶
- lshw
- lshw -short
- dmidecode
- sensors
- mcelog
状态监测¶
Error Detection And Correction (EDAC) Devices
- /sys/devices/system/edac/mc/mc*/csrow*/ch*_ce_count
- /sys/devices/system/edac/mc/mc*/csrow*/ue_count
Reliability, Availability and Serviceability
- mcelog
术语¶
FRU | Field Replaceable Unit |
DIMM | Dual Inline Memory Module |
CE | Correctable Error |
UE | Uncorrected Error |