综述¶
命令¶
- lshw
- lshw -short
- dmidecode
- sensors
- mcelog
状态监测¶
Error Detection And Correction (EDAC) Devices
- /sys/devices/system/edac/mc/mc*/csrow*/ch*_ce_count
- /sys/devices/system/edac/mc/mc*/csrow*/ue_count
Reliability, Availability and Serviceability
- mcelog
术语¶
| FRU | Field Replaceable Unit |
| DIMM | Dual Inline Memory Module |
| CE | Correctable Error |
| UE | Uncorrected Error |