Look's good, what is your hardware? Server model & NVM'es?
k
On 19 Feb 2021, at 13:22, zxcs
<zhuxiongcs(a)163.com> wrote:
BTW, actually i have two nodes has same issues, and another error node's nvme output
as below
Smart Log for NVME device:nvme0n1 namespace-id:ffffffff
critical_warning : 0
temperature : 29 C
available_spare : 100%
available_spare_threshold : 10%
percentage_used : 1%
data_units_read : 592,340,175
data_units_written : 26,443,352
host_read_commands : 5,341,278,662
host_write_commands : 515,730,885
controller_busy_time : 14,052
power_cycles : 8
power_on_hours : 4,294
unsafe_shutdowns : 6
media_errors : 0
num_err_log_entries : 0
Warning Temperature Time : 0
Critical Composite Temperature Time : 0
Temperature Sensor 1 : 29 C
Temperature Sensor 2 : 46 C
Temperature Sensor 3 : 0 C
Temperature Sensor 4 : 0 C
Temperature Sensor 5 : 0 C
Temperature Sensor 6 : 0 C
Temperature Sensor 7 : 0 C
Temperature Sensor 8 : 0 C
For compare, i get one healthy node’s nvme output as below:
mart Log for NVME device:nvme0n1 namespace-id:ffffffff
critical_warning : 0
temperature : 27 C
available_spare : 100%
available_spare_threshold : 10%
percentage_used : 1%
data_units_read : 579,829,652
data_units_written : 28,271,336
host_read_commands : 5,237,750,233
host_write_commands : 518,979,861
controller_busy_time : 14,166
power_cycles : 3
power_on_hours : 4,252
unsafe_shutdowns : 1
media_errors : 0
num_err_log_entries : 0
Warning Temperature Time : 0
Critical Composite Temperature Time : 0
Temperature Sensor 1 : 27 C
Temperature Sensor 2 : 39 C
Temperature Sensor 3 : 0 C
Temperature Sensor 4 : 0 C
Temperature Sensor 5 : 0 C
Temperature Sensor 6 : 0 C
Temperature Sensor 7 : 0 C
Temperature Sensor 8 : 0 C