All;
I turned on device health metrics in one of our Nautilus clusters. Unfortunately, it
doesn't seem to be collecting any information.
When I do "ceph device get-health-metrics <device>, I get the following;
{
"20200821-223626": {
"dev": "/dev/sdc",
"error": "smartctl failed",
"nvme_smart_health_information_add_log_error": "nvme returned an
error: sudo: exit status: 1",
"nvme_smart_health_information_add_log_error_code": -22,
"nvme_vendor": "samsung_ssd_860_evo_4tb",
"smartctl_error_code": -22,
"smartctl_output": "smartctl returned an error (1): stderr:\nsudo:
exit status: 1\nstdout:\n"
}
}
The cluster is Nautilus 14.2.16 (updated from 14.2.11 just after turning on health
metrics). Smartctl is release 7.0 dated 2018-12-30 at 14:47:55 UTC.
Thoughts?
Thank you,
Dominic L. Hilsbos, MBA
Director - Information Technology
Perform Air International Inc.
DHilsbos(a)PerformAir.com
www.PerformAir.com