On 10/7/19 9:15 AM, i.schmidt(a)langeoog.de wrote:
Hi Folks
We are using Ceph as our storage backend on our 6 Node Proxmox VM Cluster. To Monitor our
systems we use Zabbix and i would like to get some Ceph Data into our Zabbix to get some
alarms when something goes wrong.
Ceph mgr has a module, "zabbix" that uses "zabbix-sender" to actively
send data, but i cannot get the module working. It always responds with "failed to
send data"
The network side seems to be fine:
root@vm-2:~# traceroute 192.168.15.253
traceroute to 192.168.15.253 (192.168.15.253), 30 hops max, 60 byte packets
1 192.168.15.253 (192.168.15.253) 0.411 ms 0.402 ms 0.393 ms
root@vm-2:~# nmap -p 10051 192.168.15.253
Starting Nmap 7.70 (
https://nmap.org ) at 2019-09-18 08:40 CEST
Nmap scan report for 192.168.15.253
Host is up (0.00026s latency).
PORT STATE SERVICE
10051/tcp open zabbix-trapper
MAC Address: BA:F5:30:EF:40:EF (Unknown)
Nmap done: 1 IP address (1 host up) scanned in 0.61 seconds
root@vm-2:~# ceph zabbix config-show
{"zabbix_port": 10051, "zabbix_host": "192.168.15.253",
"identifier": "VM-2", "zabbix_sender":
"/usr/bin/zabbix_sender", "interval": 60}
root@vm-2:~#
I recommend to NOT set the identifier to 'VM-2'.
I wrote the Zabbix module and a bit of background.
If you don't set the identifier the fsid (uuid) of the cluster will be used.
You now used 'VM-2', but it's not guaranteed that the ceph-mgr will run
on that host. If the Mgr fails over to a different host it will not be
'vm-2' sending the data.
Usually I just leave identifier empty and just use the fsid of the
cluster as hostname in Zabbix.
> But if i try "ceph zabbix send" i get "failed to send data to
zabbix" and this show up in the systems journal:
> Sep 18 08:41:13 vm-2 ceph-mgr[54445]: 2019-09-18 08:41:13.272 7fe360fe4700 -1
mgr.server reply reply (1) Operation not permitted
>
> The log of ceph-mgr on that machine states:
> 2019-09-18 08:42:18.188 7fe359fd6700 0 mgr[zabbix] Exception when sending:
/usr/bin/zabbix_sender exited non-zero: zabbix_sender [3253392]: DEBUG: answer
[{"response":"success","info":"processed: 0; failed:
44; total: 44; seconds spent: 0.000179"}]
> 2019-09-18 08:43:18.217 7fe359fd6700 0 mgr[zabbix] Exception when sending:
/usr/bin/zabbix_sender exited non-zero: zabbix_sender [3253629]: DEBUG: answer
[{"response":"success","info":"processed: 0; failed:
44; total: 44; seconds spent: 0.000321"}]
>
> I'm guessing, this could have something to do with user rights. But i have no
idea where to start to track this down.
>
> Maybe someone here has a hint?
> If more information is needed, i will gladly provide it.
>
> greetings
> Ingo
> _______________________________________________
> ceph-users mailing list -- ceph-users(a)ceph.io
> To unsubscribe send an email to ceph-users-leave(a)ceph.io
>