Hi Rodrigo,
The fact that you're getting logs from mon and the function name
set_mon_vals suggests that you made use of mon provided centralized config,
like `ceph config set osd cluster_network x.x.x.x` but it seems
cluster_network and public_network cannot be centralized and should be set
in ceph.conf. (Just an idea not really an expert).
If that's the case you should be able to `ceph config rm osd
cluster_network` to get your cluster healthy again. Then you could try
setting in config file directly.
HTH
On Fri, Nov 29, 2019 at 4:55 PM Rodrigo Severo - Fábrica <
rodrigo(a)fabricadeideias.com> wrote:
Em qui., 28 de nov. de 2019 às 18:32, Rodrigo Severo -
Fábrica
<rodrigo(a)fabricadeideias.com> escreveu:
Em qui., 28 de nov. de 2019 às 13:39, Wido den Hollander
<wido(a)42on.com> escreveu:
>
> On 11/28/19 5:23 PM, Rodrigo Severo - Fábrica wrote:
> > Em qui., 28 de nov. de 2019 às 00:34, Konstantin Shalygin
> > <k0ste(a)k0ste.ru> escreveu:
> >>
> >>> My servers have 2 network boards each. I would like to use the
current
> >>> local one to talk to Cephs
clients (both CephFS and Object Storage)
> >>> and use the second one to all Cephs processes to talk one to the
> >>> other.
> >>>
> >>
> >> Ceph support `cluster network` and `public network` options. Only
OSD
> >> work with cluster network. Any
other is a OSD clients - public
network.
> >
> > Great. How do I migrate from my current single network board to a
dual
one?
> >
> > Can I migrate servers one by one to the dual network setup or do I
> > have to stop the whole ceph cluster and restart it all already on the
> > dual setup?
>
> Set the cluster_network in the ceph.conf and restart the OSDs one by
one.
Just tried that. The first OSD that I'm trying to restart won't came
up again.
Does anybody has any suggestion on how to get my ceph fs back to healthy
status?
I'm even considering stop it all and restarting but I'm afraid it
won't come back up with the new config.
Ideias? Suggestions?
Rodrigo
It presents the following messages which aren't that useful
to me:
Nov 28 18:26:33 a2-df systemd[1]: ceph-osd(a)1.service: Start request
repeated too quickly.
Nov 28 18:26:33 a2-df systemd[1]: ceph-osd(a)1.service: Failed with
result 'exit-code'.
Nov 28 18:26:33 a2-df systemd[1]: Failed to start Ceph object storage
daemon osd.1.
-- Subject: Unit ceph-osd(a)1.service has failed
-- Defined-By: systemd
-- Support:
http://www.ubuntu.com/support
--
-- Unit ceph-osd(a)1.service has failed.
--
-- The result is RESULT.
I also see the following error messages:
Nov 28 18:26:46 a2-df ceph-mon[2526]: 2019-11-28 18:26:46.230
7f487dee1700 -1 set_mon_vals failed to set cluster_network =
192.168.111.0/24: Configuration option 'cluster_network' may not be
modified at runtime
Nov 28 18:26:46 a2-df ceph-mon[2526]: 2019-11-28 18:26:46.230
7f487dee1700 -1 set_mon_vals failed to set public_network =
192.168.109.0/24: Configuration option 'public_network' may not be
modified at runtime
There are 2 things that I don't understand in these messages:
1. Why is it mentioning configuration option 'public_network' in these
error messages as I didn't change the public_network config, I only
added a cluster_network one?
2. Why are there messages from ceph-mon when I'm trying to restart
ceph-osd?
And the most important issue: how can I get my osd back online?
Regards,
Rodrigo
_______________________________________________
ceph-users mailing list -- ceph-users(a)ceph.io
To unsubscribe send an email to ceph-users-leave(a)ceph.io