[Pacemaker] Node name problems after upgrading to 1.1.9
Bernardo Cabezas Serra
bcabezas at apsl.net
Thu Jun 27 12:29:51 UTC 2013
Hello,
Ohhh, sorry, but I have deleted node selavi and restarted, and now works
OK and I can't reproduce the bug :(
El 27/06/13 12:32, Andrew Beekhof escribió:
> o, more likely a bug. Which is concerning since I thought I had this particular kind ironed out.
>
> Could you set PCMK_trace_functions=crm_get_peer on selavi and repeat the test?
>
> The exact way to do this will depend on your distro (which is?).
> On most rpm based distros its done in /etc/sysconfig/pacemaker
the distro is ubuntu 12.04 LTS (precise)
As I have installed all in /opt/ha, file is /opt/ha/etc/default/pacemaker.
The setting is only for debug, isn't it? I can't see more info, but now
it works.
The strange thing is that I previously deleted both nodes, and cleaned
up selavi var/lib/pacemaker state, but it continued to fail.
The long explanation of what I have done:
- Reverting back selavi corosync to 2.3.0 (turifel still on 2.3.0.66 git
version)
- On starting, issue persisted
- On turifel, crm node status, I saw two nodes with same uname (selavi)
but different Ids.
- From turifel crm, deleted selavi two times (with some warnings).
- Put the cluster unmanaged
- Rebooted turifel node (because of a dlm lock failure due to wrong
corosync stop)
- Set up PCMK_trace_functions=crm_get_peer setting on both nodes.
- On start, all worked :/
- Upgraded back selavi corosync to 2.3.0.66 git
- Still working.
Will continue trying to reproduce issue.
PS: the PCMK_trace_functions environ is only for debugging purposes,
isn't it? (I mean: can't have resolved the issue)
Thans so much for your help!
Regards,
Bernardo
--
APSL
*Bernardo Cabezas Serra*
*Responsable Sistemas*
Camí Vell de Bunyola 37, esc. A, local 7
07009 Polígono de Son Castelló, Palma
Mail: bcabezas at apsl.net
Skype: bernat.cabezas
Tel: 971439771
More information about the Pacemaker
mailing list