[Pacemaker] some questions about STONITH

Andrew Beekhof andrew at beekhof.net
Tue Jan 7 21:07:42 EST 2014


On 26 Nov 2013, at 12:39 am, Andrey Groshev <greenx at yandex.ru> wrote:

>> ...snip...
>>>  Make next test:
>>>  #stonith_admin --reboot=dev-cluster2-node2
>>>  Node reboot, but resource don't start.
>>>  In crm_mon status - Node dev-cluster2-node2 (172793105): pending.
>>>  And it will be hung.
>> 
>> That is *probably* a race - the node reboots too fast, or still
>> communicates for a bit after the fence has supposedly completed (if it's
>> not a reboot -nf, but a mere reboot). We have had problems here in the
>> past.
>> 
>> You may want to file a proper bug report with crm_report included, and
>> preferably corosync/pacemaker debugging enabled.
> 
> It was found that he hangs not forever.
> Triggered timeout - in 20 minutes.
> crm_report archive - http://send2me.ru/pen2.tar.bz2
> Of course in the logs many type entries:
> 
> pgsql:1: Breaking dependency loop at msPostgresql
> 
> But where does this relationship after a timeout, I do not understand.

Can you rephrase your question?
I'm not 100% sure I understand what you're asking.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20140108/53f4bdb1/attachment-0002.sig>


More information about the Pacemaker mailing list