[Pacemaker] Cluster crash

Wed Feb 8 08:29:12 EST 2012

Dear community,

I am currently running different corosync / drbd cluster using VM running
on vmware esxi host.
Guest Os are Debian Squeeze.

the active member of the cluster just freeze the VM was unreachable.
But the resources didn't achieved to move to the other node.

My cluster has the following ressources :

Resource Group: grp
     fs-data    (ocf::heartbeat:Filesystem):
     nagios-ip  (ocf::heartbeat:IPaddr2):
     apache2    (ocf::heartbeat:apache):
     nagios     (lsb:nagios3):
     pnp        (lsb:npcd):

I am currently troubleshooting this issue. I don't really know where to
look. Of course I had a look at the logs, but it is pretty hard for me to
understand what happen.
I noticed that the VM crash at 12:09 and that the cluster only try to move
the ressources at  12:58, this does not make sens for me. Or maybe the host
wasn't totaly down ?

Do you have any idea how I can troubleshoot ?

Last thing, I notice that If I start apache2 on the slave server, corosync
didn't detect that the resource is started, could that be an issue ?

Regards,
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20120208/c0b34d6b/attachment-0002.html>