<html><head><style type='text/css'>p { margin: 0; }</style></head><body><div style='font-family: Times New Roman; font-size: 12pt; color: #000000'><font size="3">Hello,</font><div style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; "><br></div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">Recently I finished configuring a two-node cluster with pacemaker 1.1.6 and heartbeat 3.0.5 on nodes running Ubuntu 10.04. This cluster includes the following resources:</div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">- primitives for DRBD storage devices</div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">- primitives for mounting the filesystem on the DRBD storage</div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">- primitives for some mount binds</div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">- primitive for starting apache</div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">- primitives for starting samba and nfs servers (following instructions <a href="http://www.linbit.com/fileadmin/tech-guides/ha-nfs.pdf">here</a>)</div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">- primitives for exporting nfs shares (ocf:heartbeat:exportfs)</div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; "><br></div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; ">Perhaps this is best described through the output of crm_mon:</div><div id=""><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2">Online: [ node1 node2 ]</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"><br></font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> Master/Slave Set: ms_drbd_mount1 [p_drbd_mount1] (unmanaged)</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_drbd_mount1:0 (ocf::linbit:drbd): Started node2 (unmanaged)</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_drbd_mount1:1 (ocf::linbit:drbd): Started node1 (unmanaged) FAILED</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> Master/Slave Set: ms_drbd_mount2 [p_drbd_mount2]</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" id="" size="2"> p_drbd_mount2:0 (ocf::linbit:drbd): Master node1 (unmanaged) FAILED</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> Slaves: [ node2 ]</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> Resource Group: g_core</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2" id=""> p_fs_mount1 (ocf::heartbeat:Filesystem): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_fs_mount2 (ocf::heartbeat:Filesystem): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_ip_nfs (ocf::heartbeat:IPaddr2): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> Resource Group: g_apache</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_fs_mountbind1 (ocf::heartbeat:Filesystem): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_fs_mountbind2 (ocf::heartbeat:Filesystem): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_fs_mountbind3 (ocf::heartbeat:Filesystem): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_fs_varwww (ocf::heartbeat:Filesystem): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_apache (ocf::heartbeat:apache): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> Resource Group: g_fileservers</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_lsb_smb (lsb:smbd): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_lsb_nmb (lsb:nmbd): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_lsb_nfsserver (lsb:nfs-kernel-server): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_exportfs_mount1 (ocf::heartbeat:exportfs): Started node1</font></div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2"> p_exportfs_mount2 (ocf::heartbeat:exportfs): Started node1</font></div></div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; "><br></div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; ">I have read through the <a href="http://www.clusterlabs.org/doc/en-US/Pacemaker/1.1/html-single/Pacemaker_Explained" style="font-family: 'Times New Roman'; ">Pacemaker Explained</a> documentation, however could not find a way to further debug these problems. First, <span style="font-size: 12pt; " id="">I put node1 into standby mode to attempt failover to the other node (node2). Node2 appeared to start the transition to master, however it failed to promote the DRBD resources to master (the first step). I have attached a copy of this session in commands.log and additional excerpts from /var/log/syslog during important steps. </span>I have attempted everything I can think of to try and start the DRBD resource (e.g. start/stop/promote/manage/cleanup under <font face="'courier new', courier, monaco, monospace, sans-serif">crm resource</font>, restarting heartbeat) but cannot bring it out of the slave state. However, if I set it to unmanaged and then run <font face="'courier new', courier, monaco, monospace, sans-serif">drbdadm primary all</font> in the terminal, pacemaker is satisfied and continues starting the rest of the resources. It then failed when attempting to mount the filesystem for mount2, the p_fs_mount2 resource. I attempted to mount the filesystem myself and was successful. I then unmounted it and ran cleanup on p_fs_mount2 and then it mounted. The rest of the resources started as expected until the p_exportfs_mount2 resource, which failed as follows:</div><div id=""><font face="'courier new', courier, monaco, monospace, sans-serif" size="2">p_exportfs_mount2 (ocf::heartbeat:exportfs): started node2 (unmanaged) FAILED</font></div><div id=""><br></div><div id="">I ran cleanup on this and it started, however when running this test earlier today no command could successfully start this exportfs resource. </div><div id=""><br></div><div id="aeaoofnhgocdbnbeljkmbjdmhbcokfdb-mousedown">How can I configure pacemaker to better resolve these problems and be able to bring the node up successfully on its own? What can I check to determine why these failures are occuring? /var/log/syslog did not seem to contain very much useful information regarding why the failures occurred.</div><div id="aeaoofnhgocdbnbeljkmbjdmhbcokfdb-mousedown"><br></div><div id="aeaoofnhgocdbnbeljkmbjdmhbcokfdb-mousedown">Thanks,</div><div id="aeaoofnhgocdbnbeljkmbjdmhbcokfdb-mousedown"><br></div><div id="aeaoofnhgocdbnbeljkmbjdmhbcokfdb-mousedown">Andrew</div><div id="aeaoofnhgocdbnbeljkmbjdmhbcokfdb-mousedown" style="color: rgb(0, 0, 0); font-size: 12pt; "><br></div><div id="" style="color: rgb(0, 0, 0); font-size: 12pt; font-family: 'Times New Roman'; "><br></div></div></body></html>