[Pacemaker] ERROR: Device "drbd0": another node is heartbeating in our slot!

Hunny Bunny tmphb at yahoo.com
Tue Jan 26 17:52:43 EST 2010


Hello folkz,

I'm very puzzled by the error messages in  my /var/log/warn shown below.
I'm not using any Pacemaker or Hearbeat CRM at this point.

I have a plain configuration of DRBD which replicates /dev/sda4 as a block device /dev/drbd0 between two nodes node1 and node2

On a top of /dev/drbd0 block I have OCFS2 partition which is mounted on node1 and node2
as /data. 

Also /dev/drbd0 is available as iscsi-target from the node2 to the node3

On the node3 using iscsiadm (open-iscsi) I can login into iscsi-target on the node2 which then becomes /dev/sdb on the node3.

Then by executing /etc/init.d/o2cb.init start and  /etc/init.d/ocfs2.init start I can also mount 
/dev/sdb as /data on the node3.

It works and I can access all the files in /data which is shared between these three nodes.
However, this "...another node is heartbeating in our slot!" warning on the node2 just drives me crazy.

Could somebody please help me to either turn it off or explain what is wrong.

Many thanks in advance,

Alex

messages in /var/log/warn
<------------ snipped -------------->

Jan 26 13:50:31 node2 kernel: [ 2740.959848] OCFS2 Node Manager 1.5.0
Jan 26 13:50:31 node2 kernel: [ 2740.961868] OCFS2 DLM 1.5.0
Jan 26 13:50:31 node2 kernel: [ 2740.962625] ocfs2: Registered cluster interface o2cb
Jan 26 13:50:31 node2 kernel: [ 2740.974260] OCFS2 DLMFS 1.5.0
Jan 26 13:50:31 node2 kernel: [ 2740.974331] OCFS2 User DLM kernel interface loaded
Jan 26 13:50:44 node2 kernel: [ 2753.748066] OCFS2 1.5.0
Jan 26 13:50:44 node2 kernel: [ 2753.757040] ocfs2_dlm: Nodes in domain ("8B9FFC5A4F12408EA5FC14B7CD1B3E97"): 2
Jan 26 13:50:44 node2 kernel: [ 2753.821386] ocfs2: Mounting device (147,0) on (node 2, slot 0) with ordered data mode.
Jan 26 13:55:01 node2 kernel: [ 3011.010186] (3449,0):o2hb_do_disk_heartbeat:768 ERROR: Device "drbd0": another node is heartbeating in our slot!
Jan 26 13:55:03 node2 kernel: [ 3013.020160] (3449,0):o2hb_do_disk_heartbeat:768 ERROR: Device "drbd0": another node is heartbeating in our slot!
Jan 26 13:55:05 node2 kernel: [ 3015.030169] (3449,0):o2hb_do_disk_heartbeat:768 ERROR: Device "drbd0": another node is heartbeating in our slot!
Jan 26 13:55:07 node2 kernel: [ 3017.040183] (3449,0):o2hb_do_disk_heartbeat:768 ERROR: Device "drbd0": another node is heartbeating in our slot!
Jan 26 13:55:09 node2 kernel: [ 3019.050178] (3449,1):o2hb_do_disk_heartbeat:768 ERROR: Device "drbd0": another node is heartbeating in our slot!
Jan 26 13:55:31 node2 kernel: [ 3041.010023] (3431,0):o2net_connect_expired:1656 ERROR: no connection established with node 3 after 30.0 seconds, giving up and returning errors.

<------------ snipped -------------->



      
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20100126/fed487a2/attachment.html>


More information about the Pacemaker mailing list