Hello guys,<br><br>We are running <br><br>corosync-1.0.0<br>heartbeat-2.99.1<br>pacemaker-1.0.4<br><br>the corosync.conf under /etc/corosync/ is <br><br>============<br># Please read the corosync.conf.5 manual page<br>compatibility: whitetank<br>
<br>aisexec {<br> user: root<br> group: root<br>}<br>totem {<br> version: 2<br> secauth: off<br> threads: 0<br> interface {<br> ringnumber: 0<br> bindnetaddr: 172.30.0.0<br>
mcastaddr:226.94.1.1<br> mcastport: 5406<br> }<br>}<br><br>logging {<br> fileline: off<br> to_stderr: yes<br> to_logfile: yes<br> to_syslog: yes<br> logfile: /tmp/corosync.log<br>
debug: on<br> timestamp: on<br> logger_subsys {<br> subsys: pacemaker<br> debug: on<br> tags: enter|leave|trace1|trace2| trace3|trace4|trace6<br> }<br>
}<br><br><br>service {<br> name: pacemaker<br> ver: 0<br> # use_mgmtd: yes<br> # use_logd:yes<br>}<br><br><br>corosync {<br> user: root<br> group: root<br>}<br><br><br>amf {<br> mode: disabled<br>
}<br>============<br><br><br>#service corosync start <br><br>starts the messaging but fails to load pacemaker,<br><br>/tmp/corosync.log --- <br><br>==================<br><br>Oct 21 11:05:43 corosync [MAIN ] Corosync Cluster Engine ('trunk'): started and ready to provide service.<br>
Oct 21 11:05:43 corosync [MAIN ] Successfully read main configuration file '/etc/corosync/corosync.conf'.<br>Oct 21 11:05:43 corosync [TOTEM ] Token Timeout (1000 ms) retransmit timeout (238 ms)<br>Oct 21 11:05:43 corosync [TOTEM ] token hold (180 ms) retransmits before loss (4 retrans)<br>
Oct 21 11:05:43 corosync [TOTEM ] join (50 ms) send_join (0 ms) consensus (800 ms) merge (200 ms)<br>Oct 21 11:05:43 corosync [TOTEM ] downcheck (1000 ms) fail to recv const (50 msgs)<br>Oct 21 11:05:43 corosync [TOTEM ] seqno unchanged const (30 rotations) Maximum network MTU 1500<br>
Oct 21 11:05:43 corosync [TOTEM ] window size per rotation (50 messages) maximum messages per rotation (17 messages)<br>Oct 21 11:05:43 corosync [TOTEM ] send threads (0 threads)<br>Oct 21 11:05:43 corosync [TOTEM ] RRP token expired timeout (238 ms)<br>
Oct 21 11:05:43 corosync [TOTEM ] RRP token problem counter (2000 ms)<br>Oct 21 11:05:43 corosync [TOTEM ] RRP threshold (10 problem count)<br>Oct 21 11:05:43 corosync [TOTEM ] RRP mode set to none.<br>Oct 21 11:05:43 corosync [TOTEM ] heartbeat_failures_allowed (0)<br>
Oct 21 11:05:43 corosync [TOTEM ] max_network_delay (50 ms)<br>Oct 21 11:05:43 corosync [TOTEM ] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0<br>Oct 21 11:05:43 corosync [TOTEM ] Initializing transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).<br>
Oct 21 11:05:43 corosync [TOTEM ] Receive multicast socket recv buffer size (262142 bytes).<br>Oct 21 11:05:43 corosync [TOTEM ] Transmit multicast socket send buffer size (262142 bytes).<br>Oct 21 11:05:43 corosync [TOTEM ] The network interface [172.30.0.145] is now up.<br>
Oct 21 11:05:43 corosync [TOTEM ] Created or loaded sequence id 184.172.30.0.145 for this ring.<br>Oct 21 11:05:43 corosync [TOTEM ] entering GATHER state from 15.<br>Oct 21 11:05:43 corosync [SERV ] <font class="Apple-style-span" color="#FF0000"><b>Service failed to load 'pacemaker'.</b></font><br>
Oct 21 11:05:43 corosync [SERV ] Service initialized 'corosync extended virtual synchrony service'<br>Oct 21 11:05:43 corosync [SERV ] Service initialized 'corosync configuration service'<br>Oct 21 11:05:43 corosync [SERV ] Service initialized 'corosync cluster closed process group service v1.01'<br>
Oct 21 11:05:43 corosync [SERV ] Service initialized 'corosync cluster config database access v1.01'<br>Oct 21 11:05:43 corosync [SERV ] Service initialized 'corosync profile loading service'<br>Oct 21 11:05:43 corosync [MAIN ] Compatibility mode set to whitetank. Using V1 and V2 of the synchronization engine.<br>
Oct 21 11:05:43 corosync [TOTEM ] Creating commit token because I am the rep.<br>Oct 21 11:05:43 corosync [TOTEM ] Saving state aru 0 high seq received 0<br>Oct 21 11:05:43 corosync [TOTEM ] Storing new sequence id for ring bc<br>
Oct 21 11:05:43 corosync [TOTEM ] entering COMMIT state.<br>Oct 21 11:05:43 corosync [TOTEM ] got commit token<br>Oct 21 11:05:43 corosync [TOTEM ] entering RECOVERY state.<br>Oct 21 11:05:43 corosync [TOTEM ] position [0] member <a href="http://172.30.0.145">172.30.0.145</a>:<br>
Oct 21 11:05:43 corosync [TOTEM ] previous ring seq 184 rep 172.30.0.145<br>Oct 21 11:05:43 corosync [TOTEM ] aru 0 high delivered 0 received flag 1<br>Oct 21 11:05:43 corosync [TOTEM ] Did not need to originate any messages in recovery.<br>
Oct 21 11:05:43 corosync [TOTEM ] got commit token<br>Oct 21 11:05:43 corosync [TOTEM ] Sending initial ORF token<br>Oct 21 11:05:43 corosync [TOTEM ] token retrans flag is 0 my set retrans flag0 retrans queue empty 1 count 0, aru 0<br>
Oct 21 11:05:43 corosync [TOTEM ] install seq 0 aru 0 high seq received 0<br>Oct 21 11:05:43 corosync [TOTEM ] token retrans flag is 0 my set retrans flag0 retrans queue empty 1 count 1, aru 0<br>Oct 21 11:05:43 corosync [TOTEM ] install seq 0 aru 0 high seq received 0<br>
Oct 21 11:05:43 corosync [TOTEM ] token retrans flag is 0 my set retrans flag0 retrans queue empty 1 count 2, aru 0<br>Oct 21 11:05:43 corosync [TOTEM ] install seq 0 aru 0 high seq received 0<br>Oct 21 11:05:43 corosync [TOTEM ] token retrans flag is 0 my set retrans flag0 retrans queue empty 1 count 3, aru 0<br>
Oct 21 11:05:43 corosync [TOTEM ] install seq 0 aru 0 high seq received 0<br>Oct 21 11:05:43 corosync [TOTEM ] retrans flag count 4 token aru 0 install seq 0 aru 0 0<br>Oct 21 11:05:43 corosync [TOTEM ] recovery to regular 1-0<br>
Oct 21 11:05:43 corosync [TOTEM ] Delivering to app 1 to 0<br>Oct 21 11:05:43 corosync [SYNC ] This node is within the primary component and will provide service.<br>Oct 21 11:05:43 corosync [TOTEM ] entering OPERATIONAL state.<br>
Oct 21 11:05:43 corosync [TOTEM ] A processor joined or left the membership and a new membership was formed.<br>Oct 21 11:05:43 corosync [TOTEM ] mcasted message added to pending queue<br>Oct 21 11:05:43 corosync [TOTEM ] Delivering 0 to 1<br>
Oct 21 11:05:43 corosync [TOTEM ] Delivering MCAST message with seq 1 to pending delivery queue<br>Oct 21 11:05:43 corosync [SYNC ] confchg entries 1<br>Oct 21 11:05:43 corosync [SYNC ] Barrier Start Received From -1862263124<br>
Oct 21 11:05:43 corosync [SYNC ] Barrier completion status for nodeid -1862263124 = 1.<br>==================<br><div><br></div><div><br></div><div>I'm curious to know how actually corosync/openais loads pacemaker, the config directive seems to have done the magic but apparently not in my case.</div>
<div>What should I be looking for, as the log message hardly gives any information.</div><div><br></div><div>Pacemaker comprises bunch of daemons like crmd, stonithd and stuff, I ran them individually to see any permission problems</div>
<div>like /var/lib/heartbeat and /var/run/heartbeat which should be chown hacluster:haclient.</div><div><br></div><div><br></div><div>Even after doing those it fails to load.</div><div><br></div><div><br></div><div>Please advise me what should I do.</div>
<div><br></div><div><br></div><div>Thanks</div><div>Shravan</div><div><br></div><div><br></div><div><br></div>