[Pacemaker] The problem with which queue between cib and stonith-ng overflows

Andrew Beekhof andrew at beekhof.net
Mon Jun 2 07:31:29 EDT 2014


On 2 Jun 2014, at 3:05 pm, Yusuke Iida <yusk.iida at gmail.com> wrote:

> Hi, Andrew
> 
> I use the newest of 1.1 brunches and am testing by eight sets of nodes.
> 
> Although the problem was settled once,
> Now, the problem with which queue overflows between cib and stonithd
> has recurred.
> 
> As an example, I paste the log of the DC node.
> The problem is occurring on all nodes.
> 
> Jun  2 11:34:02 vm04 cib[3940]:    error: crm_ipcs_flush_events:
> Evicting slow client 0x250afe0[3941]: event queue reached 638 entries
> Jun  2 11:34:02 vm04 stonith-ng[3941]:    error: crm_ipc_read:
> Connection to cib_rw failed
> Jun  2 11:34:02 vm04 stonith-ng[3941]:    error:
> mainloop_gio_callback: Connection to cib_rw[0x662510] closed (I/O
> condition=17)
> Jun  2 11:34:02 vm04 stonith-ng[3941]:   notice:
> cib_connection_destroy: Connection to the CIB terminated. Shutting
> down.
> Jun  2 11:34:02 vm04 stonith-ng[3941]:     info: stonith_shutdown:
> Terminating with  2 clients
> Jun  2 11:34:02 vm04 stonith-ng[3941]:     info: qb_ipcs_us_withdraw:
> withdrawing server sockets
> 
> After loading a resource setup, time for stonithd to build device
> information is long.
> It has taken the time for about about 15 seconds.

15 seconds!! Yikes. I'll investigate tomorrow.

> It seems that the diff message of cib accumulates between them.
> 
> Are there any plans to improve on this issue?
> 
> I attach a report when a problem occurs.
> https://drive.google.com/file/d/0BwMFJItoO-fVUEFEN1NlelNWRjg/edit?usp=sharing
> 
> Regards,
> Yusuke
> -- 
> ----------------------------------------
> METRO SYSTEMS CO., LTD
> 
> Yusuke Iida
> Mail: yusk.iida at gmail.com
> ----------------------------------------
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20140602/1fefbe64/attachment-0003.sig>


More information about the Pacemaker mailing list