<html><header></header><body><div style="font-family: Tahoma; font-size: 14px; color: #000000;">Sorry, I was using wrong hostnames for that networks, using debug log I found it was not finding "this node" in conf file.</div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;">Gabriele<br /><br />
<div id="wt-mailcard">
<div> </div>
<div> </div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Sonicle S.r.l. </strong>: <a href="http://www.sonicle.com/" target="_new">http://www.sonicle.com</a></span></div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Music: </strong><a href="http://www.gabrielebulfon.com/" target="_new">http://www.gabrielebulfon.com</a></span></div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Quantum Mechanics : </strong><a href="http://www.cdbaby.com/cd/gabrielebulfon" target="_new">http://www.cdbaby.com/cd/gabrielebulfon</a></span></div>
</div>
<br /><hr /><br /><br /><span style="font-family: Arial, Helvetica, sans-serif; font-size: small;"><strong>Da:</strong> Gabriele Bulfon <gbulfon@sonicle.com><br /><strong>A:</strong> Cluster Labs - All topics related to open-source clustering welcomed <users@clusterlabs.org><br /><strong>Data:</strong> 26 luglio 2020 11.23.53 CEST<br /><strong>Oggetto:</strong> Re: [ClusterLabs] pacemaker startup problem<br /></span><br /><br />
<blockquote style="border-left: #000080 2px solid; margin-left: 5px; padding-left: 5px;">
<div> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;">Thanks, I ran it manually so I got those errors, running from service script it correctly set PCMK_ipc_type to socket.</div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;">But now I see these now:<br /><br />Jul 26 11:08:16 [4039] pacemakerd: info: crm_log_init: Changed active directory to /sonicle/var/cluster/lib/pacemaker/cores<br />Jul 26 11:08:16 [4039] pacemakerd: info: mcp_read_config: cmap connection setup failed: CS_ERR_LIBRARY. Retrying in 1s<br />Jul 26 11:08:17 [4039] pacemakerd: info: mcp_read_config: cmap connection setup failed: CS_ERR_LIBRARY. Retrying in 2s<br />Jul 26 11:08:19 [4039] pacemakerd: info: mcp_read_config: cmap connection setup failed: CS_ERR_LIBRARY. Retrying in 3s<br />Jul 26 11:08:22 [4039] pacemakerd: info: mcp_read_config: cmap connection setup failed: CS_ERR_LIBRARY. Retrying in 4s<br />Jul 26 11:08:26 [4039] pacemakerd: info: mcp_read_config: cmap connection setup failed: CS_ERR_LIBRARY. Retrying in 5s<br />Jul 26 11:08:31 [4039] pacemakerd: warning: mcp_read_config: Could not connect to Cluster Configuration Database API, error 2<br />Jul 26 11:08:31 [4039] pacemakerd: notice: main: Could not obtain corosync config data, exiting<br />Jul 26 11:08:31 [4039] pacemakerd: info: crm_xml_cleanup: Cleaning up memory from libxml2<br /><br /></div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;">So I think I need to start corosync first (right?) but it dies with this:</div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;">Jul 26 11:07:06 [4027] xstorage1 corosync notice [MAIN ] Corosync Cluster Engine ('2.4.1'): started and ready to provide service.<br />Jul 26 11:07:06 [4027] xstorage1 corosync info [MAIN ] Corosync built-in features: bindnow<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [TOTEM ] Initializing transport (UDP/IP Multicast).<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [TOTEM ] Initializing transmit/receive security (NSS) crypto: none hash: none<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [TOTEM ] The network interface [10.100.100.1] is now up.<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [SERV ] Service engine loaded: corosync configuration map access [0]<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [YKD ] Service engine loaded: corosync configuration service [1]<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [YKD ] Service engine loaded: corosync cluster closed process group service v1.01 [2]<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [YKD ] Service engine loaded: corosync profile loading service [4]<br />Jul 26 11:07:06 [4027] xstorage1 corosync notice [QUORUM] Using quorum provider corosync_votequorum<br />Jul 26 11:07:06 [4027] xstorage1 corosync crit [QUORUM] Quorum provider: corosync_votequorum failed to initialize.<br />Jul 26 11:07:06 [4027] xstorage1 corosync error [SERV ] Service engine 'corosync_quorum' failed to load for reason 'configuration error: nodelist or quorum.expected_votes must be configured!'<br />Jul 26 11:07:06 [4027] xstorage1 corosync error [MAIN ] Corosync Cluster Engine exiting with status 20 at /data/sources/sonicle/xstream-storage-gate/components/cluster/corosync/corosync-2.4.1/exec/service.c:356.</div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"><br />My corosync conf has nodelist configured! Here it is:</div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;">
<pre>service {
ver: 1
name: pacemaker
use_mgmtd: no
use_logd: no
}
totem {
version: 2
crypto_cipher: none
crypto_hash: none
interface {
ringnumber: 0
bindnetaddr: 10.100.100.0
mcastaddr: 239.255.1.1
mcastport: 5405
ttl: 1
}
}
nodelist {
node {
ring0_addr: xstorage1
nodeid: 1
}
node {
ring0_addr: xstorage2
nodeid: 2
}
}
quorum {
provider: corosync_votequorum
two_node: 1
}
logging {
fileline: off
to_stderr: no
to_logfile: yes
logfile: /sonicle/var/log/cluster/corosync.log
to_syslog: no
debug: off
timestamp: on
logger_subsys {
subsys: QUORUM
debug: off
}
}
</pre>
</div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;"> </div>
<div style="font-family: Tahoma; font-size: 14px; color: #000000;">
<div id="wt-mailcard">
<div> </div>
<div> </div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Sonicle S.r.l. </strong>: <a href="http://www.sonicle.com/" target="_new">http://www.sonicle.com</a></span></div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Music: </strong><a href="http://www.gabrielebulfon.com/" target="_new">http://www.gabrielebulfon.com</a></span></div>
<div><span style="font-size: 14px; font-family: Helvetica;"><strong>Quantum Mechanics : </strong><a href="http://www.cdbaby.com/cd/gabrielebulfon" target="_new">http://www.cdbaby.com/cd/gabrielebulfon</a></span></div>
</div>
<tt><br /><br /><br />----------------------------------------------------------------------------------<br /><br />Da: Ken Gaillot <kgaillot@redhat.com><br />A: Cluster Labs - All topics related to open-source clustering welcomed <users@clusterlabs.org> <br />Data: 25 luglio 2020 0.46.52 CEST<br />Oggetto: Re: [ClusterLabs] pacemaker startup problem<br /><br /></tt>
<blockquote style="border-left: #000080 2px solid; margin-left: 5px; padding-left: 5px;"><tt>On Fri, 2020-07-24 at 18:34 +0200, Gabriele Bulfon wrote:<br />> Hello,<br />> <br />> after a long time I'm back to run heartbeat/pacemaker/corosync on our<br />> XStreamOS/illumos distro.<br />> I rebuilt the original components I did in 2016 on our latest release<br />> (probably a bit outdated, but I want to start from where I left).<br />> Looks like pacemaker is having trouble starting up showin this logs:<br />> <br />> Set r/w permissions for uid=401, gid=401 on /var/log/pacemaker.log<br />> Set r/w permissions for uid=401, gid=401 on /var/log/pacemaker.log<br />> Jul 24 18:21:32 [971] crmd: info: crm_log_init: Changed active<br />> directory to /sonicle/var/cluster/lib/pacemaker/cores<br />> Jul 24 18:21:32 [971] crmd: info: main: CRM Git Version: 1.1.15<br />> (e174ec8)<br />> Jul 24 18:21:32 [971] crmd: info: do_log: Input I_STARTUP received in<br />> state S_STARTING from crmd_init<br />> Jul 24 18:21:32 [969] lrmd: info: crm_log_init: Changed active<br />> directory to /sonicle/var/cluster/lib/pacemaker/cores<br />> Jul 24 18:21:32 [968] stonith-ng: info: crm_log_init: Changed active<br />> directory to /sonicle/var/cluster/lib/pacemaker/cores<br />> Jul 24 18:21:32 [968] stonith-ng: info: get_cluster_type: Verifying<br />> cluster type: 'heartbeat'<br />> Jul 24 18:21:32 [968] stonith-ng: info: get_cluster_type: Assuming an<br />> active 'heartbeat' cluster<br />> Jul 24 18:21:32 [968] stonith-ng: notice: crm_cluster_connect:<br />> Connecting to cluster infrastructure: heartbeat<br /><br /><br />> Jul 24 18:21:32 [969] lrmd: error: mainloop_add_ipc_server: Could not<br />> start lrmd IPC server: Operation not supported (-48)<br /><br />This is repeated for all the subdaemons ... the error is coming from<br />qb_ipcs_run(), which looks like the issue is an invalid PCMK_ipc_type<br />for illumos. If you set it to "socket" it should work.<br /><br /><br />> Jul 24 18:21:32 [969] lrmd: error: main: Failed to create IPC server:<br />> shutting down and inhibiting respawn<br />> Jul 24 18:21:32 [969] lrmd: info: crm_xml_cleanup: Cleaning up memory<br />> from libxml2<br />> Jul 24 18:21:32 [971] crmd: info: get_cluster_type: Verifying cluster<br />> type: 'heartbeat'<br />> Jul 24 18:21:32 [971] crmd: info: get_cluster_type: Assuming an<br />> active 'heartbeat' cluster<br />> Jul 24 18:21:32 [971] crmd: info: start_subsystem: Starting sub-<br />> system "pengine"<br />> Jul 24 18:21:32 [968] stonith-ng: info: crm_get_peer: Created entry<br />> 25bc5492-a49e-40d7-ae60-fd8f975a294a/80886f0 for node xstorage1/0 (1<br />> total)<br />> Jul 24 18:21:32 [968] stonith-ng: info: crm_get_peer: Node 0 has uuid<br />> d426a730-5229-6758-853a-99d4d491514a<br />> Jul 24 18:21:32 [968] stonith-ng: info: register_heartbeat_conn:<br />> Hostname: xstorage1<br />> Jul 24 18:21:32 [968] stonith-ng: info: register_heartbeat_conn:<br />> UUID: d426a730-5229-6758-853a-99d4d491514a<br />> Jul 24 18:21:32 [970] attrd: notice: crm_cluster_connect: Connecting<br />> to cluster infrastructure: heartbeat<br />> Jul 24 18:21:32 [970] attrd: error: mainloop_add_ipc_server: Could<br />> not start attrd IPC server: Operation not supported (-48)<br />> Jul 24 18:21:32 [970] attrd: error: attrd_ipc_server_init: Failed to<br />> create attrd servers: exiting and inhibiting respawn.<br />> Jul 24 18:21:32 [970] attrd: warning: attrd_ipc_server_init: Verify<br />> pacemaker and pacemaker_remote are not both enabled.<br />> Jul 24 18:21:32 [972] pengine: info: crm_log_init: Changed active<br />> directory to /sonicle/var/cluster/lib/pacemaker/cores<br />> Jul 24 18:21:32 [972] pengine: error: mainloop_add_ipc_server: Could<br />> not start pengine IPC server: Operation not supported (-48)<br />> Jul 24 18:21:32 [972] pengine: error: main: Failed to create IPC<br />> server: shutting down and inhibiting respawn<br />> Jul 24 18:21:32 [972] pengine: info: crm_xml_cleanup: Cleaning up<br />> memory from libxml2<br />> Jul 24 18:21:33 [971] crmd: info: do_cib_control: Could not connect<br />> to the CIB service: Transport endpoint is not connected<br />> Jul 24 18:21:33 [971] crmd: warning: do_cib_control: Couldn't<br />> complete CIB registration 1 times... pause and retry<br />> Jul 24 18:21:33 [971] crmd: error: crmd_child_exit: Child process<br />> pengine exited (pid=972, rc=100)<br />> Jul 24 18:21:35 [971] crmd: info: crm_timer_popped: Wait Timer<br />> (I_NULL) just popped (2000ms)<br />> Jul 24 18:21:36 [971] crmd: info: do_cib_control: Could not connect<br />> to the CIB service: Transport endpoint is not connected<br />> Jul 24 18:21:36 [971] crmd: warning: do_cib_control: Couldn't<br />> complete CIB registration 2 times... pause and retry<br />> Jul 24 18:21:38 [971] crmd: info: crm_timer_popped: Wait Timer<br />> (I_NULL) just popped (2000ms)<br />> Jul 24 18:21:39 [971] crmd: info: do_cib_control: Could not connect<br />> to the CIB service: Transport endpoint is not connected<br />> Jul 24 18:21:39 [971] crmd: warning: do_cib_control: Couldn't<br />> complete CIB registration 3 times... pause and retry<br />> Jul 24 18:21:41 [971] crmd: info: crm_timer_popped: Wait Timer<br />> (I_NULL) just popped (2000ms)<br />> Jul 24 18:21:42 [971] crmd: info: do_cib_control: Could not connect<br />> to the CIB service: Transport endpoint is not connected<br />> Jul 24 18:21:42 [971] crmd: warning: do_cib_control: Couldn't<br />> complete CIB registration 4 times... pause and retry<br />> Jul 24 18:21:42 [968] stonith-ng: error: setup_cib: Could not connect<br />> to the CIB service: Transport endpoint is not connected (-134)<br />> Jul 24 18:21:42 [968] stonith-ng: error: mainloop_add_ipc_server:<br />> Could not start stonith-ng IPC server: Operation not supported (-48)<br />> Jul 24 18:21:42 [968] stonith-ng: error: stonith_ipc_server_init:<br />> Failed to create stonith-ng servers: exiting and inhibiting respawn.<br />> Jul 24 18:21:42 [968] stonith-ng: warning: stonith_ipc_server_init:<br />> Verify pacemaker and pacemaker_remote are not both enabled.<br />> <br />> Any idea what's happening?<br />> Gabriele<br />> <br />> <br />> <br />> <br />> Sonicle S.r.l. : http://www.sonicle.com<br />> Music: http://www.gabrielebulfon.com<br />> Quantum Mechanics : http://www.cdbaby.com/cd/gabrielebulfon<br />> _______________________________________________<br />> Manage your subscription:<br />> https://lists.clusterlabs.org/mailman/listinfo/users<br />> <br />> ClusterLabs home: https://www.clusterlabs.org/<br />-- <br />Ken Gaillot <kgaillot@redhat.com><br /><br />_______________________________________________<br />Manage your subscription:<br />https://lists.clusterlabs.org/mailman/listinfo/users<br /><br />ClusterLabs home: https://www.clusterlabs.org/<br /><br /><br /></tt></blockquote>
</div>
<pre>_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users
ClusterLabs home: https://www.clusterlabs.org/
</pre>
</blockquote>
</div></body></html>