[ClusterLabs] getting "Totem is unable to form a cluster" error

Jan Friesse jfriesse at redhat.com
Fri Apr 8 04:01:12 EDT 2016


> pacemaker 1.1.12-11.12
> openais 1.1.4-5.24.5
> corosync 1.4.7-0.23.5
>
> Its a two node active/passive cluster and we just upgraded the SLES 11
> SP 3 to SLES 11 SP 4(nothing  else) but when we try to start the cluster
> service we get the following error:
>
> "Totem is unable to form a cluster because of an operating system or
> network fault."
>
> Firewall is stopped and disabled on both the nodes. Both nodes can
> ping/ssh/vnc each other.

Hard to help. First of all, I would recommend to ask SUSE support 
because I don't really have access to source code of corosync 
1.4.7-0.23.5 package, so really don't know what patches are added.


>
> corosync.conf:
> aisexec {
>      group:    root
>      user:    root
> }
> service {
>      use_mgmtd:    yes
>      use_logd:    yes
>      ver:    0
>      name:    pacemaker
> }
> totem {
>      rrp_mode:    none
>      join:    60
>      max_messages:    20
>      vsftype:    none
>      token:    5000
>      consensus:    6000
>
>      interface {
>          bindnetaddr:    192.168.150.0
>
>          member {
>              memberaddr:     192.168.150.12
>          }
>          member {
>              memberaddr:      192.168.150.13
>          }
>          mcastport:    5405
>
>          ringnumber:    0
>
>      }
>      secauth:    off
>      version:    2
>      transport:    udpu
>      token_retransmits_before_loss_const:    10
>      clear_node_high_bit:    new
> }
> logging {
>      to_logfile:    no
>      to_syslog:    yes
>      debug:    off
>      timestamp:    off
>      to_stderr:    no
>      fileline:    off
>      syslog_facility:    daemon
> }
> amf {
>      mode:    disable
> }
>
> /var/log/messages:
> Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Corosync Cluster Engine
> ('1.4.7'): started and ready to provide service.
> Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Corosync built-in
> features: nss
> Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Successfully configured
> openais services to load
> Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Successfully read main
> configuration file '/etc/corosync/corosync.conf'.
> Apr  6 17:51:49 prd1 corosync[8672]:  [TOTEM ] Initializing transport
> (UDP/IP Unicast).
> Apr  6 17:51:49 prd1 corosync[8672]:  [TOTEM ] Initializing
> transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
> Apr  6 17:51:49 prd1 corosync[8672]:  [TOTEM ] The network interface is
> down.

^^^ This is important line. It means corosync was unable to find 
interface for bindnetaddr 192.168.150.0. Make sure interface with this 
network address exists.

Regards,
   Honza





More information about the Users mailing list