[Pacemaker] CLVM & Pacemaker & Corosync on Ubuntu Omeiric Server
Andreas Kurz
andreas at hastexo.com
Thu Nov 24 14:23:30 UTC 2011
Hello,
On 11/23/2011 03:30 PM, Vadim Bulst wrote:
> Hi list,
>
> I try to bring up a 3-node cluster running Ubuntu Omeiric. The packages
> I used are all from the Ubuntu-repo: pacemaker 1.1.5, corosync 1.3.0,
> clvm 2.02.66.
>
> I'm using teamed and bridged interfaces for networking. Every node has
> only one address for cluster management.
> All nodes are connected to a FC-san and see the same volumes. Iptables
> --list shows no rules.
>
> My problem:
>
> clvm resource is not coming up and when I commit some changes on
> resources in most cases one node is dying.
there are some constraints missing in your config ... or better, use a
cloned group ... see below ...
And use Corosync 1.4.x with redundant rings and automatic ring recovery
feature enabled.
>
> crm_mon:
>
> ============
> Last updated: Wed Nov 23 14:47:58 2011
> Stack: openais
> Current DC: bbzclnode07 - partition with quorum
> Version: 1.1.5-01e86afaaa6d4a8c4836f68df80ababd6ca3902f
> 3 Nodes configured, 3 expected votes
> 2 Resources configured.
> ============
>
> Online: [ bbzclnode06 bbzclnode04 bbzclnode07 ]
>
> Clone Set: dlm-clone [dlm]
> Started: [ bbzclnode06 bbzclnode07 bbzclnode04 ]
>
> Failed actions:
> clvm:0_start_0 (node=bbzclnode06, call=31, rc=1, status=complete):
> unknown error
> clvm:1_start_0 (node=bbzclnode07, call=5, rc=1, status=complete):
> unknown error
> clvm:0_start_0 (node=bbzclnode04, call=29, rc=1, status=complete):
> unknown error
>
>
>
>
> I configured corosync like this:
>
> totem {
> version: 2
> token: 3000
> token_retransmits_before_loss_const: 10
> join: 60
> consensus: 3600
> vsftype: none
> max_messages: 20
> clear_node_high_bit: yes
> secauth: on
> threads: 8
> rrp_mode: none
> interface {
> ringnumber: 0
> bindnetaddr: 192.168.128.0
> mcastaddr: 226.94.1.1
> mcastport: 5405
> }
> }
> amf {
> mode: disabled
> }
> service {
> ver: 0
> name: pacemaker
> }
> aisexec {
> user: root
> group: root
> }
> logging {
> fileline: off
> to_stderr: yes
> to_logfile: no
> to_syslog: yes
> syslog_facility: daemon
> debug: off
> timestamp: on
> logger_subsys {
> subsys: AMF
> debug: off
> tags: enter|leave|trace1|trace2|trace3|trace4|trace6
> }
> }
>
> and my cib looks like this:
>
> node bbzclnode04
> node bbzclnode06
> node bbzclnode07
> primitive clvm ocf:lvm2:clvmd \
> params daemon_timeout="30" \
> meta target-role="started"
> primitive m ocf:pacemaker:controld \
> meta target-role="started"
> clone clvm-clone clvm \
> meta clone-max="3" clone-node-max="1"
> clone dlm-clone dlm \
> meta clone-max="3" clone-node-max="1"
ommit those two clones, use a cloned group:
group g_lock dlm clvm
clone g_lock \
meta interleave="true"
> property $id="cib-bootstrap-options" \
> dc-version="1.1.5-01e86afaaa6d4a8c4836f68df80ababd6ca3902f" \
> cluster-infrastructure="openais" \
> expected-quorum-votes="3" \
> no-quorum-policy="ignore" \
> stonith-enabled="false" \
> last-lrm-refresh="1322049979"
Don't forget to set up stonith in a productive system when using shared
storage.
Regards,
Andreas
--
Need help with Pacemaker?
http://www.hastexo.com/now
>
>
> Thanks for your help.
>
>
> Cheers,
>
> Vadim
>
>
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 286 bytes
Desc: OpenPGP digital signature
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20111124/ee4a2b56/attachment-0004.sig>
More information about the Pacemaker
mailing list