|
Hello list
Im trying to setup a 3 nodes Cluster with 2 failover Domain for an HA
mail solution.
I want 1 run active for the Imap server in the Imap Failover domain , 1
node active for the Smtp in the Smtp Failover domain and the 3rd in the
2 failover domain as a backup node.
I run Centos 5.3
My fence device is a wti power switch
My cluster.conf is in attachement
My SMTP service is composed of:
1 IP
1 amavisd scritp
1 postfix script
2 NFS mount for postfix and amavis
If I manually kill the postfix master process (to simulate a crash), my
node is not fence and the logs said:
Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info Executing
/etc/init.d/postfix status
Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <err script:postfix:
status of /etc/init.d/postfix failed (returned 3)
Jul 6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice status on script
"postfix" returned 1 (generic error)
Jul 6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice Stopping service
service:Postfix
Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info Executing
/etc/init.d/amavisd stop
Jul 6 10:00:40 centos-smtp1 kernel: do_vfs_lock: VFS is out of sync
with lock manager!
Jul 6 10:00:40 centos-smtp1 last message repeated 8 times
Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info Executing
/etc/init.d/postfix stop
Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <err script:postfix:
stop of /etc/init.d/postfix failed (returned 1)
Jul 6 10:00:41 centos-smtp1 clurgmgrd[4228]: <notice stop on script
"postfix" returned 1 (generic error)
Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info Removing IPv4
address 195.83.126.201/24 from bond0
Jul 6 10:00:41 centos-smtp1 avahi-daemon[3552]: Withdrawing address
record for 195.83.126.201 on bond0.
Jul 6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info unmounting
/var/lib/amavis
Jul 6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info unmounting
/var/spool/postfix
Jul 6 10:00:51 centos-smtp1 clurgmgrd[4228]: <crit #12: RG
service:Postfix failed to stop; intervention required
Jul 6 10:00:51 centos-smtp1 clurgmgrd[4228]: <notice Service
service:Postfix is failed
Jul 6 10:00:52 centos-smtp1 ntpd[3322]: synchronized to 195.83.126.119,
stratum 1
Clustat said:
Cluster Status for cluster-test @ Mon Jul 6 10:02:39 2009
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
centos-imap1.ill.fr 1
Online, Local, rgmanager
centos-imap2.ill.fr 2
Online, rgmanager
centos-smtp1.ill.fr 3
Online, rgmanager
/dev/disk/by-id/scsi-360a98000567247514634507447594661-part1 0
Online, Quorum Disk
Service Name Owner
(Last) State
------- ---- -----
------ -----
service:Imap
centos-imap2.ill.fr started
service:Postfix
(centos-smtp1.ill.fr) failed
So I have to disable the Postfix servcie with:
clusvcadm -d Postfix
and re-enable
clusvcadm -e Postfix
Could you explain my why my original smtp node is not fenced and why my
service is not start on the 2nd node ???
Is there a way to force the fencing ???
--
ARMANET Stephane
Division Projet Technique
Service Informatique
Groupe Infrastructure
Institut Laue langevin
<?xml version="1.0"?
<cluster alias="cluster-test" config_version="57" name="cluster-test"
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/
<clusternodes
<clusternode name="centos-imap1.test.fr" nodeid="1" votes="1"
<fence
<method name="1"
<device name="psu2" option="off" port="1"/
<device name="psu2" option="off" port="5"/
</method
<method name="2"
<device name="centos_manual-fence" nodename="centos-imap1.test.fr"/
</method
</fence
</clusternode
<clusternode name="centos-imap2.test.fr" nodeid="2" votes="1"
<fence
<method name="1"
<device name="psu2" option="off" port="2"/
<device name="psu2" option="off" port="6"/
</method
<method name="2"
<device name="centos_manual-fence" nodename="centos-imap2.test.fr"/
</method
</fence
</clusternode
<clusternode name="centos-smtp1.test.fr" nodeid="3" votes="1"
<fence
<method name="1"
<device name="psu1" option="off" port="1"/
<device name="psu1" option="off" port="5"/
</method
<method name="2"
<device name="centos_manual-fence" nodename="centos-smtp1.test.fr"/
</method
</fence
</clusternode
</clusternodes
<cman/
<fencedevices
<fencedevice agent="fence_manual" name="centos_manual-fence"/
<fencedevice agent="fence_wti" ipaddr="192.168.0.215" name="psu1" passwd="passwd"/
<fencedevice agent="fence_wti" ipaddr="192.168.0.216" name="psu2" passwd="passwd"/
</fencedevices
<rm log_facility="local4" log_level="7"
<failoverdomains
<failoverdomain name="imap-FOD" nofailback="0" ordered="1" restricted="1"
<failoverdomainnode name="centos-imap1.test.fr" priority="1"/
<failoverdomainnode name="centos-imap2.test.fr" priority="2"/
</failoverdomain
<failoverdomain name="smtp-FOD" ordered="1" restricted="1"
<failoverdomainnode name="centos-smtp1.test.fr" priority="1"/
<failoverdomainnode name="centos-imap2.test.fr" priority="2"/
</failoverdomain
</failoverdomains
<resources
<netfs export="/vol/volSMTP/postfix" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/spool/postfix" name="NFS-postfix" options="rw,nolock"/
<fs device="/dev/mapper/vgMail-lvMailboxes" force_fsck="1" force_unmount="1" fsid="34650" fstype="ext3" mountpoint="/var/spool/imap" name="lvMailboxes" options="commit=1" self_fence="1"/
<fs device="/dev/mapper/vgMail-lvDBMail" force_fsck="1" force_unmount="1" fsid="4277" fstype="ext3" mountpoint="/var/lib/imap" name="lvDBMail" options="commit=1" self_fence="1"/
<netfs export="/vol/volSMTP/amavis" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/lib/amavis" name="NFS Amavis" options=""/
</resources
<service autostart="1" domain="imap-FOD" name="Imap" recovery="relocate"
<ip address="192.168.0.198/24" monitor_link="1"
<script file="/etc/init.d/cyrus-imapd" name="Cyrus-imapd"/
</ip
<fs ref="lvMailboxes"/
<fs ref="lvDBMail"/
</service
<service autostart="1" domain="smtp-FOD" name="Postfix" recovery="relocate"
<ip address="192.168.0.201" monitor_link="1"
<script file="/etc/init.d/postfix" name="postfix"/
<script file="/etc/init.d/amavisd" name="amavisd"/
</ip
<netfs ref="NFS-postfix"/
<netfs ref="NFS Amavis"/
</service
</rm
<quorumd interval="2" label="QDISK" min_score="1" tko="5" votes="2"
<heuristic interval="2" program="/bin/ping -c 1 -t 1 192.168.0.254" score="1"/
<heuristic interval="5" program="/bin/ping -c 3 -t 1 192.168.0.254" score="1"/
</quorumd
<totem consensus="4800" join="60" token="25000" token_retransmits_before_loss_const="20"/
</cluster
--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
rel="nofollow" www.redhat.com/mailman/listinfo/linux-cluster www.redhat.com/mailman/listinfo/linux-cluster
|