[LinuxFailSafe] failsafe do not failover ?

cqcai cqcai@163.com
Sat, 22 Mar 2003 16:34:48 +0800


Hi, Paddy:

        I checked network interface configure file in redhat 7.3 below, 
but can't see any wrong.
but failsafe still can't see them.  Do i need modify start script or 
something ?


        Thank you a lot !

  /etc/hosts:
 192.168.0.143        qcai.wiregate.com
192.168.1.143        qcai.wiregate.com
192.168.0.21        UNIX.wiregate.com
192.168.1.21        UNIX.wiregate.com

 /etc/sysconfig/network*script*/eth0:0
DEVICE=eth0:0
BROADCAST=192.168.1.255
IPADDR=192.168.1.13
NETMASK=255.255.255.0
NETWORK=192.168.1.0
ONBOOT=no

 /etc/sysconfig/network*script*/eth0:10
DEVICE=eth0:10
BROADCAST=192.168.1.255
IPADDR=192.168.1.143
NETMASK=255.255.255.0
NETWORK=192.168.1.0
ONBOOT=no

all network interface config give above, and use 
/etc/rc.d/init.d/network restart they can all hang up.
but, the failsafe still can't see them:
/var/log/failsafe/script*:
Sat Mar 22 15:34:13.035 <N ha_cilog script 32449:0 Unknown:0> Sat Mar 22 
15:34:12 <N IP_address-exclusive script 32406:0> IP_address: exclusive - 
192.168.1.143
Sat Mar 22 15:34:13.180 <N ha_cilog script 32471:0 Unknown:0> Sat Mar 22 
15:34:13 <N IP_address-exclusive script 32406:0> 192.168.1.143 not 
configured on any interface
Sat Mar 22 15:34:13.281 <N ha_cilog script 32475:0 Unknown:0> Sat Mar 22 
15:34:13 <N IP_address-exclusive script 32406:0> 192.168.1.143 exclusive 
status: NOT RUNNING
Sat Mar 22 15:34:24.534 <N ha_cilog script 32525:0 Unknown:0> Sat Mar 22 
15:34:24 <N IP_address-start script 32482:0> IP_address: start - 
192.168.1.143
Sat Mar 22 15:34:24.673 <N ha_cilog script 32547:0 Unknown:0> Sat Mar 22 
15:34:24 <N IP_address-start script 32482:0> 192.168.1.143 not 
configured on any interface
Sat Mar 22 15:34:24.811 <N ha_cilog script 32561:0 Unknown:0> Sat Mar 22 
15:34:24 <N IP_address-start script 32482:0> ip address 192.168.1.143 
cannot be configured
Sat Mar 22 15:34:24.913 <N ha_cilog script 32564:0 Unknown:0> Sat Mar 22 
15:34:24 <N IP_address-start script 32482:0> Check ha_ifd logs on this 
node for more information
Sat Mar 22 15:34:36.166 <N ha_cilog script 32622:0 Unknown:0> Sat Mar 22 
15:34:36 <N IP_address-stop script 32579:0> IP_address: stop - 192.168.1.143
Sat Mar 22 15:34:36.305 <N ha_cilog script 32636:0 Unknown:0> Sat Mar 22 
15:34:36 <N IP_address-stop script 32579:0> ip address 192.168.1.143 was 
not configured
Sat Mar 22 15:34:36.405 <N ha_cilog script 32639:0 Unknown:0> Sat Mar 22 
15:34:36 <N IP_address-stop script 32579:0> Check ha_ifd logs on this 
node for more information

/var/log/failsafe/ifd*:
Sat Mar 22 15:31:53.207 <I0 ha_ifd ifd 32353:0 ifd_main.c:346> ha_ifd 
monitoring network interfaces
Sat Mar 22 15:34:24.694 <W ha_ifd ifd 32353:0 ifd_net.c:913> CI_FAILURE, 
found no interface for ip address 192.168.1.143
Sat Mar 22 15:34:24.694 <W ha_ifd ifd 32353:0 ifd_net.c:916> CI_FAILURE, 
Check IP_address 192.168.1.143 resource configuration. Check if 
interface name is valid.
Sat Mar 22 15:34:36.185 <I0 ha_ifd ifd 32353:0 ifd_main.c:751> 
CI_FAILURE, get information ipaddress 192.168.1.143 failed
Sat Mar 22 16:10:16.210 <W ha_ifd ifd 32353:0 ifd_net.c:536> interface 
eth0 has changed






Padmanabhan Sreenivasan wrote:

>cqcai wrote:
>
>>now i have start the ha_services in cmgr,  and process seems ok.
>>and have a  new  problem:
>>the IP_address  resource script maybe get wrong value:
>>srmd_qcai:
>>
>>Fri Mar 21 17:03:14.588 <E ha_srmd srm 5930:1 sa_process_tasks.c:638>
>>CI_FAILURE, ERROR: Action (start) for resource (192.168.1.13) of type
>>(IP_address): resource group (webserver-group) failed with status
>>(failed/running)
>>Fri Mar 21 17:03:14.588 <N ha_srmd srm 5930:1 sa_process_tasks.c:654>
>>Check script (/usr/lib/failsafe/resource_types/IP_address/start) logs on
>>this node for more error information
>>
>
>
>Check script logs (/var/log/failsafe/script*) for errors
>from the IP_address start script. This might be a configuration
>problem. Check if interface name for 192.168.1.13 is correct.
>
>Paddy
>
>
>>Fri Mar 21 17:03:14.593 <N ha_srmd srm 5860:0 sr_main.c:941> Allocation
>>error: Resource name webserver-group type __binding status failed/running
>>Fri Mar 21 17:03:14.594 <I0 ha_srmd srm 5860:0 sr_main.c:944> Allocate
>>error: Resource state 5 error 8
>>Fri Mar 21 17:03:20.339 <I0 ha_srmd srm 5931:2 sc_reply.c:133> Allocate
>>request reply done
>>
>>Resource name: webserver-group
>>resource type:    __binding
>>resource status:  failed/running
>>
>>Padmanabhan Sreenivasan wrote:
>>
>>>cqcai wrote:
>>>
>>>>Hi  everybody,  I am a failsafe newbie and want to install and try
>>>>linux failsafe on redhat 7.3. Now linux failsafe have been installed
>>>>successful and finished configured. But the simple 2 node HA system does
>>>>not failover at all.
>>>>
>>>>Can anyone help me about that?
>>>>
>>>Check ifd logs for IP address errors. Did you bring
>>>resource groups online?
>>>
>>>Paddy
>>>
>>_______________________________________________
>>LinuxFailSafe mailing list
>>LinuxFailSafe@lists.community.tummy.com
>>http://lists.community.tummy.com/mailman/listinfo/linuxfailsafe
>>
>