[Linux-HA] problem with ipfail pingd

holgi hwoehle at arcor.de
Sat Dec 29 15:38:24 MST 2007


Hi,
i am testing a migration from heartbeat v1. to heartbeat v2.
In V2 ipfail e.g. pingd don't work as i expected.
Following is my ha.cf:

debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility     local0
keepalive 2
deadtime 30
warntime 10
initdead 90
udpport 694
bcast   eth0 eth1     
auto_failback on
watchdog /dev/watchdog
node master
node slave
#apiauth        ping gid=root uid=root
#respawn root /usr/lib/heartbeat/pingd -m 100 -d 5s
respawn hacluster /usr/lib/heartbeat/ipfail
ping 192.168.1.1
crm     yes

since you can see, i tried it with both pingd and ipfail.

This is a cut-off the ha-debug logfile

startup:
heartbeat[21610]: 2007/12/29_18:21:44 info: glib: ping heartbeat started.
heartbeat[21610]: 2007/12/29_18:21:44 notice: Using watchdog device: 
/dev/watchdog
heartbeat[21610]: 2007/12/29_18:21:44 info: G_main_add_SignalHandler: 
Added signal handler for signal 17
heartbeat[21610]: 2007/12/29_18:21:45 info: Local status now set to: 'up'
heartbeat[21610]: 2007/12/29_18:21:46 info: Link master:eth0 up.
heartbeat[21610]: 2007/12/29_18:21:46 info: Link master:eth1 up.
heartbeat[21610]: 2007/12/29_18:21:46 info: Link 192.168.1.1:192.168.1.1 up.
heartbeat[21610]: 2007/12/29_18:21:46 info: Status update for node 
192.168.1.1: status ping
heartbeat[21610]: 2007/12/29_18:22:07 info: Link slave:eth0 up.
heartbeat[21610]: 2007/12/29_18:22:07 info: Status update for node 
slave: status up
heartbeat[21610]: 2007/12/29_18:22:07 info: Link slave:eth1 up.

cutting the cabel:
heartbeat[21610]: 2007/12/29_18:25:35 WARN: node 192.168.1.1: is dead
heartbeat[21610]: 2007/12/29_18:25:35 info: Link slave:eth0 dead.
heartbeat[21610]: 2007/12/29_18:25:35 info: Link 192.168.1.1:192.168.1.1 
dead.
crmd[21627]: 2007/12/29_18:25:35 notice: crmd_ha_status_callback: Status 
update: Node 192.168.1.1 now has status [dead]
crmd[21627]: 2007/12/29_18:25:36 WARN: get_uuid: Could not calculate 
UUID for 192.168.1.1
crmd[21627]: 2007/12/29_18:25:36 info: crmd_ha_status_callback: Ping 
node 192.168.1.1 is dead

but nothing else happens....

Under Version V1, node2 (slave) graps all resources as expected.

Please, can someone be so kind and point that out for me ?

with kind regards
holgi






More information about the Linux-HA mailing list