[Linux-HA] ipfail did not triggered failover
pavol_gono at yahoo.com
Thu Aug 4 03:53:16 MDT 2005
I have looked at logs once again. In second error situation it
is clear - our faulty resource script was blocked and this was
the reason why failover was waiting. You can see
"Aug 3 18:47:45 Hig50v3Ps heartbeat: debug: Starting
but there wasn't "...start_hiq30 start done"
But please look once again at first error situation (Aug 3
17:52:27 Hig50v3Pm). There is no previous unfinished resource
script, and I fear there is another bigger problem.
Unfortunately, I don't have state of processes from this
situation, at Aug 3 18:25:32 m-machine was restarted. What can
be reason for failover hanging?
--- Alan Robertson <alanr at unix.sh> wrote:
> Pavol Gono wrote:
> > After killing this ssh, failover was immediately in
> > Is it possible that blocking resource processes can block
> > whole failover?
> It is not merely possible. It is certain.
> Some things take a long time to stop. Heartbeat is patient.
> Alan Robertson <alanr at unix.sh>
Start your day with Yahoo! - make it your home page
More information about the Linux-HA