[Linux-HA] Reboot if network or service fail
nunotavares at hotmail.com
Mon Jun 7 01:39:17 MDT 2004
Greetings to all.
This is the situation: I use to manage systems remotely (as probably most
of us), but I'm a really distracted guy and sometimes I start playing
around with the network and (guess what) I loose my connection. Then I'll
have to wait until I have physical access to the machines.
I'd like to set heartbeat to detect when it has lost contact with the
outside network, instead of a specific host, and reboot if confirmed. So
if I mess with network parameters, it will reboot.
Moreover, another critical service that has to be monitored is SSH daemon.
So if I do a "service sshd restart" and it doesn't restart the daemon, I'd
like heartbeat to failback a standard configuration.
So, my questions are:
1) how to monitor+react network connectivity (instead of peer's)
2) how to monitor services (may involve creating a heartbeat script to
rollback to a service "safe-mode")
More information about the Linux-HA