[Linux-HA] Service not starting?
alanr at unix.sh
Sun Mar 14 16:24:38 MST 2004
John Hearns wrote:
> I have a bit of a peculiar problem.
> I have done lots of testing though, so am not crying
> Wolf straight away.
> I have a service that will not start.
> It is the rcsge script to start Sun Gridengine.
> On startup, the ha-log says:
> Acquiring resource group: hostname .... rcsge ....
> ie. the rcsge is in the list
> The services start up one after the other, but rcsge isn't there.
> When I put an 'exit 1' at the start of the rcsge script,
> it is logged, with ERROR: Return code 1 from /etc/init.d/rcsge
> which seems eminently sensible to me.
> When I start rcsge by hand, the service starts.
> I'm a bit puzzled. Is there a timeout set for services started
> by heartbeat? Am I just lost in space?
There is no timeout. The code for starting services is very straightforward.
In general, sometimes scripts which start fine by hand don't start when
started by programs. I've had this happen with cron jobs, and other things
Examples of things that your script might rely on:
a controlling tty
writable current directory
I would put this at the start of your start action
exec >/some/logging/path 2>&1
This looks a little odd, but it's correct (except, of course, for the
Or you can do it like this if it makes you happier:
start action stuff...
) >/some/logging/path 2>&1
This will probably be helpful in diagnosing your start script.
Alan Robertson <alanr at unix.sh>
"Openness is the foundation and preservative of friendship... Let me claim
from you at all times your undisguised opinions." - William Wilberforce
More information about the Linux-HA