AW: [LinuxFailSafe] startup of resource groups with one node down

Martin Bene martin.bene@icomedias.com
Tue, 7 May 2002 20:40:29 +0200


> Von: Padmanabhan Sreenivasan [mailto:paddy@sgi.com], 07. Mai 2002 =
01:20

> FailSafe notion of tie-breaker is different. Tiebreaker node=20
> gets the first chance to reset other node in a two node cluster=20
> in case of network partition.
> If only one node in the cluster is operational, start HA services only =
on that
> node. When the other node is available, you can start HA services on =
that node. It
> should rejoin the cluster.

Thanks for that hint. I didn't realize the results of starting ha =
services for the cluster or starting it for just one node are quite =
different if only one node is available at startup:

start ha services for cluster:=20
	* membership only comes up if tiebreaker node is available,=20
	* unavailable node gets reset
	* node status is 1x UP, 1x DOWN
	* bringing up resource groups fails.

start up ha services just for the available node:=20
	* membership comes up regardles of tiebreaker node
	* unavailable node doesn't get reset
	* node status is 1x UP, 1x inactive
	* bringing up resource groups works.

I feel much better now that I have known-to-work recipiece for bringing =
up services even from "both-nodes-down, one node kaputt" status.=20

That said, I still don't understand where the first case differs from =
starting up with both nodes present and then loosing one :-)

Thanks again, Martin