[Linux-HA] Don't get back resource again
Andrew Beekhof
beekhof at gmail.com
Thu Jul 19 04:30:25 MDT 2007
On 7/19/07, Adrian Chapela <achapela.rexistros at gmail.com> wrote:
> Andrew Beekhof escribió:
> > then you want:
> >
> > default_resource_failure_stickiness=-INFINITY (if it fails, move
> > immediately), and
> >
> > default_resource_stickiness=INFINITY (dont move it unless it fails or
> > the node is shutting down or similar)
> But If I assign default_resource_stickiness=INFINITY , the resource
> group executes in the first heartbeat node available no ??
it has no relation to the initial placement (since its not yet running)
> I want to execute a resource in one node and then it network fails or
> resource fails, the resource group must be moved to the secondary node.
Right
>
> Now I assign a 200 score to a node by a rule, could be this rule the
> problem ?? The rule is:
not really a problem, but it will of course have an effect
> <rsc_location id="my_resource:connected" rsc="MySQL_GROUP">
> <rule id="my_resource:prefer:portatil" score="200">
> <expression id="my_resource:prefer:portatil:expr"
> attribute="#uname" operation="eq" value="portatil"/>
> </rule>
> <rule id="my_resource:connected:rule" score="-INFINITY"
> boolean_op="or">
> <expression id="my_resource:connected:expr:undefined"
> attribute="pingd" operation="not_defined"/>
> <expression id="my_resource:connected:expr:zero"
> attribute="pingd" operation="lte" value="0"/>
> </rule>
> </rsc_location>
>
> (It will test your theory now..)
> >
> >
> >> To do that I configure in cib the next:
> >>
> >> default_resource_failure_stickiness=-INFINITY -> with this the resource
> >> doesn't execute in any node.
> >
> > has it failed once on every node?
> No, I think not..
well the only time default_resource_failure_stickiness has any effect
on a node's score is if the resource has failed on that node.
unless its the pingd rule that is preventing some of the nodes from
taking over the resource
> > you'll need to clear out the failcount before it will be allowed to
> > run there again
> This is to execute the resource in a node wich has a failcount > 0 not
> ?? How can I know failcount value for a node ??
crm_failcount --help
More information about the Linux-HA
mailing list