[Linux-HA] Can't get the ping_group to work - problem with location constraints and DRBD

Tobias Appel tappel at eso.org
Fri Jan 30 05:12:09 MST 2009


On Fri, 2009-01-30 at 12:16 +0100, Dejan Muhamedagic wrote:
> Hi,
> 
> On Fri, Jan 30, 2009 at 11:35:54AM +0100, Tobias Appel wrote:
> > Hi,
> > 
> > I'm using Heartbeat 2.1.4 and I tried to setup a ping_group so that if I
> > pull the ethernet cables on one of the two nodes a failover will occur.
> > 
> > I have configured a ping_group group 1 in ha.cf and created a clone
> > resource with pingd.
> > 
> > The problem is, I have a resource group + a master / slave resource for
> > DRBD. the resource group only runs on the node which is currently
> > master. Now I added another constraint for pingd, so that the failover
> > occurs, but it does not work - nothing is happening, I think it has to
> > do with the DRBD resource. Here are my current constraints:
> > 
> >  <constraints>
> >    <rsc_order id="order_drbd_filesys" from="nagios" action="start"
> > to="ms_drbd" to_action="promote"/>
> >    <rsc_colocation id="colocation_drbd_filesys" to_role="Master"
> > from="nagios" to="ms_drbd" score="INFINITY"/>
> >    <rsc_location id="location_resources" rsc="ms_drbd">
> >      <rule id="prefered_location_resources" score_attribute="pingd">
> >        <expression attribute="pingd"
> > id="66155d1f-2210-45ad-9010-a14e48825ead" operation="defined"/>
> >      </rule>
> >    </rsc_location>
> >  </constraints>
> > 
> > I'm not quite sure how to change the location constraint to the
> > following:
> > -if you are the active node and the ping group is not reachable, stop
> > resource group "nagios"
> > - promote the other node to master for ms_drbd and start resource group
> > nagios there
> 
> Add role=Master to the pingd location constraint. I think this
> was also described in the drbd howto.

I have read the DRBD howto's on linux-ha.org but this is only for the
basic setup, of course with a resource group, but not with pingd and
drbd and a resource group. I added role="Master" to the constraint but
it still will not work:

<rsc_location id="location_resources" rsc="ms_drbd">
     <rule id="prefered_location_resources" score_attribute="pingd"
role="Master">
       <expression attribute="pingd"
id="66155d1f-2210-45ad-9010-a14e48825ead" operation="defined"/>
     </rule>
   </rsc_location>

Btw, I did find some useful info here (even if it is for pacemaker):
http://www.clusterlabs.org/wiki/DRBD_HowTo_1.0

But like I said, it still does not work. Heartbeat does not stop the
resource group and does not promote the other node to master. Logfile
still says:

Jan 30 12:07:07 nagios2 pengine: [3758]: info: master_color: Promoting
resource_drbd:0 (Master nagios2)
Jan 30 12:07:07 nagios2 pengine: [3758]: info: master_color: ms_drbd:
Promoted 1 instances of a possible 1 to master
Jan 30 12:07:07 nagios2 pengine: [3758]: info: master_color: ms_drbd:
Promoted 1 instances of a possible 1 to master
Jan 30 12:07:07 nagios2 pengine: [3758]: notice: NoRoleChange: Leave
resource resource_filesys  (Started nagios2)

and so on...

one would not believe how complicated the most basic check for network
connectivity can be. *sigh*

> 
> > The logfile looks like this (especially line 3 mentions the problem with
> > DRBD master):
> > 
> > Jan 30 10:24:12 nagios2 pengine: [3758]: WARN: text2task: Unsupported
> > action: status
> 
> You should use monitor, not status. Don't know what does your
> configuration look like.
> 
I know, but I don't even know what text2task is. I don't have any
resource by that name and google didn't help much except that many
people have the same error message in their logfiles apparently but I
haven't figured out what kind of software that is.



More information about the Linux-HA mailing list