[Linux-ha-dev] LRMd exits with no warning (+ logging preferences)

Sun Jiang Dong hasjd at cn.ibm.com
Mon Dec 13 02:22:22 MST 2004



Andrew Beekhof wrote:
> On Mon, 13 Dec 2004 15:19:09 +0800, Sun Jiang Dong <hasjd at cn.ibm.com> wrote:
> 
>>
>>
>>Andrew Beekhof wrote:
>>
>>>I'm still getting the same behavior as the other day...
>>>
>>>Dec 10 12:58:07 c001n09 heartbeat[3352]: info: Respawning client
>>>"/usr/lib/heartbeat/lrmd":
>>>Dec 10 12:58:07 c001n09 heartbeat[3352]: info: Starting child client
>>>"/usr/lib/heartbeat/lrmd" (0,0)
>>>Dec 10 12:58:08 c001n09 heartbeat[4006]: info: Starting
>>>"/usr/lib/heartbeat/lrmd" as uid 0  gid 0 (pid 4006)
>>>Dec 10 12:58:08 c001n09 heartbeat[3352]: WARN: Exiting
>>>/usr/lib/heartbeat/lrmd process 4006 returned rc 127.
>>>Dec 10 12:58:08 c001n09 heartbeat[3352]: ERROR: Client
>>>/usr/lib/heartbeat/lrmd "respawning too fast"
>>
>>I can't reproduce it. Need more information such as the CVS version/tag
>>of linux HA and more.
> 
> 
> Verson tag is "HEAD"
> 
> Did you try starting the LRM without the fencing daemon running?
> 
> For now I'm deleting the stonith plugin after install to avoid this.
Ok. On x86 it seems lrm donesn't crash, although there are expected 
error  messages. I'll check the stonith plugin related.
BTW, do you encounter the crash on MAC OS?
> 
> 
>>>3 points:
>>>- (If you ask me) running the LRM without the stonithd running should be
>>>supported
>>>
>>
>>Did this just happen recently after stonithd related is checked into CVS
>>repository?
> 
> 
> Yes.
> 
> 
>>>- If something is bad enough that the LRM needs to exit, its bad enough
>>>that the logs should say why.
>>>
>>>- If this is a permanent error of which there is no hope of recovery,
>>>ever.  Then please consider exiting with rc = 100.  That way HA doesn't
>>>waste time re-spawning the LRMd over and over.
>>
>>I guess something unexpected happens in the lrmd, so no log is written,
>>and rc != 100.
> 
> 
> If it was segfaulting i'd expect to see a core somewhere and if
> someone's calling exit() then there absolutly needs to be a error log
> statement saying so.
> 
> 
>>>On a related note, there really needs to be a way for the LRMd to use
>>>the same logging preferences as everyone else does.  Currently this is a
>>>little hard given that it doesn't (any pretty much has no need to)
>>>connect to HA itself.
>>>
>>>Perhaps a common cl_ function for determining the logging preferences
>>>directly from ha.cf is required?  (Yes I can write it too, I first
>>>wanted to know what people think of the idea).
>>>
>>>Andrew
>>>
>>>--
>>>Andrew Beekhof
>>>
>>>"Would the last person to leave please turn out the enlightenment?" - TISM
>>>
>>>_______________________________________________________
>>>Linux-HA-Dev: Linux-HA-Dev at lists.linux-ha.org
>>>http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
>>>Home Page: http://linux-ha.org/
>>>
>>
>>--
>>BRs,
>>
>>Sun Jiang Dong
>>
>>
>>_______________________________________________________
>>Linux-HA-Dev: Linux-HA-Dev at lists.linux-ha.org
>>http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
>>Home Page: http://linux-ha.org/
>>
> 
> _______________________________________________________
> Linux-HA-Dev: Linux-HA-Dev at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
> Home Page: http://linux-ha.org/
> 

-- 
BRs,

Sun Jiang Dong


More information about the Linux-HA-Dev mailing list