[Linux-ha-dev] Re: Fw: [Bug 196] IPC error in starting of CRM
Andrew Beekhof
beekhof at gmail.com
Sun Dec 26 04:32:29 MST 2004
Though annoying for us, its not really that odd. Unless you're
running the same number of processors with the same speed as me, and
the same amount of RAM and the same memory and CPU usage patterns by
other programs on the box... all these things can cause a bug to
become less or more visable.
Though I hate to admit it... more than likely I'm doing something evil
with memory (de)allocations.
Can you try running with Alan's memory checking code turned on please?
You may need to have a look in cl_malloc.c to see what needs to be
defined - I forget now and dont have access to CVS.
Andrew
On Thu, 23 Dec 2004 17:29:08 +0800, Huang Zhen <zhenhltc at cn.ibm.com> wrote:
> Hi,
>
> It is strange that both of you can not reproduce.
> I am testing it on SLES9. What's your platform?
> And I checked out the whole tree and re-config it before I test it.
>
> BTW, would you please send email to this email account?
>
> Zhen Huang wrote:
> > Best Regards,
> > Huang Zhen
> > LTC and pLinux Testing
> > IBM China Software Development Lab, Beijing
> > Telno: (8610)82782244-2845
> >
> >
> > ----- Forwarded by Zhen Huang/China/IBM on 2004-12-23 08:43 -----
> >
> > Guochun Shi <gshi at ncsa.uiuc.edu> wrote on 2004-12-23 04:27:44:
> >
> > > I cannot reproduce it in my machines.
> > >
> > > There are error messages about connection to stonithd failure in
> > > your message file.
> > > Dec 23 00:55:36 hadev1 lrmd[10854]: ERROR: get_resource_list: begin.
> > > Dec 23 00:55:36 hadev1 lrmd[10854]: WARN: initiate_connection:
> > > connect failure: Connection refused
> > > Dec 23 00:55:36 hadev1 lrmd[10854]: ERROR: stonithd_signon: Can't
> > > initiate connection to stonithd
> > > Dec 23 00:55:36 hadev1 lrmd[10854]: ERROR: Can not signon to the
> > stonithd.
> > > Dec 23 00:55:36 hadev1 crmd[10857]: debug: (cib_pluralSection [8])
> > > Plural of node_state is status
> > >
> > > These error messages are suspecious to me although Andrew told that
> > > they are fatal.
> > > I did not get those error messages in my test. My ha.cf file are
> > > similar to yours and I used
> > > cvs HEAD. There must be some difference in our testings. At least in
> > > my test lrmd did not try to connect to stonithd :)
> > > please let me know anything I might have missed in reproducing the
> > > error. Thanks
> > >
> > > -Guochun
> > >
> > >
> > > At 03:55 AM 12/22/2004 -0800, you wrote:
> > > >http://www.osdl.org/developer_bugzilla/show_bug.cgi?id=196
> > > >
> > > >zhenhltc at cn.ibm.com changed:
> > > >
> > > > What |Removed |Added
> > >
> > >----------------------------------------------------------------------------
> > > > Attachment #56 is|0 |1
> > > > obsolete| |
> > > >
> > > >
> > > >
> > > >------- Additional Comments From zhenhltc at cn.ibm.com 2004-12-22
> > > 03:55 -------
> > > >Created an attachment (id=57)
> > > > -->
> > (http://www.osdl.org/developer_bugzilla/attachment.cgi?id=57&action=view
> > <http://www.osdl.org/developer_bugzilla/attachment.cgi?id=57&action=view>
> > > )
> > > >dump.tar.gz
> > > >
> > > >
> > > >
> > > >
> > > >------- You are receiving this mail because: -------
> > > >You are on the CC list for the bug, or are watching someone who is.
> > >
> >
>
> --
>
> Best Regards,
> Huang Zhen
> LTC and pLinux Testing
> IBM China Software Development Lab, Beijing
> Telno: (8610)82782244-2845
>
> _______________________________________________________
> Linux-HA-Dev: Linux-HA-Dev at lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
> Home Page: http://linux-ha.org/
>
More information about the Linux-HA-Dev
mailing list