[Linux-HA] Late heartbeats with heartbeat 2.0.8

Matt Wilder grewaru at gmail.com
Thu Jul 5 10:00:04 MDT 2007


I enabled logd and am having the same problem.  Below is updated information
from syslog, ha_logd.cf and my ha.cf

ha.cf:
bcast em0
use_logd yes
keepalive 5
warntime 10
deadtime 20
initdead 40
auto_failback off
node sparky1.domainit.com
node sparky2.domainit.com
respawn hacluster /usr/local/lib/heartbeat/ipfail

ha_logd.cf:
debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility     local7


Syslog:
Jul  5 07:57:43 sparky1 heartbeat: [1450]: WARN: Late heartbeat: Node
sparky1.domainit.com: interval 51000 ms
Jul  5 07:57:43 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for send local status took too long to execute: 45992 ms
(> 2510 ms) (GSource: 0x5e6018)
Jul  5 07:57:43 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for send local status was delayed 40992 ms (> 2510 ms)
before being called (GSource: 0x5e6018)
Jul  5 07:57:43 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for check for signals was delayed 46000 ms (> 2510 ms)
before being called (GSource: 0x5e6818)
Jul  5 07:57:43 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for update msgfree count was delayed 45968 ms (> 20000 ms)
before being called (GSource: 0x5e6a18)
Jul  5 07:57:43 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for client audit was delayed 44523 ms (> 5000 ms) before
being called (GSource: 0x5e6618)
Jul  5 07:57:46 sparky1 heartbeat: [1450]: WARN: Late heartbeat: Node
sparky1.domainit.com: interval 30062 ms
Jul  5 07:57:46 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for send local status took too long to execute: 25054 ms
(> 2510 ms) (GSource: 0x5e6018)
Jul  5 07:57:46 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for send local status was delayed 20054 ms (> 2510 ms)
before being called (GSource: 0x5e6018)
Jul  5 07:57:46 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for check for signals was delayed 25062 ms (> 2510 ms)
before being called (GSource: 0x5e6818)
Jul  5 07:57:46 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for update msgfree count was delayed 24601 ms (> 20000 ms)
before being called (GSource: 0x5e6a18)
Jul  5 07:57:46 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for client audit was delayed 23132 ms (> 5000 ms) before
being called (GSource: 0x5e6618)
Jul  5 08:44:34 sparky1 heartbeat: [1450]: WARN: Late heartbeat: Node
sparky1.domainit.com: interval 18742 ms
Jul  5 08:44:34 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for send local status took too long to execute: 13734 ms
(> 2510 ms) (GSource: 0x5e6018)
Jul  5 08:44:34 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for send local status was delayed 8734 ms (> 2510 ms)
before being called (GSource: 0x5e6018)
Jul  5 08:44:34 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for check for signals was delayed 13742 ms (> 2510 ms)
before being called (GSource: 0x5e6818)
Jul  5 08:44:34 sparky1 heartbeat: [1450]: WARN: Gmain_timeout_dispatch:
Dispatch function for client audit was delayed 6734 ms (> 5000 ms) before
being called (GSource: 0x5e6618)

On 7/3/07, Matt Wilder <grewaru at gmail.com> wrote:
>
> I have updated my configuration to use logd and will report back with the
> results.
>
> Thanks,
>
> Matt
>
> On 7/3/07, Lars Marowsky-Bree < lmb at suse.de> wrote:
> >
> > On 2007-07-03T11:51:18, Matt Wilder < grewaru at gmail.com> wrote:
> >
> > > my ha.cf:
> > >
> > > bcast em0
> > > logfacility local7
> >
> > As a first guess, use the logging daemon by setting "use_logd yes", to
> > isolate heartbeat from logging being slow.
> >
> >
> > Regards,
> >     Lars
> >
> > --
> > Teamlead Kernel, SuSE Labs, Research and Development
> > SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
> > "Experience is the name everyone gives to their mistakes." -- Oscar
> > Wilde
> >
> > _______________________________________________
> > Linux-HA mailing list
> > Linux-HA at lists.linux-ha.org
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> >
>
>



More information about the Linux-HA mailing list