[Linux-ha-dev] heartbeat-1.2.2 failure to stop

Tuomo Soini tis at foobar.fi
Mon Jun 7 05:22:10 MDT 2004


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Tuomo Soini wrote:

| Can't check from old failure. I think 16174 was master control process.
| And problem is this is not easy to reproduce because it will happen only
| after some days of running.

Ok. Here is more info but from another failed stop. And I had more info
saved from this failure.

/etc/init.d/heartbeat restart gets jammed

root      5323  0.0  0.2  2760 2728 ?        S    May19   0:01 \
heartbeat: heartbeat: master control process
nobody    5333  0.0  0.1  2484 2484 ?        SL   May19   0:00 \
heartbeat: heartbeat: FIFO reader
nobody    5334  0.0  0.1  2440 2440 ?        SL   May19   0:00 \
heartbeat: heartbeat: write: serial /dev/ttyS0
nobody    5335  0.0  0.1  2440 2440 ?        SL   May19   0:00 \
heartbeat: heartbeat: read: serial /dev/ttyS0
nobody    5336  0.0  0.1  2440 2440 ?        SL   May19   0:00 \
heartbeat: heartbeat: write: bcast eth1
nobody    5337  0.0  0.1  2440 2440 ?        SL   May19   0:00 \
heartbeat: heartbeat: read: bcast eth1
root     26799  0.0  0.0  4428 1092 pts/0    S    11:46   0:00 /bin/sh \
/sbin/service heartbeat restart
root     26802  0.0  0.0  4608 1284 pts/0    S    11:46   0:00 /bin/sh \
/etc/init.d/heartbeat restart
root     26811  0.0  0.0  2224  852 pts/0    S    11:46   0:00 \
heartbeat: heartbeat

~From ha.log:

Jun  1 11:46:41 ssgw1 heartbeat[5323]: info: Heartbeat shutdown in \
progress. (5323)
Jun  1 11:46:41 ssgw1 heartbeat[26812]: info: Giving up all HA \
resources.
Jun  1 11:46:55 ssgw1 heartbeat[26812]: info: All HA resources \
relinquished.
Jun  1 11:46:55 ssgw1 heartbeat[26812]: info: MSG: Dumping message \
with 2 fields
Jun  1 11:46:55 ssgw1 heartbeat[26812]: info: MSG[0] : [t=shutdone]
Jun  1 11:46:55 ssgw1 heartbeat[26812]: info: MSG[1] : [st=dead]

~From ha_debug.log:

Jun  1 11:46:55 ssgw1 heartbeat[26812]: ERROR: Cannot write message to \
/var/lib/heartbeat/fifo [26812 vs 5323]: No such device or address

So: 5323 is master process and 26812 is running any more.

and restart is jammed.

| There will never be dump after that line.

That's true in this case too.

- --
Tuomo Soini <tis at foobar.fi>
Linux and network services
+358 40 5240030
Foobar Oy <http://foobar.fi/>
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFAxE/iTlrZKzwul1ERAp1PAJ9I0j/2pplqOvmLUWpSJiIG8W1y3QCeO/Qc
+Qbx2IcxfstdFQA0xCaZtBY=
=8Y0+
-----END PGP SIGNATURE-----



More information about the Linux-HA-Dev mailing list