Error message received in Heartbeat log

Pamela Schmidt Pamela.Schmidt-1@ksc.nasa.gov
Wed, 2 Oct 2002 02:49:22 -0400


After completing the tcpdump the following is the data information:  The
request and reply information appears to be fine.  Is the software adding an
extra carriage return? or not picking up the proper syntax? and The "at s"
error is no where to be found.

13:42:53.134302 DNS_NAME.gov > 154.219.101.6: icmp: echo request (DF)
0x0000  4500 007b 0000 4000 4001 6cc2 80d9 6607 E..{..@.@.l...f.
0x0010  80d9 6606 0800 8629 8d09 0127 3e3e 3e0a ..f....)...'>>>.
0x0020  743d 4e53 5f73 740a 7374 3d70 696e 670a      t=NS_st.st=ping.
0x0030  7372 633d 3132 382e 3231 372e 3130 322e     src=154.219.101.
0x0040  360a 7473 3d33 6439 3964 6539 640a 6175     6.ts=3d99de9d.au
0x0050  7468 3d33 2061 3935 6233 3134 6530 6530     th=1.a95b314e0e0
0x0060  3039 6266 3438 6362 6632 3131 6332 6165     09bf48cbf211c2ae
0x0070  6637 6263 630a 3c3c 3c0a 00             f7bcc.<<<..
13:42:53.134302 154.219.101.6 > DNS_NAME.gov: icmp: echo reply
0x0000  4500 007b eff1 0000 ff01 fdcf 80d9 6606 E..{..........f.
0x0010  80d9 6607 0000 8e29 8d09 0127 3e3e 3e0a ..f....)...'>>>.
0x0020  743d 4e53 5f73 740a 7374 3d70 696e 670a      t=NS_st.st=ping.
0x0030  7372 633d 3132 382e 3231 372e 3130 322e     src=154.219.101.
0x0040  360a 7473 3d33 6439 3964 6539 640a 6175     6.ts=3d99de9d.au
0x0050  7468 3d33 2061 3935 6233 3134 6530 6530     th=1.a95b314e0e0
0x0060  3039 6266 3438 6362 6632 3131 6332 6165     09bf48cbf211c2ae
0x0070  6637 6263 630a 3c3c 3c0a 00             f7bcc.<<<..
13:42:53.524302 154.219.101.6 > DNS_NAME.gov: icmp: echo request (DF)
0x0000  4500 007b 0000 4000 4001 6cc2 80d9 6606 E..{..@.@.l...f.
0x0010  80d9 6607 0800 b63d 0e0d 012a 3e3e 3e0a ..f....=...*>>>.
0x0020  743d 4e53 5f73 740a 7374 3d70 696e 670a       t =NS_st.st=ping.
0x0030  7372 633d 3132 382e 3231 372e 3130 322e      src=154.219.101.
0x0040  370a 7473 3d33 6439 3964 6539 660a 6175      7.ts=3d99de9f.au
0x0050  7468 3d33 2035 3438 3232 6137 6337 6164     th=1.54822a7c7ad
0x0060  3537 6338 3533 3039 3661 6539 6361 3965     57c853096ae9ca9e
0x0070  6630 3934 330a 3c3c 3c0a 00             f0943.<<<..
13:42:53.524302 DNS_NAME.gov > 154.219.101.6: icmp: echo reply
0x0000  4500 007b 639f 0000 ff01 8a22 80d9 6607 E..{c......"..f.
0x0010  80d9 6606 0000 be3d 0e0d 012a 3e3e 3e0a ..f....=...*>>>.
0x0020  743d 4e53 5f73 740a 7374 3d70 696e 670a       t=NS_st.st=ping.
0x0030  7372 633d 3132 382e 3231 372e 3130 322e      src=154.219.101.
0x0040  370a 7473 3d33 6439 3964 6539 660a 6175      7.ts=3d99de9f.au
0x0050  7468 3d33 2035 3438 3232 6137 6337 6164      th=1.54822a7c7ad
0x0060  3537 6338 3533 3039 3661 6539 6361 3965      57c853096ae9ca9e
0x0070  6630 3934 330a 3c3c 3c0a 00             f0943.<<<..


Help...
Pam Schmidt

----- Original Message -----
From: "Soffen, Matthew" <msoffen@iso-ne.com>
To: "Wallwork, Nathan" <nwallwo@pnm.com>
Cc: <linux-ha@muc.de>
Sent: Tuesday, October 01, 2002 1:21 PM
Subject: RE: Error message received in Heartbeat log


> I'm not using ping, I'm only using bcast.  This is happening with Ethernet
> and/or serial communications.
>
> Alan, would having another box in promiscuous mode listening to the
> heartbeat channel work ?
>
> Matt
>
> -----Original Message-----
> From: Wallwork, Nathan [mailto:nwallwo@pnm.com]
> Sent: Tuesday, October 01, 2002 12:10 PM
> Cc: linux-ha@muc.de
> Subject: RE: Error message received in Heartbeat log
>
>
> On Tue, 1 Oct 2002, Soffen, Matthew wrote:
> > Hmmmm... Digital Signature ?
> >
> > When I'm having the problems on FreeBSD, all I am appearing to get IS
the
> > digital signature line ( '<<< ' ).
>
> Best bet is probably to use tcpdump and capture the ping packet in
> both directions, then post those.
>
> Use something like `tcpdump tcpdump icmp -X -x -s 1024 > filename`
>
> This will allow someone to see just how the ping packet is being
> mangled, and demonstrate that this is happening, assuming that's
> what's going on.

----- Original Message -----
Schmidt-1, Pamela wrote:
> The following error message is received in the /var/log/ha-log file.  Can
>  you please explain what this error message means and how to correct it?
It
>  only appears on the slave side of the cluster.
>
>  WARN: ha_msg_add_nv: line doesn't contain '='
>  info: at s
>  ERROR: NV Failure (string2msg)
>  ERROR: >>>
>  t=NS_st
>  st=ping
>  at s
>
>  This error message is displayed exactly as shown in the log.  It also is
>  displayed 16 times every 2 seconds.
>
>  Any and all suggestions are greately appreciated,
>  Pam Schmidt

OK.

This is often the result of characters being lost in a serial connection.
But, in your case, it looks like you have a ping node configured which is
mangling the ping response.

It would have soon discarded the packet anyway because it's been mangled.

In particular the line "at s" should not occur in any heartbeat message.
There also doesn't appear to be any digital signature, or <<< line.

Either you can make the node you're pinging stop mangling the ping response
somehow, or you'll have to take it out of your configuration.

-- Alan Robertson
   alanr@unix.sh