[Linux-ha-dev] CTS result
Guochun Shi
gshi at ncsa.uiuc.edu
Sun Nov 20 20:32:40 MST 2005
Nov 19 07:59:55 Overall Results:{'auditfail': 1, 'failure': 0,
'success': 1000, 'BadNews': 9}
( I did not run stonithd test because in the previous test before this
stonithd still hangs, sunjd told me it has been fixed by alan, I will
try it again)
5 bad news comes from not a clean start, thus can be ingored
Nov 17 18:10:16 BadNews: Nov 17 18:02:51 posic042 crmd: [26313]: ERROR:
mask(lrm.c:do_lrm_event): Detected active resource: DcIPaddr
Nov 17 18:10:16 BadNews: Nov 17 18:02:51 posic042 crmd: [26313]: ERROR:
mask(lrm.c:do_lrm_event): Detected active resource: rsc_posic042
Nov 17 18:10:16 BadNews: Nov 17 18:03:55 posic043 crmd: [14607]: ERROR:
mask(lrm.c:do_lrm_event): Detected active resource: rsc_posic043
Nov 17 18:10:17 BadNews: Nov 17 18:04:43 posic044 crmd: [22659]: ERROR:
mask(lrm.c:do_lrm_event): Detected active resource: rsc_posic044
Nov 17 18:10:19 BadNews: Nov 17 18:05:31 posic045 crmd: [13804]: ERROR:
mask(lrm.c:do_lrm_event): Detected active resource: rsc_posic045
1 bad news from monitoring:
Nov 18 03:35:37 Running test standby2 (posic044) [257]
Nov 18 03:37:14 BadNews: Nov 18 03:29:12 posic042 crmd: [6270]: ERROR:
mask(lrm.c:do_lrm_event): LRM operation (13) monitor_5000 on DcIPaddr
Error: not run
ning
Actually by grepping the log, there are lots of such ERROR message,
maybe it is expected. I have created a bug for it (bug 970)
3 bad news from ccm:
Nov 18 09:39:41 BadNews: Nov 18 09:32:52 posic042 ccm: [2371]: ERROR:
cl_log: 42 messages were dropped
Nov 18 09:53:46 BadNews: Nov 18 09:47:03 posic045 ccm: [9525]: ERROR:
cl_log: 13 messages were dropped
Nov 18 18:08:44 BadNews: Nov 18 18:03:54 posic043 ccm: [21561]: ERROR:
cl_log: 20 messages were dropped
CCM works well except those error messages. Since the number of
messages dropped are not that large, I think I can remove those by
increasing channel queue size.
1 audit failure:
Nov 19 04:45:59 Running test Restart (posic044) [909]
Nov 19 04:46:55 Warn: Node posic045 dissappeared: cant determin epoche
Nov 19 04:47:24 Resource {IPaddr::rsc_posic045} not served anywhere.
Nov 19 04:47:38 Audit HAResourceAudit FAILED.
I have created a bug for it (bug 971)
-Guochun
More information about the Linux-HA-Dev
mailing list