[LinuxFailSafe] Some news aboute my problem ....

Giulius blizzards@libero.it
Mon, 01 Jul 2002 09:35:02 +0200


Hi all again and sorry for my "infinite" mails.

I found a news on my server.......

I tried to start manually only the services cad and cdbd and now all 
works correctly.
If i start with fs_cluster, il gives an error on the definition of a node.

cdbd log repeats many times this below

Mon Jul  1 09:20:14.303 ftp1.bper.it cdbd  - Added new pool machine ftp1 (1)
Mon Jul  1 09:20:14.304 ftp1.bper.it cdbd  - Last remote quorum entry 
for machin
e ftp1 (1) is quorum: cluster id: 0x00000000.0x3d2001ff2d1561c0, master: 
1, sequ
ence: 40, member count: 1, members:  1
Mon Jul  1 09:20:14.305 ftp1.bper.it cdbd  - started (pid 27324)
Mon Jul  1 09:20:14.305 ftp1.bper.it cdbd  - fs2d main thread ready
Mon Jul  1 09:20:14.306 ftp1.bper.it cdbd  - Setting local machine id to 
1, old
value = 0
Mon Jul  1 09:20:14.306 ftp1 cdbd  - New quorum requested (not forced)
Mon Jul  1 09:20:14.306 ftp1 cdbd  - Need replacement for old quorum: 
cluster id
: 0x00000000.0x3d2001ff2d1561c0, master: 1, sequence: 40, member count: 
1, membe
rs:  1
Mon Jul  1 09:20:14.306 ftp1 cdbd  - Proposed new quorum: cluster id: 
0x00000000
.0x3d2001ff2d1561c0, master: 1, sequence: 41, member count: 1, members:  1
Mon Jul  1 09:20:14.349 ftp1 cdbd  - New quorum: cluster id: 
0x00000000.0x3d2001
ff2d1561c0, master: 1, sequence: 41, member count: 1, members:  1
Mon Jul  1 09:20:14.349 ftp1 cdbd  - New quorum: cluster id: 
0x00000000.0x3d2001
ff2d1561c0, master: 1, sequence: 41, member count: 1, members:  1
Mon Jul  1 09:20:14.349 ftp1 cdbd  - Ready and valid new quorum: cluster 
id: 0x0
0000000.0x3d2001ff2d1561c0, master: 1, sequence: 41, member count: 1, 
members: 
1
Mon Jul  1 09:20:14.377 ftp1 cdbd  - terminating on signal 15 (pid 27328)



CAD LOG GIVE ME:

Mon Jul  1 09:20:13.970 <cad 27096:7176> cfs_fs_connect: fs_cam_register 
failed
with error FailSafe is not ready to accept admin requests.
Mon Jul  1 09:20:13.974 <cad 27096:7176> Could not determine cluster name
Mon Jul  1 09:20:13.974 <cad 27096:7176> cfs_fs_inventory: resource 
groups enume
ration failed: Query Cluster's () Resource Group List failed: not found
Mon Jul  1 09:20:13.974 <cad 27096:7176> Could not send resource group 
inventory
 error 16393
Mon Jul  1 09:20:14.101 <cam_cascdb 27092:3076> cdb key not found 
#global#machin
es. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cascdb 27092:3076> cdb key not found 
#cluster. cdb
error 10
Mon Jul  1 09:20:14.101 <cam_cascdb 27092:3076> cdb key not found 
#local#logging
. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cascdb 27092:3076> cdb key not found 
#global#loggin
g. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cascdb 27092:3076> cdb key not found 
#local#Cluster
Admin#crs. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cascdb 27092:3076> cdb key not found 
#global#Cluste
rAdmin#crs. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cicdb 27093:4101> cdb key not found 
#local#HA#resou
rces. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cicdb 27093:4101> cdb key not found 
#local#HA#Resou
rceTypes. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cicdb 27093:4101> cdb key not found 
#local#HA#servi
ces. cdb error 10
Mon Jul  1 09:20:14.101 <cam_cicdb 27093:4101> cdb key not found 
#cluster. cdb e
rror 10



CMOND LOG GIVE:

Mon Jul  1 09:19:33.122 <cmond 27080:1024> <cmond.c:377> Cmond 
restarted, using
log level info.
Mon Jul  1 09:19:33.230 <cmond 27080:1024> <cmond.c:389> Creating 
process group
table.
Mon Jul  1 09:19:33.230 <cmond 27080:1024> <cmond.c:397> Enabling client 
request
s.
Mon Jul  1 09:19:33.230 <cmond 27080:1024> <cmond.c:405> Installing 
signal handl
ers.
Mon Jul  1 09:19:33.230 <cmond 27080:1024> <cmond.c:417> Attempting cdb 
registra
tion.
Mon Jul  1 09:19:33.236 <cmond 27080:1024> <cmond.c:426> Initiating 
autoactions.
Mon Jul  1 09:19:33.237 <cmond 27080:1024> <cmond_config.c:179> Reading 
configur
ation information for process group cluster_failsafe.
Mon Jul  1 09:19:33.237 <cmond 27080:1024> <cmond_config.c:255> 
Configuration fo
r process group cluster_failsafe.
Mon Jul  1 09:19:33.237 <cmond 27080:1024> <cmond_config.c:257>         
Type = c
luster_ha
Mon Jul  1 09:19:33.237 <cmond 27080:1024> <cmond_config.c:263>         
Procs =
ha_fsd
Mon Jul  1 09:19:33.237 <cmond 27080:1024> <cmond_config.c:272>         
Actions
= start stop restart detach attach status   
Mon Jul  1 09:19:33.238 <cmond 27080:1024> <cmond_config.c:179> Reading 
configur
ation information for process group cluster_admin.
Mon Jul  1 09:19:33.238 <cmond 27080:1024> <cmond_config.c:255> 
Configuration fo
r process group cluster_admin.
Mon Jul  1 09:19:33.238 <cmond 27080:1024> <cmond_config.c:257>         
Type = c
luster_admin
Mon Jul  1 09:19:33.238 <cmond 27080:1024> <cmond_config.c:263>         
Procs =
cad
Mon Jul  1 09:19:33.238 <cmond 27080:1024> <cmond_config.c:272>         
Actions
= start stop restart detach attach status   
Mon Jul  1 09:19:33.238 <cmond 27080:1024> <cmond_config.c:179> Reading 
configur
ation information for process group cluster_control.
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:255> 
Configuration fo
r process group cluster_control.
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:257>         
Type = c
luster_control
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:263>         
Procs =
crsd
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:272>         
Actions
= start stop restart detach attach status   
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:179> Reading 
configur
ation information for process group cluster_hainfra.
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:255> 
Configuration fo
r process group cluster_hainfra.
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:257>         
Type = c
luster_hainfra
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:263>         
Procs =
ha_cmsd ha_gcd ha_srmd
Mon Jul  1 09:19:33.239 <cmond 27080:1024> <cmond_config.c:272>         
Actions
= start stop restart detach attach status   
Mon Jul  1 09:19:33.240 <cmond 27080:1024> <cmond_config.c:179> Reading 
configur
ation information for process group ip_addresses.
Mon Jul  1 09:19:33.240 <cmond 27080:1024> <cmond_config.c:255> 
Configuration fo
r process group ip_addresses.
Mon Jul  1 09:19:33.240 <cmond 27080:1024> <cmond_config.c:257>         
Type = c
luster_agent
Mon Jul  1 09:19:33.240 <cmond 27080:1024> <cmond_config.c:263>         
Procs =
ha_ifd
Mon Jul  1 09:19:33.240 <cmond 27080:1024> <cmond_config.c:272>         
Actions
= start stop restart detach attach status   
Mon Jul  1 09:19:33.240 <cmond 27080:1024> <cmond_pg.c:117> Beginning 
autoaction
 cluster_admin .
Mon Jul  1 09:19:33.240 <cmond 27080:1024> <cmond_pg.c:168> autoaction 
is start
action.
Mon Jul  1 09:19:33.241 <cmond 27080:1024> <cmond_proc.c:178> Starting 
process c
ad.
Mon Jul  1 09:19:33.241 <cmond 27080:1024> <cmond_proc.c:97> Going to 
fork/exec
new process "cad -l -lf /var/log/failsafe/cad_log --append_log".
Mon Jul  1 09:19:33.242 <cmond 27080:1024> <cmond_proc.c:140> New 
process cad pi
d 27087
Mon Jul  1 09:19:33.243 <cmond 27080:1024> <cmond_pg.c:229> Successfully 
finishi
ng autoaction cluster_admin .
Mon Jul  1 09:19:33.243 <cmond 27080:1024> <cmond_pg.c:117> Beginning 
autoaction
 cluster_control .
Mon Jul  1 09:19:33.243 <cmond 27080:1024> <cmond_pg.c:168> autoaction 
is attach
 action.
Mon Jul  1 09:19:33.243 <cmond 27080:1024> <cmond_proc.c:175> Looking 
for proces
s crsd to attach to.
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_pg.c:215> autoaction 
cluster_c
ontrol  failed - could not access object.
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_pg.c:117> Beginning 
autoaction
 ip_addresses .
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_pg.c:168> autoaction 
is attach
 action.
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_proc.c:175> Looking 
for proces
s ha_ifd to attach to.
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_pg.c:215> autoaction 
ip_addres
ses  failed - could not access object.
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_pg.c:117> Beginning 
autoaction
 cluster_hainfra .
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_pg.c:168> autoaction 
is attach
 action.
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_proc.c:175> Looking 
for proces
s ha_cmsd to attach to.
Mon Jul  1 09:19:33.245 <cmond 27080:1024> <cmond_proc.c:175> Looking 
for proces
s ha_gcd to attach to.
Mon Jul  1 09:19:33.246 <cmond 27080:1024> <cmond_proc.c:175> Looking 
for proces
s ha_srmd to attach to.
Mon Jul  1 09:19:33.246 <cmond 27080:1024> <cmond_pg.c:215> autoaction 
cluster_h
ainfra  failed - could not access object.
Mon Jul  1 09:19:33.246 <cmond 27080:1024> <cmond_pg.c:117> Beginning 
autoaction
 cluster_failsafe .
Mon Jul  1 09:19:33.246 <cmond 27080:1024> <cmond_pg.c:168> autoaction 
is attach
 action.
Mon Jul  1 09:19:33.246 <cmond 27080:1024> <cmond_proc.c:175> Looking 
for proces
s ha_fsd to attach to.
Mon Jul  1 09:19:33.246 <cmond 27080:1024> <cmond_pg.c:215> autoaction 
cluster_f
ailsafe  failed - could not access object.
Mon Jul  1 09:19:33.246 <cmond 27080:1024> <cmond.c:432> Autoactions done.
Mon Jul  1 09:19:34.127 <cmond 27080:1024> <cmond_request.c:110> New 
client requ
est have arrived.
Mon Jul  1 09:19:34.127 <cmond 27080:1024> <cmond_request.c:145> Serving 
request
 #1.
Mon Jul  1 09:19:34.127 <cmond 27080:1024> <cmond_request.c:202> Request 
= start
 cluster_admin REPLY.
Mon Jul  1 09:19:34.127 <cmond 27080:1024> <cmond_pg.c:117> Beginning 
start clus
ter_admin REPLY.
Mon Jul  1 09:19:34.127 <cmond 27080:1024> <cmond_pg.c:123> Process 
group cluste
r_admin is already running and is being tracked status = running.
Mon Jul  1 09:19:34.127 <cmond 27080:1024> <cmond_request.c:255> Process 
group c
luster_admin is in running state.
Mon Jul  1 09:19:34.127 <cmond 27080:1024> <cmond_request.c:267> Request 
served
successfully.
Mon Jul  1 09:19:34.128 <cmond 27080:1024> <cmond_request.c:150> Sending 
reply f
or request #1.
Mon Jul  1 09:19:34.128 <cmond 27080:1024> <cmond_cdb.c:816> Stale CDB 
handle.
Mon Jul  1 09:19:34.258 <cmond 27080:1024> <cmond.c:538> Cdb 
registration comple
te.
Mon Jul  1 09:19:34.258 <cmond 27080:1024> <cmond_request.c:110> New 
client requ
est have arrived.
Mon Jul  1 09:19:34.258 <cmond 27080:1024> <cmond_request.c:145> Serving 
request
 #1.
Mon Jul  1 09:19:34.258 <cmond 27080:1024> <cmond_request.c:202> Request 
= start
 cluster_control REPLY.
Mon Jul  1 09:19:34.258 <cmond 27080:1024> <cmond_pg.c:117> Beginning 
start clus
ter_control REPLY.
Mon Jul  1 09:19:34.259 <cmond 27080:1024> <cmond_proc.c:178> Starting 
process c
rsd.
Mon Jul  1 09:19:34.259 <cmond 27080:1024> <cmond_proc.c:97> Going to 
fork/exec
new process "crsd -l ".
Mon Jul  1 09:19:34.260 <cmond 27080:1024> <cmond_proc.c:140> New 
process crsd p
id 27146
Mon Jul  1 09:19:34.260 <cmond 27080:1024> <cmond_pg.c:229> Successfully 
finishi
ng start cluster_control REPLY.
Mon Jul  1 09:19:34.260 <cmond 27080:1024> <cmond_request.c:255> Process 
group c
luster_control is in running state.
Mon Jul  1 09:19:34.260 <cmond 27080:1024> <cmond_request.c:267> Request 
served
successfully.
Mon Jul  1 09:19:34.260 <cmond 27080:1024> <cmond_request.c:150> Sending 
reply f
or request #1.
Mon Jul  1 09:19:34.294 <cmond 27080:1024> <cmond_sig.c:274> 0 processes 
have ex
ited.
Mon Jul  1 09:19:44.290 <cmond 27080:1024> <cmond_cdb.c:816> Stale CDB 
handle.
Mon Jul  1 09:19:44.296 <cmond 27080:1024> <cmond.c:538> Cdb 
registration comple
te.
Mon Jul  1 09:19:54.290 <cmond 27080:1024> <cmond_cdb.c:816> Stale CDB 
handle.
Mon Jul  1 09:19:54.296 <cmond 27080:1024> <cmond.c:538> Cdb 
registration comple
te.
Mon Jul  1 09:20:04.290 <cmond 27080:1024> <cmond_cdb.c:816> Stale CDB 
handle.
Mon Jul  1 09:20:04.416 <cmond 27080:1024> <cmond.c:538> Cdb 
registration comple
te.
Mon Jul  1 09:20:08.768 <cmond 27080:1024> <cmond_request.c:110> New 
client requ
est have arrived.
Mon Jul  1 09:20:08.768 <cmond 27080:1024> <cmond_request.c:145> Serving 
request
 #1.
Mon Jul  1 09:20:08.768 <cmond 27080:1024> <cmond_request.c:202> Request 
= stop
cluster_control REPLY|FORCE.
Mon Jul  1 09:20:08.768 <cmond 27080:1024> <cmond_pg.c:247> Beginning 
stop clust
er_control REPLY|FORCE.
Mon Jul  1 09:20:08.768 <cmond 27080:1024> <cmond_proc.c:231> Killing 
crsd:27146
, sending SIGTERM.
Mon Jul  1 09:20:13.760 <cmond 27080:1024> <cmond_proc.c:217> Killing 
crsd:27146
, sending SIGKILL.
Mon Jul  1 09:20:13.760 <cmond 27080:1024> <cmond_pg.c:321> Successfully 
finishi
ng stop cluster_control REPLY|FORCE.
Mon Jul  1 09:20:13.760 <cmond 27080:1024> <cmond_request.c:255> Process 
group c
luster_control is in stopped state.
Mon Jul  1 09:20:13.760 <cmond 27080:1024> <cmond_request.c:267> Request 
served
successfully.
Mon Jul  1 09:20:13.760 <cmond 27080:1024> <cmond_request.c:150> Sending 
reply f
or request #1.
Mon Jul  1 09:20:13.760 <cmond 27080:1024> <cmond_cdb.c:816> Stale CDB 
handle.
Mon Jul  1 09:20:13.760 <cmond 27080:1024> <cmond_sig.c:274> 0 processes 
have ex
ited.
Mon Jul  1 09:20:13.896 <cmond 27080:1024> <cmond.c:538> Cdb 
registration comple
te.
Mon Jul  1 09:20:13.896 <cmond 27080:1024> <cmond_request.c:110> New 
client requ
est have arrived.
Mon Jul  1 09:20:13.896 <cmond 27080:1024> <cmond_request.c:145> Serving 
request
 #1.
Mon Jul  1 09:20:13.896 <cmond 27080:1024> <cmond_request.c:202> Request 
= stop
cluster_admin REPLY|FORCE.
Mon Jul  1 09:20:13.896 <cmond 27080:1024> <cmond_pg.c:247> Beginning 
stop clust
er_admin REPLY|FORCE.
Mon Jul  1 09:20:13.896 <cmond 27080:1024> <cmond_proc.c:231> Killing 
cad:27087,
 sending SIGTERM.
Mon Jul  1 09:20:14.220 <cmond 27080:1024> <cmond_pg.c:321> Successfully 
finishi
ng stop cluster_admin REPLY|FORCE.
Mon Jul  1 09:20:14.220 <cmond 27080:1024> <cmond_request.c:255> Process 
group c
luster_admin is in stopped state.
Mon Jul  1 09:20:14.220 <cmond 27080:1024> <cmond_request.c:267> Request 
served
successfully.
Mon Jul  1 09:20:14.220 <cmond 27080:1024> <cmond_request.c:150> Sending 
reply f
or request #1.
Mon Jul  1 09:20:14.220 <cmond 27080:1024> <cmond_cdb.c:816> Stale CDB 
handle.
Mon Jul  1 09:20:14.220 <cmond 27080:1024> <cmond_sig.c:270> Process 
with pid 27
146 has exited with status 9
Mon Jul  1 09:20:14.220 <cmond 27080:1024> <cmond_sig.c:274> 1 processes 
have ex
ited.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond.c:538> Cdb 
registration comple
te.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_request.c:110> New 
client requ
est have arrived.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_request.c:145> Serving 
request
 #1.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_request.c:202> Request 
= exit
 REPLY|FORCE.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_misc.c:73> Received 
exit REPLY
|FORCE request.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_misc.c:99> Getting 
ready to ex
it.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_request.c:267> Request 
served
successfully.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_request.c:150> Sending 
reply f
or request #1.
Mon Jul  1 09:20:14.352 <cmond 27080:1024> <cmond_request.c:166> 
Cleaning up and
 exiting as requested by client.




THE  CRED_FTP1 new log created give me:

                * * * L o g g i n g    R e s t a r t e d * * *

Mon Jul  1 09:19:34.450 <N crsd crs 27146:0 crsd_main.c:200> Crsd restarted.
Mon Jul  1 09:19:34.678 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.
Mon Jul  1 09:19:45.083 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.
Mon Jul  1 09:19:49.321 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.
Mon Jul  1 09:19:53.581 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.
Mon Jul  1 09:19:57.693 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.
Mon Jul  1 09:20:01.934 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.
Mon Jul  1 09:20:05.163 <E crsd crs 27146:0 crs_config.c:1214> 
CI_ERR_HDL_STALE,
 Database root node not found.
Mon Jul  1 09:20:05.164 <W crsd crs 27146:0 crsd_config.c:249> 
CI_ERR_HDL_STALE,
 Could not read new config.
Mon Jul  1 09:20:09.362 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.
Mon Jul  1 09:20:13.601 <W crsd crs 27146:0 crs_config.c:665> 
CI_ERR_NOTFOUND, S
ystemController information for node ftp1 not found, requests will be 
ignored.