[LinuxFailSafe] Newbie Question: Status UNKNOWN

Tabitha Taylor tabtaylor@excite.com
Mon, 22 Jul 2002 10:46:17 -0400 (EDT)


Hi,

I am trying to set up failsafe on a two node cluster running RedHat 7.3.  I have started ha_services on both nodes and tried to bring the resource group online.  When I look at the status of the nodes/cluster it is UNKNOWN.  Any help/hints would be greatly appreciated.  I am using the latest CVS.  Below is my configuration.


The cad_log shows 

Mon Jul 22 10:05:10.670  cfs_fs_connect: fs_cam_register failed with error FailSafe is not ready to accept admin requests.

---------------------------------------------------------
My host files are as follows:

HA1 host file

172.27.5.22             HA1 HA1.company.com
127.0.0.1               HA1 localhost.localdomain localhost
172.27.5.23             HA2 HA2.company.com
192.168.1.2             priv.HA2.company.com
192.168.1.1             priv.HA1.company.com

HA2 host file

172.27.5.23             HA2 HA2.company.com
127.0.0.1               HA2 localhost.localdomain localhost
172.27.5.22             HA1 HA1.company.com
192.168.1.2             priv.HA2.company.com
192.168.1.1             priv.HA1.company.com
---------------------------------------------------------

When I try to bring the resource group online I get ...

cmgr> admin online resource_group extgroup in cluster ha-cluster
FailSafe daemon (ha_fsd) is not running on this local node or it is not ready to accept admin commands.
Resource Group (extgroup) is online-ready.

Failed to admin:
	online


----------------------------------------------------------
My configuration is as follows...


Thu Jul 18 13:26:03 EDT 2002
 
 
Cluster ha-cluster:
 
 
 
Node HA1:
 
        Logical Machine Name: HA1
        Hostname: HA1.company.com
        Is FailSafe: true
        Nodeid: 1
        Reset type: powerCycle
        System Controller: stonith
        System Controller status: enabled
        System Controller owner: HA2
        System Controller owner device: ssh
        System Controller owner type: tty
        ControlNet Ipaddr: 192.168.1.1
        ControlNet HB: true
        ControlNet Control: true
        ControlNet Priority: 1
        ControlNet Ipaddr: 172.27.5.22
        ControlNet HB: true
        ControlNet Control: false
        ControlNet Priority: 2
 
 
Node HA2:
 
        Logical Machine Name: HA2
        Hostname: HA2.company.com
        Is FailSafe: true
        Nodeid: 2
        Reset type: powerCycle
        System Controller: stonith
        System Controller status: enabled
        System Controller owner: HA1
        System Controller owner device: ssh
        System Controller owner type: tty
        ControlNet Ipaddr: 192.168.1.2
        ControlNet HB: true
        ControlNet Control: true
        ControlNet Priority: 1
        ControlNet Ipaddr: 172.27.5.23
        ControlNet HB: true
        ControlNet Control: false
        ControlNet Priority: 2
 
 
Resource_group extgroup:
 
	Failover Policy: HA1-primary
        	Version: 1
        	Script: ordered
        	Attributes: Auto_Recovery Auto_Failback 
        	Initial AFD: HA1 HA2 
 
        Resources: 
        	172.27.5.24	(type: IP_address)
        	extfs	(type: Filesystem)
 
 
Resource extfs (type Filesystem):
 
        FSType: ext3
        force_umount: yes
        mount_options: defaults
        Device: /dev/sdb1
        No resource dependencies
 
 
Resource 172.27.5.24 (type IP_address):
 
        BroadcastAddress: 172.27.5.255
        interfaces: eth1
        NetworkMask: 0xffffff00
        No resource dependencies
 
 
Failover_policy HA1-primary:
 
	Version: 1
	Script: ordered
	Attributes: Auto_Recovery Auto_Failback 
	Initial AFD: HA1 HA2 
 
 
Failover_policy HA2-primary:
 
	Version: 1
	Script: ordered
	Attributes: InPlace_Recovery Auto_Failback 
	Initial AFD: HA2 HA1 
 
 
Sincerely,

Tabitha Taylor





------------------------------------------------
Join Excite! - http://www.excite.com
The most personalized portal on the Web!