Failed to start GSD on local node
DBAsupport.com Forums - Powered by vBulletin
Results 1 to 4 of 4

Thread: Failed to start GSD on local node

  1. #1
    Join Date
    Nov 2006
    Posts
    158

    Failed to start GSD on local node

    Hi all,

    I just finished an oracle 9i RAC install (2 nodes) on AIX 5.3 using HACMP as clusterware. however, when i try to create a cluster database, DBCA prompts me to start the GSD as follows: /u01/app/oracle/product/9.2.0/bin/gsdctl start. When i issue this command as the oracle user, i get the following message.

    (oracle@cteora1) /u01/app/oracle/product/9.2.0/bin > gsdctl start
    Failed to start GSD on local node
    (oracle@cteora1) /u01/app/oracle/product/9.2.0/bin >

    As root,

    cteora1:/var/opt >/u01/app/oracle/product/9.2.0/bin/gsdctl start
    PRKC-1021 : Problem in the clusterware
    Failed to get list of active nodes from clusterware
    cteora1:/var/opt >

    Now, when i run the gsd.sh script for tracing, i get the following:

    (oracle@cteora1) /u01/app/oracle/product/9.2.0/bin > gsd.sh
    [main] [12:1:46:594] [line# N/A] Daemon: argument is /u01/app/oracle/product/9.
    2.0
    [main] [12:1:46:598] [line# N/A] got node name from the network: cteora1
    [main] [12:1:46:599] [line# N/A] tracing is true; at level 2
    [main] [12:1:46:600] [line# N/A] trace file is /u01/app/oracle/product/9.2.0/sr
    vm/log/gsdaemon_cteora1.log
    [main] [12:1:46:619] [line# N/A] my property portnum=null
    [main] [12:1:46:635] [NativeSystem.:123] Detected Cluster
    [main] [12:1:46:637] [NativeSystem.:124] Cluster existence = true
    [main] [12:1:46:639] [UnixSystem.:118] Going to load SRVM library
    [main] [12:1:46:642] [UnixSystem.:118] loaded libraries
    [main] [12:1:46:644] [OPSMDaemon.main:726] Initializing the daemon ...
    [main] [12:1:46:645] [OPSMDaemon.:188] NativeDebug is set to true
    [main] [12:1:46:647] [OPSMDaemon.:188] UnixSystem.initializeDaemon: group
    Name is opsm

    Native Code Debug is ON
    initializeDaemon: inside libsrvm.so
    initializeDaemon: The portnumber obtained is 2006
    Native Code Debug is ON
    inside initializeDaemon2
    initializeDaemon2: coordGroupName = opsmCoord
    initializeDaemon: Local Node number is 1
    Native Code Debug is ON
    inside registerInGroup
    registerInGroup: size of private or public member data is 96
    registerInGroup: length of group name = 4
    registerInGroup: Group handle length is 1480
    registerInGroup: Calling skgxnreg to register in a skgxn groupcannot register i
    n group

    Could not register in group opsm with the clusterware
    initializeDaemon: status = 0
    initializeDaemon: localNodeName = [main] [12:1:46:692] [OPSMDaemon.:188]

    initializeDaemon status = false
    [main] [12:1:46:693] [OPSMDaemon.main:726] initializeDaemon status = false
    [main] [12:1:46:695] [OPSMDaemon.main:726] Failed to initialize and register wi
    th clusterware
    [main] [12:1:46:696] [OPSMDaemon.main:726] OPSMErrCode = 1002
    [main] [12:1:46:698] [OPSMDaemon.main:729] java.rmi.RemoteException: Could not
    register with the clusterware
    java.rmi.RemoteException: Could not register with the clusterware
    at oracle.ops.mgmt.daemon.OPSMDaemon.(OPSMDaemon.java:195)
    at oracle.ops.mgmt.daemon.OPSMDaemon.main(OPSMDaemon.java:726)

    [main] [12:1:46:700] [line# N/A] Exiting from main..no point trying to start th
    e daemon
    NONODE(oracle@cteora1) /u01/app/oracle/product/9.2.0/bin >

    I've researched on Metalink & all the proposed solutions aren't helping. Can someone be of help?
    Regards,

    divroro12

  2. #2
    Join Date
    Jan 2001
    Posts
    2,828
    Hi

    You need to raise a ticket with metalink for this as i beleive oracle over wrote some of HACMP libraries during the relinking phase and thats why it couldnt load some library and register itself with HACMP.

    It could also be a permission problem

    Code:
    Could not register in group opsm with the clusterware
    Just referring my old notes and this is what i have scratched down

    Ensure that the oracle user is added to the HAGSUSER UNIX group.

    hope it helps

    regards
    Hrishy
    Last edited by hrishy; 04-04-2008 at 07:14 AM.

  3. #3
    Join Date
    Nov 2006
    Posts
    158
    hrishy,

    I just read this document from IBM on installing 9i RAC on AIX with HACMP:

    http://t4biweb01.t4bi.com/pls/htmldb...50311706566234

    It's talking about the hyagsuser group, which i believe is the issue b/c i don't even have this group in my system. I've collected all the necessary details about adding this group, & making the oracle user a member of it, still awaiting my sys admin for implementation.
    Will keep u posted on any latest dvlp'ts.
    Regards,

    divroro12

  4. #4
    Join Date
    Nov 2006
    Posts
    158
    Well, the hagsuser group was successfully created using the steps outline in the doc i sent earlier & i was able to start the gsd on both cluster nodes.

    Thanks for ur input
    Regards,

    divroro12

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  


Click Here to Expand Forum to Full Width