Thursday, September 8, 2016

How to Proceed from Failed 11gR2 Grid Infrastructure (CRS) Installation?

A. OUI Failure 
B. GI Standalone Deconfigure and Reconfigure (Oracle Restart):
C. GI Cluster Deconfigure and Reconfigure
D. Grid Infrastructure Deinstall
E. OUI Plug-in Failure or OUI Window Closed While Running root.sh



A. OUI Failure

Case applies here is when runInstaller fails before user is prompted to execute rootupgrade.sh.
When runInstaller fails, identify the cause by looking at screen output and installation logs (located in last section "Installation log locations" of this note). 
 Solution can be identified from default logfile
Once root cause is identified and the issue is resolved, if nothing is installed yet (before clicking on the install button on last page of OUI), continue with OUI; otherwise clean up the failed installation and prepare for another run.
To clean up the failed installation and re-run:
  1. As owner of the failed home, remove the failed installation by running 
$NEW_HOME/deinstall/deinstall. If deinstall fails, go to Step 2, otherwise go to Step 3

Note:
  if deinstalls asks to execute any script as root, skip it as it may cause issue to existing clusterware.

2. Manually remove failed home as home owner on all nodes:
export ORACLE_HOME=$OLD_HOME

## detach failed NEW_HOME:
$OLD_HOME/oui/bin/runInstaller -detachHome -silent -local ORACLE_HOME=$NEW_HOME

## confirm $NEW_HOME is removed from central inventory:
$OLD_HOME/OPatch/opatch lsinventory -all 

## remove files in failed NEW_HOME manually:
/bin/rm -rf $NEW_HOME

unset ORACLE_HOME

Note: OLD_HOME refers to any existing home on the node
3. If upgrading Oracle Clusterware (CRS or Grid Infrastructure), set CRS flag to true for pre-upgrade clusterware home. Refer to note 1053393.1 for details.

4. Restart runInstaller from unzipped location.

B. GI Standalone Deconfigure and Reconfigure (Oracle Restart):

 To deconfigure:
  As root, execute "$GRID_HOME/crs/install/roothas.pl -deconfig -force -verbose"
  If it fails, please disable GI, reboot the node and try the same command:
 As root, execute "$GRID_HOME/bin/crsctl disable has"
 As root, reboot the node; once the node comes backup, execute above deconfigure command again.

  To reconfigure:
    As root, execute "$GRID_HOME/root.sh"      

C. GI Cluster Deconfigure and Reconfigure

Identify cause of root.sh failure by reviewing logs in $GRID_HOME/cfgtoollogs/crsconfig and $GRID_HOME/log, once cause is identified and problem is fixed, deconfigure and reconfigure with steps below - keep in mind that you will need wait till each step finishes successfully before move to next one:

Step 0: For 11.2.0.2 and above, root.sh is restartable.

Once cause is identified and the problem is fixed, root.sh can be executed again on the failed node. If it succeeds, continue with your planned installation procedure; otherwise as root
sequentially execute "$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force" and $GRID_HOME/root.sh on local node, if it succeeds, continue with your planned installation procedure, otherwise proceed to next step (Step 1) of the note.

Step 1: As root, run "$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force" on all nodes, except the last one.

Step 2: As root, run "$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force -lastnode" on last node. This command will zero out OCR, Voting Disk and the ASM diskgroup for OCR and Voting Disk
    Note:
 a. Step1 and 2 can be skipped on node(s) where root.sh haven't been executed this time.
 b. Step1 and 2 should remove checkpoint file. To verify:

     ls -l $ORACLE_BASE/Clusterware/ckptGridHA_.xml
    If it's still there, please remove it manually with "rm" command on all nodes

c. If GPNP profile is different between nodes/setup, clean it up on all nodes as grid user

          $ rm -rf $GRID_HOME/gpnp/*
          $ mkdir -p $GRID_HOME/gpnp/profiles/peer $GRID_HOME/gpnp/wallets/peer                   $GRID_HOME/gpnp/wallets/prdr $GRID_HOME/gpnp/wallets/pa                                         $GRID_HOME/gpnp/wallets/root

    The profile needs to be cleaned up:

    c1. If root.sh is executed concurrently - one should not execute root.sh on any other nodes before it finishes on first node.
    c2. If network info, location of OCR or Voting Disk etc changed after Grid is installed - rare

Step 3: As root, run $GRID_HOME/root.sh on first node
Step 4: As root, run $GRID_HOME/root.sh on all other node(s), except last one.
Step 5: As root, run $GRID_HOME/root.sh on last node.

D. Grid Infrastructure Deinstall

As grid user, execute "$GRID_HOME/deinstall/deinstall" and follow the instructions, for details, refer to the following documentation for your platform:

Oracle Grid Infrastructure Installation Guide

How to Modify or Deinstall Oracle Grid Infrastructure

If there's any error, deconfig the failed GI with steps in Section B or C, and deinstall manually with MOS 
note 1364419.1

E. OUI Plug-in Failure or OUI Window Closed While Running root.sh

Case applies here is OUI plug-in failure after root.sh has been executed successfully on all nodes or OUI window closed while running root.sh

Identify the cause of failed plug-in. Once root cause is identified and the issue is resolved, refer to MOS 
note 1360798.1


No comments:

Post a Comment