Home > Oracle in general > 11g Release 2 for Windows – Grid Infrastructure Installation Buggy?

11g Release 2 for Windows – Grid Infrastructure Installation Buggy?

I just tried installing Oracle 11g Relese 2 on Windows 2008 R2 64-bit and Windows 2008. The system were installed with Windows 2008 and 2008 R2 standard edition 64-bit on a VMWARE server with 4 core with two core assigned to each virtual machine.

Until now i was unable to install it successfully. It always failed with:

[INS-20802] Grid Infrastructure Configuration failed

This is a short blog post about my findinds. Maybe someone else experience this issue as well  and leaves a comment.

After the installation ran for some time it failed on the second node. Some debugging and re-installation hours later i guess the grid installation is buggy at least on Windows 2008 and Windows 2008 R2. Hows that? Well: Look at the following pictures all taken from the SECOND node. The services on the first (= installation node) are fine and running. All log files look fine. On the second node i am completely unable to find any error messages in the log files. Everything looks fine… but the windows event log shows some failed services which are most probably causing the error.

During installation on the second node some services are created by the oracle installer. The first service to be created is names OracleOUIVC8Service which seems to install some kind of VC libraries:

Picture #1: Oracle seems to install some VC++ Runtime libraries during cluster installation. For doing so a service is created and then started. Starting this service shall install the VC runtime library.  But unfortunately the service seems to be unable to start within 30 seconds, causing the following error:

Picture #1: A timeout was reached (30000 milliseconds) while waiting for
the OracleOUIVC8Service service to connect

Picture #2: Note that ONE second after the OracleOUIVC8Service failed to start the windows installer entered the running state. What was installed then? The answer is in picture #3.

Picture #3: According to the windows installer the Microsoft Visual C++ 2005 Redistributable was installed. It took 3 seconds from the windows installer to enter the running start (= start of the installation at 07:27:24 show in the picture above) to the end of the installation which was successful. This is shown in the following pictures:

Picture #4 shows the failing installation service for the grid infrastructure home and/or configuration which fails with:

A timeout was reached (30000 milliseconds) while waiting for the OracleOUIOraCrs11g_home1Service service to connect

This does not surprise me at all. If there are some libraries missing the installer will most probably fail. But i guess the point is here that the service are created in a incorrect way. I DO see the java.exe from the installer in the process list – but only AFTER the start of the service failed….

For debugging i tried several thing:

  • Install on Windows 2008 R2 64-bit
  • Install on Windows 2008 64-bit
  • Pre-Install the VC Runtime
  • Increase the allowed time to start for a service to 180 seconds

Increasing the service start timeout value

Trying to increase the maximum allowed time for a service to start from 30 to 180 seconds didn’t even help. The node itself was completely idle during the service start. But the VC installation started RIGHT AFTER the service start time out after 180 seconds and was completed within 4 seconds. This makes me believe in some kind of error. And so i decided to blog about this problem because i have not yet found anything on the web.

In the following picture you can see the service OracleOUIVC8Service trying to start.

The command line being used for that was:

D:\Temp\2\OraInstall2010-04-17_02-20-52AM\ext\bin\vcredist_x64.exe
/q:a /c:"VCREDI~1.EXE /q:a /c:""msiexec /i vcredist.msi  /qn"" "

As you can see the system was completely idle during service startup with plenty of free memory.

An except from the windows event log shows the service failed to start after 180 seconds (the value specified in milliseconds was increased by adding ServicesPipeTimeout to HKEY_LOCAL_MACHINE\System\CurrentControlSet\Control; so 180000 means 180 seconds):

Right after the service failed to start the windows installer entered the running state and installed the VC redistributable:

So this also leads me to the assumtion there is some bug in the grid infrastructure installer.

Another possibility is that this error is caused by using VMWARE virtualization. But at the moment i dont have a reason to think so.

Categories: Oracle in general Tags:
  1. Phani
    April 29th, 2010 at 12:17 | #1

    Hey ,

    even am facing the same problem, here am trying a single node installation , stilll it fails with same error.

    just keep updating if you come across some info.

    Thanks,
    Phani

  2. jason
    May 4th, 2010 at 16:24 | #2

    I tried to install Oracle 11gR2 Grid Infrastructure on Windows 2008 R2 Hyper-V virtual machine (2 nodes), I got the exact the same error. Still working on it.

  3. NANCY
    May 7th, 2010 at 20:35 | #3

    Have the same issues on a single node install. Contacted Oracle support. No help a week later from them.

  4. Louis
    June 11th, 2010 at 12:56 | #4

    I tried the same on Windows 2003 R2 Enterprise x64 SP2 ( 2 nodes )with the same result. Let me know if you have a resolution.

  5. Lenoy M Joe
    June 20th, 2010 at 07:56 | #5

    Please install oracle 11g 32 bits on 2003 windows compatible mode on windows 2008 64-bit system it will work fine.

    Please install oracle 11g 64 bit

  6. Ronny Egner
    June 23rd, 2010 at 07:11 | #6

    > Please install oracle 11g 32 bits on 2003 windows compatible mode on windows 2008 64-bit system it will work fine.

    Why should i even do that? Mixing 32-bit and 64-bit is not even supported. By the way there is no 32-bit 11g R2 clusterware.

  7. June 23rd, 2010 at 09:17 | #7

    Same problem here, SR opened at Oracle, no answer from them, not even a status report … Did they even try to install Grid Infra on R2 ? Did someone here managed to install it ?

  8. Robert Durgin
    June 25th, 2010 at 01:51 | #8

    I ran into the same issue, make sure that the host name specified in listener.ora matches that of your machine (/etc/hosts on linux) (%WINDIR%\system32\drivers\etc\hosts on windows).

  9. juandie
    June 30th, 2010 at 22:48 | #9

    Same problem here.. I disabled de UAC and it looks like its working. I’ll update when it finish.

    to disable UAC : http://technet.microsoft.com/en-us/library/cc709691%28WS.10%29.aspx

  10. juandie
    July 1st, 2010 at 19:28 | #10

    @juandie
    not sure was that…. y did 3 installs
    1- your error
    (disable UAC)
    2- deinstall and then i didnt hit that error but hit another one while configuring listener. i thougth that something was not properly clean after the first install. So i uninstall again, But this time i also erase the asm disks…
    3- i hit the same error again…

    im cleaning… reinstalling again… clueless

  11. juandie
    July 1st, 2010 at 23:55 | #11

    @juandie
    UAC is not the answer….
    hosts file has the info and both node resolve fine..

    still clueless

  12. Ronny Egner
    July 2nd, 2010 at 07:46 | #12

    I appreciate your tests. I´d like to test as well. Unfortunately at the moment my available time is very limited.

  13. juandie
    July 5th, 2010 at 02:41 | #13

    @Ronny Egner
    i manage to install it by changing the diskgroup… i was using a 1.5G and change it to a 13G

    :)

  14. G
    July 14th, 2010 at 04:53 | #14

    We encountered a similar problem, and found the following things

    - The failure of the “Grid Installation Configuration” step could be caused by many different kinds of errors, some of them silent (no useful information in any install logs)

    - Useful logs to look at are in \app\11.2.0\grid\cfgtoollogs\crsconfig – rootcrs_.log_OUT and rootcrs_.log

    - We saw the same service errors as described in your post, as well as these error messages in the rootcrs_.log file:

    StartService(OracleRemoteExecService) waiting for service to complete
    service status 4, still waiting

    service status 4, still waiting
    service terminated (0×0)
    Warning failed to Stop (OracleRemoteExecService) service. The service has not been started.

    These errors do not seem to be a problem. We eventually were able to install RAC and then the DB successfully, and we still saw those errors. Everything seems to be working fine otherwise.

    - In one case where the “Grid Installation Configuration” step failed, we saw these error messages:

    DiskGroup DATA creation failed with the following message: ORA-15018: diskgroup cannot be createdORA-15307: disk DATA_0000 not discoverable by CSS

    This was because the required disks to be used for OCR/voting ASM volume were not ‘online’ on the remote node.

    - In another case where the “Grid Installation Configuration” step failed, we could see no useful error messages in any of the logs, but it turned out that our node hostnames could not be resolved by DNS, or by hosts file. Fixing the hostname resolution allowed us to install RAC successfully.

    Our conclusion is that the RAC installer is very poor at providing useful error messages :( I was expecting much better from Oracle…

    Anyway, I hope this information helps someone!

  15. Oliver
    July 29th, 2010 at 18:15 | #15

    Same Problem … Contacted Oracle support … and that was the answer:

    RAC on Windows 2008: Oracle Grid Infrastructure / Clusterware Currently Incompatible with Microsoft Failover Clustering (MSFC) (Doc ID 1157711.1)

    Now the Installation works.

    There is also a Problem when IPv6 is enabled !

  16. stefan
    July 29th, 2010 at 21:08 | #16

    Hi there,

    stumbling across the same errors. Did you install with advanced options using the new GNS or did you manually set the scan/vip ips?

  17. Ronny Egner
    August 2nd, 2010 at 16:57 | #17

    Hi,

    i installed with automatic setup and NO GNS enabled.

  18. Diogo Fernandes
    August 3rd, 2010 at 11:30 | #18

    Hi

    Facing similar problem on Windows 2008. All pre-requisites check are fine. But dont seems to understand the error. Is it something to do with Windows 2008 priviledges or with Oracle.

    The Installation fails at the end while doing Grid infrastructure configuration. Realised that create the registry entries has some issues according to the logs and Windows event logs.

    If there are some answers please update.

    Thanks and Regards

    Diogo

  19. Diogo Fernandes
    August 8th, 2010 at 10:20 | #19

    Hi

    Had to give up on Windows 2008 at the moment. It works fine on Windows 2003.

    My installation is not VMWARE but was getting the same error as yours on 2008 R2.

    Thanks and Regards

    Diogo Fernandes

  20. Vivek Achar
    August 9th, 2010 at 13:50 | #20

    Hi,

    I am also exactly facing the same error. I am trying on Windows 2008 SP2 X64

    Any help will be appreciated.

    Thanks!
    ~Vivek

  21. Olli
    August 18th, 2010 at 11:00 | #21

    @Diogo Fernandes

    RAC on Windows 2008: Oracle Grid Infrastructure / Clusterware Currently Incompatible with Microsoft Failover Clustering (MSFC) (Doc ID 1157711.1)

    Deinstall the MS CLusterware !!!

  22. Tuan
    August 20th, 2010 at 20:44 | #22

    I have encountered the same problem when installing Grid Infrastructure for RAC 11gR2 on W2008 R2 in VMWare environment.

    After so many tries, I have successfully installed Grid Infrastructure and RAC 11gR2 on W2008 R2 with two Virtual nodes.

    You can review the following link:

    http://forums.oracle.com/forums/thread.jspa?threadID=1060060&start=0&tstart=0

    Tuan Tran

  23. Toni Surrender
    August 22nd, 2010 at 20:55 | #23

    Why do you wonder about errors during installation of Oracle RAC? All Oracle products are buggy!

  24. Ronny Egner
    August 23rd, 2010 at 08:10 | #24

    Indeed :-) But normally installation works fine.

  25. Ronny Egner
    August 23rd, 2010 at 08:12 | #25

    I´ve seen the note as well. I double checked i didnt have MS Cluster installed….. error persisted.

  26. TT
    September 27th, 2010 at 23:45 | #26

    @G
    Hi,

    We are having the same error DiskGroup OCR-Voting creation failed with the following message: ORA-15018: diskgroup cannot be createdORA-15307: disk DATA_0000 not discoverable by CSS. ALl the disks on the remote node are offline. How did you resolve this “G”? Did you stop the install mount the drives and then restart the install. I don’t see how we can restart the install once this fails.

    Thanks

  27. Ronny Egner
    October 4th, 2010 at 10:51 | #27

    I´d suggest testing the disks.. are all disks accessible from all nodes in parallel? It seems one node is missing a disk.

  28. Lokesh Luthra
    October 21st, 2010 at 08:55 | #28

    Please advise if “Windows Server 2008 R2 x64 Edition” and “Oracle 11g Release 1 (11.1.0.7.0)” combination is compatible . Appreciate your Quick response on this.
    Many Thanks. Lokesh

  29. Ronny Egner
    October 21st, 2010 at 09:07 | #29

    Hi,

    Oracle 11g R1 (this includes 11.1.0.7.0) is NOT certified on Windows 2008 R2 x64.

  30. Gyan Vidyarthi
    October 21st, 2010 at 22:00 | #30

    Finally, I was able to overcome this issue “[INS-20802] Grid Infrastructure Configuration failed” and able to install Oracle 11g R2 Grid Infrastructure at Windows 2008 R2 Standard edition. It took more than a week time to really get to the bottom of the issue. Earlier, I had tried at least 5-6 times with different combinations and options.

    In my case the issue was teaming of the NIC network adapter. After breaking the teaming of network, the cluster utility check also changed and reduced the “failed message” at only one place. I had two node setup – I run the cluster utility check at both the nodes. At node one it gave just one failed message and at node 2 – no failed message.

    Thanks,
    Gyan

  31. Borges
    November 21st, 2010 at 20:57 | #31

    Hi,

    I tried to install Oracle 11gR2 Grid Infrastructure on Windows 2008 R2 x64 and gives the same error ([INS-20802] Grid Infrastructure Configuration failed).

    I searched for patchs or some solution in Metalink and Google but no sucess.

    Anybody have news ?

    God bless.

  32. Scott
    November 29th, 2010 at 19:50 | #32

    Finally was able to install the grid infrastructure successfully.

    1. Make sure that the disk drives are not lettered. After using the diskpart utility to create the partitions be sure to do a refresh in Disk Manager facility of Server Manager to be sure that none of the dirve has letters on all nodes. If they do, remove them and do a refresh again on all nodes to be sure that the drives are not lettered. This prevented us from succesfully configuring ASM.
    2. issue a route 0.0.0.0 delete on all nodes. Caution be sure that you save your defualt gateway address before executing this command. You will need to re-enter it after issuing this command. You may also need to login from the console as you will not be able to access the server untill you enter the default gateway.
    3. remove all network adapters leaving only the private and public interfaces and all references to ipv6 in all of the network adapters. When you issue an ipconfig /all you should only see the private and public interfaces. We had to issue the following commands in order to achieve this.
    netsh Interface 6to4 set state state=disabled
    netsh interface isatap set state disabled
    netsh interface teredo set state disabled
    plus we had to remove all of the ipv6 references in each of the remaining network adapters. You may need to enter further netsh commands depending on what interfaces are installed on your servers.

  33. Shery
    December 8th, 2010 at 17:09 | #33

    well , i am hit by the same error , but metalink says the error is not related to the service.
    the service is a dummy service and can be ignored.
    The erorr may be related to something else.

    WIN: During Installation The Event Log Shows Failure To Start Some Services

    Solution
    OracleOUIVC8Service is a dummy service and it is actually not supposed to start. It is used by the installer to install VC8 binaries on the remote node. It’s not a proper Windows service and it is not expected to start, and in fact the Installer expects and handles the exceptions about the service not starting during the install process.

    The same is true for the OracleOUIOraCrs11g_home1Service service and any other OracleOUI[homename]Service . These are dummy services used by the installer to run an executable and do install work. They are not actually supposed to start. So you can ignore these errors in the Windows event log.

  34. Shery
    December 8th, 2010 at 17:39 | #34

    Now
    the clufy is ok,
    the disks are online on both servers ,
    tried removing teaming ,
    followed the best practices installation from metalink ,,
    but still no luck ..
    support is working for more than a week now ,
    I think i found something new ..

  35. Jerome
    December 15th, 2010 at 12:27 | #35

    Have the same problem. Tried at least 15 times. Shery did you find a solution ?

  36. Mike A.
    December 20th, 2010 at 22:54 | #36

    I was experiencing the same problem. I am using vmware workstation and doing a two node install on Windows 2008 with 11GR2 clusterware just like most people here on this thread.

    On the second node I set up a Windows DNS server with a forward and two reverse lookup zones (the two reverse lookup zones handled the vip and private network subnets). At first neither of my nodes were setup to use any kind of generic DNS name. So my hosts were just RAC1.localdomain, RAC1-VIP.localdomain, etc. And my forward lookup zone on my DNS configuration just used localdomain.

    This appeared to be ok with the initial scan and node validation that the installer was doing. But the grid infrastructure part of the install appeared to have problems resolving the names at it just used RAC1 and RAC1-vip instead of a fully qualified domain. And even though I was using a DNS server my nslookups without the “.localdomain” were not resolving.

    In the %GRID_HOME%\\cfgtoollogs\crsconfig\rootcrs_[node1] log file I saw the following errors:
    PRKC-1023 : Invalid IP address format: rac1-vip
    add nodeapps -n rac1 -A rac1-vip/255.255.255.0/eth0 on node=rac1 … failed
    PRCR-1001 : Resource ora.net1.network does not exist

    So I put both nodes in a domain called cookie.local. Then reconfigured my DNS so that the foward lookup zone referenced cookie.local. Then I put the entries for all server names back in. Once I did this everything worked.

  37. Jong
    December 23rd, 2010 at 03:20 | #37

    We’re have a same error ([INS-20802] Grid Infrastructure Configuration failed) on Windows 2008R2 64bits

    Anybody have a good news ?

  38. Thomas
    December 27th, 2010 at 15:16 | #38

    this was the solution:

    3. remove all network adapters leaving only the private and public interfaces and all references to ipv6 in all of the network adapters. When you issue an ipconfig /all you should only see the private and public interfaces. We had to issue the following commands in order to achieve this.
    netsh Interface 6to4 set state state=disabled
    netsh interface isatap set state disabled
    netsh interface teredo set state disabled

    thx Scott

  39. Jerome
    January 4th, 2011 at 15:57 | #39

    This solution worked for me ! Thanks Thomas !
    Be carefull : these interfaces aren’t displayed in the control panel.
    Now another problem because setup failed on another step :-(

  40. January 11th, 2011 at 17:47 | #40

    I ran into this is issue as well -

    DiskGroup DATA creation failed with the following message: ORA-15018: diskgroup cannot be createdORA-15307: disk DATA_0000 not discoverable by CSS

    The problem was in my case the binding order of the iSCSI LUNs. The device order between the two nodes was different. I had created 6 x 500GB iSCSI Volumes which looked to be discovered OK. Then I ran into the installation errors above. I then increased the size of only one volume to 1GB and spotted that it was discovered as DISK5 of node A and as DISK2 on node B. I have deleted all RAW devices and re-discovered them.

    After that the installation completed successfully.

    It has only cost me 10 tries. ;-)

  41. Baqar Ali Khan
    February 14th, 2011 at 11:23 | #41

    I’ve a problem which installing Grid Infrastructure,
    Grid Infrastructe configuration failed,The Plug-in failed in its perform method

  42. Ronny Egner
    April 11th, 2011 at 10:54 | #42

    Sorry but i am unable to help because you did not supply any error messages.

  43. Leonard
    April 21st, 2011 at 23:52 | #43

    For initial timeouts i disabled ipV6 in Windows 2008 R2 and added database ip name in host. Been fine since then. I had four other network cards, i disabled three leaving the only 1 one i’m using.

    It’s working now, thanks guys. Spent 6 useful hours on that already.

  44. Jane
    July 10th, 2011 at 15:32 | #44

    We are facing the problem with Grid Configuration step is failing in Grid Infrastructure install on production system where using OCFS on Windows 2008 R2 64bit. As Follows:

    [INS-20802] Grid Infrastructure Configuration failed.

    Please help if someone gone through this,

    thanks,
    Jane

  45. Ronny Egner
    August 19th, 2011 at 09:21 | #45

    Hi Jane,
    you have to be more specifiy. Did you look through the various log files? Any errors?

  46. TJ
    December 13th, 2011 at 19:04 | #46

    I had a similar problem (unable to configure grid infrastructure) and it was a simple fix. The setup:
    Windows 2008R2 running on Hyper-V
    Oracle 11.2.0.3 grid infrastructure single instance setup

    The error messages in the install log indicated that my user did not have administrative privileges. The privilege was granted through a group. I added my domain user directly to the Administrators group on the server and I was able to successfully configure Grid & ASM.

    HTH!
    -T. J.

  47. Robert
    January 3rd, 2012 at 17:30 | #47

    I also have problems while installing on 2 VMware nodes, eachone is running on a different vmware server.

    It seems that there is an issue during installation with the hdd on node 2

    on each node there is a online Raw disk with 15GB, they have both the same location (bus and id), what have i done wrong?

    2012-01-03 15:41:40: Executing cmd: E:\oracle\11.2.0\grid\bin\asmca -silent -diskGroupName DATA -diskList ‘\\.\ORCLDISKDATA0′ -redundancy EXTERNAL -configureLocalASM
    2012-01-03 15:42:05: Command output:
    >
    > Disk Group DATA creation failed with the following message:
    > ORA-15018: diskgroup cannot be created
    > ORA-15031: disk specification ‘\\.\ORCLDISKDATA0′ matches no disks
    >
    >
    >End Command output
    2012-01-03 15:42:05: Configuration of ASM … failed

  48. Ronny Egner
    January 6th, 2012 at 09:51 | #48

    I´d start checking if both disks can be seen from all nodes.

  1. No trackbacks yet.