前言:最近在给rac添加节点,遇到几个问题,记录下,也发现自己的问题,理论基础很重要,遇到问题只能靠试,所以要是扎实的基础,非常有助于问题的分析。
一、添加节点是安装Oracle软件报错
其实就是XML文件记录信息有误,更新对了就可以。
SEVERE:Cannot perform add node procedure as the value of CLUSTER_NEW_VIRTUAL_HOSTNAMES or CLUSTER_NEW_NODES or both could not be obtained from the command line or response file(s). Silent install cannot continue.
Adding new node from database home results SEVERE:Cannot perform add node procedure as the value of CLUSTER_NEW_VIRTUAL_HOSTNAMES or CLUSTER_NEW_NODES or both could not be obtained from the command line or response file(s) (文档 ID 2345332.1)
SOLUTION
/oui/bin/runInstaller -updateNodeList ORACLE_HOME=/u01/app/test01/product/11.2.0.4/db_4 “CLUSTER_NODES={test01,test02}” -silent CRS=“false”
Adding new node from database home results SEVERE:Cannot perform add node procedure as the value of CLUSTER_NEW_VIRTUAL_HOSTNAMES or CLUSTER_NEW_NODES or both could not be obtained from the command line or response file(s) (文档 ID 2345332.1)

二、read7 RAC dd 错误
Redaht7/Oracle Linux7 + ORA11g : ohasd fails to start(Doc ID 1959008.1)
Document 1951613.1 Installation walk-through - Oracle Grid/RAC 11.2.0.4 on Oracle Linux 7 (Doc ID 1951613.1)
三、添加实例错误
–添加节点报错密码不一致,两个节点都修改,不必同步密码文件

二、安装错误
Addnode.sh or Fresh Install Fails With PRKC-1025 and Java.Lang.OutOfMemoryError (文档 ID 1085893.1)
[[email protected] bin]$ more /oracle/app/oraInventory/logs/oraInstall2019-05-23_02-24-39PM.err
Exception java.lang.OutOfMemoryError: Java heap space occurred…
java.lang.OutOfMemoryError: Java heap space
at java.lang.StringCodingCharsetSD.decode(StringCoding.java:183)atjava.lang.StringCoding.decode(StringCoding.java:228)atjava.lang.String.<init>(String.java:405)atjava.lang.String.<init>(String.java:433)atjava.io.UnixFileSystem.list(NativeMethod)atjava.io.File.list(File.java:937)atoracle.sysman.oii.oiic.OiicAddNodeSummaryInformation.getDiskSpaceOccupied(OiicAddNodeSummaryInformation.java:551)atoracle.sysman.oii.oiic.OiicAddNodeSummaryInformation.getDiskSpaceOccupied(OiicAddNodeSummaryInformation.java:561)atoracle.sysman.oii.oiic.OiicAddNodeSummaryInformation.getDiskSpaceOccupied(OiicAddNodeSummaryInformation.java:561)atoracle.sysman.oii.oiic.OiicAddNodeSummaryInformation.computeSpaceInfo(OiicAddNodeSummaryInformation.java:444)atoracle.sysman.oii.oiic.OiicAddNodeSummaryInformation.initializeAddNodeSession(OiicAddNodeSummaryInformation.java:383)atoracle.sysman.oii.oiic.OiicAddNodeSummaryInformation.<init>(OiicAddNodeSummaryInformation.java:140)atoracle.sysman.oii.oiif.oiifw.OiifwAddNodeSummaryWCDE.writeSummaryInformation(OiifwAddNodeSummaryWCDE.java:212)atoracle.sysman.oii.oiif.oiifw.OiifwAddNodeSummaryWCDE.logDialog(OiifwAddNodeSummaryWCDE.java:204)atoracle.sysman.oii.oiif.oiifb.OiifbWizChainDlgElem.doOperation(OiifbWizChainDlgElem.java:702)atoracle.sysman.oii.oiif.oiifw.OiifwAddNodeSummaryWCDE.doOperation(OiifwAddNodeSummaryWCDE.java:180)atoracle.sysman.oii.oiif.oiifb.OiifbCondIterator.iterate(OiifbCondIterator.java:171)atoracle.sysman.oii.oiic.OiicPullSession.doOperation(OiicPullSession.java:1380)atoracle.sysman.oii.oiic.OiicSessionWrapper.doOperation(OiicSessionWrapper.java:295)atoracle.sysman.oii.oiic.OiicInstaller.run(OiicInstaller.java:579)atoracle.sysman.oii.oiic.OiicInstaller.runInstaller(OiicInstaller.java:969)atoracle.sysman.oii.oiic.OiicInstaller.main(OiicInstaller.java:906)[grid@itsmdb2bin]
四、ACFS安装失败,导致root.sh执行失败
遇到ACFS安装失败试了好多次,后来单独安装ACFS时发现是缺少目录和三个文件,解决方法就是把集群deconfig掉(不会把删除文件)然后再重新执行root.sh
1. 添加节点报错

提示在安装ACFS时报错
根据日志信息,安装了相关补丁,重新perl rootcrs.pl deconfig问题依然存在
已存在的节点安装201904最新PSU,但是这个PSU也存在对ACFS的bug,所以决定将其patch回滚掉,尝试之后依旧报错相同内容。
由于当时是acfs安装失败所以决定手动安装acfs,报错是缺少三个文件,
手动创建相关目录并赋予正常节点相同的权限,将文件copy到新节点中,安装成功并online
集群启动之后,资源可以正常识别到,但是acfs是offlin也无法看到vip,instance等资源,手动添加实例,vip报错
解决方法:
从正常节点中将文件copy过去并创建安装时缺少的目录
[[email protected]: /opt/app/11.2.0/grid/product/11.2.0/db_1/install/usm/Oracle/EL6/x86_64/2.6.32-696/2.6.32-696-x86_64/bin]$ ls -ll
total 41828
-rwxr-xr-x 1 grid oinstall 29906728 May 22 13:53 oracleacfs.ko
-rwxr-xr-x 1 grid oinstall 5341646 May 22 13:53 oracleadvm.ko
-rwxr-xr-x 1 grid oinstall 7576942 May 22 13:53 oracleoks.ko
[[email protected]: /opt/app/11.2.0/grid/product/11.2.0/db_1/install/usm/Oracle/EL6/x86_64/2.6.32-696/2.6.32-696-x86_64/bin]$ exit
logout
[[email protected]: /opt/app/11.2.0/grid/product/11.2.0/db_1/bin]# ./acfsroot install
ls: cannot access /opt/app/11.2.0/grid/product/11.2.0/db_1/install/usm/Oracle/EL6/x86_64/2.6.32-696/2.6.32-696.el6-x86_64/bin/oracleoks.ko:
No such file or directory
ACFS-9459: ADVM/ACFS is not supported on this OS version: ‘2.6.32-754.11.1.el6.x86_64’
[[email protected]: /opt/app/11.2.0/grid/product/11.2.0/db_1/bin]#
srvctl add instance -d rac -i rac1 -n rac1
/opt/app/11.2.0/grid/product/11.2.0/db_1/jdk/bin/java: symbol lookup error: /opt/app/11.2.0/grid/product/11.2.0/db_1/lib/ libclntsh.so.11.1: undefined symbol: ztca_RNGInit
对比正常节点与失败节点,大小相差3g,手动添加资源不现实,决定删除节点重新添加。
删除节点:
rm -rf /etc/oracle/
rm -f /etc/init.d/init.cssd
rm -f /etc/init.d/init.crs
rm -f /etc/init.d/init.crsd
rm -f /etc/init.d/init.evmd
rm -f /etc/rc2.d/K96init.crs
rm -f /etc/rc2.d/S96init.crs
rm -f /etc/rc3.d/K96init.crs
rm -f /etc/rc3.d/S96init.crs
rm -f /etc/rc5.d/K96init.crs
rm -f /etc/rc5.d/S96init.crs
rm -Rf /etc/oracle/scls_scr
rm -f /etc/inittab.crs
rm -f /etc/ohasd
rm -f /etc/oraInst.loc
rm -f /etc/oratab
rm -rf /tmp/.oracle
rm -rf /tmp/ora*
rm -rf /var/tmp/.oracle
rm -rf /tmp/CVU*
rm -rf /tmp/Ora*
rm -rf /home/grid/.oracle
rm -rf /usr/app/*
mv /etc/init.d/init.ohasd /etc/init.d/init.ohasd.bak