ORA-29702


Instances Abort With ORA-29702 When The Server is rebooted or shut down [ID 752399.1]  

--------------------------------------------------------------------------------
 
  Modified 23-MAR-2009     Type PROBLEM     Status PUBLISHED  

In this Document
  Symptoms
  Cause
  Solution
  References

 

--------------------------------------------------------------------------------

 

Applies to:
Oracle Server - Enterprise Edition - Version: 10.2.0.1 to 10.2.0.4
Oracle Server - Enterprise Edition - Version: 11.1.0.6 to 11.1.0.7
Linux x86-64

Symptoms
When the server is rebooted or shutdown all instances on the server abort with ORA-29702.

Last log entries in the alert.log look like:
Error: KGXGN aborts the instance (6)
Errors in file /HOME/oracle/admin/+ASM/bdump/+asm1_lmon_8981.trc:
ORA-29702: error occurred in Cluster Group Service operation
LMON: terminating instance due to error 29702
System state dump is made for local instance
System State dumped to trace file /HOME/oracle/admin/+ASM/bdump/+asm1_diag_8977.
trc
Trace dumping is performing id=[cdmp_20080929110159]


Cause
When doing a reboot or a shutdown of a server the K96init.crs is called after the operating system stops other services like network, therefore instances on the server crash due to losing the private interconnect.

Solution
Replace the K96 links with K19 links.

Please run the following steps on all nodes in the cluster:


Shutdown the clusterware.
Run this script to replace K96 links with K19 links as root.
Start up the Clusterware

Script:
RC_START=S96
RC_KILL=K19
RC_KILL_OLD=K96
RCSDIR="/etc/rc.d/rc3.d /etc/rc.d/rc5.d"
RCKDIR="/etc/rc.d/rc0.d /etc/rc.d/rc1.d /etc/rc.d/rc2.d /etc/rc.d/rc4.d
/etc/rc.d/rc6.d"
RCALLDIR="/etc/rc.d/rc0.d /etc/rc.d/rc1.d /etc/rc.d/rc2.d /etc/rc.d/rc3.d
/etc/rc.d/rc4.d /etc/rc.d/rc5.d /etc/rc.d/rc6.d"
ID=/etc/init.d
if [ -z "$RMF" ]; then RMF="/bin/rm -f"; fi
if [ -z "$LNS" ]; then LNS="/bin/ln -s"; fi
if [ -z "$ECHO" ]; then ECHO=/bin/echo; fi
# Clean up any old init.crs scripts
for rc in $RCALLDIR
do
$RMF $rc/"$RC_START"init.crs
$RMF $rc/"$RC_KILL"init.crs
$RMF $rc/"$RC_KILL_OLD"init.crs
done
# Install new ones
for rc in $RCSDIR
do
$LNS $ID/init.crs $rc/"$RC_START"init.crs || { $ECHO $?; exit 1; }
done
for rc in $RCKDIR
do
$LNS $ID/init.crs $rc/"$RC_KILL"init.crs || { $ECHO $?; exit 1; }
done

References
BUG:7326677 - WHEN NODE IS REBOOTED, RAC ASM CRASHES DURING SHUTDOWN WITH ORA-29702
BUG:7496341 - FIX FOR 4587300 DOESN'T EXIST IN 11.1.0.7
//////////////////////////////////////////////////////////////////////////////////////
One of the Instances Fails to Start After Reboot With ORA-29702 [ID 788455.1]  

--------------------------------------------------------------------------------
 
  Modified 26-JUN-2009     Type PROBLEM     Status REVIEWED  

In this Document
  Symptoms
  Cause
  Solution
  References

 

--------------------------------------------------------------------------------

 

Applies to:
Oracle Server - Enterprise Edition - Version: 10.2.0.4 to 10.2.0.4
This problem can occur on any platform.
Oracle Server Enterprise Edition - Version: 10.2.0.4 to 10.2.0.4
Symptoms
After a node is rebooted, one of the instances fails to start automatically, but can be started manually.

Database Alert log reports:
==================
Thu Feb 5 15:19:07 2009
Error: KGXGN aborts the instance (6)
Thu Feb 5 15:19:07 2009
Errors in file
/oneport/apps/oracle/admin/dbname/bdump/<instance>_lmon_14390.trc:
ORA-29702: error occurred in Cluster Group Service operation
LMON: terminating instance due to error 29702
Thu Feb 5 15:19:07 2009
System state dump is made for local instance
System State dumped to trace file
/oneport/apps/oracle/admin/dbname/bdump/<instance>_diag_14386.trc
Thu Feb 5 15:19:08 2009
Trace dumping is performing id=[cdmp_20090205151907]
Thu Feb 5 15:19:13 2009
Instance terminated by LMON, pid = 14390
Thu Feb 5 15:38:08 2009
Starting ORACLE instance (normal)

From <instance>_diag_14386.trc, we see:
===========================
*** 2009-02-05 15:19:07.218
2009-02-05 15:19:07.217: [ CSSCLNT]clsssRecvMsg: comm error received, comrc
11, con (1068dea10), msg (ffffffff7fffd9e8), msgl 144
2009-02-05 15:19:07.289: [ CSSCLNT]clssgsGGetStatus: communications failed
(0/3/1)
2009-02-05 15:19:07.289: [ CSSCLNT]clssgsGGetStatus: returning 8
kgxgnpstat: received ABORT event from CLSS
CM problem, please abort
*** 2009-02-05 15:19:07.289
Node monitor becomes unavailable for service
2009-02-05 15:19:07.497: [ CSSCLNT]clsssRecvMsg: comm error received, comrc
11, con (1068dea10), msg (ffffffff7fffd9e8), msgl 144
2009-02-05 15:19:07.498: [ CSSCLNT]clssgsGGetStatus: communications failed
(0/3/1)
2009-02-05 15:19:07.498: [ CSSCLNT]clssgsGGetStatus: returning 8
kgxgnpstat: received ABORT event from CLSS
CM problem, please abort


The crsd.log shows the database instances being started before the ASM instances.


ora.<dbname>.<instance>.inst.log shortly after the reboot, reports:


startup
ORA-1565 error in identifying +<ASM disk>/../<spfile>


Cause
The instances were missing the ASM dependency.

 

crs_stat -p ora.<dbname>.<instance>.inst  shows that REQUIRED_RESOURCE is empty. It should contain the name of the ASM resource to ensure that ASM is started before the database instance.

Solution
Add ASM dependency to instances manually:

srvctl modify instance -d <database name> -i <instance name> -s <ASM instance name>

References
NOTE:387217.1 - INSTANCE NOT STARTING AUTOMATICALLY BY CRS WHEN ASM IS USED

你遇到的错误是在启动 Oracle 实例时出现的问题,尤其是在使用 `startup nomount` 并指定 PFILE 的情况下。我们来逐一分析这些错误,并提供解决方案。 --- ### ❌ 错误信息解析: ```sql ORA-32006: BACKGROUND_DUMP_DEST initialization parameter has been deprecated ORA-32006: USER_DUMP_DEST initialization parameter has been deprecated ``` 这两个是**警告性错误(实际为提示)**,不是致命错误。从 Oracle 11g 开始,`BACKGROUND_DUMP_DEST` 和 `USER_DUMP_DEST` 已被弃用,取而代之的是统一的诊断目的地:`DIAGNOSTIC_DEST`。 > ✅ **说明**:你的 pfile 中可能仍然包含这两个旧参数,虽然它们已被忽略,但 Oracle 会报出 ORA-32006 提示你清理配置。 --- ```sql ORA-29702: error occurred in Cluster Group Service operation ``` 这是关键错误! 这个错误通常出现在你试图在 **启用了 RAC(Real Application Clusters)模式的实例中启动单实例数据库**,或者 **CSS(Cluster Synchronization Services)服务未运行**。 常见原因包括: - Oracle 被安装为 RAC 环境,但现在尝试以单实例方式启动。 - `$ORACLE_HOME/bin/crsctl` 没有正确配置或 CSS 守护进程未启动。 - 实例试图加入集群组服务(CGS),但无法连接到 CSS daemon。 --- ## ✅ 解决方案 ### 步骤 1:确认是否为 RAC 安装环境 运行以下命令检查 Oracle 是否编译为支持 RAC: ```bash $ $ORACLE_HOME/bin/crsctl check css ``` 如果返回类似 “CSS appears healthy” 或者可以正常执行,则说明 CRS 正在运行。 如果没有该命令,或提示 “cannot communicate with CRS” 或 “CSS not running”,则需要处理 CSS。 --- ### 步骤 2:禁用集群模式(适用于单实例) 如果你只是想作为单实例启动数据库,请确保初始化参数中设置: ```sql cluster_database = FALSE ``` 并在你的 PFILE (`/u01/app/oracle/pfile_backup.ora`) 中添加或修改这一行: ```ini *.cluster_database=FALSE ``` 此外,还建议添加: ```ini *.instance_type='single' ``` 这明确告诉 Oracle 这是一个单实例环境,避免尝试连接 CSS。 --- ### 步骤 3:更新废弃参数(可选但推荐) 将旧的 dump 目录参数替换为 `DIAGNOSTIC_DEST`: ```ini *.diagnostic_dest='/u01/app/oracle' ``` 并移除或注释掉: ```ini # *.background_dump_dest='/u01/app/oracle/admin/db/bdump' # *.user_dump_dest='/u01/app/oracle/admin/db/udump' # *.core_dump_dest='/u01/app/oracle/admin/db/cdump' ``` 因为这些已经被 `diagnostic_dest` 统一管理: - ADR Home 将位于:`$DIAGNOSTIC_DEST/rdbms/$DBNAME/$INSTANCE_NAME` --- ### 步骤 4:确保 CSSD 正常运行(如使用 Oracle Restart 或 RAC) 如果是独立服务器且不需要 RAC,你可以使用 `localconfig` 来配置本地 CSS: 以 **root 用户** 执行: ```bash # $ORACLE_HOME/bin/localconfig add ``` 这会创建并启动本地 CSS 守护进程(仅适用于非 RAC 单机环境)。 > ⚠️ 注意:此命令只能存在一个节点上运行,在标准单实例中常用。 --- ### 步骤 5:重新启动实例 修改完 PFILE 后,再次尝试启动: ```sql SQL> startup nomount pfile='/u01/app/oracle/pfile_backup.ora'; ``` 应不再报 ORA-29702--- ## 🛠 示例修正后的 PFILE 内容 ```ini db_name='orcl' memory_target=1G processes=150 db_block_size=8192 diagnostic_dest='/u01/app/oracle' cluster_database=FALSE instance_type='single' # 可选:显式定义其他路径 control_files=('/u01/app/oracle/oradata/orcl/control01.ctl') undo_tablespace='UNDOTBS1' ``` --- ## 🔍 总结 | 错误 | 原因 | 解决方法 | |------|------|---------| | ORA-32006 | 使用了过时的 dump 参数 | 改用 `diagnostic_dest` | | ORA-29702 | 实例尝试连接 CSS 失败 | 设置 `cluster_database=FALSE` + `instance_type='single'`,或启动本地 CSS | --- ## ✅ 推荐操作顺序 1. 编辑 PFILE,加入 `cluster_database=FALSE` 和 `instance_type='single'` 2. 替换 `background_dump_dest` 等为 `diagnostic_dest` 3. (以 root)运行 `localconfig add` 启动本地 CSS(如果尚未运行) 4. 再次尝试 `startup nomount pfile=...` ---
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值