RAC环境替换多路径软件后cssd服务无法启动的恢复

原创

已于 2024-08-27 16:22:26 修改 · 3k 阅读

12 ·

CC 4.0 BY-SA版权

文章标签：

#oracle

于 2022-03-28 14:57:55 首次发布

系统环境：Centos6.8
数据库版本：11.2.0.4.0

由华为多路径替换为Centos自带的device-mapper-multipath后，RAC集群启动一直卡在CSSD服务，状态一直是starting。

由于11gR2中CRS服务依赖于ASM，因为ocr存放在ASM中，所以ASM若无法有效启动，这导致CRS服务也无法正常工作:

集群日志：

2022-03-26 12:08:02.469: 
[ohasd(28938)]CRS-2112:The OLR service started on node RAC01.
2022-03-26 12:08:02.486: 
[ohasd(28938)]CRS-1301:Oracle High Availability Service started on node RAC01.
2022-03-26 12:08:02.493: 
[ohasd(28938)]CRS-8017:location: /etc/oracle/lastgasp has 2 reboot advisory log files, 0 were announced and 0 errors occurred
2022-03-26 12:08:06.212: 
[/u01/app/11.2.0/grid/bin/orarootagent.bin(29065)]CRS-2302:Cannot get GPnP profile. Error CLSGPNP_NO_DAEMON (GPNPD daemon is not running). 
2022-03-26 12:08:10.405: 
[ohasd(28938)]CRS-2302:Cannot get GPnP profile. Error CLSGPNP_NO_DAEMON (GPNPD daemon is not running). 
2022-03-26 12:08:10.412: 
[gpnpd(29482)]CRS-2328:GPNPD started on node RAC01. 
2022-03-26 12:08:12.745: 
[cssd(29564)]CRS-1713:CSSD daemon is started in clustered mode
2022-03-26 12:08:14.621: 
[ohasd(28938)]CRS-2767:Resource state recovery not attempted for 'ora.diskmon' as its target state is OFFLINE
2022-03-26 12:08:14.621: 
[ohasd(28938)]CRS-2769:Unable to failover resource 'ora.diskmon'.
2022-03-26 12:08:17.884: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log
2022-03-26 12:08:32.900: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log
2022-03-26 12:08:47.916: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log
2022-03-26 12:09:02.932: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log
2022-03-26 12:09:17.949: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log
2022-03-26 12:09:32.965: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log
2022-03-26 12:09:47.981: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log
2022-03-26 12:10:02.998: 
[cssd(29564)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /u01/app/11.2.0/grid/log/RAC01/cssd/ocssd.log

ocssd.log


2022-03-26 12:09:47.981: [    CLSF][3717953280]checksum failed for disk:/dev/asm-datadisk01-new:
2022-03-26 12:09:47.981: [    CLSF][3717953280]Error: obj 2147483658 blk 0 name 'check_kfbh' num1 1289612970 num2 2751807285
2022-03-26 12:09:47.981: [    CLSF][3717953280]bh: ptr 0x7f5dc8138e00 size 512
2022-03-26 12