概述
今天分享下RAC实例崩溃的4个常见bug,下面版本适用于oracle 版本11.2.0.1 和更高版本,主要是做一个备忘,方便以后直接找解决方式。

ORA-29770 LMHB终止实例

官方解释
1、报错:
LMON (ospid:31216) waits for event 'control file sequential read' for 88 secs.Errors in file /Oracle/base/diag/rdbms/prod/prod3/trace/prod3_lmhb_31304.trc(incident=2329):ORA-29770: global enqueue process LMON (OSID 31216) is hung for more than 70secondsLMHB (ospid: 31304) is terminating the instance.或LMON (ospid: 8594) waits for event 'control file sequential read' for 118 secs.ERROR: LMON is not healthy and has no heartbeat.ERROR: LMHB (ospid: 8614) is terminating the instance.
2、思路:
LMON 等待读取控制文件,导致LMHB 使实例崩溃
3、解决方案:
Bug 8888434 已在 11.2.0.2 及以上版本 中得到修正
Bug 11890804 已在 11.2.0.3及以上版本中得到修正
ORA-481导致的实例崩溃
1、报错:
1. PMON (ospid:12585): terminating the instance due to error 481
LMON 进程跟踪文件显示:
Begin DRM(107) (swin 0)* drm quiesce
LMS 进程跟踪文件显示:
2011-07-05 10:53:44.218905 : Start affinity expansion for pkey 81885.02011-07-05 10:53:44.498923 : Expand failed: pkey 81885.0, 229 shadowstraversed, 153 replayed 1