没想到!Oracle 11.2.0.4还有如此严重的问题
今天我帮一个客户分析一个Oracle RAC故障,说是运行在我司zData分布式存储上;当然第一时间我们就排除了zData本身的问题,为何呢?
因为该环境上部署了3套数据库,只有其中一套出了问题,这就很好解释了。
这个用户的数据库版本是Oracle 11.2.0.4,Oracle dba们都知道这是Oracle 11g中堪称最为稳定的数据库版本,没有之一。
我们来看看相关的日志,首先节点1最早开始出现相关报错:
Sat Dec 02 01:21:51 2023
Errors in file u01/app/oracle/diag/rdbms/xxxx/xxxx1/trace/xxxx1_ora_241648.trc (incident=585920):
ORA-00600: internal error code, arguments: [17182], [0x7F48EF4B4F88], [], [], [], [], [], [], [], [], [], []
Incident details in: u01/app/oracle/diag/rdbms/xxxx/xxxx1/incident/incdir_585920/xxxx1_ora_241648_i585920.trc
.......
Sat Dec 02 01:24:09 2023
Block recovery from logseq 187420, block 393115 to scn 16343981745481
Recovery of Online Redo Log: Thread 1 Group 13 Seq 187420 Reading mem 0
Mem# 0: +FRADG/xxxx/onlinelog/group_13.290.1045325125
Block recovery completed at rba 187420.393158.16, scn 3805.1631184222
Block recovery from logseq 187420, block 390945 to scn 16343981745481
Recovery of Online Redo Log: Thread 1 Group 13 Seq 187420 Reading mem 0
Mem# 0: +FRADG/xxxx/onlinelog/group_13.290.1045325125
Block recovery completed at rba 187420.393158.16, scn 3805.1631184222
Errors in file u01/app/oracle/diag/rdbms/xxxx/xxxx1/trace/xxxx1_ora_241648.trc (incident=585921):
ORA-00600: internal error code, arguments: [KSMFPG2], [0x7F48EF4B4000], [], [], [], [], [], [], [], [], [], []
ORA-00600: internal error code, arguments: [17182], [0x7F48EF4B4F88], [], [], [], [], [], [], [], [], [], []
Incident details in: u01/app/oracle/diag/rdbms/xxxx/xxxx1/incident/incdir_585921/xxxx1_ora_241648_i585921.trc
LOGMINER: summary for session# = 2150937857
.......Errors in file u01/app/oracle/diag/rdbms/xxxx/xxxx1/trace/xxxx1_ora_241648.trc:
ORA-00600: internal error code, arguments: [KSMFPG2], [0x7F48EF4B4000], [], [], [], [], [], [], [], [], [], []
ORA-00600: internal error code, arguments: [17182], [0x7F48EF4B4F88], [], [], [], [], [], [], [], [], [], []
.......