标签云
asm恢复 bbed bootstrap$ dul In Memory kcbzib_kcrsds_1 kccpb_sanity_check_2 kfed MySQL恢复 ORA-00312 ORA-00607 ORA-00704 ORA-01110 ORA-01555 ORA-01578 ORA-08103 ORA-600 2131 ORA-600 2662 ORA-600 2663 ORA-600 3020 ORA-600 4000 ORA-600 4137 ORA-600 4193 ORA-600 4194 ORA-600 16703 ORA-600 kcbzib_kcrsds_1 ORA-600 KCLCHKBLK_4 ORA-15042 ORA-15196 ORACLE 12C oracle dul ORACLE PATCH Oracle Recovery Tools oracle加密恢复 oracle勒索 oracle勒索恢复 oracle异常恢复 Oracle 恢复 ORACLE恢复 ORACLE数据库恢复 oracle 比特币 OSD-04016 YOUR FILES ARE ENCRYPTED 勒索恢复 比特币加密文章分类
- Others (2)
- 中间件 (2)
- WebLogic (2)
- 操作系统 (102)
- 数据库 (1,671)
- DB2 (22)
- MySQL (73)
- Oracle (1,533)
- Data Guard (52)
- EXADATA (8)
- GoldenGate (21)
- ORA-xxxxx (159)
- ORACLE 12C (72)
- ORACLE 18C (6)
- ORACLE 19C (14)
- ORACLE 21C (3)
- Oracle 23ai (7)
- Oracle ASM (65)
- Oracle Bug (8)
- Oracle RAC (52)
- Oracle 安全 (6)
- Oracle 开发 (28)
- Oracle 监听 (28)
- Oracle备份恢复 (560)
- Oracle安装升级 (92)
- Oracle性能优化 (62)
- 专题索引 (5)
- 勒索恢复 (78)
- PostgreSQL (18)
- PostgreSQL恢复 (6)
- SQL Server (27)
- SQL Server恢复 (8)
- TimesTen (7)
- 达梦数据库 (2)
- 生活娱乐 (2)
- 至理名言 (11)
- 虚拟化 (2)
- VMware (2)
- 软件开发 (37)
- Asp.Net (9)
- JavaScript (12)
- PHP (2)
- 小工具 (20)
-
最近发表
- Kylin Linux 安装19c
- ORA-600 krse_arc_complete.4
- Oracle 19c 202410补丁(RUs+OJVM)
- ntfs MFT损坏(ntfs文件系统故障)导致oracle异常恢复
- .mkp扩展名oracle数据文件加密恢复
- 清空redo,导致ORA-27048: skgfifi: file header information is invalid
- A_H_README_TO_RECOVER勒索恢复
- 通过alert日志分析客户自行对一个数据库恢复的来龙去脉和点评
- ORA-12514: TNS: 监听进程不能解析在连接描述符中给出的SERVICE_NAME
- ORA-01092 ORA-00604 ORA-01558故障处理
- ORA-65088: database open should be retried
- Oracle 19c异常恢复—ORA-01209/ORA-65088
- ORA-600 16703故障再现
- 数据库启动报ORA-27102 OSD-00026 O/S-Error: (OS 1455)
- .[metro777@cock.li].Elbie勒索病毒加密数据库恢复
- 应用连接错误,初始化mysql数据库恢复
- RAC默认服务配置优先节点
- Oracle 19c RAC 替换私网操作
- 监听报TNS-12541 TNS-12560 TNS-00511错误
- drop tablespace xxx including contents恢复
分类目录归档:Oracle
ORA-27154 ORA-27300 ORA-27301 ORA-27302故障处理
根据经验对系统的内核参数做了一些调整,结果导致数据库启动失败提示报ORA-27154 ORA-27300 ORA-27301 ORA-27302错误
ORA-27154: post/wait create failed ORA-27300: OS system dependent operation:semget failed with status: 28 ORA-27301: OS failure message: No space left on device ORA-27302: failure occurred at: sskgpcreates
根据官方描述:Database Startup Fails with ORA-27300: OS system dependent operation:semget failed with status: 28 (Doc ID 949468.1),出现该问题原因可能是由于kernel.sem参数配置不合适当时,该库的processes配置为:20000,kernel.sem参数配置为:kernel.sem = 250 32000 100 128,参数说明:
kernel.sem = SEMMSL SEMMNS SEMOPM SEMMNI SEMMSL - max semaphores per array SEMMNS - max semaphores system wide SEMOPM - max ops per semop call SEMMNI - max number of arrays
理论上每个这样的配置最大值SEMMSL*SEMMNI=32000大于process的20000的设置,可是实际上控制每个信号集的信号数量没有达到250,而是只有156,通过ipcs命令可以看
[oracle@xifenfei ~]$ ipcs ------ Shared Memory Segments -------- key shmid owner perms bytes nattch status 0x00000000 32768 oracle 640 33554432 30 0x00000000 65537 oracle 640 4261412864 30 0xc2d167d0 98306 oracle 640 2097152 30 0x00000072 131075 root 444 1 1 ------ Semaphore Arrays -------- key semid owner perms nsems 0x450e15bd 0 root 666 1 0x0000cace 32769 root 666 1 0x358b172c 327683 oracle 660 104 0x9053d038 11075588 oracle 660 156 0x9053d039 11108357 oracle 660 156 0x9053d03a 11141126 oracle 660 156 0x9053d03b 11173895 oracle 660 156
从而使得SEMMSL*SEMMNI小于processes值,进而数据库启动报ORA-27154 ORA-27300 ORA-27301 ORA-27302,修改kernel.sem = 250 64000 128 256,数据库启动成功
awr创建snapshot等待library cache: mutex X
客户一个11.2.0.4的库,在准备收集awr的时候发现没有snap id
人工创建snapshot发现hang住了
查询该会话等待事件为:library cache: mutex X,查看以前mmon的子进程m000/1的trace信息
Trace file /u01/app/oracle/diag/rdbms/xff/xff/trace/xff_m000_6241.trc Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production With the Partitioning, OLAP, Data Mining and Real Application Testing options ORACLE_HOME = /u01/app/oracle/product/11.2.0/db_1 System name: Linux Node name: HIS Release: 5.4.17-2102.201.3.el7uek.x86_64 Version: #2 SMP Fri Apr 23 09:05:55 PDT 2021 Machine: x86_64 VM name: VMWare Version: 6 Instance name: xff Redo thread mounted by this instance: 1 Oracle process number: 5714 Unix process pid: 6241, image: oracle@HIS (M000) *** 2024-06-19 11:44:39.483 *** SESSION ID:(8709.38013) 2024-06-19 11:44:39.483 *** CLIENT ID:() 2024-06-19 11:44:39.483 *** SERVICE NAME:(SYS$BACKGROUND) 2024-06-19 11:44:39.483 *** MODULE NAME:(MMON_SLAVE) 2024-06-19 11:44:39.483 *** ACTION NAME:(Auto-Flush Slave Action) 2024-06-19 11:44:39.483 DDE rules only execution for: ORA 12751 ----- START Event Driven Actions Dump ---- ---- END Event Driven Actions Dump ---- ----- START DDE Actions Dump ----- Executing SYNC actions Executing ASYNC actions ----- START DDE Action: 'ORA_12751_DUMP' (Sync) ----- Runtime exceeded 900 seconds Time limit violation detected at: ksedsts()+465<-kspol_12751_dump()+145<-dbgdaExecuteAction()+1065<-dbgerRunAction()+109<-dbgerRunActions() +4134<-dbgexPhaseII()+1873<-dbgexProcessError()+2680<-dbgeExecuteForError()+88<-dbgePostErrorKGE()+2136<- dbkePostKGE_kgsf()+71<-kgeselv()+276<-kgesecl0()+139<-kgxWait()+1412<-kgxExclusive()+447<- kglGetMutex()+140<-kglGetHandleReference()+69<-kglic0()+319<-kksIterCursorStat()+330<-kewrrtsq_rank_topsql() +240<-kewrbtsq_build_topsql()+128<-kewrftsq_flush_topsql()+679<-kewrft_flush_table()+397<- kewrftec_flush_table_ehdlcx()+766<-kewrfat_flush_all_tables()+1406<-kewrfos_flush_onesnap()+170 <-kewrfsc_flush_snapshot_c()+623<-kewrafs_auto_flush_slave()+769<-kebm_slave_main()+586<-ksvrdp()+1766 <-opirip()+674<-opidrv()+603<-sou2o()+103<-opimai_real()+250<-ssthrdmain()+265<-main()+201 <-__libc_start_main()+245 Current Wait Stack: 0: waiting for 'library cache: mutex X' idn=0x644e2de0, value=0xf3a00000000, where=0x7c wait_id=1189 seq_num=1190 snap_id=1 wait times: snap=3 min 0 sec, exc=3 min 0 sec, total=3 min 0 sec wait times: max=infinite, heur=15 min 3 sec wait counts: calls=16376 os=16376 in_wait=1 iflags=0x15b2 There is at least one session blocking this session. Dumping 1 direct blocker(s): inst: 1, sid: 3898, ser: 47299 Dumping final blocker: inst: 1, sid: 3898, ser: 47299 Wait State: fixed_waits=0 flags=0x22 boundary=(nil)/-1 Session Wait History: elapsed time of 0.000016 sec since current wait 0: waited for 'library cache: mutex X' idn=0x644e2de0, value=0xf3a00000000, where=0x7c wait_id=1188 seq_num=1189 snap_id=1 wait times: snap=12 min 2 sec, exc=12 min 2 sec, total=12 min 2 sec wait times: max=infinite wait counts: calls=65535 os=65535 occurred after 0.327543 sec of elapsed time 1: waited for 'db file sequential read' file#=0x2, block#=0x1a5b, blocks=0x1 wait_id=1187 seq_num=1188 snap_id=1 wait times: snap=0.000420 sec, exc=0.000420 sec, total=0.000420 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.000251 sec of elapsed time 2: waited for 'db file sequential read' file#=0x1, block#=0x82e6, blocks=0x1 wait_id=1186 seq_num=1187 snap_id=1 wait times: snap=0.000429 sec, exc=0.000429 sec, total=0.000429 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.001085 sec of elapsed time 3: waited for 'db file sequential read' file#=0x2, block#=0x11344, blocks=0x1 wait_id=1185 seq_num=1186 snap_id=1 wait times: snap=0.000356 sec, exc=0.000356 sec, total=0.000356 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.000008 sec of elapsed time 4: waited for 'db file sequential read' file#=0x2, block#=0x19eb, blocks=0x1 wait_id=1184 seq_num=1185 snap_id=1 wait times: snap=0.000397 sec, exc=0.000397 sec, total=0.000397 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.000044 sec of elapsed time 5: waited for 'db file sequential read' file#=0x2, block#=0xb1659, blocks=0x1 wait_id=1183 seq_num=1184 snap_id=1 wait times: snap=0.000003 sec, exc=0.000003 sec, total=0.000003 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.000010 sec of elapsed time 6: waited for 'db file sequential read' file#=0x2, block#=0xb1658, blocks=0x1 wait_id=1182 seq_num=1183 snap_id=1 wait times: snap=0.000453 sec, exc=0.000453 sec, total=0.000453 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.000009 sec of elapsed time 7: waited for 'db file sequential read' file#=0x2, block#=0x19e1, blocks=0x1 wait_id=1181 seq_num=1182 snap_id=1 wait times: snap=0.000388 sec, exc=0.000388 sec, total=0.000388 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.000017 sec of elapsed time 8: waited for 'db file sequential read' file#=0x2, block#=0x19e2, blocks=0x1 wait_id=1180 seq_num=1181 snap_id=1 wait times: snap=0.000415 sec, exc=0.000415 sec, total=0.000415 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.004826 sec of elapsed time 9: waited for 'db file sequential read' file#=0x2, block#=0x2ffc0c, blocks=0x1 wait_id=1179 seq_num=1180 snap_id=1 wait times: snap=0.000404 sec, exc=0.000404 sec, total=0.000404 sec wait times: max=infinite wait counts: calls=0 os=0 occurred after 0.000007 sec of elapsed time Sampled Session History of session 8709 serial 38013 --------------------------------------------------- The sampled session history is constructed by sampling the target session every 1 second. The sampling process captures at each sample if the session is in a non-idle wait, an idle wait, or not in a wait. If the session is in a non-idle wait then one interval is shown for all the samples the session was in the same non-idle wait. If the session is in an idle wait or not in a wait for consecutive samples then one interval is shown for all the consecutive samples. Though we display these consecutive samples in a single interval the session may NOT be continuously idle or not in a wait (the sampling process does not know). The history is displayed in reverse chronological order. sample interval: 1 sec, max history 120 sec --------------------------------------------------- [118 samples, 11:42:39 - 11:44:39] waited for 'library cache: mutex X', seq_num: 1190 p1: 'idn'=0x644e2de0 p2: 'value'=0xf3a00000000 p3: 'where'=0x7c time_waited: >= 120 sec (still in wait) [3 samples, 11:42:39 - 11:42:38] idle wait at each sample --------------------------------------------------- Sampled Session History Summary: longest_non_idle_wait: 'library cache: mutex X' [118 samples, 11:42:39 - 11:44:39] time_waited: >= 120 sec (still in wait) --------------------------------------------------- ----- END DDE Action: 'ORA_12751_DUMP' (SUCCESS, 8 csec) ----- ----- END DDE Actions Dump (total 8 csec) ----- KGX cleanup... KGX Atomic Operation Log 0x1de44da670 Mutex 0x1d113cf7c8(8709, 0) idn 2de0 oper EXCL(6) Library Cache uid 8709 efd 7 whr 49 slp 0 oper=0 pt1=(nil) pt2=(nil) pt3=(nil) pt4=(nil) pt5=(nil) ub4=0 KGX cleanup... KGX Atomic Operation Log 0x1de44da6c8 Mutex 0x1de9468550(3898, 0) idn 644e2de0 oper GET_EXCL(5) Library Cache uid 8709 efd 7 whr 124 slp 16376 oper=0 pt1=0x1de9468410 pt2=(nil) pt3=(nil) pt4=(nil) pt5=(nil) ub4=0 *** KEWRAFM1: Error=12751 encountered by kewrfteh *** KEWRAFS: Error=12751 encountered by Auto Flush Slave. KEBM: MMON slave action policy violation. kewrmafsa_; viol=1; err=12751
一般来说类似这样的系统自动任务被阻塞很可能是由于那种bug导致,找到相关mos文档: library cache: mutex x waits during AWR Flush High Cursor Scan (Doc ID 2382741.1),确认为:Bug 19294556 AWR Flush Waiting For Cursor Scan, Library Cache : Mutex X,目前没有好的workaround,而且在11.2.0.4基础版上没有对应的patch
exadata换flash卡的一些操作
客户有一套oracle exadata x3-2的1/4配置(采用高容量磁盘)的机器,反馈由于flash卡异常导致性能很慢,通过临时关闭异常卡所在机器业务恢复正常
相关版本信息
[root@oa0cel03 ~]# imageinfo Kernel version: 2.6.32-400.11.1.el5uek #1 SMP Thu Nov 22 03:29:09 PST 2012 x86_64 Cell version: OSS_11.2.3.2.1_LINUX.X64_130109 Cell rpm version: cell-11.2.3.2.1_LINUX.X64_130109-1 Active image version: 11.2.3.2.1.130109 Active image activated: 2013-06-27 02:24:19 -0700 Active image status: success Active system partition on device: /dev/md6 Active software partition on device: /dev/md8 In partition rollback: Impossible Cell boot usb partition: /dev/sdm1 Cell boot usb version: 11.2.3.2.1.130109 Inactive image version: 11.2.3.2.0.120713 Inactive image activated: 2012-10-14 06:46:16 -0700 Inactive image status: success Inactive system partition on device: /dev/md5 Inactive software partition on device: /dev/md7 [root@oa0cel03 ~]# cellcli CellCLI: Release 11.2.3.2.1 - Production on Thu Jun 20 18:28:37 CST 2024 Copyright (c) 2007, 2012, Oracle. All rights reserved. Cell Efficiency Ratio: 3,617 CellCLI> list cell detail name: oa0cel03 bbuTempThreshold: 60 bbuChargeThreshold: 800 bmcType: IPMI cellVersion: OSS_11.2.3.2.1_LINUX.X64_130109 cpuCount: 24 diagHistoryDays: 7 fanCount: 8/8 fanStatus: normal flashCacheMode: WriteBack id: 1238FM507A interconnectCount: 3 interconnect1: bondib0 iormBoost: 0.0 ipaddress1: 192.168.10.5/22 kernelVersion: 2.6.32-400.11.1.el5uek locatorLEDStatus: off makeModel: Oracle Corporation SUN FIRE X4270 M3 SAS metricHistoryDays: 7 notificationMethod: snmp notificationPolicy: critical,warning,clear offloadEfficiency: 3,616.5 powerCount: 2/2 powerStatus: normal releaseVersion: 11.2.3.2.1 releaseTrackingBug: 14522699 snmpSubscriber: host=oa0db02.qhsrmyy.com,port=3872,community=cell host=oa0db01.qhsrmyy.com,port=3872,community=cell status: online temperatureReading: 28.0 temperatureStatus: normal upTime: 0 days, 3:49 cellsrvStatus: running msStatus: running rsStatus: running
客户第一次换盘之后,依旧有性能问题,先把griddisk给inactive
[root@oa0cel03 ~]# cellcli -e list metriccurrent attributes name,metricvalue where name like \'FC_BY_DIRTY.*\' FC_BY_DIRTY 38,820 MB [root@oa0cel03 ~]# cellcli -e "alter flashcache all flush" Flash cache on FD_00_oa0cel03 successfully altered Flash cache on FD_01_oa0cel03 successfully altered Flash cache on FD_02_oa0cel03 successfully altered Flash cache on FD_03_oa0cel03 successfully altered Flash cache on FD_04_oa0cel03 successfully altered Flash cache on FD_05_oa0cel03 successfully altered Flash cache on FD_06_oa0cel03 successfully altered Flash cache on FD_07_oa0cel03 successfully altered Flash cache on FD_09_exastlx01 successfully altered Flash cache on FD_10_exastlx01 successfully altered Flash cache on FD_11_oa0cel03 skipped because FD_11_oa0cel03 is degraded Flash cache on FD_12_oa0cel03 successfully altered Flash cache on FD_13_oa0cel03 successfully altered Flash cache on FD_14_oa0cel03 successfully altered Flash cache on FD_15_oa0cel03 successfully altered [root@oa0cel03 ~]# cellcli -e list metriccurrent attributes name,metricvalue where name like \'FC_BY_DIRTY.*\' FC_BY_DIRTY 0.000 MB [root@oa0cel03 ~]# cellcli -e "alter griddisk all inactive" GridDisk DATA_oa0_CD_00_oa0cel03 successfully altered GridDisk DATA_oa0_CD_01_oa0cel03 successfully altered GridDisk DATA_oa0_CD_02_oa0cel03 successfully altered GridDisk DATA_oa0_CD_03_oa0cel03 successfully altered GridDisk DATA_oa0_CD_04_oa0cel03 successfully altered GridDisk DATA_oa0_CD_05_oa0cel03 successfully altered GridDisk DATA_oa0_CD_06_oa0cel03 successfully altered GridDisk DATA_oa0_CD_07_oa0cel03 successfully altered GridDisk DATA_oa0_CD_08_oa0cel03 successfully altered GridDisk DATA_oa0_CD_09_oa0cel03 successfully altered GridDisk DATA_oa0_CD_10_oa0cel03 successfully altered GridDisk DATA_oa0_CD_11_oa0cel03 successfully altered GridDisk DBFS_DG_CD_02_oa0cel03 successfully altered GridDisk DBFS_DG_CD_03_oa0cel03 successfully altered GridDisk DBFS_DG_CD_04_oa0cel03 successfully altered GridDisk DBFS_DG_CD_05_oa0cel03 successfully altered GridDisk DBFS_DG_CD_06_oa0cel03 successfully altered GridDisk DBFS_DG_CD_07_oa0cel03 successfully altered GridDisk DBFS_DG_CD_08_oa0cel03 successfully altered GridDisk DBFS_DG_CD_09_oa0cel03 successfully altered GridDisk DBFS_DG_CD_10_oa0cel03 successfully altered GridDisk DBFS_DG_CD_11_oa0cel03 successfully altered GridDisk RECO_oa0_CD_00_oa0cel03 successfully altered GridDisk RECO_oa0_CD_01_oa0cel03 successfully altered GridDisk RECO_oa0_CD_02_oa0cel03 successfully altered GridDisk RECO_oa0_CD_03_oa0cel03 successfully altered GridDisk RECO_oa0_CD_04_oa0cel03 successfully altered GridDisk RECO_oa0_CD_05_oa0cel03 successfully altered GridDisk RECO_oa0_CD_06_oa0cel03 successfully altered GridDisk RECO_oa0_CD_07_oa0cel03 successfully altered GridDisk RECO_oa0_CD_08_oa0cel03 successfully altered GridDisk RECO_oa0_CD_09_oa0cel03 successfully altered GridDisk RECO_oa0_CD_10_oa0cel03 successfully altered GridDisk RECO_oa0_CD_11_oa0cel03 successfully altered [root@oa0cel03 ~]# cellcli -e list griddisk DATA_oa0_CD_00_oa0cel03 inactive DATA_oa0_CD_01_oa0cel03 inactive DATA_oa0_CD_02_oa0cel03 inactive DATA_oa0_CD_03_oa0cel03 inactive DATA_oa0_CD_04_oa0cel03 inactive DATA_oa0_CD_05_oa0cel03 inactive DATA_oa0_CD_06_oa0cel03 inactive DATA_oa0_CD_07_oa0cel03 inactive DATA_oa0_CD_08_oa0cel03 inactive DATA_oa0_CD_09_oa0cel03 inactive DATA_oa0_CD_10_oa0cel03 inactive DATA_oa0_CD_11_oa0cel03 inactive DBFS_DG_CD_02_oa0cel03 inactive DBFS_DG_CD_03_oa0cel03 inactive DBFS_DG_CD_04_oa0cel03 inactive DBFS_DG_CD_05_oa0cel03 inactive DBFS_DG_CD_06_oa0cel03 inactive DBFS_DG_CD_07_oa0cel03 inactive DBFS_DG_CD_08_oa0cel03 inactive DBFS_DG_CD_09_oa0cel03 inactive DBFS_DG_CD_10_oa0cel03 inactive DBFS_DG_CD_11_oa0cel03 inactive RECO_oa0_CD_00_oa0cel03 inactive RECO_oa0_CD_01_oa0cel03 inactive RECO_oa0_CD_02_oa0cel03 inactive RECO_oa0_CD_03_oa0cel03 inactive RECO_oa0_CD_04_oa0cel03 inactive RECO_oa0_CD_05_oa0cel03 inactive RECO_oa0_CD_06_oa0cel03 inactive RECO_oa0_CD_07_oa0cel03 inactive RECO_oa0_CD_08_oa0cel03 inactive RECO_oa0_CD_09_oa0cel03 inactive RECO_oa0_CD_10_oa0cel03 inactive RECO_oa0_CD_11_oa0cel03 inactive [root@oa0cel03 ~]# cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome DATA_oa0_CD_00_oa0cel03 OFFLINE Yes DATA_oa0_CD_01_oa0cel03 OFFLINE Yes DATA_oa0_CD_02_oa0cel03 OFFLINE Yes DATA_oa0_CD_03_oa0cel03 OFFLINE Yes DATA_oa0_CD_04_oa0cel03 OFFLINE Yes DATA_oa0_CD_05_oa0cel03 OFFLINE Yes DATA_oa0_CD_06_oa0cel03 OFFLINE Yes DATA_oa0_CD_07_oa0cel03 OFFLINE Yes DATA_oa0_CD_08_oa0cel03 OFFLINE Yes DATA_oa0_CD_09_oa0cel03 OFFLINE Yes DATA_oa0_CD_10_oa0cel03 OFFLINE Yes DATA_oa0_CD_11_oa0cel03 OFFLINE Yes DBFS_DG_CD_02_oa0cel03 OFFLINE Yes DBFS_DG_CD_03_oa0cel03 OFFLINE Yes DBFS_DG_CD_04_oa0cel03 OFFLINE Yes DBFS_DG_CD_05_oa0cel03 OFFLINE Yes DBFS_DG_CD_06_oa0cel03 OFFLINE Yes DBFS_DG_CD_07_oa0cel03 OFFLINE Yes DBFS_DG_CD_08_oa0cel03 OFFLINE Yes DBFS_DG_CD_09_oa0cel03 OFFLINE Yes DBFS_DG_CD_10_oa0cel03 OFFLINE Yes DBFS_DG_CD_11_oa0cel03 OFFLINE Yes RECO_oa0_CD_00_oa0cel03 OFFLINE Yes RECO_oa0_CD_01_oa0cel03 OFFLINE Yes RECO_oa0_CD_02_oa0cel03 OFFLINE Yes RECO_oa0_CD_03_oa0cel03 OFFLINE Yes RECO_oa0_CD_04_oa0cel03 OFFLINE Yes RECO_oa0_CD_05_oa0cel03 OFFLINE Yes RECO_oa0_CD_06_oa0cel03 OFFLINE Yes RECO_oa0_CD_07_oa0cel03 OFFLINE Yes RECO_oa0_CD_08_oa0cel03 OFFLINE Yes RECO_oa0_CD_09_oa0cel03 OFFLINE Yes RECO_oa0_CD_10_oa0cel03 OFFLINE Yes RECO_oa0_CD_11_oa0cel03 OFFLINE Yes
客户继续换卡尝试,最终确认4号卡槽损坏,放弃这个槽位重建flashcache
[root@oa0cel03 ~]# cellcli -e list celldisk CD_00_oa0cel03 normal CD_01_oa0cel03 normal CD_02_oa0cel03 normal CD_03_oa0cel03 normal CD_04_oa0cel03 normal CD_05_oa0cel03 normal CD_06_oa0cel03 normal CD_07_oa0cel03 normal CD_08_oa0cel03 normal CD_09_oa0cel03 normal CD_10_oa0cel03 normal CD_11_oa0cel03 normal FD_00_oa0cel03 not present FD_01_oa0cel03 not present FD_02_oa0cel03 not present FD_03_oa0cel03 not present FD_04_oa0cel03 normal FD_05_oa0cel03 normal FD_06_oa0cel03 normal FD_07_oa0cel03 normal FD_08_oa0cel03 normal FD_09_oa0cel03 normal FD_10_oa0cel03 normal FD_10_exastlx01 normal FD_12_oa0cel03 normal FD_13_oa0cel03 normal FD_14_oa0cel03 normal FD_15_oa0cel03 normal
这个里面FD_10_exastlx01名字是以前老的卡上面留下来的,太影响视觉感官了,删除重建
[root@oa0cel03 ~]# cellcli -e drop celldisk FD_10_exastlx01 CellDisk FD_10_exastlx01 successfully dropped [root@oa0cel03 ~]# cellcli -e create celldisk all flashdisk CellDisk FD_11_oa0cel03 successfully created [root@oa0cel03 ~]# cellcli -e list celldisk CD_00_oa0cel03 normal CD_01_oa0cel03 normal CD_02_oa0cel03 normal CD_03_oa0cel03 normal CD_04_oa0cel03 normal CD_05_oa0cel03 normal CD_06_oa0cel03 normal CD_07_oa0cel03 normal CD_08_oa0cel03 normal CD_09_oa0cel03 normal CD_10_oa0cel03 normal CD_11_oa0cel03 normal FD_00_oa0cel03 not present FD_01_oa0cel03 not present FD_02_oa0cel03 not present FD_03_oa0cel03 not present FD_04_oa0cel03 normal FD_05_oa0cel03 normal FD_06_oa0cel03 normal FD_07_oa0cel03 normal FD_08_oa0cel03 normal FD_09_oa0cel03 normal FD_10_oa0cel03 normal FD_11_oa0cel03 normal FD_12_oa0cel03 normal FD_13_oa0cel03 normal FD_14_oa0cel03 normal FD_15_oa0cel03 normal
删除flashlog和flashcache
[root@oa0cel03 ~]# cellcli -e drop flashlog Flash log oa0cel03_FLASHLOG successfully dropped [root@oa0cel03 ~]# [root@oa0cel03 ~]# [root@oa0cel03 ~]# [root@oa0cel03 ~]# cellcli -e drop flashcache Flash cache oa0cel03_FLASHCACHE successfully dropped
尝试重建flashlog和flashcache
[root@oa0cel03 ~]# cellcli -e create flashlog all size=512M Flash log oa0cel03_FLASHLOG successfully created, but the following cell disks were degraded because their statuses are not normal: FD_00_oa0cel03, FD_01_oa0cel03, FD_02_oa0cel03, FD_03_oa0cel03
由于有一些celldisk实际硬盘不存在,无法直接创建成功,需要删除对应的celldisk
[root@oa0cel03 ~]# cellcli -e drop celldisk FD_00_oa0cel03 CELL-04519: Cannot complete the drop of cell disk: FD_00_oa0cel03. Received error: CELL-04516: LUN Object cannot be obtained for cell disk: FD_00_oa0cel03 Cell disks not dropped: FD_00_oa0cel03 --强制删除 [root@oa0cel03 ~]# cellcli -e drop celldisk FD_00_oa0cel03 force CellDisk FD_00_oa0cel03 successfully dropped [root@oa0cel03 ~]# cellcli -e drop celldisk FD_01_oa0cel03 force CellDisk FD_01_oa0cel03 successfully dropped [root@oa0cel03 ~]# cellcli -e drop celldisk FD_02_oa0cel03 force CellDisk FD_02_oa0cel03 successfully dropped [root@oa0cel03 ~]# cellcli -e drop celldisk FD_03_oa0cel03 force CellDisk FD_03_oa0cel03 successfully dropped [root@oa0cel03 ~]# cellcli -e list celldisk CD_00_oa0cel03 normal CD_01_oa0cel03 normal CD_02_oa0cel03 normal CD_03_oa0cel03 normal CD_04_oa0cel03 normal CD_05_oa0cel03 normal CD_06_oa0cel03 normal CD_07_oa0cel03 normal CD_08_oa0cel03 normal CD_09_oa0cel03 normal CD_10_oa0cel03 normal CD_11_oa0cel03 normal FD_04_oa0cel03 normal FD_05_oa0cel03 normal FD_06_oa0cel03 normal FD_07_oa0cel03 normal FD_08_oa0cel03 normal FD_09_oa0cel03 normal FD_10_oa0cel03 normal FD_11_oa0cel03 normal FD_12_oa0cel03 normal FD_13_oa0cel03 normal FD_14_oa0cel03 normal FD_15_oa0cel03 normal
创建flashlog和flashcache
[root@oa0cel03 ~]# cellcli -e create flashlog all size=512M Flash log oa0cel03_FLASHLOG successfully created [root@oa0cel03 ~]# cellcli -e list flashlog detail name: oa0cel03_FLASHLOG cellDisk: ………… creationTime: 2024-06-21T18:20:51+08:00 degradedCelldisks: effectiveSize: 384M efficiency: 100.0 id: f3ab3882-fa03-4f49-b0ca-879ef3f2ac05 size: 384M status: normal [root@oa0cel03 ~]# cellcli -e create flashcache all Flash cache oa0cel03_FLASHCACHE successfully created [root@oa0cel03 ~]# cellcli -e list flashcache detail name: oa0cel03_FLASHCACHE cellDisk: ………… creationTime: 2024-06-21T18:21:24+08:00 degradedCelldisks: effectiveCacheSize: 1116.5625G id: 2195ac46-3021-461f-a6d5-5f64ff1da546 size: 1116.5625G status: normal [root@oa0cel03 ~]# cellcli -e list cell detail | grep flashCacheMode flashCacheMode: WriteBack
active griddisk,把这个cell的griddisk加入到asm磁盘组中
[root@oa0cel03 ~]# cellcli -e "alter griddisk all active" GridDisk DATA_oa0_CD_00_oa0cel03 successfully altered GridDisk DATA_oa0_CD_01_oa0cel03 successfully altered GridDisk DATA_oa0_CD_02_oa0cel03 successfully altered GridDisk DATA_oa0_CD_03_oa0cel03 successfully altered GridDisk DATA_oa0_CD_04_oa0cel03 successfully altered GridDisk DATA_oa0_CD_05_oa0cel03 successfully altered GridDisk DATA_oa0_CD_06_oa0cel03 successfully altered GridDisk DATA_oa0_CD_07_oa0cel03 successfully altered GridDisk DATA_oa0_CD_08_oa0cel03 successfully altered GridDisk DATA_oa0_CD_09_oa0cel03 successfully altered GridDisk DATA_oa0_CD_10_oa0cel03 successfully altered GridDisk DATA_oa0_CD_11_oa0cel03 successfully altered GridDisk DBFS_DG_CD_02_oa0cel03 successfully altered GridDisk DBFS_DG_CD_03_oa0cel03 successfully altered GridDisk DBFS_DG_CD_04_oa0cel03 successfully altered GridDisk DBFS_DG_CD_05_oa0cel03 successfully altered GridDisk DBFS_DG_CD_06_oa0cel03 successfully altered GridDisk DBFS_DG_CD_07_oa0cel03 successfully altered GridDisk DBFS_DG_CD_08_oa0cel03 successfully altered GridDisk DBFS_DG_CD_09_oa0cel03 successfully altered GridDisk DBFS_DG_CD_10_oa0cel03 successfully altered GridDisk DBFS_DG_CD_11_oa0cel03 successfully altered GridDisk RECO_oa0_CD_00_oa0cel03 successfully altered GridDisk RECO_oa0_CD_01_oa0cel03 successfully altered GridDisk RECO_oa0_CD_02_oa0cel03 successfully altered GridDisk RECO_oa0_CD_03_oa0cel03 successfully altered GridDisk RECO_oa0_CD_04_oa0cel03 successfully altered GridDisk RECO_oa0_CD_05_oa0cel03 successfully altered GridDisk RECO_oa0_CD_06_oa0cel03 successfully altered GridDisk RECO_oa0_CD_07_oa0cel03 successfully altered GridDisk RECO_oa0_CD_08_oa0cel03 successfully altered GridDisk RECO_oa0_CD_09_oa0cel03 successfully altered GridDisk RECO_oa0_CD_10_oa0cel03 successfully altered GridDisk RECO_oa0_CD_11_oa0cel03 successfully altered [root@oa0cel03 ~]# cellcli -e list griddisk attributes name,asmmodestatus,asmdeactivationoutcome,status DATA_oa0_CD_00_oa0cel03 SYNCING Yes active DATA_oa0_CD_01_oa0cel03 SYNCING Yes active DATA_oa0_CD_02_oa0cel03 SYNCING Yes active DATA_oa0_CD_03_oa0cel03 SYNCING Yes active DATA_oa0_CD_04_oa0cel03 SYNCING Yes active DATA_oa0_CD_05_oa0cel03 SYNCING Yes active DATA_oa0_CD_06_oa0cel03 SYNCING Yes active DATA_oa0_CD_07_oa0cel03 SYNCING Yes active DATA_oa0_CD_08_oa0cel03 SYNCING Yes active DATA_oa0_CD_09_oa0cel03 SYNCING Yes active DATA_oa0_CD_10_oa0cel03 SYNCING Yes active DATA_oa0_CD_11_oa0cel03 SYNCING Yes active DBFS_DG_CD_02_oa0cel03 ONLINE Yes active DBFS_DG_CD_03_oa0cel03 ONLINE Yes active DBFS_DG_CD_04_oa0cel03 ONLINE Yes active DBFS_DG_CD_05_oa0cel03 ONLINE Yes active DBFS_DG_CD_06_oa0cel03 ONLINE Yes active DBFS_DG_CD_07_oa0cel03 ONLINE Yes active DBFS_DG_CD_08_oa0cel03 ONLINE Yes active DBFS_DG_CD_09_oa0cel03 ONLINE Yes active DBFS_DG_CD_10_oa0cel03 ONLINE Yes active DBFS_DG_CD_11_oa0cel03 ONLINE Yes active RECO_oa0_CD_00_oa0cel03 SYNCING Yes active RECO_oa0_CD_01_oa0cel03 SYNCING Yes active RECO_oa0_CD_02_oa0cel03 SYNCING Yes active RECO_oa0_CD_03_oa0cel03 SYNCING Yes active RECO_oa0_CD_04_oa0cel03 SYNCING Yes active RECO_oa0_CD_05_oa0cel03 SYNCING Yes active RECO_oa0_CD_06_oa0cel03 SYNCING Yes active RECO_oa0_CD_07_oa0cel03 SYNCING Yes active RECO_oa0_CD_08_oa0cel03 SYNCING Yes active RECO_oa0_CD_09_oa0cel03 SYNCING Yes active RECO_oa0_CD_10_oa0cel03 SYNCING Yes active RECO_oa0_CD_11_oa0cel03 SYNCING Yes active [root@oa0cel03 ~]# cellcli -e list metriccurrent attributes name,metricvalue where name like \'FC_BY_DIRTY.*\' FC_BY_DIRTY 585 MB