联系:手机/微信(+86 17813235971) QQ(107644445)
标题:asm磁盘类似_DROPPED_0001_DATA名称故障处理
作者:惜分飞©版权所有[未经本人同意,不得以任何形式转载,否则有进一步追究法律责任的权利.]
发现一客户数据库的asm磁盘组中有磁盘掉线(通过分析日志确认2016年就已经掉线,而且不在做rebalance)
进一步检查
SQL> / NAME PATH GROUP_NUMBER DISK_NUMBER MOUNT_STATUS HEADER_STATUS ------------------------------ --------------------- ------------ ----------- -------------- ------------------------ MODE_STATUS STATE FAILGROUP -------------- ---------------- -------------------- ORCL:DATA2 0 0 CLOSED MEMBER ONLINE NORMAL ORCL:FLASH1 0 1 CLOSED MEMBER ONLINE NORMAL ORCL:GRID3 0 2 CLOSED MEMBER ONLINE NORMAL _DROPPED_0000_FLASH 2 0 MISSING UNKNOWN OFFLINE FORCING FLASH1 _DROPPED_0001_DATA 1 1 MISSING UNKNOWN OFFLINE FORCING DATA2 DATA1 ORCL:DATA1 1 0 CACHED MEMBER ONLINE NORMAL DATA1 FLASH2 ORCL:FLASH2 2 1 CACHED MEMBER ONLINE NORMAL FLASH2 GRID1 ORCL:GRID1 3 0 CACHED MEMBER ONLINE NORMAL GRID1 GRID2 ORCL:GRID2 3 1 CACHED MEMBER ONLINE NORMAL GRID2 GRID4 ORCL:GRID4 3 3 CACHED MEMBER ONLINE NORMAL GRID4 GRID5 ORCL:GRID5 3 4 CACHED MEMBER ONLINE NORMAL GRID5 GRID6 ORCL:GRID6 3 5 CACHED MEMBER ONLINE NORMAL GRID6 12 rows selected. SQL> select NAME,STATE,TYPE,OFFLINE_DISKS from v$asm_diskgroup; NAME ------------------------------------------------------------ STATE TYPE OFFLINE_DISKS ---------------------- ------------ ------------- DATA MOUNTED NORMAL 1 FLASH MOUNTED NORMAL 1 GRID MOUNTED NORMAL 0
主要问题是由于ORCL:FLASH1和ORCL:DATA2磁盘掉线导致处于_DROPPED_0000_FLASH和_DROPPED_0001_DATA状态.底层检查,确定现在这些磁盘都正常.然后使用force命令进行强制增加掉线的磁盘到对应的磁盘组中
SQL> alter diskgroup FLASH add failgroup flg1 disk 'ORCL:FLASH1' force; Diskgroup altered. SQL> alter diskgroup data add failgroup dg2 disk 'ORCL:DATA2' force; Diskgroup altered.
观察asm 日志,等rebalance完成
Sat Dec 05 16:48:10 2020 SQL> alter diskgroup FLASH add failgroup flg1 disk 'ORCL:FLASH1' force NOTE: GroupBlock outside rolling migration privileged region NOTE: Assigning number (2,2) to disk (ORCL:FLASH1) NOTE: requesting all-instance membership refresh for group=2 NOTE: initializing header on grp 2 disk FLASH1 NOTE: requesting all-instance disk validation for group=2 Sat Dec 05 16:48:13 2020 NOTE: skipping rediscovery for group 2/0x58e713e7 (FLASH) on local instance. NOTE: requesting all-instance disk validation for group=2 NOTE: skipping rediscovery for group 2/0x58e713e7 (FLASH) on local instance. Sat Dec 05 16:48:19 2020 GMON updating for reconfiguration, group 2 at 14 for pid 34, osid 12203 NOTE: group 2 PST updated. NOTE: initiating PST update: grp = 2 GMON updating group 2 at 15 for pid 34, osid 12203 NOTE: cache closing disk 0 of grp 2: (not open) _DROPPED_0000_FLASH NOTE: group FLASH: updated PST location: disk 0001 (PST copy 0) NOTE: group FLASH: updated PST location: disk 0002 (PST copy 1) NOTE: PST update grp = 2 completed successfully NOTE: membership refresh pending for group 2/0x58e713e7 (FLASH) GMON querying group 2 at 16 for pid 18, osid 41180 NOTE: cache closing disk 0 of grp 2: (not open) _DROPPED_0000_FLASH NOTE: cache opening disk 2 of grp 2: FLASH1 label:FLASH1 NOTE: Attempting voting file refresh on diskgroup FLASH NOTE: Refresh completed on diskgroup FLASH. No voting file found. GMON querying group 2 at 17 for pid 18, osid 41180 NOTE: cache closing disk 0 of grp 2: (not open) _DROPPED_0000_FLASH Sat Dec 05 16:48:25 2020 SUCCESS: refreshed membership for 2/0x58e713e7 (FLASH) Sat Dec 05 16:48:25 2020 SUCCESS: alter diskgroup FLASH add failgroup flg1 disk 'ORCL:FLASH1' force NOTE: starting rebalance of group 2/0x58e713e7 (FLASH) at power 1 Starting background process ARB0 Sat Dec 05 16:48:26 2020 ARB0 started with pid=36, OS id=12451 NOTE: assigning ARB0 to group 2/0x58e713e7 (FLASH) with 1 parallel I/O cellip.ora not found. NOTE: F1X0 copy 2 relocating from 0:2 to 2:2 for diskgroup 2 (FLASH) NOTE: Attempting voting file refresh on diskgroup FLASH NOTE: Refresh completed on diskgroup FLASH. No voting file found. Sat Dec 05 16:48:45 2020 NOTE: Rebalance has restored redundancy for any existing control file or redo log in disk group FLASH Sat Dec 05 16:49:06 2020 NOTE: stopping process ARB0 SUCCESS: rebalance completed for group 2/0x58e713e7 (FLASH) Sat Dec 05 16:49:08 2020 NOTE: GroupBlock outside rolling migration privileged region NOTE: requesting all-instance membership refresh for group=2 Sat Dec 05 16:49:11 2020 GMON updating for reconfiguration, group 2 at 18 for pid 36, osid 12681 NOTE: cache closing disk 0 of grp 2: (not open) _DROPPED_0000_FLASH NOTE: group FLASH: updated PST location: disk 0001 (PST copy 0) NOTE: group FLASH: updated PST location: disk 0002 (PST copy 1) NOTE: group 2 PST updated. SUCCESS: grp 2 disk _DROPPED_0000_FLASH going offline GMON updating for reconfiguration, group 2 at 19 for pid 36, osid 12681 NOTE: cache closing disk 0 of grp 2: (not open) _DROPPED_0000_FLASH NOTE: group FLASH: updated PST location: disk 0001 (PST copy 0) NOTE: group FLASH: updated PST location: disk 0002 (PST copy 1) NOTE: group 2 PST updated. NOTE: membership refresh pending for group 2/0x58e713e7 (FLASH) GMON querying group 2 at 20 for pid 18, osid 41180 GMON querying group 2 at 21 for pid 18, osid 41180 NOTE: Disk _DROPPED_0000_FLASH in mode 0x0 marked for de-assignment SUCCESS: refreshed membership for 2/0x58e713e7 (FLASH) Sat Dec 05 16:51:56 2020 SQL> alter diskgroup data add failgroup dg2 disk 'ORCL:DATA2' force NOTE: GroupBlock outside rolling migration privileged region NOTE: Assigning number (1,2) to disk (ORCL:DATA2) NOTE: requesting all-instance membership refresh for group=1 NOTE: initializing header on grp 1 disk DATA2 NOTE: requesting all-instance disk validation for group=1 Sat Dec 05 16:51:57 2020 NOTE: skipping rediscovery for group 1/0x58d713e6 (DATA) on local instance. NOTE: requesting all-instance disk validation for group=1 NOTE: skipping rediscovery for group 1/0x58d713e6 (DATA) on local instance. Sat Dec 05 16:52:02 2020 GMON updating for reconfiguration, group 1 at 22 for pid 34, osid 12203 NOTE: group 1 PST updated. NOTE: initiating PST update: grp = 1 GMON updating group 1 at 23 for pid 34, osid 12203 NOTE: cache closing disk 1 of grp 1: (not open) _DROPPED_0001_DATA NOTE: group DATA: updated PST location: disk 0000 (PST copy 0) NOTE: group DATA: updated PST location: disk 0002 (PST copy 1) NOTE: PST update grp = 1 completed successfully NOTE: membership refresh pending for group 1/0x58d713e6 (DATA) GMON querying group 1 at 24 for pid 18, osid 41180 NOTE: cache closing disk 1 of grp 1: (not open) _DROPPED_0001_DATA NOTE: cache opening disk 2 of grp 1: DATA2 label:DATA2 Sat Dec 05 16:52:08 2020 NOTE: Attempting voting file refresh on diskgroup DATA NOTE: Refresh completed on diskgroup DATA. No voting file found. GMON querying group 1 at 25 for pid 18, osid 41180 NOTE: cache closing disk 1 of grp 1: (not open) _DROPPED_0001_DATA SUCCESS: refreshed membership for 1/0x58d713e6 (DATA) Sat Dec 05 16:52:08 2020 SUCCESS: alter diskgroup data add failgroup dg2 disk 'ORCL:DATA2' force NOTE: starting rebalance of group 1/0x58d713e6 (DATA) at power 1 Starting background process ARB0 Sat Dec 05 16:52:08 2020 ARB0 started with pid=37, OS id=13463 NOTE: assigning ARB0 to group 1/0x58d713e6 (DATA) with 1 parallel I/O NOTE: Attempting voting file refresh on diskgroup DATA NOTE: Refresh completed on diskgroup DATA. No voting file found. Sat Dec 05 16:52:44 2020 cellip.ora not found. NOTE: F1X0 copy 2 relocating from 1:2 to 2:2 for diskgroup 1 (DATA) Sat Dec 05 16:53:22 2020 NOTE: Rebalance has restored redundancy for any existing control file or redo log in disk group DATA NOTE: membership refresh pending for group 1/0x58d713e6 (DATA) GMON querying group 1 at 27 for pid 18, osid 41180 NOTE: cache closing disk 1 of grp 1: (not open) _DROPPED_0001_DATA SUCCESS: refreshed membership for 1/0x58d713e6 (DATA) SUCCESS: alter diskgroup data rebalance power 11 NOTE: starting rebalance of group 1/0x58d713e6 (DATA) at power 11 Starting background process ARB0 Sat Dec 05 17:27:52 2020 ARB0 started with pid=35, OS id=23318 NOTE: assigning ARB0 to group 1/0x58d713e6 (DATA) with 11 parallel I/Os NOTE: Attempting voting file refresh on diskgroup DATA NOTE: Refresh completed on diskgroup DATA. No voting file found. Sat Dec 05 17:28:29 2020 cellip.ora not found. Sat Dec 05 17:28:45 2020 NOTE: Rebalance has restored redundancy for any existing control file or redo log in disk group DATA Sat Dec 05 18:48:10 2020 NOTE: GroupBlock outside rolling migration privileged region NOTE: requesting all-instance membership refresh for group=1 Sat Dec 05 18:48:32 2020 GMON updating for reconfiguration, group 1 at 28 for pid 36, osid 47454 NOTE: cache closing disk 1 of grp 1: (not open) _DROPPED_0001_DATA NOTE: group DATA: updated PST location: disk 0000 (PST copy 0) NOTE: group DATA: updated PST location: disk 0002 (PST copy 1) Sat Dec 05 18:48:32 2020 NOTE: group 1 PST updated. SUCCESS: grp 1 disk _DROPPED_0001_DATA going offline GMON updating for reconfiguration, group 1 at 29 for pid 36, osid 47454 NOTE: cache closing disk 1 of grp 1: (not open) _DROPPED_0001_DATA NOTE: group DATA: updated PST location: disk 0000 (PST copy 0) NOTE: group DATA: updated PST location: disk 0002 (PST copy 1) NOTE: group 1 PST updated. Sat Dec 05 18:48:32 2020 NOTE: membership refresh pending for group 1/0x58d713e6 (DATA) GMON querying group 1 at 30 for pid 18, osid 41180 GMON querying group 1 at 31 for pid 18, osid 41180 NOTE: Disk _DROPPED_0001_DATA in mode 0x0 marked for de-assignment SUCCESS: refreshed membership for 1/0x58d713e6 (DATA) NOTE: Attempting voting file refresh on diskgroup DATA NOTE: Refresh completed on diskgroup DATA. No voting file found. Sat Dec 05 18:52:24 2020 NOTE: stopping process ARB0 SUCCESS: rebalance completed for group 1/0x58d713e6 (DATA)
总结:对于normal磁盘组由于某种原因磁盘从磁盘组中掉,v$asm_disk.name类似_DROPPED_0001_DATA,v$asm_disk.state为FORCING,可以通过类似alter diskgroup data add failgroup dg2 disk ‘ORCL:DATA2′ force;方式强制增加掉线的磁盘进入磁盘组,然后待rebalance完成,问题修复