标签云
asm恢复 bbed bootstrap$ dul In Memory kcbzib_kcrsds_1 kccpb_sanity_check_2 kfed MySQL恢复 ORA-00312 ORA-00607 ORA-00704 ORA-01110 ORA-01555 ORA-01578 ORA-08103 ORA-600 2131 ORA-600 2662 ORA-600 2663 ORA-600 3020 ORA-600 4000 ORA-600 4137 ORA-600 4193 ORA-600 4194 ORA-600 16703 ORA-600 kcbzib_kcrsds_1 ORA-600 KCLCHKBLK_4 ORA-15042 ORA-15196 ORACLE 12C oracle dul ORACLE PATCH Oracle Recovery Tools oracle加密恢复 oracle勒索 oracle勒索恢复 oracle异常恢复 Oracle 恢复 ORACLE恢复 ORACLE数据库恢复 oracle 比特币 OSD-04016 YOUR FILES ARE ENCRYPTED 勒索恢复 比特币加密文章分类
- Others (2)
- 中间件 (2)
- WebLogic (2)
- 操作系统 (102)
- 数据库 (1,683)
- DB2 (22)
- MySQL (73)
- Oracle (1,545)
- Data Guard (52)
- EXADATA (8)
- GoldenGate (24)
- ORA-xxxxx (159)
- ORACLE 12C (72)
- ORACLE 18C (6)
- ORACLE 19C (15)
- ORACLE 21C (3)
- Oracle 23ai (7)
- Oracle ASM (68)
- Oracle Bug (8)
- Oracle RAC (53)
- Oracle 安全 (6)
- Oracle 开发 (28)
- Oracle 监听 (28)
- Oracle备份恢复 (565)
- Oracle安装升级 (92)
- Oracle性能优化 (62)
- 专题索引 (5)
- 勒索恢复 (79)
- PostgreSQL (18)
- PostgreSQL恢复 (6)
- SQL Server (27)
- SQL Server恢复 (8)
- TimesTen (7)
- 达梦数据库 (2)
- 生活娱乐 (2)
- 至理名言 (11)
- 虚拟化 (2)
- VMware (2)
- 软件开发 (37)
- Asp.Net (9)
- JavaScript (12)
- PHP (2)
- 小工具 (20)
-
最近发表
- ORA-15411: Failure groups in disk group DATA have different number of disks.
- 断电引起的ORA-08102: 未找到索引关键字, 对象号 39故障处理
- ORA-00227: corrupt block detected in control file
- 手工删除19c rac
- 解决oracle数据文件路径有回车故障
- .wstop扩展名勒索数据库恢复
- Oracle Recovery Tools工具一键解决ORA-00376 ORA-01110故障(文件offline)
- OGG-02771 Input trail file format RELEASE 19.1 is different from previous trail file form at RELEASE 11.2.
- OGG-02246 Source redo compatibility level 19.0.0 requires trail FORMAT 12.2 or higher
- GoldenGate 19安装和打patch
- dd破坏asm磁盘头恢复
- 删除asmlib磁盘导致磁盘组故障恢复
- Kylin Linux 安装19c
- ORA-600 krse_arc_complete.4
- Oracle 19c 202410补丁(RUs+OJVM)
- ntfs MFT损坏(ntfs文件系统故障)导致oracle异常恢复
- .mkp扩展名oracle数据文件加密恢复
- 清空redo,导致ORA-27048: skgfifi: file header information is invalid
- A_H_README_TO_RECOVER勒索恢复
- 通过alert日志分析客户自行对一个数据库恢复的来龙去脉和点评
分类目录归档:Linux
注意系统bug—linux在E5、E5 V2、E7 V2 cpu之上的bug 765720
今天晚上群里面兄弟说了一个linux 6上面bug,会导致系统在运行200天以上(hardware uptime),然后进行热重启后,可能在几分钟或者几个小时内出发该bug,导致系统异常.
主要影响条件为:
Red Hat Enterprise Linux 6.1 (kernel-2.6.32-131.26.1.el6 and newer)
Red Hat Enterprise Linux 6.2 (kernel-2.6.32-220.4.2.el6 and newer)
Red Hat Enterprise Linux 6.3 (kernel-2.6.32-279 series)
Red Hat Enterprise Linux 6.4 (kernel-2.6.32-358 series)
Any Intel® Xeon® E5, Intel® Xeon® E5 v2, or Intel® Xeon® E7 v2 series processor
从这里可以看出来该问题主要影响E5、E5 V2、E7 V2 cpu上的redhat 6.1-6.4版本,在6.5版本中修复,具体参考:bug 765720
另外对已ORACLE Linux,如果使用EL Kernel影响和redhat一致,如果使用Unbreakable Enterprise Kernel则在6.2版本中进行了修复该问题。
MOS上类似文章:Oracle Linux 6 RHCK system hang: processes blocked in ext4_file_open(), pick_next_task_fair()
补充说明:
1. 在Red Hat/OEL 5.x版本中不存在。
2. 在32和64位操作系统都有可能发生
3. 鉴于该bug短期内无法修复,而且真的发生了,考虑冷重启主机,临时规避
再次提醒:系统版本选定也很重要,大家在选择Linux版本之时尽量选择避开该bug(el kernel 6.5及其以后版本,uek kernel 6.2及其以后版本)。个人倾向:如果是部署ORACLE db,而且还是redhat系列Linux,更加倾向OEL(省事,相信Oracle)
发表在 Linux
评论关闭
记录一次rm -rf 删除数据文件异常恢复
因为人员离职闹得不愉快,系统工程师离职后,由于公司未及时关闭其vpn,数据库服务器(Linux 6.5 Oracle 11.2.0.1)帐号未及时被修改,最后直接上去rm ORACLE_BASE给干掉,悲剧的是ORADATA目录也在里面,更加悲剧的是所有数据文件都在里面.也就是说数据库彻底被删除,而且没有任何备份.朋友咨询了我,让我给予支持.最后比较幸运,文件没有被覆盖,inode都还在,通过extundelete顺利恢复所有数据文件,控制文件,redo文件(extundelete恢复Linux被删除文件),数据库顺利打开,实现0丢失,算是一次完美的恢复
[root@DB1 tmp]# tar xvf extundelete-0.2.4.tar extundelete-0.2.4/ extundelete-0.2.4/acinclude.m4 extundelete-0.2.4/missing extundelete-0.2.4/autogen.sh extundelete-0.2.4/aclocal.m4 extundelete-0.2.4/configure extundelete-0.2.4/LICENSE extundelete-0.2.4/README extundelete-0.2.4/install-sh extundelete-0.2.4/config.h.in extundelete-0.2.4/src/ extundelete-0.2.4/src/extundelete.cc extundelete-0.2.4/src/block.h extundelete-0.2.4/src/kernel-jbd.h extundelete-0.2.4/src/insertionops.cc extundelete-0.2.4/src/block.c extundelete-0.2.4/src/cli.cc extundelete-0.2.4/src/extundelete-priv.h extundelete-0.2.4/src/extundelete.h extundelete-0.2.4/src/jfs_compat.h extundelete-0.2.4/src/Makefile.in extundelete-0.2.4/src/Makefile.am extundelete-0.2.4/configure.ac extundelete-0.2.4/depcomp extundelete-0.2.4/Makefile.in extundelete-0.2.4/Makefile.am [root@DB1 tmp]# cd extundelete-0.2.4 [root@DB1 extundelete-0.2.4]# ./configure Configuring extundelete 0.2.4 Writing generated files to disk [root@DB1 extundelete-0.2.4]# make && make install make -s all-recursive Making all in src Making install in src /usr/bin/install -c extundelete '/usr/local/bin' [root@DB1 extundelete-0.2.4]# df -h Filesystem Size Used Avail Use% Mounted on /dev/sda3 244G 11G 221G 5% / tmpfs 16G 72K 16G 1% /dev/shm /dev/sda1 190M 62M 119M 35% /boot /dev/sdb1 2.0T 71M 1.9T 1% /home [root@DB1 extundelete-0.2.4]# umount /dev/sdb1 umount: /home: device is busy. (In some cases useful info about processes that use the device is found by lsof(8) or fuser(1)) [root@DB1 extundelete-0.2.4]# fuser -m -u /home /home: 3914c(oracle) 8372c(oracle) [root@DB1 extundelete-0.2.4]# kill -9 3914 [root@DB1 extundelete-0.2.4]# fuser -m -u /home /home: 8372c(oracle) [root@DB1 extundelete-0.2.4]# kill -9 8372 [root@DB1 extundelete-0.2.4]# fuser -m -u /home [root@DB1 extundelete-0.2.4]# umount /dev/sdb1 [root@DB1 extundelete-0.2.4]# df -h Filesystem Size Used Avail Use% Mounted on /dev/sda3 244G 11G 221G 5% / tmpfs 16G 72K 16G 1% /dev/shm /dev/sda1 190M 62M 119M 35% /boot [root@DB1 extundelete-0.2.4]# extundelete /dev/sdb1 --restore-all NOTICE: Extended attributes are not restored. Loading filesystem metadata ... 16384 groups loaded. Loading journal descriptors ... 26542 descriptors loaded. Searching for recoverable inodes in directory / ... 18896 recoverable inodes found. Looking through the directory structure for deleted files ... 2 recoverable inodes still lost. Unable to restore inode 43778050 (file.43778050): Space has been reallocated. [root@DB1 extundelete-0.2.4]# ls acinclude.m4 autogen.sh config.h.in config.status configure.ac install-sh Makefile Makefile.in aclocal.m4 config.h config.log configure depcomp LICENSE Makefile.am missing [root@DB1 extundelete-0.2.4]# cd RECOVERED_FILES/ [root@DB1 RECOVERED_FILES]# ls app file.43778051 oracle oraInventory [root@DB1 RECOVERED_FILES]# cd app [root@DB1 app]# ls admin cfgtoollogs diag oracle oradata orcl ORCL [root@DB1 app]# cd oradata [root@DB1 oradata]# ls orcl [root@DB1 oradata]# cd orcl [root@DB1 orcl]# ls control01.ctl redo01.log redo02.log redo03.log sysaux01.dbf system01.dbf undotbs01.dbf users01.dbf [root@DB1 orcl]# ls -ltr total 2908776 -rw-r--r--. 1 root root 734011392 Nov 18 02:06 system01.dbf -rw-r--r--. 1 root root 1069555712 Nov 18 02:06 sysaux01.dbf -rw-r--r--. 1 root root 120594432 Nov 18 02:06 undotbs01.dbf -rw-r--r--. 1 root root 887365632 Nov 18 02:06 users01.dbf -rw-r--r--. 1 root root 9748480 Nov 18 02:06 control01.ctl -rw-r--r--. 1 root root 52429312 Nov 18 02:06 redo01.log -rw-r--r--. 1 root root 52429312 Nov 18 02:06 redo02.log -rw-r--r--. 1 root root 52429312 Nov 18 02:06 redo03.log [root@DB1 orcl]#
再次提醒各位:数据库备份重于一切,防天灾的同时还要防人灾,也希望圈子里面以后不要听到类似故障.
multipath实现设备用户组设置
现在的Linux系统中,很多都会使用系统自带的multipath多路径软件,在以前的版本中,我们一般通过multipath+udev或者multipath+rc.local来实现多路径和权限设置,而在redhat 5.3-5.11的版本中multipath就直接可以实现多路径聚合、设备持久化、用户组设置
操作系统版本
[root@rac1 dev]# uname -r 2.6.39-300.26.1.el5uek [root@rac1 dev]# more /etc/issue Oracle Linux Server release 5.9 Kernel \r on an \m
fdisk记录
[root@rac1 dev]# fdisk -l ………… Disk /dev/sdh: 134.2 GB, 134217728000 bytes 255 heads, 63 sectors/track, 16317 cylinders Units = cylinders of 16065 * 512 = 8225280 bytes Device Boot Start End Blocks Id System Disk /dev/sdi: 33.5 GB, 33554432000 bytes 64 heads, 32 sectors/track, 32000 cylinders Units = cylinders of 2048 * 512 = 1048576 bytes Device Boot Start End Blocks Id System
multipath包
检查安装multipath相关包(该版本系统默认安装)
[root@rac1 dev]# rpm -aq|grep mapper device-mapper-multipath-libs-0.4.9-56.0.3.el5 device-mapper-event-1.02.67-2.el5 device-mapper-1.02.67-2.el5 device-mapper-multipath-0.4.9-56.0.3.el5
获取wwid值
[root@rac1 dev]# /sbin/scsi_id -g -u -s /block/sdh 14f504e46494c45527049754962662d395751372d68356743 [root@rac1 dev]# /sbin/scsi_id -g -u -s /block/sdi 14f504e46494c4552484d486249782d464471382d354f4b58
获取uid和gid
[root@rac1 dev]# id grid uid=1100(grid) gid=54321(oinstall) groups=54321(oinstall),1020(asmadmin),1021(asmdba)
multipath.conf配置
[root@rac1 dev]# vi /etc/multipath.conf defaults { user_friendly_names no queue_without_daemon no flush_on_last_del yes max_fds max } blacklist { devnode "^hd[a-z]" devnode "^(ram|raw|loop|fd|md|dm-|sr|scd|st)[0-9]*" devnode "^cciss.*" } devices { device { vendor "OPNFILER " product "LUN" path_grouping_policy group_by_prio features "3 queue_if_no_path pg_init_retries 50" getuid_callout "/sbin/scsi_id -g -u -s /block/%n" path_checker tur path_selector "round-robin 0" hardware_handler "1 alua" failback immediate rr_weight uniform rr_min_io 128 } } multipaths { multipath { wwid 14f504e46494c45527049754962662d395751372d68356743 #wwid alias xifenfei128 uid 1100 #uid gid 1020 #gid } multipath { wwid 14f504e46494c4552484d486249782d464471382d354f4b58 #wwid alias xifenfei32 uid 1100 #uid gid 1020 #gid } }
启动multipath
[root@rac1 dev]# modprobe dm-multipath [root@rac1 dev]# modprobe dm-round-robin [root@rac1 dev]# chkconfig multipathd on [root@rac1 dev]# service multipathd start Starting multipathd daemon: [ OK ] [root@rac1 dev]# multipath -F [root@rac1 dev]# multipath -v2 create: xifenfei128 (14f504e46494c45527049754962662d395751372d68356743) undef OPNFILER,VIRTUAL-DISK size=125G features='0' hwhandler='0' wp=undef `-+- policy='round-robin 0' prio=1 status=undef `- 3:0:0:9 sdh 8:112 undef ready running create: xifenfei32 (14f504e46494c4552484d486249782d464471382d354f4b58) undef OPNFILER,VIRTUAL-DISK size=31G features='0' hwhandler='0' wp=undef `-+- policy='round-robin 0' prio=1 status=undef `- 3:0:0:10 sdi 8:128 undef ready running
查看生成多路径设备
注意设备名称、组、用户
[root@rac1 dev]# ls -l /dev/mapper/xifenfei* brw-rw---- 1 grid asmadmin 252, 2 Jan 7 21:21 /dev/mapper/xifenfei128 brw-rw---- 1 grid asmadmin 252, 3 Jan 7 21:21 /dev/mapper/xifenfei32
补充Linux 6.x中udev设置所属组和权限
对于linux 6.x,multipath不能设置磁盘所属组和权限,可以通过udev进行实现,类似配置如下
[root@bxrac03 mapper]#cat 99-diskownership.rules SUBSYSTEM!="block", GOTO="quickexit" KERNEL!="dm-*", GOTO="quickexit" PROGRAM=="/sbin/dmsetup info -c --noheadings -o name -m %m -j %M" RESULT=="*ocr*", OWNER="grid", GROUP="oinstall", MODE="0660" RESULT=="*oradata", OWNER="grid", GROUP="oinstall", MODE="0660" RESULT=="*backup", OWNER="grid", GROUP="oinstall", MODE="0660" LABEL="quickexit"
其中RESULT和dm的别名向匹配