ASM diskgroup cann’t mount and drop

If you cannot recover the data by yourself, ask Parnassusdata, the professional ORACLE database recovery team for help.

Parnassusdata Software Database Recovery Team

Service Hotline:  +86 13764045638 E-mail: service@parnassusdata.com

 

IHAC who encounter an error as below after restart database and storage .

 

 

SQL> alter diskgroup DATA mount;
alter diskgroup DATA mount
*
ERROR at line 1:
ORA-15032: not all alterations performed
ORA-15040: diskgroup is incomplete
ORA-15042: ASM disk "3" is missing
 
Then we found /dev/raw/raw9 is in candidate status
SQL> select path,HEADER_STATUS,MOUNT_STATUS,MODE_STATUS from v$asm_disk;
 
PATH            HEADER_STATU MOUNT_S MODE_ST
--------------- ------------ ------- -------
/dev/raw/raw9   CANDIDATE    CLOSED  ONLINE
/dev/raw/raw6   MEMBER       CLOSED  ONLINE
/dev/raw/raw7   MEMBER       CLOSED  ONLINE
/dev/raw/raw8   MEMBER       CLOSED  ONLINE
/dev/raw/raw1   FOREIGN      CLOSED  ONLINE
/dev/raw/raw4   FOREIGN      CLOSED  ONLINE
/dev/raw/raw3   FOREIGN      CLOSED  ONLINE
/dev/raw/raw2   FOREIGN      CLOSED  ONLINE
/dev/raw/raw5   FOREIGN      CLOSED  ONLINE 
 
While we are using kfed checking the status , we found /dev/raw/raw9 was invalid.
[oracle@DCSDB2 ~]$ kfed read /dev/raw/raw9 blkn=2 | grep KFBTYP
kfbh.type:                            0 ; 0x002: KFBTYP_INVALID
[oracle@DCSDB2 ~]$ kfed read /dev/raw/raw9 blkn=4 | grep KFBTYP
kfbh.type:                            0 ; 0x002: KFBTYP_INVALID
[oracle@DCSDB2 ~]$ kfed read /dev/raw/raw9 blkn=10 | grep KFBTYP
kfbh.type:                            0 ; 0x002: KFBTYP_INVALID
[oracle@DCSDB2 ~]$ kfed read /dev/raw/raw9 blkn=100 | grep KFBTYP
kfbh.type: 
 

Right now we want to remove or drop /dev/raw/raw9 and bring the database up , But we can’t drop it in normal because /dev/raw/raw9 can’t mount.

My question is  how can we drop or remove /dev/raw/raw9 (CT clear that they is no data or important data in this device) and bring the database /ASM up.

 

[root@DCSDB1 ~]#   ls -l /dev/raw/*
crw------- 1 root   oinstall 162, 1 01-27 06:37 /dev/raw/raw1
crw------- 1 root   oinstall 162, 2 01-27 06:37 /dev/raw/raw2
crw------- 1 oracle oinstall 162, 3 01-27 06:37 /dev/raw/raw3
crw------- 1 oracle oinstall 162, 4 01-27 06:37 /dev/raw/raw4
crw------- 1 oracle oinstall 162, 5 01-27 06:37 /dev/raw/raw5
crw------- 1 oracle oinstall 162, 6 01-27 06:37 /dev/raw/raw6
crw------- 1 oracle oinstall 162, 7 01-27 06:37 /dev/raw/raw7
crw------- 1 oracle oinstall 162, 8 01-27 06:37 /dev/raw/raw8
crw------- 1 oracle oinstall 162, 9 01-27 06:37 /dev/raw/raw9

[root@DCSDB1 ~]# cat /etc/sysconfig/rawdevices
# raw device bindings
# format:    
#           
# example: /dev/raw/raw1 /dev/sda1
#          /dev/raw/raw2 8 5
/dev/raw/raw1      /dev/mapper/oravg-ocr1
/dev/raw/raw2      /dev/mapper/oravg-ocr2
/dev/raw/raw3      /dev/mapper/oravg-vot1
/dev/raw/raw4      /dev/mapper/oravg-vot2
/dev/raw/raw5      /dev/mapper/oravg-vot3
/dev/raw/raw6     /dev/mapper/oravg-data1
/dev/raw/raw7     /dev/mapper/oravg-data2
/dev/raw/raw8     /dev/mapper/oravg-data3
/dev/raw/raw9     /dev/mapper/oravg-data5

[root@DCSDB1 tmp]# cat /proc/partitions
major minor  #blocks  name

  8     0 1754880000 sda
  8     1     514048 sda1
  8     2 1754362260 sda2
  8    16  262144000 sdb
  8    32  262144000 sdc
  8    48  262144000 sdd
  8    64  262144000 sde
  8    80  262144000 sdf
  8    96  262144000 sdg
  8   112  262144000 sdh
  8   128  262144000 sdi
  8   144  262144000 sdj
  8   160  262144000 sdk
  8   176  262144000 sdl
  8   192  262144000 sdm
  8   208  262144000 sdn
  8   224  262144000 sdo
  8   240  262144000 sdp
 65     0  262144000 sdq
 65    16  262144000 sdr
 65    32  262144000 sds
 65    48  262144000 sdt
 65    64  262144000 sdu
 65    80  262144000 sdv
 65    96  262144000 sdw
 65   112  262144000 sdx
 65   128  262144000 sdy
 65   144  262144000 sdz
 65   160  262144000 sdaa
 65   176  262144000 sdab
 65   192  262144000 sdac
 65   208  262144000 sdad
 65   224  262144000 sdae
 65   240  262144000 sdaf
 66     0  262144000 sdag
 66    16  262144000 sdah
 66    32  262144000 sdai
 66    48  262144000 sdaj
 66    64  262144000 sdak
 66    80  262144000 sdal
 66    96  262144000 sdam
 66   112  262144000 sdan
 66   128  262144000 sdao
 66   144  262144000 sdap
 66   160  262144000 sdaq
 66   176  262144000 sdar
 66   192  262144000 sdas
 66   208  262144000 sdat
 66   224  262144000 sdau
 66   240  262144000 sdav
 67     0  262144000 sdaw
253     0    1048576 dm-0
253     1   52428800 dm-1
253     2   10485760 dm-2
253     3   10485760 dm-3
253     4   10485760 dm-4
253     5   10485760 dm-5
253     6   10485760 dm-6
253     7   33554432 dm-7
253     8 1073741824 dm-8
253     9  262144000 dm-9
253    10  262144000 dm-10
253    11  262144000 dm-11
253    12  262144000 dm-12
253    13  262144000 dm-13
253    14  262144000 dm-14
253    15  262144000 dm-15
253    16  262144000 dm-16
253    17  262144000 dm-17
253    18  262144000 dm-18
253    19  262144000 dm-19
253    20  262144000 dm-20
253    21     512000 dm-21
253    22     512000 dm-22
253    23     512000 dm-23
253    24     512000 dm-24
253    25     512000 dm-25
253    26  157286400 dm-26
253    27  157286400 dm-27
253    28  157286400 dm-28
253    29  157286400 dm-29
253    30  157286400 dm-30
253    31  157286400 dm-31
253    32  157286400 dm-32
253    33  157286400 dm-33
253    34  157286400 dm-34
253    35  157286400 dm-35
253    36  157286400 dm-36
253    37  157286400 dm-37
253    38  157286400 dm-38
253    39  157286400 dm-39
253    40  157286400 dm-40
253    41  157286400 dm-41
253    42  157286400 dm-42
253    43  157286400 dm-43
253    44  157286400 dm-44
 
[oracle@DCSDB2 dbs]$ cat /app/admin/+ASM/pfile/init.ora
 
 
##############################################################################
# Copyright (c) 1991, 2001, 2002 by Oracle Corporation
##############################################################################
 
###########################################
# Cluster Database
###########################################
cluster_database=true
 
###########################################
# Diagnostics and Statistics
###########################################
background_dump_dest=/app/admin/+ASM/bdump
core_dump_dest=/app/admin/+ASM/cdump
user_dump_dest=/app/admin/+ASM/udump
 
###########################################
# Miscellaneous
###########################################
instance_type=asm
 
###########################################
# Pools
###########################################
large_pool_size=12M
 
###########################################
# Security and Auditing
###########################################
remote_login_passwordfile=exclusive
 
 
asm_diskgroups='DATA'
 
+ASM2.instance_number=2
+ASM1.instance_number=1

 

 

‘m assuming the following are true:

–  that the DATA diskgroup redundancy is either NORMAL or HIGH.
–  the redundancy is NOT external
–  You have a recent backup of the database.

If this is the case, then do the following:

1.  Mount FORCE

alter diskgroup DATA mount force;

Let it mount.

2.  Inspect that everything is there.

3.  Drop force the disk

alter diskgorup drop disk ‘/dev/raw/raw9’ force;

3.  Issue a rebalance if one does not kick off automatcally.

alter diskgroup DATA rebalance;

and let it finish.

From the SQL language documentation for ALTER DISKGROUP:

  • In the FORCE mode, Oracle ASM attempts to mount the disk group even if it cannot discover all of the devices that belong to the disk group. This setting is useful if some of the disks in a normal or high redundancy disk group became unavailable while the disk group was dismounted. When MOUNT FORCE succeeds, Oracle ASM takes the missing disks offline.

    If Oracle ASM discovers all of the disks in the disk group, then MOUNT FORCE fails. Therefore, use the MOUNT FORCE setting only if some disks are unavailable. Otherwise, useNOFORCE.

    In normal- and high-redundancy disk groups, disks from one failure group can be unavailable and MOUNT FORCE will succeed. Also in high-redundancy disk groups, two disks in two different failure groups can be unavailable and MOUNT FORCE will succeed. Any other combination of unavailable disks causes the operation to fail, because Oracle ASM cannot guarantee that a valid copy of all user data or metadata exists on the available disks.

 

Are you sure its external?  I don’t mean to ask that like you wouldn’t know but here is a sure way to know.

There is a tool called amdu and its in your grid home.  This is 11gR2, correct?  If so, you can do the following:

$ORACLE_HOME/bin/amdu

It will create a amdu directory with the current date and in that directory it creates a file called report.txt.  It will report out all of the disks belonging to the DATA disk groups.  One of the fields for each disk
is redundancy.  If its set to 0 or 1, I believe your external. If its set to 2 or 3, your NORMAL or HIGH.

I don’t know how an external redundant AMS diskgroup can be recovered.

From the ASM doc:

  • External redundancy

    Oracle ASM does not provide mirroring redundancy and relies on the storage system to provide RAID functionality. Any write error causes a forced dismount of the disk group. All disks must be located to successfully mount the disk group.

I will let someone else comment but, you may have to restore and recover the database.

if only the ASM diskheader was corrupted, and not the whole disk, it might be worth a try to only recover the disk header. This does make sense in an external DG, since you can’t access the Data anyways. Search in MOS how to do this.

 

[Urgent] ORA-15042: ASM disk “76” is missing

If you cannot recover the data by yourself, ask Parnassusdata, the professional ORACLE database recovery team for help.

Parnassusdata Software Database Recovery Team

Service Hotline:  +86 13764045638 E-mail: service@parnassusdata.com

 

One customer has a ASM problem about ORA-15042.
O/S: Linux X86 64bit 2.6.18-194.el5
DB Version : 10.2.0.5

Although we can access the ASM header using kfed & dd, the asm instance cannot read these devices.
For example, the ASM instance can read the 75th disk, but cannot read the 76th disk.

Do you have this experience?

# Environment
LGEDGDMS01:/engn001/orasvc01/product/10.2.0] uname -a
Linux LGEDGDMS01 2.6.18-194.el5 #1 SMP Tue Mar 16 21:52:39 EDT 2010 x86_64 x86_64 x86_64 GNU/Linux

# Error
SQL> startup
ASM instance started

Total System Global Area  130023424 bytes
Fixed Size                  2094544 bytes
Variable Size             102763056 bytes
ASM Cache                  25165824 bytes
ORA-15042: ASM disk “23” is missing
ORA-15042: ASM disk “22” is missing
ORA-15040: diskgroup is incomplete
ORA-15066: offlining disk “” may result in a data loss
ORA-15042: ASM disk “88” is missing

ORA-15042: ASM disk “77” is missing
ORA-15042: ASM disk “76” is missing   ==> 76 th device
ORA-15040: diskgroup is incomplete
ORA-15066: offlining disk “” may result in a data loss
ORA-15042: ASM disk “88” is missing
ORA-15042: ASM disk “87” is missing

ORA-15042: ASM disk “81” is missing

SQL> show parameter asm_diskstring
NAME                                 TYPE                              VALUE
———————————— ——————————— ——————————
asm_diskstring                       string                    /dev/mapper/mpath_asm*

# v$asm_disks results.
select name, group_number,disk_number, path, state, header_status from v$asm_disk order by disk_number
/
NAME       GROUP_NUMBER DISK_NUMBER PATH                                     STATE                    HEADER_STATUS
———- ———— ———– —————————————- ———————— ————————————
0          66 /dev/mapper/mpath_asm129p1               NORMAL                   MEMBER
0          67 /dev/mapper/mpath_asm130p1               NORMAL                   MEMBER
0          68 /dev/mapper/mpath_asm131p1               NORMAL                   MEMBER
0          69 /dev/mapper/mpath_asm132p1               NORMAL                   MEMBER
0          70 /dev/mapper/mpath_asm133p1               NORMAL                   MEMBER
0          71 /dev/mapper/mpath_asm134p1               NORMAL                   MEMBER
0          72 /dev/mapper/mpath_asm135p1               NORMAL                   MEMBER
0          73 /dev/mapper/mpath_asm136p1               NORMAL                   MEMBER
0          74 /dev/mapper/mpath_asm137p1               NORMAL                   MEMBER
0          75 /dev/mapper/mpath_asm138p1               NORMAL                   MEMBER
0          89 /dev/mapper/mpath_asm063p1               NORMAL                   MEMBER   ==> Cannot see the 76 th device
0          90 /dev/mapper/mpath_asm064p1               NORMAL                   MEMBER
0          91 /dev/mapper/mpath_asm065p1               NORMAL                   MEMBER
0          92 /dev/mapper/mpath_asm066p1               NORMAL                   MEMBER
# Permission – OK
* 75th asm file (Good Device)
LGEDGDMS01:/engn001/orasvc01/product/10.2.0] ls -al /dev/mapper/mpath_asm138p1
brw-rw—- 1 orasvc01 dba 253, 248 Jan 30 17:06 /dev/mapper/mpath_asm138p1

* 76th the asm file (Cannot read this device)
LGEDGDMS01:/engn001/orasvc01/product/10.2.0] ls -al /dev/mapper/mpath_asm175
brw-rw—- 1 orasvc01 dba 253, 197 Jan 30 17:06 /dev/mapper/mpath_asm175

# kfed result – OK
* 75th asm file (Good Device)
+ /engn001/orasvc01/product/10.2.0/bin/kfed read /dev/mapper/mpath_asm138p1    
kfbh.endian:                          1 ; 0x000: 0x01                          
kfbh.hard:                          130 ; 0x001: 0x82                          
kfbh.type:                            1 ; 0x002: KFBTYP_DISKHEAD               
kfbh.datfmt:                          1 ; 0x003: 0x01                          
kfbh.block.blk:                       0 ; 0x004: T=0 NUMB=0x0                      
kfbh.block.obj:              2147483723 ; 0x008: TYPE=0x8 NUMB=0x4b
kfbh.check:                  2774762225 ; 0x00c: 0xa56382f1                    
kfbh.fcn.base:                        0 ; 0x010: 0x00000000                    
kfbh.fcn.wrap:                        0 ; 0x014: 0x00000000                    
kfbh.spare1:                          0 ; 0x018: 0x00000000                    
kfbh.spare2:                          0 ; 0x01c: 0x00000000                    
kfdhdb.driver.provstr:ORCLDISKASMDISK138 ; 0x000: length=18                     
kfdhdb.driver.reserved[0]:   1145918273 ; 0x008: 0x444d5341                    
kfdhdb.driver.reserved[1]:    827020105 ; 0x00c: 0x314b5349                    
kfdhdb.driver.reserved[2]:        14387 ; 0x010: 0x00003833                    
kfdhdb.driver.reserved[3]:            0 ; 0x014: 0x00000000                    
kfdhdb.driver.reserved[4]:            0 ; 0x018: 0x00000000                    
kfdhdb.driver.reserved[5]:            0 ; 0x01c: 0x00000000                    
kfdhdb.compat:                168820736 ; 0x020: 0x0a100000                    
kfdhdb.dsknum:                       75 ; 0x024: 0x004b                       ==> The 75th device
kfdhdb.grptyp:                        2 ; 0x026: KFDGTP_NORMAL                 
kfdhdb.hdrsts:                        3 ; 0x027: KFDHDR_MEMBER                 
kfdhdb.dskname:           DGDATA01_0075 ; 0x028: length=13                     
kfdhdb.grpname:                DGDATA01 ; 0x048: length=8                      
kfdhdb.fgname:            DGDATA01_0075 ; 0x068: length=13                     
kfdhdb.capname:                         ; 0x088: length=0                      
kfdhdb.crestmp.hi:             32973218 ; 0x0a8: HOUR=0x2 DAYS=0xd MNTH=0x8 YEAR=0x7dc
kfdhdb.crestmp.lo:           1898247168 ; 0x0ac: USEC=0x0 MSEC=0x13d SECS=0x12 MINS=0x1c
kfdhdb.mntstmp.hi:             32973219 ; 0x0b0: HOUR=0x3 DAYS=0xd MNTH=0x8 YEAR=0x7dc
kfdhdb.mntstmp.lo:           1163180032 ; 0x0b4: USEC=0x0 MSEC=0x12e SECS=0x15 MINS=0x11
kfdhdb.secsize:                     512 ; 0x0b8: 0x0200                        
kfdhdb.blksize:                    4096 ; 0x0ba: 0x1000                        
kfdhdb.ausize:                  1048576 ; 0x0bc: 0x00100000                    
kfdhdb.mfact:                    113792 ; 0x0c0: 0x0001bc80                    
kfdhdb.dsksize:                   13892 ; 0x0c4: 0x00003644                    

* 76th the asm file (Cannot read this device)
+ /engn001/orasvc01/product/10.2.0/bin/kfed read /dev/mapper/mpath_asm175      
kfbh.endian:                          1 ; 0x000: 0x01                          
kfbh.hard:                          130 ; 0x001: 0x82                          
kfbh.type:                            1 ; 0x002: KFBTYP_DISKHEAD               
kfbh.datfmt:                          1 ; 0x003: 0x01                          
kfbh.block.blk:                       0 ; 0x004: T=0 NUMB=0x0                      
kfbh.block.obj:              2147483724 ; 0x008: TYPE=0x8 NUMB=0x4c
kfbh.check:                  2433973412 ; 0x00c: 0x91137ca4                    
kfbh.fcn.base:                        0 ; 0x010: 0x00000000                    
kfbh.fcn.wrap:                        0 ; 0x014: 0x00000000                    
kfbh.spare1:                          0 ; 0x018: 0x00000000                    
kfbh.spare2:                          0 ; 0x01c: 0x00000000                    
kfdhdb.driver.provstr:ORCLDISKASMDISK175 ; 0x000: length=18                     
kfdhdb.driver.reserved[0]:   1145918273 ; 0x008: 0x444d5341                    
kfdhdb.driver.reserved[1]:    827020105 ; 0x00c: 0x314b5349                    
kfdhdb.driver.reserved[2]:        13623 ; 0x010: 0x00003537                    
kfdhdb.driver.reserved[3]:            0 ; 0x014: 0x00000000                    
kfdhdb.driver.reserved[4]:            0 ; 0x018: 0x00000000                    
kfdhdb.driver.reserved[5]:            0 ; 0x01c: 0x00000000                    
kfdhdb.compat:                168820736 ; 0x020: 0x0a100000                    
kfdhdb.dsknum:                       76 ; 0x024: 0x004c            ==> the 76th device, ASM instance cannot read this device.
kfdhdb.grptyp:                        2 ; 0x026: KFDGTP_NORMAL                 
kfdhdb.hdrsts:                        3 ; 0x027: KFDHDR_MEMBER                 
kfdhdb.dskname:           DGDATA01_0076 ; 0x028: length=13                     
kfdhdb.grpname:                DGDATA01 ; 0x048: length=8                      
kfdhdb.fgname:            DGDATA01_0076 ; 0x068: length=13                     
kfdhdb.capname:                         ; 0x088: length=0                      
kfdhdb.crestmp.hi:             32982981 ; 0x0a8: HOUR=0x5 DAYS=0x1e MNTH=0x1 YEAR=0x7dd
kfdhdb.crestmp.lo:            366295040 ; 0x0ac: USEC=0x0 MSEC=0x14e SECS=0x1d MINS=0x5
kfdhdb.mntstmp.hi:             32982981 ; 0x0b0: HOUR=0x5 DAYS=0x1e MNTH=0x1 YEAR=0x7dd
kfdhdb.mntstmp.lo:            366307328 ; 0x0b4: USEC=0x0 MSEC=0x15a SECS=0x1d MINS=0x5
kfdhdb.secsize:                     512 ; 0x0b8: 0x0200                        
kfdhdb.blksize:                    4096 ; 0x0ba: 0x1000                        
kfdhdb.ausize:                  1048576 ; 0x0bc: 0x00100000                    
kfdhdb.mfact:                    113792 ; 0x0c0: 0x0001bc80                    
kfdhdb.dsksize:                   55572 ; 0x0c4: 0x0000d914                    

# Check the dd results – OK
* the 75th asm device (Good)
dd if=/dev/mapper/mpath_asm138p1 bs=4096|od -tx1z|more                    
                              
0000000 01 82 01 01 00 00 00 00 4b 00 00 80 f1 82 63 a5  >……..K…..c.<
0000020 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0000040 4f 52 43 4c 44 49 53 4b 41 53 4d 44 49 53 4b 31  >ORCLDISKASMDISK1<
0000060 33 38 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >38…………..<
0000100 00 00 10 0a 4b 00 02 03 44 47 44 41 54 41 30 31  >….K…DGDATA01<
0000120 5f 30 30 37 35 00 00 00 00 00 00 00 00 00 00 00  >_0075………..<
0000140 00 00 00 00 00 00 00 00 44 47 44 41 54 41 30 31  >……..DGDATA01<
0000160 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0000200 00 00 00 00 00 00 00 00 44 47 44 41 54 41 30 31  >……..DGDATA01<
0000220 5f 30 30 37 35 00 00 00 00 00 00 00 00 00 00 00  >_0075………..<
0000240 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
*                                                           
0000300 00 00 00 00 00 00 00 00 a2 21 f7 01 00 f4 24 71  >………!….$q<
0000320 a3 21 f7 01 00 b8 54 45 00 02 00 10 00 00 10 00  >.!….TE……..<
0000340 80 bc 01 00 44 36 00 00 02 00 00 00 01 00 00 00  >….D6……….<
0000360 02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0000400 00 00 10 0a 14 cd f6 01 00 2c 95 00 00 00 00 00  >………,……<
0000420 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
*                                                           
0010000 01 82 02 01 01 00 00 00 4b 00 00 80 de 63 17 81  >……..K….c..<
0010020 af e0 35 00 00 00 00 00 00 00 00 00 00 00 00 00  >..5………….<
0010040 00 00 00 00 fe 00 20 00 c0 01 00 01 c0 01 00 01  >…… ………<
0010060 c0 01 00 01 c0 01 00 01 c0 01 00 01 c0 01 01 01  >…………….<
0010100 c0 01 00 01 c0 01 00 01 c0 01 00 01 c0 01 00 01  >…………….<
0010120 c0 01 00 01 c0 01 01 01 c0 01 01 01 c0 01 01 01  >…………….<
0010140 c0 01 01 01 c0 01 01 01 c0 01 01 01 c0 01 01 01  >…………….<
*                                                           
0010240 c0 01 01 01 04 00 01 01 00 00 00 00 00 00 00 00  >…………….<
0010260 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
*                                                           
0020000 01 82 03 01 02 00 00 00 4b 00 00 80 ce 10 bd 80  >……..K…….<
0020020 df ad 1e 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0020040 00 00 00 00 c0 01 00 00 08 00 08 00 00 00 c0 01  >…………….<
0020060 10 00 10 00 00 00 00 00 18 00 18 00 00 00 00 00  >…………….<
0020100 20 00 20 00 00 00 00 00 00 00 00 00 00 00 80 00  > . ………….<
0020120 00 00 00 00 00 00 80 00 d9 0b 00 00 18 01 80 00  >…………….<

* 76th device (Read Failure)                                                    
dd if=/dev/mapper/mpath_asm175 bs=4096|od -tx1z|more                    
                              
0000000 01 82 01 01 00 00 00 00 4c 00 00 80 a4 7c 13 91  >……..L….|..<
0000020 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0000040 4f 52 43 4c 44 49 53 4b 41 53 4d 44 49 53 4b 31  >ORCLDISKASMDISK1<
0000060 37 35 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >75…………..<
0000100 00 00 10 0a 4c 00 02 03 44 47 44 41 54 41 30 31  >….L…DGDATA01<
0000120 5f 30 30 37 36 00 00 00 00 00 00 00 00 00 00 00  >_0076………..<
0000140 00 00 00 00 00 00 00 00 44 47 44 41 54 41 30 31  >……..DGDATA01<
0000160 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0000200 00 00 00 00 00 00 00 00 44 47 44 41 54 41 30 31  >……..DGDATA01<
0000220 5f 30 30 37 36 00 00 00 00 00 00 00 00 00 00 00  >_0076………..<
0000240 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
*                                                           
0000300 00 00 00 00 00 00 00 00 c5 47 f7 01 00 38 d5 15  >………G…8..<
0000320 c5 47 f7 01 00 68 d5 15 00 02 00 10 00 00 10 00  >.G…h……….<
0000340 80 bc 01 00 14 d9 00 00 02 00 00 00 01 00 00 00  >…………….<
0000360 02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0000400 00 00 10 0a 14 cd f6 01 00 2c 95 00 00 00 00 00  >………,……<
0000420 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
*                                                           
0000660 00 00 00 00 00 00 00 00 02 ec 44 ff 00 00 00 00  >……….D…..<
0000700 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
*                                                           
0010000 01 82 02 01 01 00 00 00 4c 00 00 80 c3 62 4b 80  >……..L….bK.<
0010020 65 e0 35 00 00 00 00 00 00 00 00 00 00 00 00 00  >e.5………….<
0010040 00 00 00 00 fe 00 7d 00 c0 01 00 01 c0 01 00 01  >……}………<
0010060 c0 01 00 01 c0 01 00 01 c0 01 00 01 c0 01 00 01  >…………….<
*                                                           
0010460 c0 01 01 01 c0 01 01 01 c0 01 01 01 c0 01 01 01  >…………….<
*                                                           
0011020 c0 01 01 01 c0 01 01 01 14 00 01 01 00 00 00 00  >…………….<
0011040 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
*                                                           
0020000 01 82 03 01 02 00 00 00 4c 00 00 80 f0 45 ff 80  >……..L….E..<
0020020 9a d8 1c 00 00 00 00 00 00 00 00 00 00 00 00 00  >…………….<
0020040 00 00 00 00 c0 01 00 00 08 00 08 00 00 00 c0 01  >…………….<
0020060 10 00 10 00 00 00 00 00 18 00 18 00 00 00 00 00  >…………….<

 

kfod status=true asm_diskstring=’/dev/mapper/mpath_asm*’ disk=ALL
——————————————————————————–
Disk          Size Header    Path
================================================================================
1:      13893 Mb CANDIDATE /dev/mapper/mpath_asm001
2:      13892 Mb MEMBER    /dev/mapper/mpath_asm001p1
3:      13893 Mb CANDIDATE /dev/mapper/mpath_asm002
4:      13892 Mb MEMBER    /dev/mapper/mpath_asm002p1
5:      13893 Mb CANDIDATE /dev/mapper/mpath_asm003
6:      13892 Mb MEMBER    /dev/mapper/mpath_asm003p1
7:      13893 Mb CANDIDATE /dev/mapper/mpath_asm004

274:      13892 Mb MEMBER    /dev/mapper/mpath_asm137p1
275:      13893 Mb CANDIDATE /dev/mapper/mpath_asm138
276:      13892 Mb MEMBER    /dev/mapper/mpath_asm138p1   ==> MEMBER
277:      13893 Mb CANDIDATE /dev/mapper/mpath_asm139
278:      13892 Mb MEMBER    /dev/mapper/mpath_asm139p1

343:      62400 Mb CANDIDATE /dev/mapper/mpath_asm172
344:      62393 Mb MEMBER    /dev/mapper/mpath_asm172p1
345:      62400 Mb CANDIDATE /dev/mapper/mpath_asm173
346:      62393 Mb MEMBER    /dev/mapper/mpath_asm173p1
347:      62400 Mb CANDIDATE /dev/mapper/mpath_asm174
348:      62393 Mb MEMBER    /dev/mapper/mpath_asm174p1
349:      55572 Mb CANDIDATE /dev/mapper/mpath_asm175    ==> CANDIDATE~!
350:      55572 Mb CANDIDATE /dev/mapper/mpath_asm176
351:      55572 Mb CANDIDATE /dev/mapper/mpath_asm177
352:      55572 Mb CANDIDATE /dev/mapper/mpath_asm178
353:      55572 Mb CANDIDATE /dev/mapper/mpath_asm179

 

I could find out one. The added partition don’t have any partition tables, but the original asm disks do it.
Due to storage engineer fault, I suppose that the KFED results is “MEMBER” and the KFOD result is “CANDICATE’ status.
I’ll replace it to the additional disks with partition tables.
If it successful, I will reply it.

# Reference
(Doc ID 580153.1) How To Setup ASM on Linux Using ASMLIB Disks, Raw Devices or Block Devices?
In order to use a disk (e.g. SAN) in Automatic Storage Management, the disk must have a partition table.

Need urgent help on ASM issue – disk header status problem

If you cannot recover the data by yourself, ask Parnassusdata, the professional ORACLE database recovery team for help.

Parnassusdata Software Database Recovery Team

Service Hotline:  +86 13764045638 E-mail: service@parnassusdata.com

 

 

 
ODA system
+In order to workaround a known issue (startup hang when using Hitachi disks) FE/customer was in the process of replacing Hitachi drives on the system.
+ They pulled 2 disks out simultaneously and new disks put in
+ Diskgroups (DATA and RECO) dismounted  – as diskgroups built in  NORMAL redundancy .
+ Clusterware went down and realizing the problem- customer reinstated the original disks.

Current issue:
Disk groups are not mounting.
ASM disks from slot 0 are not being seen by ASM
ASM disks from slot 1 are being seen, but reported as new disks ( Header status=CANDIDATE)

Mounting the diskgroup with FORCE option has also not helped ( because there is 1 disk from slot 0 missing and 1 disk from slot 1 being reported as candidate)
** Customer has no backup and he needs to find out if it is fixable, or the system needs to be rebuilt from scratch.

 

--------------------------------------------------------------------------------
 Disk          Size Header    Path                                     Disk Group   User     Group   
================================================================================
   1:     491520 Mb CANDIDATE /dev/mapper/HDD_E0_S01_717882548p1       #            grid     asmadmin
   2:      75080 Mb CANDIDATE /dev/mapper/HDD_E0_S01_717882548p2       #            grid     asmadmin
   3:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S04_717894368p1       DATA         grid     asmadmin
   4:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S04_717894368p2       RECO         grid     asmadmin
   5:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S05_717844560p1       DATA         grid     asmadmin
   6:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S05_717844560p2       RECO         grid     asmadmin
   7:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S08_717882264p1       DATA         grid     asmadmin
   8:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S08_717882264p2       RECO         grid     asmadmin
   9:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S09_717844480p1       DATA         grid     asmadmin
  10:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S09_717844480p2       RECO         grid     asmadmin
  11:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S12_717844976p1       DATA         grid     asmadmin
  12:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S12_717844976p2       RECO         grid     asmadmin
  13:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S13_717845048p1       DATA         grid     asmadmin
  14:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S13_717845048p2       RECO         grid     asmadmin
  15:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S16_717895116p1       DATA         grid     asmadmin
  16:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S16_717895116p2       RECO         grid     asmadmin
  17:     491520 Mb MEMBER    /dev/mapper/HDD_E0_S17_717888848p1       DATA         grid     asmadmin
  18:      75080 Mb MEMBER    /dev/mapper/HDD_E0_S17_717888848p2       RECO         grid     asmadmin
  19:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S02_717825396p1       DATA         grid     asmadmin
  20:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S02_717825396p2       RECO         grid     asmadmin
  21:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S03_717894252p1       DATA         grid     asmadmin
  22:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S03_717894252p2       RECO         grid     asmadmin
  23:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S06_717886840p1       DATA         grid     asmadmin
  24:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S06_717886840p2       RECO         grid     asmadmin
  25:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S07_717888592p1       DATA         grid     asmadmin
  26:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S07_717888592p2       RECO         grid     asmadmin
  27:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S10_717843708p1       DATA         grid     asmadmin
  28:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S10_717843708p2       RECO         grid     asmadmin
  29:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S11_717852256p1       DATA         grid     asmadmin
  30:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S11_717852256p2       RECO         grid     asmadmin
  31:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S14_717895376p1       DATA         grid     asmadmin
  32:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S14_717895376p2       RECO         grid     asmadmin
  33:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S15_717843800p1       DATA         grid     asmadmin
  34:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S15_717843800p2       RECO         grid     asmadmin
  35:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S18_717882696p1       DATA         grid     asmadmin
  36:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S18_717882696p2       RECO         grid     asmadmin
  37:     491520 Mb MEMBER    /dev/mapper/HDD_E1_S19_717849420p1       DATA         grid     asmadmin
  38:      75080 Mb MEMBER    /dev/mapper/HDD_E1_S19_717849420p2       RECO         grid     asmadmin
  39:      70005 Mb MEMBER    /dev/mapper/SSD_E0_S20_805725574p1       REDO         grid     asmadmin
  40:      70005 Mb MEMBER    /dev/mapper/SSD_E0_S21_805708282p1       REDO         grid     asmadmin
  41:      70005 Mb MEMBER    /dev/mapper/SSD_E1_S22_805706766p1       REDO         grid     asmadmin
  42:      70005 Mb MEMBER    /dev/mapper/SSD_E1_S23_805706623p1       REDO         grid     asmadmin
--------------------------------------------------------------------------------
ORACLE_SID ORACLE_HOME                                                          
================================================================================
     +ASM1 /u01/app/11.2.0.3/grid                                               
     +ASM2 /u01/app/11.2.0.3/grid                                               

 
What is the backup block status ,

kfed read <device_name> aunum=1 blknum=254

Does it shows proper header ,if so the run kfed repair command.

if other blocks are fine except header this will work … else on next mount while doing COD recovery ,it will crash .

 

ASM log file info
==============

NOTE: cache closing disk 0 of grp 1: (not open) _DROPPED_0000_DATA
ERROR: Disk 1 cannot be offlined, since all the disks [1, 0] with mirrored data would be offline.
ERROR: too many offline disks in PST (grp 1)

Here the disks 0,1 have been put back in respective slots but still same issue.

 

Seems similar to the one described in ORA-15042: ASM disk is missing after add disk took place (Doc ID 1529397.1)

 

ASMdisk Status – Candidate disk after reboot [ recover ASM header files ]

If you cannot recover the data by yourself, ask Parnassusdata, the professional ORACLE database recovery team for help.

Parnassusdata Software Database Recovery Team

Service Hotline:  +86 13764045638 E-mail: service@parnassusdata.com

 

Customer has  migrated  oracle databases running on old SAN to new SAN using ASM  rebalance operation. Customer is using External   redundancy. After completing rebalance operation, customer rebooted all server and removed old SAN device entry on weekend.  Customer is unable to bring databases online on  One of the 4 servers. Customer is getting following error :
ORA-15032: not all alterations performed
ORA-15017: diskgroup “DATA” cannot be mounted
ORA-15063: ASM discovered an insufficient number of disks for diskgroup “DATA”

 

 

I have attached asm-kfed result for your reference. Is it possible to recover ASM header files without so customer doesn’t need to backup/restore  5 TB database ?

 

Total System Global Area  284008448 bytes
Fixed Size                  2158616 bytes
Variable Size             256684008 bytes
ASM Cache                  25165824 bytes
ORA-15032: not all alterations performed
ORA-15017: diskgroup "DATA" cannot be mounted
ORA-15063: ASM discovered an insufficient number of disks for diskgroup "DATA"
 

SQL> select group_number,disk_number,MOUNT_STATUS,HEADER_STATUS,MODE_STATUS,STATE,NAME,PATH from V$asm_disk;
 
GROUP_NUMBER DISK_NUMBER MOUNT_S HEADER_STATU MODE_ST STATE    NAME                           PATH
------------ ----------- ------- ------------ ------- -------- ------------------------------ ----------------------------------------
           0           0 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/c0d26s6
           0          23 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/c0d22s6
           0           2 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM01
           0           3 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM12
           0           4 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/c0d27s6
           0           5 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/c0d25s6
           0           6 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM05
           0           7 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM02
           0           8 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM08
           0           9 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/c0d24s6
           0          10 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM11
           0          11 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/c0d30s6
           0          12 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM07
           0          13 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/c0d31s6
           0          14 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/c0d21s6
           0          15 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM10
           0          16 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/c0d23s6
           0          17 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM09
           0          18 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM03
           0          19 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/c0d29s6
           0          20 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/c0d28s6
           0          21 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM04
           0          22 IGNORED MEMBER       ONLINE  NORMAL                                  /dev/rdsk/c0d32s6
           0           1 CLOSED  CANDIDATE    ONLINE  NORMAL                                  /dev/rdsk/san03dp_dbs05dp_ASM06





dev/rdsk/san03dp_dbs05dp_ASM03
kfbh.endian: 0 ; 0x000: 0x00
kfbh.hard: 0 ; 0x001: 0x00
kfbh.type: 0 ; 0x002: KFBTYP_INVALID
kfbh.datfmt: 0 ; 0x003: 0x00
kfbh.block.blk: 0 ; 0x004: blk=0
kfbh.block.obj: 0 ; 0x008: file=0
kfbh.check: 0 ; 0x00c: 0x00000000
kfbh.fcn.base: 0 ; 0x010: 0x00000000
kfbh.fcn.wrap: 0 ; 0x014: 0x00000000
kfbh.spare1: 0 ; 0x018: 0x00000000
kfbh.spare2: 0 ; 0x01c: 0x00000000
10037FE00 00000000 00000000 00000000 00000000 [................]
Repeat 255 times
KFED-00322: Invalid content encountered during block traversal: [kfbtTraverseBlock][Invalid OSM block type][][0]

/dev/rdsk/san03dp_dbs05dp_ASM04
kfbh.endian: 0 ; 0x000: 0x00
kfbh.hard: 0 ; 0x001: 0x00
kfbh.type: 0 ; 0x002: KFBTYP_INVALID
kfbh.datfmt: 0 ; 0x003: 0x00
kfbh.block.blk: 0 ; 0x004: blk=0
kfbh.block.obj: 0 ; 0x008: file=0
kfbh.check: 0 ; 0x00c: 0x00000000
kfbh.fcn.base: 0 ; 0x010: 0x00000000
kfbh.fcn.wrap: 0 ; 0x014: 0x00000000
kfbh.spare1: 0 ; 0x018: 0x00000000
kfbh.spare2: 0 ; 0x01c: 0x00000000
10037FE00 00000000 00000000 00000000 00000000 [................]
Repeat 255 times
KFED-00322: Invalid content encountered during block traversal: [kfbtTraverseBlock][Invalid OSM block type][][0]

/dev/rdsk/san03dp_dbs05dp_ASM05
kfbh.endian: 0 ; 0x000: 0x00
kfbh.hard: 0 ; 0x001: 0x00
kfbh.type: 0 ; 0x002: KFBTYP_INVALID
kfbh.datfmt: 0 ; 0x003: 0x00
kfbh.block.blk: 0 ; 0x004: blk=0
kfbh.block.obj: 0 ; 0x008: file=0
kfbh.check: 0 ; 0x00c: 0x00000000
kfbh.fcn.base: 0 ; 0x010: 0x00000000
kfbh.fcn.wrap: 0 ; 0x014: 0x00000000
kfbh.spare1: 0 ; 0x018: 0x00000000
kfbh.spare2: 0 ; 0x01c: 0x00000000
10037FE00 00000000 00000000 00000000 00000000 [................]
Repeat 255 times
KFED-00322: Invalid content encountered during block traversal: [kfbtTraverseBlock][Invalid OSM block type][][0]

/dev/rdsk/san03dp_dbs05dp_ASM06
kfbh.endian: 0 ; 0x000: 0x00
kfbh.hard: 0 ; 0x001: 0x00
kfbh.type: 0 ; 0x002: KFBTYP_INVALID
kfbh.datfmt: 0 ; 0x003: 0x00
kfbh.block.blk: 0 ; 0x004: blk=0
kfbh.block.obj: 0 ; 0x008: file=0
kfbh.check: 0 ; 0x00c: 0x00000000
kfbh.fcn.base: 0 ; 0x010: 0x00000000
kfbh.fcn.wrap: 0 ; 0x014: 0x00000000
kfbh.spare1: 0 ; 0x018: 0x00000000
kfbh.spare2: 0 ; 0x01c: 0x00000000
10037FE00 00000000 00000000 00000000 00000000 [................]
Repeat 255 times
KFED-00322: Invalid content encountered during block traversal: [kfbtTraverseBlock][Invalid OSM block type][][0]

/dev/rdsk/san03dp_dbs05dp_ASM07
kfbh.endian: 0 ; 0x000: 0x00
kfbh.hard: 0 ; 0x001: 0x00
kfbh.type: 0 ; 0x002: KFBTYP_INVALID
kfbh.datfmt: 0 ; 0x003: 0x00
kfbh.block.blk: 0 ; 0x004: blk=0
kfbh.block.obj: 0 ; 0x008: file=0
kfbh.check: 0 ; 0x00c: 0x00000000
kfbh.fcn.base: 0 ; 0x010: 0x00000000
kfbh.fcn.wrap: 0 ; 0x014: 0x00000000
kfbh.spare1: 0 ; 0x018: 0x00000000
kfbh.spare2: 0 ; 0x01c: 0x00000000
10037FE00 00000000 00000000 00000000 00000000 [................]
Repeat 255 times
KFED-00322: Invalid content encountered during block traversal: [kfbtTraverseBlock][Invalid OSM block type][][0]

/dev/rdsk/san03dp_dbs05dp_ASM08
kfbh.endian: 0 ; 0x000: 0x00
kfbh.hard: 0 ; 0x001: 0x00
kfbh.type: 0 ; 0x002: KFBTYP_INVALID
kfbh.datfmt: 0 ; 0x003: 0x00
kfbh.block.blk: 0 ; 0x004: blk=0
kfbh.block.obj: 0 ; 0x008: file=0
kfbh.check: 0 ; 0x00c: 0x00000000
kfbh.fcn.base: 0 ; 0x010: 0x00000000
kfbh.fcn.wrap: 0 ; 0x014: 0x00000000
kfbh.spare1: 0 ; 0x018: 0x00000000
kfbh.spare2: 0 ; 0x01c: 0x00000000
10037FE00 00000000 00000000 00000000 00000000 [................]
Repeat 255 times
KFED-00322: Invalid content encountered during block traversal: [kfbtTraverseBlock][Invalid OSM block type][][0]

 

 

ASM disks – ASM03/04/05/06/07/08 –  showing the status as “CANDIDATE” is a bit worry.  But if the devices were not part of DATA diskgroup before, these devices are not the major cause of ORA-15063.
Please check ASM alert.log whether these 5 devices were belong to DATA.

I’m more concerned about the following 4 devices as they show the status as “IGNORED” which indicates there are other devices showing the same disk information given asm_diskstring parameter.
– ASM01/02/0910/11

Chances are that the following devices below show the same disk information as ASM01/02/0910/11 and there is good chance that these different path point to the same physical devices.
~~
/dev/rdsk/c0d22s6
/dev/rdsk/c0d30s6
/dev/rdsk/c0d31s6
/dev/rdsk/c0d21s6
/dev/rdsk/c0d29s6
/dev/rdsk/c0d32s6
~~

Please check all disk header which disks show the duplicate ASM disk information using the following perl script in the note below
And if duplicate paths point to the same physical device, the additional device path should be disabled using “chmod 000 <device_path>”
KFED.PL for diagnosing – ORA-15063 ORA-15042 ORA-15020 (Doc ID 1346190.1)

ora-600 [4000]异常导致Open Database失败 使用bbed解决一例

如果自己搞不定可以找诗檀软件专业ORACLE数据库修复团队成员帮您恢复!

诗檀软件专业数据库修复团队

服务热线 : 13764045638   QQ号:47079569    邮箱:service@parnassusdata.com

 

近日在客户这边,测试环境遇到ora-600 [4000]异常,无法打开数据库.

 

造成ORA-00600 [4000]错误的主要原因是因为系统掉电,或者数据库恢复不完全,导致system表空间的一些位的状态异常,导致bootstrap无法正常情况.

以下是通过bbed方法,修改相关标志位,强制打开数据库的一种方法.

但是由于数据库实际上已经处于非一致状态,强烈建议用户打开数据库后,通过将生产环境的数据导出后,重建数据库.

 

 

造成ORA-00600 [4000]错误的主要原因是因为系统掉电,或者数据库恢复不完全,导致system表空间的一些位的状态异常,导致bootstrap无法正常情况.
以下是通过bbed方法,修改相关标志位,强制打开数据库的一种方法.
但是由于数据库实际上已经处于非一致状态,强烈建议用户打开数据库后,通过将生产环境的数据导出后,重建数据库.

--alert log相关的trace如下:
Tue Jun  4 15:41:34 2013
Errors in file /u02/oracle/app/oracle/admin/zjrpt/udump/zjrpt1_ora_770332.trc:
ORA-00600: internal error code, arguments: [4000], [41], [], [], [], [], [], []

trace文件 /u02/oracle/app/oracle/admin/zjrpt/udump/zjrpt1_ora_770332.trc,可以找到类似的相关内容
*** 2013-06-04 15:41:35.055
ksedmp: internal or fatal error
ORA-00600: internal error code, arguments: [4000], [41], [], [], [], [], [], []
Current SQL statement for this session:
select ctime, mtime, stime from obj$ where obj# = :1
----- Call Stack Trace -----
calling              call     entry                argument values in hex      
location             type     point                (? means dubious value)     
-------------------- -------- -------------------- ----------------------------
ksedst+001c          bl       ksedst1              0FFFFFFFF ? 000000020 ?
ksedmp+0290          bl       ksedst               1048EBB20 ?
ksfdmp+0018          bl       03F3BE0C             
kgeriv+0108          bl       _ptrgl               
kgeasi+0118          bl       kgeriv               1019AAD00 ? 000000000 ?
                                                   000000000 ?
                                                   26414080048E1630 ?
                                                   300000000 ?
...................................................                                                   
Block header dump:  0x0040007a
 Object id on Block? Y
 seg/obj: 0x12  csc: 0xc20.26114c0d  itc: 1  flg: -  typ: 1 - DATA
     fsl: 0  fnx: 0x0 ver: 0x01
 Itl           Xid                  Uba         Flag  Lck        Scn/Fsc
0x01   0x0029.026.0021d127  0x0091a949.3d91.01  --U-    1  fsc 0x0000.26114c0f
data_block_dump,data header at 0x700000b77a4e044
===============
tsiz: 0x1fb8
hsiz: 0xea
pbl: 0x700000b77a4e044
bdba: 0x0040007a
     76543210
flag=--------
ntab=1
nrow=108
frre=-1
fsbo=0xea
fseo=0x40c
avsp=0x368
tosp=0x368

根据trace文件的bdba: 0x0040007a可以找到对应的文件号和块ID
select dbms_utility.data_block_address_file(TO_NUMBER('40007a', 'XXXXXXXX')) file_id, 
dbms_utility.data_block_address_block(TO_NUMBER('40007a', 'XXXXXXXX')) block_id 
from dual;
   FILE_ID   BLOCK_ID
---------- ----------
         1        122
         
!!!!在后续操作中,因为要修改system表空间的数据文件内容;建议在修改前,在数据库关闭状态下,先将当前的system表空间看的数据文件进行备份


bbed可执行文件的生成:
linux和unix平台,10g中可能需要手工编译出bbed工具
cd $ORACLE_HOME/rdbms/lib
make -f ins_rdbms.mk $ORACLE_HOME/rdbms/lib/bbed
mv bbed $ORACLE_HOME/bin
在windows平台,只有8i上有bbed,可以通过将8i的bbed复制到10g的环境,进行使用

使用bbed需要2个配置文件
A : file.txt文件
这个文件对应的3个列为,file_id,文件名的和路径,数据文件大小,可以使用SQL生成:
set linesize 300
set pagesize 9999
select file#||' '||name||' '||bytes from v$datafile where FILE#=1;
FILE#||''||NAME||''||BYTES
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
1 /oradata/sys/zjrpt/system01.dbf 503316480

vi file.txt
1 /oradata/sys/zjrpt/system01.dbf 503316480

B :bbed.txt:
这个文件是bbed的配置文件,配置了blocksize,listfile,mode ,password参数
blocksize:数据文件所在表空间的blocksize
listfile:指向前面生成的文件file.txt
mode:edit说明是以编辑模式打开bbed
password:blockedit这个是定值,调用bbed的时候会
SQL> show parameter db_block_size
NAME                                 TYPE        VALUE
----------------------------- ----------- -------------
db_block_size                        integer     8192
SQL>

vi bbed.txt
blocksize=8192
listfile=/home/oracle/oracle_work/file.txt
mode=edit
password=blockedit

--正式调用bbed
bbed parfile=/home/oracle/oracle_work/bbed.txt

--指定需要编辑的文件号和块号
BBED> set file 1 block 122
        FILE#           1
        BLOCK#          122

--打印块的结构
BBED> p ktbbh
struct ktbbh, 48 bytes                      @20
   ub1 ktbbhtyp                             @20       0x01 (KDDBTDATA)
   union ktbbhsid, 4 bytes                  @24
      ub4 ktbbhsg1                          @24       0x00000012
      ub4 ktbbhod1                          @24       0x00000012
   struct ktbbhcsc, 8 bytes                 @28
      ub4 kscnbas                           @28       0x26114c0d
      ub2 kscnwrp                           @32       0x0c20
   b2 ktbbhict                              @36       1
   ub1 ktbbhflg                             @38       0x02 (NONE)
   ub1 ktbbhfsl                             @39       0x00
   ub4 ktbbhfnx                             @40       0x00000000
   struct ktbbhitl[0], 24 bytes             @44
      struct ktbitxid, 8 bytes              @44
         ub2 kxidusn                        @44       0x0029
         ub2 kxidslt                        @46       0x0026
         ub4 kxidsqn                        @48       0x0021d127
      struct ktbituba, 8 bytes              @52
         ub4 kubadba                        @52       0x0091a949
         ub2 kubaseq                        @56       0x3d91
         ub1 kubarec                        @58       0x01
      ub2 ktbitflg                          @60       0x2001 (KTBFUPB)
      union _ktbitun, 2 bytes               @62
         b2 _ktbitfsc                       @62       0
         ub2 _ktbitwrp                      @62       0x0000
      ub4 ktbitbas                          @64       0x26114c0f

上面的ktbitxid 即为XID
ktbituba即为UBA
这里主要是要修改 ktbitflg,该结构占据了2个offset,@60表示这个结构的偏移量从60开始,0x2001说明当前值为2001。
修改的时候需要注意一下的是要看big endian还是little endian,对于little endian,其字节序是反的。

查看相关文件的信息
BBED> info
 File#  Name                                                        Size(blks)
 -----  ----                                                        ----------
     1  /oradata/sys/zjrpt/system01.dbf                                  61440


--查看当前的详细信息,最开始的时候offset=20
BBED> show
        FILE#           1
        BLOCK#          122
        OFFSET          20
        DBA             0x0040007a (4194426 1,122)
        FILENAME        /oradata/sys/zjrpt/system01.dbf
        BIFILE          bifile.bbd
        LISTFILE        /home/oracle/oracle_work/file.txt
        BLOCKSIZE       8192
        MODE            Edit
        EDIT            Unrecoverable
        IBASE           Dec
        OBASE           Dec
        WIDTH           80
        COUNT           512
        LOGFILE         log.bbd
        SPOOL           No


--指定offset=60
BBED> set offset 60
        OFFSET          60

--查看当前的详细信息,offset已经从前面的60修改为20了
BBED> show
        FILE#           1
        BLOCK#          122
        OFFSET          60
        DBA             0x0040007a (4194426 1,122)
        FILENAME        /oradata/sys/zjrpt/system01.dbf
        BIFILE          bifile.bbd
        LISTFILE        /home/oracle/oracle_work/file.txt
        BLOCKSIZE       8192
        MODE            Edit
        EDIT            Unrecoverable
        IBASE           Dec
        OBASE           Dec
        WIDTH           80
        COUNT           512
        LOGFILE         log.bbd
        SPOOL           No

--查看相关的内容,可以看到修改前Offsets=60的情况下,最开始的2个字节为2001,也就是第一个字节为20,第二个字节为01;
--在这里需要再确认一下开头的值是否是2001(big endian)或者0120(little endian)
--如果发现最开始4个字符不是上述值,而是第5到8个字符是上述值
--那么差异可能是因为数据文件存放在裸设备还是文件系统上,块头占了1个字节的差异造成
--那么说明标志位的修改需要从offset=61开始,
BBED> dump
 File: /oradata/sys/zjrpt/system01.dbf (1)
 Block: 122              Offsets:   60 to  571           Dba:0x0040007a
------------------------------------------------------------------------
 20010000 26114c0f 0001006c ffff00ea 040c0368 03680000 006c1f7c 1f3c1efb
 1ebc1e7c 1e3c1e00 1dbe1d82 1d461d06 1cc51c84 1c431c02 1bc21b74 1b351af4
 1ab51a67 1a2b19eb 19aa196a 192a18ed 18ac186f 040c17ed 17a51764 172116e0
 1698165c 162015df 15a21567 152714e6 14a6145e 141d13e0 139f1362 132212e2
 12a11265 122411e7 11a01160 112110de 109b105f 10230fe6 0fa30f62 0f1d0ed8
 0e9c0e5a 0e140dd3 0d940d54 0d130cd3 0c930c57 0c170bd9 0b980b58 0b150ad0
 0a8b0a46 0a0109c1 09810941 090108c0 087d083e 080107bc 077e0742 06fc06bb
 06740632 05f005aa 0564051e 04d80495 04530000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000

--以下命令中offset 60还是61,取决与前一步的查询结果
--对于big endian平台来说,将前2个字节由2001修改为8001;如果是little endian平台来说,将前2个字节由0120修改为0180
BBED> modify /x 8001 offset 60
 File: /oradata/sys/zjrpt/system01.dbf (1)
 Block: 122              Offsets:   60 to  571           Dba:0x0040007a
------------------------------------------------------------------------
 80010000 26114c0f 0001006c ffff00ea 040c0368 03680000 006c1f7c 1f3c1efb
 1ebc1e7c 1e3c1e00 1dbe1d82 1d461d06 1cc51c84 1c431c02 1bc21b74 1b351af4
 1ab51a67 1a2b19eb 19aa196a 192a18ed 18ac186f 040c17ed 17a51764 172116e0
 1698165c 162015df 15a21567 152714e6 14a6145e 141d13e0 139f1362 132212e2
 12a11265 122411e7 11a01160 112110de 109b105f 10230fe6 0fa30f62 0f1d0ed8
 0e9c0e5a 0e140dd3 0d940d54 0d130cd3 0c930c57 0c170bd9 0b980b58 0b150ad0
 0a8b0a46 0a0109c1 09810941 090108c0 087d083e 080107bc 077e0742 06fc06bb
 06740632 05f005aa 0564051e 04d80495 04530000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000


--确认修改后的相关的内容
BBED> dump
 File: /oradata/sys/zjrpt/system01.dbf (1)
 Block: 122              Offsets:   60 to  571           Dba:0x0040007a
------------------------------------------------------------------------
 80010000 26114c0f 0001006c ffff00ea 040c0368 03680000 006c1f7c 1f3c1efb
 1ebc1e7c 1e3c1e00 1dbe1d82 1d461d06 1cc51c84 1c431c02 1bc21b74 1b351af4
 1ab51a67 1a2b19eb 19aa196a 192a18ed 18ac186f 040c17ed 17a51764 172116e0
 1698165c 162015df 15a21567 152714e6 14a6145e 141d13e0 139f1362 132212e2
 12a11265 122411e7 11a01160 112110de 109b105f 10230fe6 0fa30f62 0f1d0ed8
 0e9c0e5a 0e140dd3 0d940d54 0d130cd3 0c930c57 0c170bd9 0b980b58 0b150ad0
 0a8b0a46 0a0109c1 09810941 090108c0 087d083e 080107bc 077e0742 06fc06bb
 06740632 05f005aa 0564051e 04d80495 04530000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000

 

--提交修改和校验
sum apply

然后退出即可

后续出现ora-600 [2662],ora-600 [4993]等异常,通过设置隐含参数等比较容易过去

某企业ASM存储变更导致ASM DISKGROUP上的多套库出现损坏

某南方变更ASM所使用底层存储的RAID盘后,ASM 马上出现fail I/O问题,该环境中ASM Diskgroup上有多套数据库,全部受到影响。

Warning:  IO failed group :2 disk number incarnation disk AU disk_offset io_size opeartion:Write result: I/O error process_ID

 

 

QQ截图20150117214132

其中一套库的UNDO明显损坏,OPEN Database失败出现了ORA-00600[4193]错误:

QQ截图20150117214158

其他多套出现了大量的坏块:

Corrupt block relative dba:  0x99806a3b file 2 block 31291

 

QQ截图20150117214216

 

此外有库虽然能打开 但访问ASM上的数据时会出现ORA-15055 、ORA-01115错误。

 

 

QQ截图20150117214024

 

2015-01-09 00.51.31

如果自己搞不定可以找诗檀软件专业ORACLE数据库修复团队成员帮您恢复!

诗檀软件专业数据库修复团队

服务热线 : 13764045638   QQ号:47079569    邮箱:service@parnassusdata.com

ORA-00600 [KRSCINT.NO_CUR_LOG]数据库无法打开一例

 
SQL> alter database open;
alter database open
*
ERROR at line 1:
ORA-600: internal error code, arguments: [krscint.no_cur_log], [], [], [],

以上ORA-00600 [krscint.no_cur_log] 错误中的krscint 代表 Kernel Recovery Standby Crash recovery support,此错误可能为在线日志文件损坏所致:

 alter system dump logfile 

进一步诊断需要通过DUMP LOGFILE来获得更多信息。

 

如果自己搞不定可以找诗檀软件专业ORACLE数据库修复团队成员帮您恢复!

诗檀软件专业数据库修复团队

服务热线 : 13764045638   QQ号:47079569    邮箱:service@parnassusdata.com

ORA-600 [12820] 导致alter database open打开数据库失败一例

如果打开数据库open database时遇到如下报错:

ORA-00600: internal error code, arguments: [12820]

且其stack call 为:

 

 

ksedmp kgeriv kgeasi kkmevw kkmfcbvw kkmfcblo kkmpfcbk qcsprfro qcspafq qcspqb qcspqb kkmdrv 
opiSem opiDeferredSem opitca  rpiswu2 kksSetBindType kksfbc opiexe opoal8 opiodr kpoodrc 
rpiswu2 kpoodr upirtrc kpurcsc kpuexecv8 kpuexec OCIStmtExecute ktsmg_execute_sql ktsmgfru 
ktusmout_online_ut ktusmiut_init_ut ktuini adbdrv opiexe opiosq0 kpooprx kpoal8 opiodr ttcpip 
opitsk opiino opiodr  opidrv sou2o main start

 

ORA-00600 [12820]在这里 表达的是 数据库打开过程中发现在数据字典上 视图的定义长度和视图定义文本的实际长度之间有差别。

在一个特定的例子中,可能是如下的查询引发的报错:

select max(maxconcurrency) from sys.wrh$_undostat where instance_number = :1 and dbid = :2 and snap_id in (select snap_id from dba_hist_snapshot where end_interval_time > (select max(end_interval_time)-7 from dba_hist_snapshot))

 

sys.wrh$_undostat 是一张表

dba_hist_snapshot是视图上的同义词

 

有的时候却是可能是数据字典上的对应记录存在损坏的情况。尝试重建视图即可绕过这些问题。但如果是大量数据库视图都出现了问题,那么要重新运行catalog.sql。

针对该问题的解决需要设置部分隐藏参数,并重新运行数据库核心脚本catalog.sql。

 

ORA-00600 [2665]导致ALTER DATABASE OPEN RESETLOGS 失败一例

当对一个克隆的数据库做Open resetlogs 时可能出现ORA-00600: internal error code, arguments: [2665] …报错。

 

出现该报错的原因可能是因为LOG_BUFFER参数过小,可以考虑扩大参数LOG_BUFFER来绕过该问题。

Ora-00600[kskopen1] Oracle数据库无法打开OPEN一例

如果遇到了Ora-00600[kskopen1] 且是在Open database 打开数据库时出现该错误,导致数据库无法顺利打开。那么一般可以参考这个 case。

对于该Ora-00600[kskopen1]报错考虑是否是Resource_Manager的内部计划被不当修改了。对于该问题可以考虑临时性关闭Resource_Manager特性来绕过数据库无法打开的问题。

沪ICP备14014813号-2

沪公网安备 31010802001379号