ERROR: failed to update diskgroup resource ora.OCR_NEW.dg

1 创建磁盘组


SQL>  CREATE DISKGROUP OCR_NEW NORMAL REDUNDANCY  DISK '/dev/oracledisk/ssd_ocr01',

'/dev/oracledisk/ssd_ocr02','/dev/oracledisk/ssd_ocr03'

 ATTRIBUTE 'compatible.asm'='11.2.0.0.0','au_size'='1M' ;


Diskgroup created.


2 发现两个节点的磁盘组信息不一致

节点1 [grid@host1 ~]$ asmcmd

ASMCMD> lsdg

State    Type    Rebal  Sector  Block       AU  Total_MB  Free_MB  Req_mir_free_MB  Usable_file_MB  Offline_disks  Voting_files  Name

MOUNTED  EXTERN  N         512   4096  1048576   1024000   955947                0          955947              0             N  ARCH/

MOUNTED  EXTERN  N         512   4096  1048576   3072000  1392206                0         1392206              0             N  DATA/

MOUNTED  NORMAL  N         512   4096  1048576     30720    29794            10240            9777              0             Y  OCR/

MOUNTED  NORMAL  N         512   4096  1048576     30720    30561            10240           10160              0             N  OCR_NEW/


节点2 未发现相关磁盘组信息:

[grid@host2 ~]$ asmcmd

ASMCMD> lsdg

State    Type    Rebal  Sector  Block       AU  Total_MB  Free_MB  Req_mir_free_MB  Usable_file_MB  Offline_disks  Voting_files  Name

MOUNTED  EXTERN  N         512   4096  1048576   1024000   955947                0          955947              0             N  ARCH/

MOUNTED  EXTERN  N         512   4096  1048576   3072000  1392206                0         1392206              0             N  DATA/

MOUNTED  NORMAL  N         512   4096  1048576     30720    29794            10240            9777              0             Y  OCR/


3 查看ASM实例的日志信息,发现在节点1报ERROR: failed to update diskgroup resource ora.OCR_NEW.dg 错误:


节点1:

Wed Sep 14 16:14:09 2022

SQL>  CREATE DISKGROUP OCR_NEW NORMAL REDUNDANCY  DISK '/dev/oracledisk/ssd_ocr01',

'/dev/oracledisk/ssd_ocr02',

'/dev/oracledisk/ssd_ocr03' ATTRIBUTE 'compatible.asm'='11.2.0.0.0','au_size'='1M'  

NOTE: Assigning number (4,0) to disk (/dev/oracledisk/ssd_ocr01)

NOTE: Assigning number (4,1) to disk (/dev/oracledisk/ssd_ocr02)

NOTE: Assigning number (4,2) to disk (/dev/oracledisk/ssd_ocr03)

NOTE: initializing header on grp 4 disk OCR_NEW_0000

NOTE: initializing header on grp 4 disk OCR_NEW_0001

NOTE: initializing header on grp 4 disk OCR_NEW_0002

Wed Sep 14 16:14:09 2022

GMON updating for reconfiguration, group 4 at 43 for pid 34, osid 46138

NOTE: group 4 PST updated.

NOTE: initiating PST update: grp = 4

GMON updating group 4 at 44 for pid 34, osid 46138

NOTE: group OCR_NEW: initial PST location: disk 0000 (PST copy 0)

NOTE: group OCR_NEW: initial PST location: disk 0001 (PST copy 1)

NOTE: group OCR_NEW: initial PST location: disk 0002 (PST copy 2)

NOTE: PST update grp = 4 completed successfully 

NOTE: cache registered group OCR_NEW number=4 incarn=0x3a020595

NOTE: cache began mount (first) of group OCR_NEW number=4 incarn=0x3a020595

NOTE: cache opening disk 0 of grp 4: OCR_NEW_0000 path:/dev/oracledisk/ssd_ocr01

NOTE: cache opening disk 1 of grp 4: OCR_NEW_0001 path:/dev/oracledisk/ssd_ocr02

NOTE: cache opening disk 2 of grp 4: OCR_NEW_0002 path:/dev/oracledisk/ssd_ocr03

Wed Sep 14 16:14:09 2022

* allocate domain 4, invalid = TRUE 

kjbdomatt send to inst 2

Wed Sep 14 16:14:09 2022

NOTE: attached to recovery domain 4

NOTE: cache creating group 4/0x3A020595 (OCR_NEW)

NOTE: cache mounting group 4/0x3A020595 (OCR_NEW) succeeded

NOTE: allocating F1X0 on grp 4 disk OCR_NEW_0000

NOTE: allocating F1X0 on grp 4 disk OCR_NEW_0001

NOTE: allocating F1X0 on grp 4 disk OCR_NEW_0002

NOTE: Created Used Space Directory for 1 threads

NOTE: diskgroup must now be re-mounted prior to first use

NOTE: cache dismounting (clean) group 4/0x3A020595 (OCR_NEW) 

NOTE: messaging CKPT to quiesce pins Unix process pid: 46138, image: oracle@host1 (TNS V1-V3)

NOTE: lgwr not being msg'd to dismount

kjbdomdet send to inst 2

detach from dom 4, sending detach message to inst 2

freeing rdom 4

NOTE: detached from domain 4

NOTE: cache dismounted group 4/0x3A020595 (OCR_NEW) 

GMON dismounting group 4 at 45 for pid 34, osid 46138

GMON dismounting group 4 at 46 for pid 34, osid 46138

NOTE: Disk OCR_NEW_0000 in mode 0x7f marked for de-assignment

NOTE: Disk OCR_NEW_0001 in mode 0x7f marked for de-assignment

NOTE: Disk OCR_NEW_0002 in mode 0x7f marked for de-assignment

SUCCESS: diskgroup OCR_NEW was created

NOTE: cache deleting context for group OCR_NEW 4/0x3a020595

NOTE: cache registered group OCR_NEW number=4 incarn=0x42e20597

NOTE: cache began mount (first) of group OCR_NEW number=4 incarn=0x42e20597

NOTE: Assigning number (4,2) to disk (/dev/oracledisk/ssd_ocr03)

NOTE: Assigning number (4,1) to disk (/dev/oracledisk/ssd_ocr02)

NOTE: Assigning number (4,0) to disk (/dev/oracledisk/ssd_ocr01)

NOTE: GMON heartbeating for grp 4

GMON querying group 4 at 49 for pid 34, osid 46138

NOTE: cache opening disk 0 of grp 4: OCR_NEW_0000 path:/dev/oracledisk/ssd_ocr01

NOTE: F1X0 found on disk 0 au 2 fcn 0.0

NOTE: cache opening disk 1 of grp 4: OCR_NEW_0001 path:/dev/oracledisk/ssd_ocr02

NOTE: F1X0 found on disk 1 au 2 fcn 0.0

NOTE: cache opening disk 2 of grp 4: OCR_NEW_0002 path:/dev/oracledisk/ssd_ocr03

NOTE: F1X0 found on disk 2 au 2 fcn 0.0

NOTE: cache mounting (first) normal redundancy group 4/0x42E20597 (OCR_NEW)

* allocate domain 4, invalid = TRUE 

kjbdomatt send to inst 2

NOTE: attached to recovery domain 4

NOTE: cache recovered group 4 to fcn 0.0

NOTE: redo buffer size is 256 blocks (1053184 bytes)

Wed Sep 14 16:14:16 2022

NOTE: LGWR attempting to mount thread 1 for diskgroup 4 (OCR_NEW)

NOTE: LGWR found thread 1 closed at ABA 0.10750

NOTE: LGWR mounted thread 1 for diskgroup 4 (OCR_NEW)

NOTE: LGWR opening thread 1 at fcn 0.0 ABA 2.0

NOTE: setting 11.2 start ABA for group OCR_NEW thread 1 to 2.0

NOTE: cache mounting group 4/0x42E20597 (OCR_NEW) succeeded

NOTE: cache ending mount (success) of group OCR_NEW number=4 incarn=0x42e20597

Wed Sep 14 16:14:17 2022

NOTE: Instance updated compatible.asm to 11.2.0.0.0 for grp 4

SUCCESS: diskgroup OCR_NEW was mounted

SUCCESS:  CREATE DISKGROUP OCR_NEW NORMAL REDUNDANCY  DISK '/dev/oracledisk/ssd_ocr01','/dev/oracledisk/ssd_ocr02',

'/dev/oracledisk/ssd_ocr03' ATTRIBUTE 'compatible.asm'='11.2.0.0.0','au_size'='1M' 

Wed Sep 14 16:14:17 2022

NOTE: diskgroup resource ora.OCR_NEW.dg is online

ERROR: failed to update diskgroup resource ora.OCR_NEW.dg



节点2:

Wed Sep 14 09:57:30 2022

NOTE: membership refresh pending for group 2/0xe9470f1d (DATA)

GMON querying group 2 at 29 for pid 18, osid 413705

GMON querying group 2 at 30 for pid 18, osid 413705

NOTE: Disk DATA_0000 in mode 0x0 marked for de-assignment

NOTE: Disk DATA_0001 in mode 0x0 marked for de-assignment

NOTE: Disk DATA_0002 in mode 0x0 marked for de-assignment

NOTE: Disk DATA_0003 in mode 0x0 marked for de-assignment

SUCCESS: refreshed membership for 2/0xe9470f1d (DATA)

NOTE: Attempting voting file refresh on diskgroup DATA

NOTE: Refresh completed on diskgroup DATA. No voting file found.

Wed Sep 14 16:14:09 2022

* allocate domain 4, invalid = TRUE 

freeing rdom 4

 Received detach msg from inst 1 for dom 4

* allocate domain 4, invalid = TRUE 



4 查看集群相关磁盘组信息,发现节点2的磁盘信息为offline


[grid@host1 ~]$ crsctl stat res -t

--------------------------------------------------------------------------------

NAME           TARGET  STATE        SERVER                   STATE_DETAILS       

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.ARCH.dg

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.DATA.dg

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.LISTENER.lsnr

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.OCR.dg

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.OCR_NEW.dg

               ONLINE  ONLINE       host1                                    

               OFFLINE OFFLINE      host2 


5 使用如下命令查看磁盘组的集群信息,只在节点1启动了             

 srvctl status diskgroup -g OCR_NEW -a     

 

 [grid@host1 ~]$  srvctl status diskgroup -g OCR_NEW -a

Disk Group OCR_NEW is running on host1

Disk Group OCR_NEW is enabled    


6 执行如下命令启动节点2上的磁盘组

 srvctl start diskgroup -g OCR_NEW -n host2  

 

 节点1asm实例的日志信息:

 Wed Sep 14 16:26:09 2022

NOTE: enlarging ACD to 2 threads for group 4/0x42e20597 (OCR_NEW)

SUCCESS: ACD enlarged for group 4/0x42e20597 (OCR_NEW)


节点2asm实例的日志信息:

Wed Sep 14 16:26:09 2022

NOTE: Instance updated compatible.asm to 11.2.0.0.0 for grp 4

SUCCESS: diskgroup OCR_NEW was mounted

SUCCESS: ALTER DISKGROUP OCR_NEW MOUNT  /* asm agent *//* {1:55916:4774} */

Wed Sep 14 16:26:10 2022

NOTE: diskgroup resource ora.OCR_NEW.dg is updated


7 查看磁盘组相关信息,一切恢复正常

[grid@host1 ~]$ srvctl status diskgroup -g OCR_NEW -a

Disk Group OCR_NEW is running on host1,host2

Disk Group OCR_NEW is enabled


[grid@host1 ~]$ crsctl stat res -t

--------------------------------------------------------------------------------

NAME           TARGET  STATE        SERVER                   STATE_DETAILS       

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.ARCH.dg

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.DATA.dg

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.LISTENER.lsnr

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.OCR.dg

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2                                    

ora.OCR_NEW.dg

               ONLINE  ONLINE       host1                                    

               ONLINE  ONLINE       host2 

               

[grid@host2 ~]$ asmcmd

ASMCMD> lsdg

State    Type    Rebal  Sector  Block       AU  Total_MB  Free_MB  Req_mir_free_MB  Usable_file_MB  Offline_disks  Voting_files  Name

MOUNTED  EXTERN  N         512   4096  1048576   1024000   954153                0          954153              0             N  ARCH/

MOUNTED  EXTERN  N         512   4096  1048576   3072000  1392206                0         1392206              0             N  DATA/

MOUNTED  NORMAL  N         512   4096  1048576     30720    29794            10240            9777              0             Y  OCR/

MOUNTED  NORMAL  N         512   4096  1048576     30720    30435            10240           10097              0             N  OCR_NEW/  


8 本文解决方法参照Oracle的官方文档《ALERT.LOG for ASM Shows "WARNING: failed to online diskgroup 

resource ora..dg (unable to communicate with CRSD/OHASD)" (Doc ID 1060920.1)》,此文

说明从11.2.01以后,集群的启动顺序发现变化,且不依赖与ASM磁盘组,故导致刷新失败。

             


请使用浏览器的分享功能分享到微信等