从 11.2.0.2 开始,Grid Infrastructure 本地支持没有任何第三方 IP 故障转移技术(绑定,IPMP或类似技术)的冗余互连。可以在安装阶段或之后使用 oifcfg 定义多个专用网络适配器。11.2.0.2 中的 Oracle 数据库、CSS、OCR、CRS、CTSS 和 EVM组件会自动使用它。
即使定义了更多,Grid Infrastructure 一次最多可以激活四个专用网络适配器。ora.cluster_interconnect.haip 资源将在专用网络适配器上启动一到四个链接本地 HAIP,以用于 Oracle RAC,Oracle ASM 和 Oracle ACFS 等的互连通信。
GRID 会自动从 HAIP 的预留的 169.254.*.* 子网中选择免费链接本地地址。根据 RFC-3927,链接本地子网169.254.*.* 不得用于任何其他目的。借助HAIP,默认情况下,互连流量将在所有活动的互连接口之间进行负载均衡,并且如果一个发生故障或变的不可通信,则相应的 HAIP 地址将透明地故障转移到其他适配器。。
GI 配置好后,可以使用“
注意:如果使用 11.2.0.2(及以上)的冗余互连/HAIP特性(如下面案例2所述)-目前需要将所有互连接口放置在单独的子网上。如果这些接口都在同一子网上,并且电缆从路由表中的第一个NIC中被拉出,则将发生重启或节点重新启动。
在编写本文时,冗余专用网络需要为每个网络适配器设置不同的子网,例如,如果 eth1、eth2 和 eth3 用于专用网络,则它们应该位于不同的子网上,请参考案例 2。
当 Oracle 集群软件完全打开时,资源 haip 应该显示在线状态:
$ $GRID_HOME/bin/crsctl stat res -t -init .. ora.cluster_interconnect.haip 1 ONLINE ONLINE
案例1:单个私有网络适配器
如果多个物理网络适配器在OS级别绑定在一起,并以单个设备名(例如bond0)的形式出现,它仍然被认为是单个网络适配器环境。单个私有用网络适配器不能提供真正的 HAIP,因为只有一个适配器,建议至少两个来获得真正的HAIP。如果只定义了一个私有网络适配器,比如下面示例中的 eth1,那么 HAIP 将创建一个虚拟 IP。下面是当Grid 启动并运行时的预期结果:
$ $GRID_HOME/bin/oifcfg getif eth1 10.x.x.128 global cluster_interconnect eth3 10.1.x.x global public
$ $GRID_HOME/bin/oifcfg iflist -p -n eth1 10.x.x.128 PRIVATE 255.255.255.128 eth1 169.254.0.0 UNKNOWN 255.255.0.0 eth3 10.1.x.x PRIVATE 255.255.255.128 Note: 1. subnet 169.254.0.0 on eth1 is started by resource haip; 2. refer to note 1386709.1 for explanation of the output
ifconfig .. eth1 Link encap:Ethernet HWaddr 00:16:3E:11:11:22 inet addr:10.x.x.168 Bcast:10.1.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1122/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:6369306 errors:0 dropped:0 overruns:0 frame:0 TX packets:4270790 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:3037449975 (2.8 GiB) TX bytes:2705797005 (2.5 GiB) eth1:1 Link encap:Ethernet HWaddr 00:16:3E:11:22:22 inet addr:169.254.x.x Bcast:169.254.255.255 Mask:255.255.0.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
Instance alert.log (ASM and database): Private Interface 'eth1:1' configured from GPnP for use as a private interconnect. [name='eth1:1', type=1, ip=169.254.x.x, mac=00-16-3e-11-11-22, net=169.254.0.0/16, mask=255.255.0.0, use=haip:cluster_interconnect/62] Public Interface 'eth3' configured from GPnP for use as a public interface. [name='eth3', type=1, ip=10.x.x.168, mac=00-16-3e-11-11-44, net=10.1.x.x/25, mask=255.255.255.128, use=public/1] .. Shared memory segment for instance monitoring created Picked latch-free SCN scheme 3 .. Cluster communication is configured to use the following interface(s) for this instance 169.254.x.x Note: interconnect will use virtual private IP 169.254.x.x instead of real private IP. For pre-11.2.0.2 instance, by default it will still use the real private IP; to take advantage of the new feature, init.ora parameter cluster_interconnects can be updated each time Grid is restarted .
For 11.2.0.2 and above, v$cluster_interconnects will show haip info: SQL> select name,ip_address from v$cluster_interconnects; NAME IP_ADDRESS --------------- ---------------- eth1:1 169.254.x.x
案例2:多个私有网络适配器
如果每个节点上有多个私有网络适配器,可以部署多个交换机,如果一个网络适配器出现故障,该网络段上的HAIP 将故障转移到所有节点上的其他节点上。
2.1 默认状态
下面是 Grid 启动并运行时的 3 个私有网络 eth1、eth6 和 eth7 的示例:
$ $GRID_HOME/bin/oifcfg getif eth1 10.x.x.128 global cluster_interconnect eth3 10.1.x.x global public eth6 10.11.x.x global cluster_interconnect eth7 10.12.x.x global cluster_interconnect
$ $GRID_HOME/bin/oifcfg iflist -p -n eth1 10.x.x.128 PRIVATE 255.255.255.128 eth1 169.254.0.x UNKNOWN 255.255.192.0 eth1 169.254.192.x UNKNOWN 255.255.192.0 eth3 10.1.x.x PRIVATE 255.255.255.128 eth6 10.11.x.x PRIVATE 255.255.255.128 eth6 169.254.64.x UNKNOWN 255.255.192.0 eth7 10.12.x.x PRIVATE 255.255.255.128 eth7 169.254.128.x UNKNOWN 255.255.192.0 Note: resource haip started four virtual private IPs, two on eth1, and one on eth6 and eth7
ifconfig .. eth1 Link encap:Ethernet HWaddr 00:16:3E:11:11:22 inet addr:10.x.x.168 Bcast:10.1.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1122/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:15176906 errors:0 dropped:0 overruns:0 frame:0 TX packets:10239298 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:7929246238 (7.3 GiB) TX bytes:5768511630 (5.3 GiB) eth1:1 Link encap:Ethernet HWaddr 00:16:3E:11:11:22 inet addr:169.254.x.x Bcast:169.254.63.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth1:2 Link encap:Ethernet HWaddr 00:16:3E:11:11:22 inet addr:169.254.x.x Bcast:169.254.255.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth6 Link encap:Ethernet HWaddr 00:16:3E:11:11:77 inet addr:10.11.x.x Bcast:10.11.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1177/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:7068185 errors:0 dropped:0 overruns:0 frame:0 TX packets:595746 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:2692567483 (2.5 GiB) TX bytes:382357191 (364.6 MiB) eth6:1 Link encap:Ethernet HWaddr 00:16:3E:11:11:77 inet addr:169.254.x.x Bcast:169.254.127.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:10.12.x.x Bcast:10.12.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1188/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:6435829 errors:0 dropped:0 overruns:0 frame:0 TX packets:314780 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:2024577502 (1.8 GiB) TX bytes:172461585 (164.4 MiB) eth7:1 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.191.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
Instance alert.log (ASM and database): Private Interface 'eth1:1' configured from GPnP for use as a private interconnect. [name='eth1:1', type=1, ip=169.254.xx.xx, mac=00-16-3e-11-11-22, net=169.254.x.0/18, mask=255.255.192.0, use=haip:cluster_interconnect/62] Private Interface 'eth6:1' configured from GPnP for use as a private interconnect. [name='eth6:1', type=1, ip=169.254.xx.xx, mac=00-16-3e-11-11-77, net=169.254.x.0/18, mask=255.255.192.0, use=haip:cluster_interconnect/62] Private Interface 'eth7:1' configured from GPnP for use as a private interconnect. [name='eth7:1', type=1, ip=169.254.x.x, mac=00-16-3e-11-11-88, net=169.254.x.0/18, mask=255.255.192.0, use=haip:cluster_interconnect/62] Private Interface 'eth1:2' configured from GPnP for use as a private interconnect. [name='eth1:2', type=1, ip=169.254.x.x, mac=00-16-3e-11-11-22, net=169.254.x.0/18, mask=255.255.192.0, use=haip:cluster_interconnect/62] Public Interface 'eth3' configured from GPnP for use as a public interface. [name='eth3', type=1, ip=10.x.x.68, mac=00-16-3e-11-11-44, net=10.1.x.x/25, mask=255.255.255.128, use=public/1] Picked latch-free SCN scheme 3 .. Cluster communication is configured to use the following interface(s) for this instance 169.254.x.98 169.254.x.250 169.254.x.237 169.254.x.103 Note: interconnect communication will use all four virtual private IPs; in case of network failure, as long as there is one private network adapter functioning, all four IPs will remain active.
2.2 当私有网络适配器失败时
如果一个私有网络适配器失败(在本例中为 eth6),那么 eth6 上的虚拟私有 IP 将自动重新定位到一个健康的适配器,并且它对实例(ASM 或数据库)是透明的
$ $GRID_HOME/bin/oifcfg iflist -p -n eth1 10.x.x.128 PRIVATE 255.255.255.128 eth1 169.254.0.x UNKNOWN 255.255.192.0 eth1 169.254.128.x UNKNOWN 255.255.192.0 eth7 10.12.x.x PRIVATE 255.255.255.128 eth7 169.254.64.x UNKNOWN 255.255.192.0 eth7 169.254.192.x UNKNOWN 255.255.192.0 Note: virtual private IP on eth6 subnet 169.254.64.x relocated to eth7
ifconfig .. eth1 Link encap:Ethernet HWaddr 00:16:3E:11:11:22 inet addr:10.x.x.168 Bcast:10.1.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1122/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:15183840 errors:0 dropped:0 overruns:0 frame:0 TX packets:10245071 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:7934311823 (7.3 GiB) TX bytes:5771878414 (5.3 GiB) eth1:1 Link encap:Ethernet HWaddr 00:16:3E:11:11:22 inet addr:169.254.x.x Bcast:169.254.63.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth1:3 Link encap:Ethernet HWaddr 00:16:3E:11:11:22 inet addr:169.254.x.x Bcast:169.254.191.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:10.12.x.x Bcast:10.12.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1188/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:6438985 errors:0 dropped:0 overruns:0 frame:0 TX packets:315877 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:2026266447 (1.8 GiB) TX bytes:173101641 (165.0 MiB) eth7:2 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.127.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7:3 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.255.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
2.3 当另一个私有网络适配器失败时
如果另一个私有网络适配器宕机,在本例 eth1 中,它上的虚拟私有 IP 将自动重新定位到其他健康的适配器,不会对实例(ASM或数据库)造成影响。
$ $GRID_HOME/bin/oifcfg iflist -p -n eth7 10.12.x.x PRIVATE 255.255.255.128 eth7 169.254.64.x UNKNOWN 255.255.192.0 eth7 169.254.192.x UNKNOWN 255.255.192.0 eth7 169.254.0.x UNKNOWN 255.255.192.0 eth7 169.254.128.x UNKNOWN 255.255.192.0
ifconfig .. eth7 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:10.12.x.x Bcast:10.12.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1188/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:6441559 errors:0 dropped:0 overruns:0 frame:0 TX packets:317271 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:2027824788 (1.8 GiB) TX bytes:173810658 (165.7 MiB) eth7:1 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.63.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7:2 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.127.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7:3 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.255.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7:4 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.191.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
2.4 当私有网络适配器恢复
如果私有网络适配器 eth6 被恢复,它将被自动激活,因为虚拟私有 ip 将被分配给它:
$ $GRID_HOME/bin/oifcfg iflist -p -n .. eth6 10.11.x.x PRIVATE 255.255.255.128 eth6 169.254.128.x UNKNOWN 255.255.192.0 eth6 169.254.0.x UNKNOWN 255.255.192.0 eth7 10.12.x.x PRIVATE 255.255.255.128 eth7 169.254.64.x UNKNOWN 255.255.192.0 eth7 169.254.192.x UNKNOWN 255.255.192.0
ifconfig .. eth6 Link encap:Ethernet HWaddr 00:16:3E:11:11:77 inet addr:10.11.x.x Bcast:10.11.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1177/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:398 errors:0 dropped:0 overruns:0 frame:0 TX packets:121 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:185138 (180.7 KiB) TX bytes:56439 (55.1 KiB) eth6:1 Link encap:Ethernet HWaddr 00:16:3E:11:11:77 inet addr:169.254.x.x Bcast:169.254.191.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth6:2 Link encap:Ethernet HWaddr 00:16:3E:11:11:77 inet addr:169.254.x.x Bcast:169.254.63.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:10.12.x.x Bcast:10.12.0.255 Mask:255.255.255.128 inet6 addr: fe80::216:3eff:fe11:1188/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:6442552 errors:0 dropped:0 overruns:0 frame:0 TX packets:317983 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:2028404133 (1.8 GiB) TX bytes:174103017 (166.0 MiB) eth7:2 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.127.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 eth7:3 Link encap:Ethernet HWaddr 00:16:3E:11:11:88 inet addr:169.254.x.x Bcast:169.254.255.255 Mask:255.255.192.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
注意:
不支持在集群启动和运行时禁用或停止 HAIP,除非 Oracle 支持/开发另有建议。
-
如果Sun Cluster存在,则在11.2.0.2/11.2.0.3中禁用该特性
-
该功能在Windows 11.2.0.2/11.2.0.3中不存在
-
如果富士通PRIMECLUSTER存在,则在11.2.0.2/11.2.0.3中禁用该功能
-
修复了bug 11077756(在11.2.0.2 GI PSU6, 11.2.0.3中修复)后,如果运行root脚本(root.sh或root.sh)时启动失败,HAIP将被禁用,详细信息请参阅bug 11077756
-
如果IPMP用于私有网络,那么在Solaris 11上禁用该特性。跟踪
-
如果cluster_interconnect/“private network”为Infiniband,则在HP-UX和AIX上禁用该特性
HAIP日志文件
资源 haip 由 ohasd.bin 管理。资源日志位于 $GRID_HOME/log/ohasd/ohasd.log 和$GRID_HOME/log/agent/ohasd/orarootagent_root/orarootagent_root.log 中
当私有网络适配器失败时,记录示例
在多个私有网络适配器环境中,如果其中一个适配器发生故障:
-
ohasd.log
2010-09-24 09:10:00.891: [GIPCHGEN][1083025728]gipchaInterfaceFail: marking interface failing 0x2aaab0269a10 { host '', haName 'CLSFRAME_a2b2', local (nil), ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x4d } 2010-09-24 09:10:00.902: [GIPCHGEN][1138145600]gipchaInterfaceDisable: disabling interface 0x2aaab0269a10 { host '', haName 'CLSFRAME_a2b2', local (nil), ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x1cd } 2010-09-24 09:10:00.902: [GIPCHDEM][1138145600]gipchaWorkerCleanInterface: performing cleanup of disabled interface 0x2aaab0269a10 { host '', haName 'CLSFRAME_a2b2', local (nil), ip '10.11.0.188', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x1ed -
orarootagent_root.log
2010-09-24 09:09:57.708: [ USRTHRD][1129138496] {0:0:2} failed to receive ARP request 2010-09-24 09:09:57.708: [ USRTHRD][1129138496] {0:0:2} Assigned IP 169.254.x.x no longer valid on inf eth6 2010-09-24 09:09:57.708: [ USRTHRD][1129138496] {0:0:2} VipActions::startIp { 2010-09-24 09:09:57.708: [ USRTHRD][1129138496] {0:0:2} Adding 169.254.x.x on eth6:1 2010-09-24 09:09:57.719: [ USRTHRD][1129138496] {0:0:2} VipActions::startIp } 2010-09-24 09:09:57.719: [ USRTHRD][1129138496] {0:0:2} Reassigned IP: 169.254.x.x on interface eth6 2010-09-24 09:09:58.013: [ USRTHRD][1082325312] {0:0:2} HAIP: Updating member info HAIP1;10.11.x.x#0;10.11.x.x#1 2010-09-24 09:09:58.015: [ USRTHRD][1082325312] {0:0:2} HAIP: Moving ip '169.254.x.x' from inf 'eth6' to inf 'eth7' 2010-09-24 09:09:58.015: [ USRTHRD][1082325312] {0:0:2} pausing thread 2010-09-24 09:09:58.015: [ USRTHRD][1082325312] {0:0:2} posting thread 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]start { 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]start } 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} HAIP: Moving ip '169.254.x.x' from inf 'eth1' to inf 'eth7' 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} pausing thread 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} posting thread 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]start { 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]start } 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} HAIP: Moving ip '169.254.x.x' from inf 'eth7' to inf 'eth1' 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} pausing thread 2010-09-24 09:09:58.016: [ USRTHRD][1082325312] {0:0:2} posting thread 2010-09-24 09:09:58.017: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]start { 2010-09-24 09:09:58.017: [ USRTHRD][1116531008] {0:0:2} [NetHAWork] thread started 2010-09-24 09:09:58.017: [ USRTHRD][1116531008] {0:0:2} Arp::sCreateSocket { 2010-09-24 09:09:58.017: [ USRTHRD][1093232960] {0:0:2} [NetHAWork] thread started 2010-09-24 09:09:58.017: [ USRTHRD][1093232960] {0:0:2} Arp::sCreateSocket { 2010-09-24 09:09:58.017: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]start } 2010-09-24 09:09:58.018: [ USRTHRD][1143847232] {0:0:2} [NetHAWork] thread started 2010-09-24 09:09:58.018: [ USRTHRD][1143847232] {0:0:2} Arp::sCreateSocket { 2010-09-24 09:09:58.034: [ USRTHRD][1116531008] {0:0:2} Arp::sCreateSocket } 2010-09-24 09:09:58.034: [ USRTHRD][1116531008] {0:0:2} Starting Probe for ip 169.254.x.x 2010-09-24 09:09:58.034: [ USRTHRD][1116531008] {0:0:2} Transitioning to Probe State 2010-09-24 09:09:58.034: [ USRTHRD][1093232960] {0:0:2} Arp::sCreateSocket } 2010-09-24 09:09:58.035: [ USRTHRD][1093232960] {0:0:2} Starting Probe for ip 169.254.x.x 2010-09-24 09:09:58.035: [ USRTHRD][1093232960] {0:0:2} Transitioning to Probe State 2010-09-24 09:09:58.050: [ USRTHRD][1143847232] {0:0:2} Arp::sCreateSocket } 2010-09-24 09:09:58.050: [ USRTHRD][1143847232] {0:0:2} Starting Probe for ip 169.254.x.x 2010-09-24 09:09:58.050: [ USRTHRD][1143847232] {0:0:2} Transitioning to Probe State 2010-09-24 09:09:58.231: [ USRTHRD][1093232960] {0:0:2} Arp::sProbe { 2010-09-24 09:09:58.231: [ USRTHRD][1093232960] {0:0:2} Arp::sSend: sending type 1 2010-09-24 09:09:58.231: [ USRTHRD][1093232960] {0:0:2} Arp::sProbe } 2010-09-24 09:10:04.879: [ USRTHRD][1116531008] {0:0:2} Arp::sAnnounce { 2010-09-24 09:10:04.879: [ USRTHRD][1116531008] {0:0:2} Arp::sSend: sending type 1 2010-09-24 09:10:04.879: [ USRTHRD][1116531008] {0:0:2} Arp::sAnnounce } 2010-09-24 09:10:04.879: [ USRTHRD][1116531008] {0:0:2} Transitioning to Defend State 2010-09-24 09:10:04.879: [ USRTHRD][1116531008] {0:0:2} VipActions::startIp { 2010-09-24 09:10:04.879: [ USRTHRD][1116531008] {0:0:2} Adding 169.254.x.x on eth7:2 2010-09-24 09:10:04.880: [ USRTHRD][1116531008] {0:0:2} VipActions::startIp } 2010-09-24 09:10:04.880: [ USRTHRD][1116531008] {0:0:2} Assigned IP: 169.254.x.x on interface eth7 2010-09-24 09:10:05.150: [ USRTHRD][1143847232] {0:0:2} Arp::sAnnounce { 2010-09-24 09:10:05.150: [ USRTHRD][1143847232] {0:0:2} Arp::sSend: sending type 1 2010-09-24 09:10:05.150: [ USRTHRD][1143847232] {0:0:2} Arp::sAnnounce } 2010-09-24 09:10:05.150: [ USRTHRD][1143847232] {0:0:2} Transitioning to Defend State 2010-09-24 09:10:05.150: [ USRTHRD][1143847232] {0:0:2} VipActions::startIp { 2010-09-24 09:10:05.151: [ USRTHRD][1143847232] {0:0:2} Adding 169.254.x.x on eth1:3 2010-09-24 09:10:05.151: [ USRTHRD][1143847232] {0:0:2} VipActions::startIp } 2010-09-24 09:10:05.151: [ USRTHRD][1143847232] {0:0:2} Assigned IP: 169.254.x.x on interface eth1 2010-09-24 09:10:05.470: [ USRTHRD][1093232960] {0:0:2} Arp::sAnnounce { 2010-09-24 09:10:05.470: [ USRTHRD][1093232960] {0:0:2} Arp::sSend: sending type 1 2010-09-24 09:10:05.470: [ USRTHRD][1093232960] {0:0:2} Arp::sAnnounce } 2010-09-24 09:10:05.470: [ USRTHRD][1093232960] {0:0:2} Transitioning to Defend State 2010-09-24 09:10:05.470: [ USRTHRD][1093232960] {0:0:2} VipActions::startIp { 2010-09-24 09:10:05.471: [ USRTHRD][1093232960] {0:0:2} Adding 169.254.x.x on eth7:3 2010-09-24 09:10:05.471: [ USRTHRD][1093232960] {0:0:2} VipActions::startIp } 2010-09-24 09:10:05.471: [ USRTHRD][1093232960] {0:0:2} Assigned IP: 169.254.x.x on interface eth7 2010-09-24 09:10:06.047: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]stop { 2010-09-24 09:10:06.282: [ USRTHRD][1129138496] {0:0:2} [NetHAWork] thread stopping 2010-09-24 09:10:06.282: [ USRTHRD][1129138496] {0:0:2} Thread:[NetHAWork]isRunning is reset to false here 2010-09-24 09:10:06.282: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]stop } 2010-09-24 09:10:06.282: [ USRTHRD][1082325312] {0:0:2} VipActions::stopIp { 2010-09-24 09:10:06.282: [ USRTHRD][1082325312] {0:0:2} NetInterface::sStopIp { 2010-09-24 09:10:06.282: [ USRTHRD][1082325312] {0:0:2} Stopping ip '169.254.x.x', inf 'eth6', mask '10.11.x.x' 2010-09-24 09:10:06.288: [ USRTHRD][1082325312] {0:0:2} NetInterface::sStopIp } 2010-09-24 09:10:06.288: [ USRTHRD][1082325312] {0:0:2} VipActions::stopIp } 2010-09-24 09:10:06.288: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]stop { 2010-09-24 09:10:06.298: [ USRTHRD][1131239744] {0:0:2} [NetHAWork] thread stopping 2010-09-24 09:10:06.298: [ USRTHRD][1131239744] {0:0:2} Thread:[NetHAWork]isRunning is reset to false here 2010-09-24 09:10:06.298: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]stop } 2010-09-24 09:10:06.298: [ USRTHRD][1082325312] {0:0:2} VipActions::stopIp { 2010-09-24 09:10:06.298: [ USRTHRD][1082325312] {0:0:2} NetInterface::sStopIp { 2010-09-24 09:10:06.298: [ USRTHRD][1082325312] {0:0:2} Stopping ip '169.254.x.x', inf 'eth7', mask '10.12.x.x' 2010-09-24 09:10:06.299: [ USRTHRD][1082325312] {0:0:2} NetInterface::sStopIp } 2010-09-24 09:10:06.299: [ USRTHRD][1082325312] {0:0:2} VipActions::stopIp } 2010-09-24 09:10:06.299: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]stop { 2010-09-24 09:10:06.802: [ USRTHRD][1133340992] {0:0:2} [NetHAWork] thread stopping 2010-09-24 09:10:06.802: [ USRTHRD][1133340992] {0:0:2} Thread:[NetHAWork]isRunning is reset to false here 2010-09-24 09:10:06.802: [ USRTHRD][1082325312] {0:0:2} Thread:[NetHAWork]stop } 2010-09-24 09:10:06.802: [ USRTHRD][1082325312] {0:0:2} VipActions::stopIp { 2010-09-24 09:10:06.802: [ USRTHRD][1082325312] {0:0:2} NetInterface::sStopIp { 2010-09-24 09:10:06.802: [ USRTHRD][1082325312] {0:0:2} Stopping ip '169.254.x.x', inf 'eth1', mask '10.1.x.x' 2010-09-24 09:10:06.802: [ USRTHRD][1082325312] {0:0:2} NetInterface::sStopIp } 2010-09-24 09:10:06.802: [ USRTHRD][1082325312] {0:0:2} VipActions::stopIp } 2010-09-24 09:10:06.803: [ USRTHRD][1082325312] {0:0:2} USING HAIP[ 0 ]: eth7 - 169.254.112.x 2010-09-24 09:10:06.803: [ USRTHRD][1082325312] {0:0:2} USING HAIP[ 1 ]: eth1 - 169.254.178.x 2010-09-24 09:10:06.803: [ USRTHRD][1082325312] {0:0:2} USING HAIP[ 2 ]: eth7 - 169.254.244.x 2010-09-24 09:10:06.803: [ USRTHRD][1082325312] {0:0:2} USING HAIP[ 3 ]: eth1 - 169.254.30.x Note: from above, even only NIC eth6 failed, there could be multiple virtual private IP movement among surviving NIC -
ocssd.log
2010-09-24 09:09:58.314: [ GIPCNET][1089964352] gipcmodNetworkProcessSend: [network] failed send attempt endp 0xe1b9150 [0000000000000399] { gipcEndpoint : localAddr 'udp://10.11.x.x:60169', remoteAddr '', numPend 5, numReady 1, numDone 0, numDead 0, numTransfer 0, objFlags 0x0, pidPeer 0, flags 0x2, usrFlags 0x4000 }, req 0x2aaab00117f0 [00000000004b0cae] { gipcSendRequest : addr 'udp://10.11.x.x:41486', data 0x2aaab0050be8, len 80, olen 0, parentEndp 0xe1b9150, ret gipcretEndpointNotAvailable (40), objFlags 0x0, reqFlags 0x2 } 2010-09-24 09:09:58.314: [ GIPCNET][1089964352] gipcmodNetworkProcessSend: slos op : sgipcnValidateSocket 2010-09-24 09:09:58.314: [ GIPCNET][1089964352] gipcmodNetworkProcessSend: slos dep : Invalid argument (22) 2010-09-24 09:09:58.314: [ GIPCNET][1089964352] gipcmodNetworkProcessSend: slos loc : address not 2010-09-24 09:09:58.314: [ GIPCNET][1089964352] gipcmodNetworkProcessSend: slos info: addr '10.11.x.x:60169', len 80, buf 0x2aaab0050be8, cookie 0x2aaab00117f0 2010-09-24 09:09:58.314: [GIPCXCPT][1089964352] gipcInternalSendSync: failed sync request, ret gipcretEndpointNotAvailable (40) 2010-09-24 09:09:58.314: [GIPCXCPT][1089964352] gipcSendSyncF [gipchaLowerInternalSend : gipchaLower.c : 755]: EXCEPTION[ ret gipcretEndpointNotAvailable (40) ] failed to send on endp 0xe1b9150 [0000000000000399] { gipcEndpoint : localAddr 'udp://10.11.x.x:60169', remoteAddr '', numPend 5, numReady 0, numDone 0, numDead 0, numTransfer 0, objFlags 0x0, pidPeer 0, flags 0x2, usrFlags 0x4000 }, addr 0xe4e6d10 [00000000000007ed] { gipcAddress : name 'udp://10.11.x.x:41486', objFlags 0x0, addrFlags 0x1 }, buf 0x2aaab0050be8, len 80, flags 0x0 2010-09-24 09:09:58.314: [GIPCHGEN][1089964352] gipchaInterfaceFail: marking interface failing 0xe2bd5f0 { host '', haName 'CSS_a2b2', local 0x2aaaac2098e0, ip '10.11.x.x:41486', subnet '10.11.0.128', mask '255.255.255.128', numRef 0, numFail 0, flags 0x6 } 2010-09-24 09:09:58.314: [GIPCHALO][1089964352] gipchaLowerInternalSend: failed to initiate send on interface 0xe2bd5f0 { host ' ', haName 'CSS_a2b2', local 0x2aaaac2098e0, ip '10.11.x.x:41486', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x86 }, hctx 0xde81d10 [0000000000000010] { gipchaContext : host ' ', name 'CSS_a2b2', luid '4f06f2aa-00000000', numNode 1, numInf 3, usrFlags 0x0, flags 0x7 } 2010-09-24 09:09:58.326: [GIPCHGEN][1089964352] gipchaInterfaceDisable: disabling interface 0x2aaaac2098e0 { host '', haName 'CSS_a2b2', local (nil), ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 1, flags 0x14d } 2010-09-24 09:09:58.326: [GIPCHGEN][1089964352] gipchaInterfaceDisable: disabling interface 0xe2bd5f0 { host ' ', haName 'CSS_a2b2', local 0x2aaaac2098e0, ip '10.11.x.x:41486', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x86 } 2010-09-24 09:09:58.327: [GIPCHALO][1089964352] gipchaLowerCleanInterfaces: performing cleanup of disabled interface 0xe2bd5f0 { host ' ', haName 'CSS_a2b2', local 0x2aaaac2098e0, ip '10.11.x.x:41486', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0xa6 } 2010-09-24 09:09:58.327: [GIPCHGEN][1089964352] gipchaInterfaceReset: resetting interface 0xe2bd5f0 { host ' ', haName 'CSS_a2b2', local 0x2aaaac2098e0, ip '10.11.x.x:41486', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0xa6 } 2010-09-24 09:09:58.338: [GIPCHDEM][1089964352] gipchaWorkerCleanInterface: performing cleanup of disabled interface 0x2aaaac2098e0 { host '', haName 'CSS_a2b2', local (nil), ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x16d } 2010-09-24 09:09:58.338: [GIPCHTHR][1089964352] gipchaWorkerUpdateInterface: created remote interface for node ' ', haName 'CSS_a2b2', inf 'udp://10.11.x.x:41486' 2010-09-24 09:09:58.338: [GIPCHGEN][1089964352] gipchaWorkerAttachInterface: Interface attached inf 0xe2bd5f0 { host ' ', haName 'CSS_a2b2', local 0x2aaaac2014f0, ip '10.11.x.x:41486', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x6 } 2010-09-24 09:10:00.454: [ CSSD][1108904256]clssnmSendingThread: sending status msg to all nodes Note: from above, ocssd.bin won't fail as long as there's at least one private network adapter is workin
私有网络适配器还原时的日志示例
在多个私有网络适配器环境中,如果有故障的适配器恢复:
-
ohasd.log
2010-09-24 09:14:30.962: [GIPCHGEN][1083025728]gipchaNodeAddInterface: adding interface information for inf 0x2aaaac1a53d0 { host '', haName 'CLSFRAME_a2b2', local (nil), ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x41 } 2010-09-24 09:14:30.972: [GIPCHTHR][1138145600]gipchaWorkerUpdateInterface: created local bootstrap interface for node '', haName 'CLSFRAME_a2b2', inf 'mcast://230.0.1.0:42424/10.11.x.x' 2010-09-24 09:14:30.972: [GIPCHTHR][1138145600]gipchaWorkerUpdateInterface: created local interface for node ' ', haName 'CLSFRAME_a2b2', inf '10.11.x.x:13235 -
ocssd.log
2010-09-24 09:14:30.961: [GIPCHGEN][1091541312] gipchaNodeAddInterface: adding interface information for inf 0x2aaab005af00 { host '', haName 'CSS_a2b2', local (nil), ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x41 } 2010-09-24 09:14:30.972: [GIPCHTHR][1089964352] gipchaWorkerUpdateInterface: created local bootstrap interface for node '', haName 'CSS_a2b2', inf 'mcast://230.0.1.0:42424/10.11.x.x' 2010-09-24 09:14:30.972: [GIPCHTHR][1089964352] gipchaWorkerUpdateInterface: created local interface for node ' ', haName 'CSS_a2b2', inf '10.11.x.x:10884' 2010-09-24 09:14:30.972: [GIPCHGEN][1089964352] gipchaNodeAddInterface: adding interface information for inf 0x2aaab0035490 { host ' ', haName 'CSS_a2b2', local (nil), ip '10.21.x.x', subnet '10.12.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x42 } 2010-09-24 09:14:30.972: [GIPCHGEN][1089964352] gipchaNodeAddInterface: adding interface information for inf 0x2aaab00355c0 { host ' ', haName 'CSS_a2b2', local (nil), ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x42 } 2010-09-24 09:14:30.972: [GIPCHTHR][1089964352] gipchaWorkerUpdateInterface: created remote interface for node ' ', haName 'CSS_a2b2', inf 'mcast://230.0.1.0:42424/10.12.x.x' 2010-09-24 09:14:30.972: [GIPCHGEN][1089964352] gipchaWorkerAttachInterface: Interface attached inf 0x2aaab0035490 { host ' ', haName 'CSS_a2b2', local 0x2aaab005af00, ip '10.12.x.x', subnet '10.12.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x46 } 2010-09-24 09:14:30.972: [GIPCHTHR][1089964352] gipchaWorkerUpdateInterface: created remote interface for node ' ', haName 'CSS_a2b2', inf 'mcast://230.0.1.0:42424/10.11.x.x' 2010-09-24 09:14:30.972: [GIPCHGEN][1089964352] gipchaWorkerAttachInterface: Interface attached inf 0x2aaab00355c0 { host ' ', haName 'CSS_a2b2', local 0x2aaab005af00, ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x46 } 2010-09-24 09:14:31.437: [GIPCHGEN][1089964352] gipchaInterfaceDisable: disabling interface 0x2aaab00355c0 { host ' ', haName 'CSS_a2b2', local 0x2aaab005af00, ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x46 } 2010-09-24 09:14:31.437: [GIPCHALO][1089964352] gipchaLowerCleanInterfaces: performing cleanup of disabled interface 0x2aaab00355c0 { host ' ', haName 'CSS_a2b2', local 0x2aaab005af00, ip '10.11.x.x', subnet '10.11.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x66 } 2010-09-24 09:14:31.446: [GIPCHGEN][1089964352] gipchaInterfaceDisable: disabling interface 0x2aaab0035490 { host ' ', haName 'CSS_a2b2', local 0x2aaab005af00, ip '10.12.x.x', subnet '10.12.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x46 } 2010-09-24 09:14:31.446: [GIPCHALO][1089964352] gipchaLowerCleanInterfaces: performing cleanup of disabled interface 0x2aaab0035490 { host ' ', haName 'CSS_a2b2', local 0x2aaab005af00, ip '10.12.x.x', subnet '10.12.x.x', mask '255.255.255.128', numRef 0, numFail 0, flags 0x66
源自:Grid Infrastructure Redundant Interconnect and ora.cluster_interconnect.haip (Doc ID 1210883.1)