问题描述
s9700交换机收到大量TC报文导致ospf闪断
告警信息
last Down OSPF Peer
Neighbor Ip Address : 10.174.96.10
Neighbor Area Id : 0.0.0.140
Neighbor Router Id : 10.16.248.201
Interface : Vlanif963
Immediate Reason : Neighbor Down Due to 1-Wayhello Received
Primary Reason : 1-Wayhello Received
Down Time : 2016-04-23 18:03:48+08:00
Neighbor Ip Address : 10.16.247.41
Neighbor Area Id : 0.0.0.140
Neighbor Router Id : 10.16.249.19
Interface : Vlanif1001
Immediate Reason : Neighbor Down Due to Inactivity
Primary Reason : Hello Not Seen
Down Time : 2016-04-23 18:03:36+08:00
处理过程
1、排查97交换机的2、3号单板都有无CP-Car丢弃计数。
=====================================================================
===============display cpu-defend statistics all===============
=====================================================================
Statistics on mainboard:
--------------------------------------------------------------------------------
Packet Type Pass(Packet/Byte) Drop(Packet/Byte) Last-dropping-time
--------------------------------------------------------------------------------
ospf-hello 58251816 0 -
Statistics on slot 2/2:
--------------------------------------------------------------------------------
Packet Type Pass(Packet/Byte) Drop(Packet/Byte) Last-dropping-time
ospf-hello 14186337 0 -
Statistics on slot 2/3:
--------------------------------------------------------------------------------
Packet Type Pass(Packet/Byte) Drop(Packet/Byte) Last-dropping-time
ospf-hello 6740585 0 -
2、 确定二层物理联通性和三层可达性,查询ospf邻居中断时间点日志发现设备收到大量TC报文,mstp发生收敛G2/3/0/23和Eth-trunk3口被阻塞。
Apr 23 2016 18:02:58+08:00 AHHF-HeFei-CS01 %%01MSTP/6/SET_PORT_DISCARDING(l)[527490]:In MSTP process 0 instance 0, MSTP set port Eth-Trunk3 state as discarding.
Apr 23 2016 18:02:58+08:00 AHHF-HeFei-CS01 %%01MSTP/6/SET_PORT_DISCARDING(l)[527491]:In MSTP process 0 instance 0, MSTP set port GigabitEthernet2/3/0/23 state as discarding.
Apr 23 2016 18:02:58+08:00 AHHF-HeFei-CS01 %%01MSTP/6/SET_PORT_DISCARDING(l)[527492]:In MSTP process 0 instance 0, MSTP set port GigabitEthernet1/3/0/22 state as discarding.
3、进一步确定stp发生收敛的原因, GigabitEthernet1/3/0/22、Eth-Trunk2收到大量TC报文导致stp收敛端口阻塞,最终导致邻居中断。
2016/04/23 18:03:14 UTC+08:00, process 0 instance 0 port Eth-Trunk2 receive TC from 001d-4574-b088.
2016/04/23 18:03:15 UTC+08:00, process 0 instance 0 port GigabitEthernet1/3/0/22 receive TC from 000b-ab3a-c4f0.
2016/04/23 18:03:15 UTC+08:00, process 0 instance 0 port Eth-Trunk2 receive TC from 001d-4574-b088.
2016/04/23 18:03:24 UTC+08:00, process 0 instance 0 port GigabitEthernet1/3/0/22 receive TC from 000b-ab3a-c4f0.
2016/04/23 18:03:22 UTC+08:00, process 0 instance 0 port GigabitEthernet1/3/0/22 receive TC from 000b-ab3a-c4f0.
2016/04/23 18:03:20 UTC+08:00, process 0 instance 0 port GigabitEthernet1/3/0/22 receive TC from 000b-ab3a-c4f0.
2016/04/23 18:03:19 UTC+08:00, process 0 instance 0 port GigabitEthernet1/3/0/22 receive TC from 000b-ab3a-c4f0.
2016/04/23 18:03:18 UTC+08:00, process 0 instance 0 port GigabitEthernet1/3/0/22 receive TC from 000b-ab3a-c4f0.
2016/04/23 18:03:17 UTC+08:00, process 0 instance 0 port Eth-Trunk2 receive TC from 001d-4574-b088.
2016/04/23 18:03:17 UTC+08:00, process 0 instance 0 port GigabitEthernet1/3/0/22 receive TC from 000b-ab3a-c4f0.
根因
设备收到大量TC报文导致设备端口阻塞。
解决方案
三层互联口stp去使能,确定其它设备发送TC报文的原因。
建议与总结
规范相关配置