Hello,
I have a topology composed of 6 EX3300 in VC.
1 is master , one other is backup (botj have same weight, 129). The four others are linecard.
I've done an upgrade from 12.3R9 to 12.3R12 few days ago.
During this upgrade, i've noticed NSB is not activated in my preprovisionned VC.
GRES is activated:
root@sw-cloud-ex3300-stack> show configuration chassis
redundancy {
graceful-switchover;
}
Tonight, all my LACP linksflapped (40 links approximately in my VC). Problem started at 01:42:53
I can't find something interresting in logs except LACP timeout.
Jun 7 00:53:28 sw-cloud-ex3300-stack xntpd[10092]: NTP Server Unreachable
Jun 7 00:53:44 sw-cloud-ex3300-stack last message repeated 8 times
Jun 7 01:11:01 sw-cloud-ex3300-stack last message repeated 9 times
Jun 7 01:28:03 sw-cloud-ex3300-stack xntpd[10092]: NTP Server Unreachable
Jun 7 01:28:19 sw-cloud-ex3300-stack last message repeated 8 times
Jun 7 01:42:53 sw-cloud-ex3300-stack sfid[1303]: JTASK_SCHED_SLIP_KEVENT: 21 sec 481478 usec kevent block
Jun 7 01:42:53 sw-cloud-ex3300-stack chassism[1302]: JTASK_SCHED_SLIP_KEVENT: 21 sec 486749 usec kevent block
Jun 7 01:42:53 sw-cloud-ex3300-stack lldpd[10106]: JTASK_SCHED_SLIP: 21 sec scheduler slip, user: 0 sec 0 usec, system: 0 sec, 557 usec
Jun 7 01:42:53 sw-cloud-ex3300-stack eswd[10087]: JTASK_SCHED_SLIP_KEVENT: 21 sec 526552 usec kevent block
Jun 7 01:42:53 sw-cloud-ex3300-stack eswd[10087]: Root bridge in context 0 changed from 4:cc:4e:24:3a:e9:b8 to 8192:84:b5:9c:46:79:01
Jun 7 01:42:53 sw-cloud-ex3300-stack cfmd[10091]: JTASK_SCHED_SLIP_KEVENT: 23 sec 280174 usec kevent block
Jun 7 01:42:53 sw-cloud-ex3300-stack /kernel: KERN_LACP_INTF_STATE_CHANGE: lacp_update_state_userspace: cifd ge-2/0/2 - ATTACHED state - acting as standby link
Jun 7 01:42:53 sw-cloud-ex3300-stack lacpd[1329]: LACPD_TIMEOUT: ge-2/0/2: lacp current while timer expired current Receive State: CURRENT
Jun 7 01:42:53 sw-cloud-ex3300-stack sflowd[10107]: JTASK_SCHED_SLIP_KEVENT: 24 sec 297734 usec kevent block
Jun 7 01:42:53 sw-cloud-ex3300-stack mcsnoopd[10108]: JTASK_SCHED_SLIP_KEVENT: 23 sec 337595 usec kevent block
Jun 7 01:42:53 sw-cloud-ex3300-stack rpd[1319]: RPD_SCHED_SLIP_KEVENT: 22 sec 425412 usec kevent block
Jun 7 01:42:53 sw-cloud-ex3300-stack lacpd[1329]: LACPD_TIMEOUT: ge-2/0/10: lacp current while timer expired current Receive State: CURRENT
Jun 7 01:42:53 sw-cloud-ex3300-stack /kernel: KERN_LACP_INTF_STATE_CHANGE: lacp_update_state_userspace: cifd ge-2/0/10 - ATTACHED state - acting as standby link
Jun 7 01:42:53 sw-cloud-ex3300-stack lacpd[1329]: LACPD_TIMEOUT: ge-2/0/9: lacp current while timer expired current Receive State: CURRENT
Jun 7 01:42:53 sw-cloud-ex3300-stack /kernel: KERN_LACP_INTF_STATE_CHANGE: lacp_update_state_userspace: cifd ge-2/0/9 - ATTACHED state - acting as standby link
Jun 7 01:42:53 sw-cloud-ex3300-stack bdbrepd: Subscriber Management is not ready for GRES
Jun 7 01:42:53 sw-cloud-ex3300-stack lacpd[1329]: LACPD_TIMEOUT: ge-3/0/10: lacp current while timer expired current Receive State: CURRENT
Jun 7 01:42:53 sw-cloud-ex3300-stack /kernel: ae_bundlestate_ifd_change: bundle ae31: bundle IFD minimum links not met 0 < 1
Jun 7 01:42:53 sw-cloud-ex3300-stack /kernel: KERN_LACP_INTF_STATE_CHANGE: lacp_update_state_userspace: cifd ge-3/0/10 - ATTACHED state - acting as standby link
Jun 7 01:42:53 sw-cloud-ex3300-stack lacpd[1329]: LACP_INTF_DOWN: ae31: Interface marked down due to lacp timeout on member ge-3/0/10
Jun 7 01:42:54 sw-cloud-ex3300-stack lacpd[1329]: LACPD_TIMEOUT: ge-3/0/9: lacp current while timer expired current Receive State: CURRENT
Jun 7 01:42:54 sw-cloud-ex3300-stack lacpd[1329]: LACP_INTF_DOWN: ae30: Interface marked down due to lacp timeout on member ge-3/0/9
Jun 7 01:42:54 sw-cloud-ex3300-stack /kernel: ae_bundlestate_ifd_change: bundle ae30: bundle IFD minimum links not met 0 < 1
Jun 7 01:42:54 sw-cloud-ex3300-stack /kernel: KERN_LACP_INTF_STATE_CHANGE: lacp_update_state_userspace: cifd ge-3/0/9 - ATTACHED state - acting as standby link
Jun 7 01:42:54 sw-cloud-ex3300-stack lacpd[1329]: LACPD_TIMEOUT: ge-5/0/11: lacp current while timer expired current Receive State: CURRENT
Jun 7 01:42:54 sw-cloud-ex3300-stack /kernel: KERN_LACP_INTF_STATE_CHANGE: lacp_update_state_userspace: cifd ge-5/0/11 - ATTACHED state - acting as standby link
Jun 7 01:42:54 sw-cloud-ex3300-stack eswd[10087]: Root bridge in context 0 changed from 8192:84:b5:9c:46:79:01 to 4:cc:4e:24:3a:e9:b8
I think the only interessting information is Subscriber Management is not ready for GRES
Is LACP flaps are linked to NSB not activated ? How can I debug this nihght issue ?
Thanks,