All,
I posted a while back with an issue where we experience network outages at our DR site causing all network devices to drop off monitoring and subsequently all connections in and out are disrupted. Broadcast symptoms.
So after packet sniffing the VLAN generating all this traffic there was a pattern. Traffic was broadcasting meant for a Virt NLB address (Unicast). The outages coincided with elavated levels of traffic being sent to the NLB environment.
The reason for the broadcast is the lookup in the ARP table does not find the virtual MAC so floods to all hosts on the switch, thus wrecking the entire subnet and most switches on the LAN.
We disolved the NLB cluster so all 3 members ran separate and this has resolved the outages.
Now the problem is we need this NLB cluster in place.
I have been speaking to a team at Avaya in India, they dont seem to understand what is happening despite me holding their hand and leading them up the path the cause. Me dissolving the NLB cluster now proves what i was saying all along.
Now i need to know how to configure this correctly, ive seen this before over Extreme Summit X series and there is a work around by looping a cable back in to a non routable private subnet housing the cluster members, this is quite primitive and think Extreme have resolved this now with a SW release.
I have seen an Avaya white paper on this issue but not specifically for 5520's running at layer 2.
We have 2 x Cisco 3560's running HSRP as the Layer 3 engine, below that a stack of 2 x 5520's distributing to 6 x single 5520's housing all hosts (servers). The cluster in question has 3 members running Unicast mode and all reside on the same 5520 switch.
I have been digging around but cannot find anything specific to our issue.
We need to be concentrating on how the switch handles traffic to a virtual address which is not present in its ARP table hence flooding out all interfaces.
I have been thinking of a couple of ideas round this but dont want to lead anyone up those paths as yet.
I would appreciate if any of you guys have seen this before or know how to get round this.
Cheers
Phil