Afternoon
I have a pair of EX3400's, one PoE and one non-PoE connected together in a Virtual Chassis. The switches were installed around 18 months ago and for around 1 year they have been working without any issue's. In the last couple of months we have had the same problem twice.
The problem is with DHCP relay, it just stops working. The only way to resolve is to reboot the virtual chassis. The virtual chassis is located in Paris and I am based in London, I have a contact on site that knows a little bit about IT so he can do things like plug cables in and run "ipconfig" so all support falls on the London support Team, I had my contact in Paris connect directly into one of the switches and got him to perform "ipconfig /renew" but the laptop fails to get an IP address. If I check the DHCP server there are only something like 20% of the avalibale address in use, so I know it is not a DHCP pool issue
The switches are running Junos 15.1X53-D55.5 (32-bit). When we are experancing the problem this morning I ran "show dhcp relay statistics" the stats under "Packets dropped" increase, sorry I can't remember if the stats under "Messages received" and "Messages sent" where incresing, I am pritty sure the stat under"Messages received" were.
I also ran "monitor traffic interface ge-1/0/21 no-resolve extensive" and I kept getting the same messages
11:44:38.737204 In
Juniper PCAP Flags [Ext, no-L2, In], PCAP Extension(s) total length 22
Device Media Type Extension TLV #3, length 1, value: Ethernet (1)
Logical Interface Encapsulation Extension TLV #6, length 1, value: Ethernet (14)
Device Interface Index Extension TLV #1, length 2, value: 678
Logical Interface Index Extension TLV #4, length 4, value: 547
Logical Unit Number Extension TLV #5, length 4, value: 102
-----original packet-----
PFE proto 2 (ipv4): (tos 0x0, ttl 255, id 0, offset 0, flags [none], proto: UDP (17), length: 396) 0.0.0.0.68 > 255.255.255.255.67: [udp sum ok] BOOTP/DHCP, Request from ec:f4:bb:0b:80:2c, length 300, xid 0xf2839fee, secs 3072, Flags [none] (0x0000)
Client-Ethernet-Address ec:f4:bb:0b:80:2c
Vendor-rfc1048 Extensions
Magic Cookie 0x63825363
DHCP-Message Option 53, length 1: Discover
Client-ID Option 61, length 7: ether ec:f4:bb:0b:80:2c
Hostname Option 12, length 11: "FRA-L-00019"
Vendor-Class Option 60, length 8: "MSFT 5.0"
Parameter-Request Option 55, length 12:
Subnet-Mask, Domain-Name, Default-Gateway, Domain-Name-Server
Netbios-Name-Server, Netbios-Node, Netbios-Scope, Router-Discovery
Static-Route, Classless-Static-Route, Classless-Static-Route-Microsoft, Vendor-Option
11:44:41.954227 In
Juniper PCAP Flags [Ext, no-L2, In], PCAP Extension(s) total length 22
Device Media Type Extension TLV #3, length 1, value: Ethernet (1)
Logical Interface Encapsulation Extension TLV #6, length 1, value: Ethernet (14)
Device Interface Index Extension TLV #1, length 2, value: 678
Logical Interface Index Extension TLV #4, length 4, value: 547
Logical Unit Number Extension TLV #5, length 4, value: 102
-----original packet-----
PFE proto 2 (ipv4): (tos 0x0, ttl 255, id 0, offset 0, flags [none], proto: UDP (17), length: 396) 0.0.0.0.68 > 255.255.255.255.67: [udp sum ok] BOOTP/DHCP, Request from ec:f4:bb:0b:80:2c, length 300, xid 0xa308b2d, Flags [Broadcast] (0x8000)
Client-Ethernet-Address ec:f4:bb:0b:80:2c
Vendor-rfc1048 Extensions
Magic Cookie 0x63825363
DHCP-Message Option 53, length 1: Discover
Client-ID Option 61, length 7: ether ec:f4:bb:0b:80:2c
Hostname Option 12, length 11: "FRA-L-00019"
Vendor-Class Option 60, length 8: "MSFT 5.0"
Parameter-Request Option 55, length 12:
Subnet-Mask, Domain-Name, Default-Gateway, Domain-Name-Server
Netbios-Name-Server, Netbios-Node, Netbios-Scope, Router-Discovery
Static-Route, Classless-Static-Route, Classless-Static-Route-Microsoft, Vendor-Option
After the reboot I am getting
12:05:40.275741 In
Juniper PCAP Flags [Ext, no-L2, In], PCAP Extension(s) total length 22
Device Media Type Extension TLV #3, length 1, value: Ethernet (1)
Logical Interface Encapsulation Extension TLV #6, length 1, value: Ethernet (14)
Device Interface Index Extension TLV #1, length 2, value: 678
Logical Interface Index Extension TLV #4, length 4, value: 547
Logical Unit Number Extension TLV #5, length 4, value: 102
-----original packet-----
PFE proto 2 (ipv4): (tos 0x0, ttl 255, id 0, offset 0, flags [none], proto: UDP (17), length: 396) 0.0.0.0.68 > 255.255.255.255.67: [udp sum ok] BOOTP/DHCP, Request from ec:f4:bb:0b:80:2c, length 300, xid 0xa48db9ba, Flags [Broadcast] (0x8000)
Client-Ethernet-Address ec:f4:bb:0b:80:2c
Vendor-rfc1048 Extensions
Magic Cookie 0x63825363
DHCP-Message Option 53, length 1: Discover
Client-ID Option 61, length 7: ether ec:f4:bb:0b:80:2c
Requested-IP Option 50, length 4: 10.33.102.2
Hostname Option 12, length 11: "FRA-L-00019"
Vendor-Class Option 60, length 8: "MSFT 5.0"
Parameter-Request Option 55, length 12:
Subnet-Mask, Domain-Name, Default-Gateway, Domain-Name-Server
Netbios-Name-Server, Netbios-Node, Netbios-Scope, Router-Discovery
Static-Route, Classless-Static-Route, Classless-Static-Route-Microsoft, Vendor-Option
12:05:40.689149 In
Juniper PCAP Flags [Ext, no-L2, In], PCAP Extension(s) total length 22
Device Media Type Extension TLV #3, length 1, value: Ethernet (1)
Logical Interface Encapsulation Extension TLV #6, length 1, value: Ethernet (14)
Device Interface Index Extension TLV #1, length 2, value: 678
Logical Interface Index Extension TLV #4, length 4, value: 547
Logical Unit Number Extension TLV #5, length 4, value: 102
-----original packet-----
PFE proto 2 (ipv4): (tos 0x0, ttl 255, id 0, offset 0, flags [none], proto: UDP (17), length: 427) 0.0.0.0.68 > 255.255.255.255.67: [udp sum ok] BOOTP/DHCP, Request from ec:f4:bb:0b:80:2c, length 331, xid 0xa48db9ba, Flags [Broadcast] (0x8000)
Client-Ethernet-Address ec:f4:bb:0b:80:2c
Vendor-rfc1048 Extensions
Magic Cookie 0x63825363
DHCP-Message Option 53, length 1: Request
Client-ID Option 61, length 7: ether ec:f4:bb:0b:80:2c
Requested-IP Option 50, length 4: 10.33.102.2
Server-ID Option 54, length 4: 10.33.15.11
Hostname Option 12, length 11: "FRA-L-00019"
FQDN Option 81, length 27: "FRA-L-00019.ebiquity.net"
Vendor-Class Option 60, length 8: "MSFT 5.0"
Parameter-Request Option 55, length 12:
Subnet-Mask, Domain-Name, Default-Gateway, Domain-Name-Server
Netbios-Name-Server, Netbios-Node, Netbios-Scope, Router-Discovery
Static-Route, Classless-Static-Route, Classless-Static-Route-Microsoft, Vendor-Option
Before the reboot I ran "show dhcp relay binding" and there was only a handful off address in the list but they all had the state of "BOUND". I then ran "clear dhcp relay binding", this remove all the bound interfaces, and even after 5 minutes nothing appered in the list even though my contact onsite did "ipconfig /renew" at least 3 or 4 times.
The config I have is as follows
show forwarding-option
storm-control-profiles default {
all;
}
dhcp-relay {
server-group {
FRADHCPGroup {
<Address1>;
<Address2>;
}
FRAVoiceDHCP {
<VOIPControllerIPAddress>;
}
}
active-server-group FRADHCPGroup;
group FRADHCPGroup {
interface irb.15;
interface irb.21;
interface irb.90;
interface irb.102;
interface irb.105;
interface irb.150;
interface irb.170;
interface irb.180;
interface irb.200;
interface irb.202;
interface irb.204;
interface irb.205;
interface irb.232;
}
inactive: group FRAVoiceDHCP {
active-server-group FRAVoiceDHCP;
interface irb.98;
}
}
I have tried adding in the following, but they did not help with the problem.
set forwarding-options dhcp-relay overrides allow-snooped-clients
set forwarding-options dhcp-relay overrides delete-binding-on-renegotiation
set forwarding-options dhcp-relay overrides send-release-on-delete
set forwarding-options dhcp-relay forward-snooped-clients
The DHCP servers are running on a pair of Active Directory domain controllers in a "Windows 2012 R2 DHCP Cluster", one Domain controller is connected directly to the switches and is setup in a lacp aggregated ethernet interface and the second is a Virtual machine on a HyperV server which is connected to the chassis in a lacp aggregated ethernet interface.
If anyone has any idea's I would be very grateful.
Richard