Rapid Network Redundancy Failover
20230188874 ยท 2023-06-15
Inventors
- Kyle Platts (Huntsville, AL, US)
- Camila Jersonsky (Huntsville, AL, US)
- Darrin L. Gieger (Huntsville, AL, US)
- Andrew T. Ruble (Athens, AL, US)
- Abhijeet Shirgurkar (Huntsville, AL, US)
Cpc classification
H04L43/10
ELECTRICITY
H04Q11/0067
ELECTRICITY
H04L41/0663
ELECTRICITY
H04Q2011/0081
ELECTRICITY
International classification
Abstract
Methods and systems for high speed failover in a network are provided. To provide faster Type C GPON redundancy failover, the disclosure herein describes the use of G.8031 1:1 ELPS in a single ended application to ensure path integrity through the network. Single ended 1:1 ELPS means that a network device is configured with 1:1 ELPS and switches paths in the event of disruption of the working communication path without the other underlying transport entities having knowledge of either the ELPS protocol or state machine. ELPS (Ethernet Linear Protection Switching, ITU G.8031) is a standardized method for protection switching between two point-to-point paths through a network, however its application here is quite novel. During a failure on the working path, traffic will switch over to the protection path. Type C PON protection provides a fully redundant path between the OLT and the ONU (2 separate PONs).
Claims
1. A method of communication resilience in a network, comprising: establishing a working communication path between an aggregation switch and a CPE, wherein the working communication path communicatively traverses an OLT and wherein a MEP of the aggregation switch is communicatively coupled with a MEP of the CPE; establishing a protection communication path between the aggregation switch or a second aggregation switch and the CPE, wherein the protection communication path traverses a second OLT and wherein a second MEP of the aggregation switch or the second aggregation switch is communicatively coupled with a second MEP of the CPE; wherein the CPEs transmit and receive data on the working communication path and monitor the protection communication path in a non-fault state; detecting a network fault on the working communication path based on non-responsiveness of the MEP of the aggregation switch or the MEP of the CPE; and responding to the network fault on the working communication path by promoting, at the aggregation switch, the protection communication path to an active state.
2. The method of claim 1 wherein detecting a network fault on the working communication path based on non-responsiveness of the MEP of the aggregation switch or the MEP of the CPE further comprises: monitoring the working communication path using continuity check messages generated by the MEP of the aggregation switch.
3. The method of claim 2 wherein the continuity check messages include status information about a local port and a physical interface.
4. The method of claim 1 wherein the MEP of the CPE sends an RDI notification to the aggregation switch based on a determination that the CPE has detected a communication fault in the working communication path.
6. The method of claim 1 wherein a communication path exists between each physical interface of aggregation switch and the OLT element.
7. The method of claim 1 wherein promoting, at the aggregation switch, the protection communication path comprises: switching upstream traffic from the CPE to the aggregation switch from the working communication path to the protection communication path; learning a MAC address of a port coupled to the protection path at the CPE; sending downstream traffic from the aggregation switch to the port at the CPE.
8. The method of claim 1 further comprising: sending, by the CPE, a gratuitous ARP containing IP address and MAC address information.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025] Like reference numbers and designations in the various drawings indicate like elements.
DETAILED DESCRIPTION
[0026] Methods and systems for Rapid Type C GPON Redundancy Failover are discussed throughout this document. As will be discussed in more detail with reference to the figures, redundant communication paths exist between a CPE and a network. Network access is via one or more aggregation switches. These redundant communication paths can be viewed as an ELPS protection group whose connectivity is protected from network failures through the use of a unique form of Single Ended 1:1 ELPS processing. Unlike traditional ELPS processing, only one endpoint is directly involved in fault detection and there is no coordination between the two endpoints using APS messages. The network fault detection and rapid failover scheme described herein also decouples the control and data planes. The solution separates the control and data planes such that the control plane is monitored using the unique Single Ended 1:1 ELPS while the data plane uses ELAN resiliency. Thus, as is disclosed herein, the network can individually or collectively control failover, as appropriate. As one example, CCMs associated with one VLAN could detect network faults causing failovers in different VLANs.
[0027] Per ELPS, between two network entities, traffic traverses one of two paths: a working path or a protection path. A given path has two states: active or standby. These two paths and their associated traffic and services, running on VLANs, form an ELPS (Ethernet linear protection switching) group. In normal operation, the traffic and services will traverse the working path, as it is active while the protection path is standby. However, in a fault state, the ELPS group fails over from the active working path such that its traffic and services now traverse the newly active protection path. The ELPS group may revert the active path to the working path when the failure has been corrected, however this is not required.
[0028] As described in the standard, G.8031 1:1 ELPS uses selectors and bridges at upstream and downstream network elements (EAST and WEST endpoints) that are coordinated using state machines tracking the active and standby status of the working transport entity (TE) and the protection transport entity (TE). To detect faults on the working and protection TEs, CCM traffic is sent over both paths. When a fault is detected, APS packets are sent on the protection TE. For clarity, the term working TE and working path refer to the same element, and the term protection TE and protection path refer to the same element.
[0029] G.8031 can be advantageously modified by replacing the selector and bridge at the WEST endpoint with an Ethernet switch. CCM messages are communicated on each of the working and protect paths to monitor path health. CCM endpoints detect network faults and determine the path fault domain. The Ethernet switch generates CCM messages on the working and the protect paths that will inform the CPE of their status and integrity, and ultimately allow it to make a decision as to which path to use. The EAST endpoints then choose which of the working or protect path should be designated the active path. Among the ports assigned to the working and protect paths, the active port of the WEST Ethernet switch is determined to be that port with a MAC address known to the system (e.g., through ARP tables, IP to MAC address mappings, etc.). Unlike G.8031 defined in the standard, no APS packets are used. In other words, this solution can be implemented independent of APS packets.
[0030] In one implementation, the CPEs only transmit and receive on the active path while monitoring both paths using CCMs. The aggregation switches are agnostic to the ELPS group. However, the aggregation switches contain MEPs to generate CCMs to each CPE. The CPEs make the decision as to which path to use based on the CCMs received from the switch, and trigger path changes in the ELPS state machine accordingly. For instance, the absence of received CCM traffic on a MEP of the CPE indicates a network fault on that communication path. On failover, the aggregation switches relearn the traffic MAC addresses on the newly active path as the traffic starts to flow through it, such as through ARP messaging for management or through upstream data packets. Accordingly, rapid fault detection and failover can occur in many embodiments.
[0031] Because the WEST endpoint does not use the ELPS protocol or state machine, it's functionality can be split between multiple aggregation switches, providing additional redundancy.
[0032] As shown in
[0033] As shown in
[0034] Still with respect to
[0035] Still with respect to
[0036]
[0037] An ELPS protection group is established 530 comprising the working path and the protection path. Communication proceeds over the ELPS protection group 540. As communication proceeds over the ELPS protection group 540, CCM traffic is monitored 550 for fault detection 560. A fault on a working path can be detected 560 based on the absence of CCM traffic at a MEP, of a CPE device, associated with the working path. For example, if CCM traffic persists, no fault is indicated 565. If CCM traffic is absent, a fault is detected 570. When a fault is detected 570, the CPE may send an RDI notification to the aggregation switch over the protect path. RDI notifications can be sent over either the working or the protect path, depending on which has the fault. When a fault is detected 570 on the working path, the protection path is promoted to an active state and becomes the active path for that ELPS protection group 580. Communications continue on that ELPS protection group 540 and CCM traffic continues to be monitored 550. For instance, once the protection path is made active, the CPE switches upstream traffic to the aggregation switch from the working communication path to the protection communication path. For downstream traffic, the aggregation switches learn a MAC address of a port coupled to the active path at the CPE. The aggregation switches may learn this MAC address by sending a gratuitous ARP message. The CPE sends a gratuitous ARP for the aggregation switch to learn its management MAC address. Upstream traffic flowing through the CPE causes the aggregation switch to learn other MAC addresses. Once the MAC address of the port coupled to the active path at the CPE is learned, the aggregation switches send downstream traffic on the active path to the port at the CPE.
[0038]
[0039] An ELPS protection group is established 630 comprising the working path and the protection path. Communication proceeds over the ELPS protection group 640. As communication proceeds over the ELPS protection group 640, CCM traffic is monitored 650 for fault detection 560. A fault on both the active and the standby paths can be detected 660 based on the absence of CCM traffic at a MEP, of a CPE device, associated with the working path. For example, if CCM traffic persists, no fault is indicated 665. If CCM traffic is absent, a fault is detected 670. When a fault is detected 670, the CPE may send an RDI notification to the aggregation switch over the protect path. RDI notifications can be sent over either the working or the protect path, depending on which has the fault. When a fault is detected 670 on both the active and the standby paths, the working path is promoted to an active state and becomes the active path for that ELPS protection group 680. Communications continue on that ELPS protection group 640 and CCM traffic continues to be monitored 650. For instance, once the protection path is made active, the CPE switches upstream traffic to the aggregation switch from the working communication path to the protection communication path. For downstream traffic, the aggregation switches learn a MAC address of a port coupled to the active path at the CPE. The aggregation switches may learn this MAC address by sending a gratuitous ARP message. The CPE sends a gratuitous ARP for the aggregation switch to learn its management MAC address. Upstream traffic flowing through the CPE causes the aggregation switch to learn other MAC addresses. Once the MAC address of the port coupled to the active path at the CPE is learned, the aggregation switches send downstream traffic on the active path to the port at the CPE.
[0040] As shown in
[0041] As shown in
[0042] As shown in
[0043] CPE (customer-premises equipment) generally refers to devices such as telephones, routers, network switches, residential gateways, set-top boxes, fixed mobile convergence products, home networking adapters and Internet access gateways that enable consumers to access communication providers' services and distribute them in a residence or business over a local area network.
[0044] While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any inventions or of what may be claimed, but rather as descriptions of features specific to particular embodiments of particular inventions. Certain features that are described in this specification in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.
[0045] Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products, or a single hardware product or multiple hardware products, or any combination thereof.
[0046] Thus, particular embodiments of the subject matter have been described. Other embodiments are within the scope of the following claims. In some cases, the actions recited in the claims can be performed in a different order and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous.