Multi-chassis link aggregation group

Last updated

A multi-chassis link aggregation group (MLAG or MC-LAG) is a type of link aggregation group (LAG) with constituent ports that terminate on separate chassis, primarily for the purpose of providing redundancy in the event one of the chassis fails. The IEEE 802.1AX-2008 industry standard for link aggregation does not mention MC-LAG, but does not preclude it. Its implementation varies by vendor; notably, the protocol existing between the chassis is proprietary.

Contents

Background

A LAG is a method of inverse multiplexing over multiple Ethernet links, thereby increasing bandwidth and providing redundancy. It is defined by the IEEE 802.1AX-2008 standard, which states, "Link Aggregation allows one or more links to be aggregated together to form a Link Aggregation Group, such that a MAC client can treat the Link Aggregation Group as if it were a single link." [1] This layer 2 transparency is achieved by the LAG using a single MAC address for all the device’s ports in the LAG group. LAG can be configured as either static or dynamic. Dynamic LAG uses a peer-to-peer protocol for control, called Link Aggregation Control Protocol (LACP). This LACP protocol is also defined within the 802.1AX-2008 standard.

Multi-chassis

MC-LAG adds node-level redundancy to the normal link-level redundancy that a LAG provides. This allows two or more nodes to share a common LAG endpoint. The multiple nodes present a single logical LAG to the remote end. Note that MC-LAG implementations are vendor-specific, but cooperating chassis remain externally compliant to the IEEE 802.1AX-2008 standard. [2] Nodes in an MC-LAG cluster communicate to synchronize and negotiate automatic switchovers (failover). Some implementations may support administrator-initiated (manual) switchovers.

The diagram here shows four configurations:

Illustration comparing LAG to high-availability MLAG Using Multi-Chassis LAG (MC-LAG) for High Availability.png
Illustration comparing LAG to high-availability MLAG
  1. Switches A and B are each configured to group four discrete links (as indicated in green) into a single logical link with four times the bandwidth. Standard LACP protocol ensures that if any of the links go down, traffic will be distributed among the remaining three.
  2. Switch A is replaced by two chassis, switches A1 and A2. They communicate between themselves using a proprietary protocol and are thereby able to masquerade as a single "virtual" switch A running a shared instance of LACP. Switch B is not aware that it is connected to more than one chassis.
  3. Switch B is also replaced by two chassis B1 and B2. If these switches are from a different vendor, they may use a different proprietary protocol between themselves. But "virtual" switches A and B still communicate using LACP.
  4. Crossing two links to form an X makes no difference logically, any more than crossing links in a normal LAG would. However, physically it provides much improved fault tolerance (high availability). If any of the switches fail, LACP reconfigures paths in as little as a few seconds. Operation continues with paths existing between all sources and destinations, albeit with degraded bandwidth.

The HA configuration is superior to spanning tree. Load can be shared across all links during normal operation, whereas spanning tree must disable some links to prevent loops.

Implementations

The following table lists known vendor implementations of MC-LAG, all of which are proprietary.

VendorImplementation Name
ADVA Optical Networking MC-LAG
Arista Networks MLAG
Aruba Networks (formerly HP ProCurve)Distributed Trunking under Intelligent Resilient Framework switch clustering technology
Avaya Distributed Split Multi-Link Trunking
Ruckus Networks (formerly Brocade)Multi-Chassis Trunking
Ciena MC-LAG
Cisco Catalyst 6500Multichassis Etherchannel (MEC) - Virtual Switching System (VSS)
Cisco Catalyst 3750 (and similar)Cross-Stack EtherChannel
Cisco Catalyst 9000StackWise Virtual
Cisco Nexus Virtual PortChannel (vPC), where a PortChannel is a regular LAG
Cisco IOS XRmLACP (Multichassis Link Aggregation Control Protocol)
Cumulus Networks MLAG (formerly CLAG)
Dell Networking (formerly Force10 Networks, formerly nCore)DNOS6.x Virtual Port Channel (vPC) or Virtual Link Trunking
Edgecore Networks MLAG
Extreme Networks MLAG (Multi Switch Link Aggregation Group)
Ericsson MC-LAG (Multi Chassis Link Aggregation Group)
FS MLAG
Fortinet MC-LAG (Multi Chassis Link Aggregation Group)
H3C Distributed Resilient Network Interconnect
Huawei M-LAG
Juniper MC-LAG
Lenovo Networking (formerly IBM)vLAG
Mellanox Technologies MLAG
MikroTik MLAG [3]
NEC MC-LAG (Openflow to traditional network)
NocsysMLAG
Netgear MLAG
Nokia (Formerly Alcatel-Lucent)MC-LAG
Nortel Split multi-link trunking
Nuage Networks (from Nokia)MC-LAG ; including MCS (Multi-chassis Sync)
Plexxi (now Aruba Networks))vLAG
Pluribus Networks (now Arista Networks)vLAG
ZTE MC-LAG

Alternatives

IEEE 802.1aq (Shortest Path Bridging) is an alternative to MC-LAG that can be used for complex networks. [4]

TRILL (TRansparent Interconnection of Lots of Links) facilitates an Ethernet to have an arbitrary topology, and enables per flow pair-wise load splitting by way of Dijkstra's algorithm, without configuration and user intervention.

Related Research Articles

A network switch is networking hardware that connects devices on a computer network by using packet switching to receive and forward data to the destination device.

The Spanning Tree Protocol (STP) is a network protocol that builds a loop-free logical topology for Ethernet networks. The basic function of STP is to prevent bridge loops and the broadcast radiation that results from them. Spanning tree also allows a network design to include backup links providing fault tolerance if an active link fails.

A virtual local area network (VLAN) is any broadcast domain that is partitioned and isolated in a computer network at the data link layer. In this context, virtual, refers to a physical object recreated and altered by additional logic, within the local area network. VLANs work by applying tags to network frames and handling these tags in networking systems – creating the appearance and functionality of network traffic that is physically on a single network but acts as if it is split between separate networks. In this way, VLANs can keep network applications separate despite being connected to the same physical network, and without requiring multiple sets of cabling and networking devices to be deployed.

<span class="mw-page-title-main">Link aggregation</span> Using multiple network connections in parallel to increase capacity and reliability

In computer networking, link aggregation is the combining of multiple network connections in parallel by any of several methods, in order to increase throughput beyond what a single connection could sustain, to provide redundancy in case one of the links should fail, or both. A link aggregation group (LAG) is the combined collection of physical ports.

<span class="mw-page-title-main">Metro Ethernet</span> Metropolitan area network based on Ethernet standards

A metropolitan-area Ethernet, Ethernet MAN, or metro Ethernet network is a metropolitan area network (MAN) that is based on Ethernet standards. It is commonly used to connect subscribers to a larger service network or the Internet. Businesses can also use metropolitan-area Ethernet to connect their own offices to each other.

<span class="mw-page-title-main">EtherChannel</span> Computer networking link aggregation technology

EtherChannel is a port link aggregation technology or port-channel architecture used primarily on Cisco switches. It allows grouping of several physical Ethernet links to create one logical Ethernet link for the purpose of providing fault-tolerance and high-speed links between switches, routers and servers. An EtherChannel can be created from between two and eight active Fast, Gigabit or 10-Gigabit Ethernet ports, with an additional one to eight inactive (failover) ports which become active as the other active ports fail. EtherChannel is primarily used in the backbone network, but can also be used to connect end user machines.

<span class="mw-page-title-main">Profinet</span>

Profinet is an industry technical standard for data communication over Industrial Ethernet, designed for collecting data from, and controlling equipment in industrial systems, with a particular strength in delivering data under tight time constraints. The standard is maintained and supported by Profibus and Profinet International, an umbrella organization headquartered in Karlsruhe, Germany.

Avionics Full-Duplex Switched Ethernet (AFDX), also ARINC 664, is a data network, patented by international aircraft manufacturer Airbus, for safety-critical applications that utilizes dedicated bandwidth while providing deterministic quality of service (QoS). AFDX is a worldwide registered trademark by Airbus. The AFDX data network is based on Ethernet technology using commercial off-the-shelf (COTS) components. The AFDX data network is a specific implementation of ARINC Specification 664 Part 7, a profiled version of an IEEE 802.3 network per parts 1 & 2, which defines how commercial off-the-shelf networking components will be used for future generation Aircraft Data Networks (ADN). The six primary aspects of an AFDX data network include full duplex, redundancy, determinism, high speed performance, switched and profiled network.

<span class="mw-page-title-main">Computer network</span> Network that allows computers to share resources and communicate with each other

A computer network is a set of computers sharing resources located on or provided by network nodes. The computers use common communication protocols over digital interconnections to communicate with each other. These interconnections are made up of telecommunication network technologies, based on physically wired, optical, and wireless radio-frequency methods that may be arranged in a variety of network topologies.

Port Aggregation Protocol (PAgP) is a Cisco Systems proprietary networking protocol, which is used for the automated, logical aggregation of Ethernet switch ports, known as an EtherChannel. The PAgP is proprietary to Cisco Systems. A similar protocol known as LACP — released by the IEEE and known as 802.3ad or 802.1ax recently — is an industry standard and is not tied to a specific vendor.

<span class="mw-page-title-main">Multi-link trunking</span> Network link aggregation technology

Multi-link trunking (MLT) is a link aggregation technology developed at Nortel in 1999. It allows grouping several physical Ethernet links into one logical Ethernet link to provide fault-tolerance and high-speed links between routers, switches, and servers.

<span class="mw-page-title-main">Storage area network</span> Network which provides access to consolidated, block-level data storage

A storage area network (SAN) or storage network is a computer network which provides access to consolidated, block-level data storage. SANs are primarily used to access data storage devices, such as disk arrays and tape libraries from servers so that the devices appear to the operating system as direct-attached storage. A SAN typically is a dedicated network of storage devices not accessible through the local area network (LAN).

Data center bridging (DCB) is a set of enhancements to the Ethernet local area network communication protocol for use in data center environments, in particular for use with clustering and storage area networks.

Shortest Path Bridging (SPB), specified in the IEEE 802.1aq standard, is a computer networking technology intended to simplify the creation and configuration of Ethernet networks while enabling multipath routing.

Intelligent Resilient Framework (IRF) is a proprietary software virtualization technology developed by H3C (3Com). Its core idea is to connect multiple network devices through physical IRF ports and perform necessary configurations, and then these devices are virtualized into a distributed device. This virtualization technology realizes the cooperation, unified management, and non-stop maintenance of multiple devices. This technology follows some of the same general concepts as Cisco's VSS and vPC technologies.

Virtual Link Trunking (VLT) is a name that has been used for at least two proprietary network protocols. A link aggregation protocol developed by Force10 and an early VLAN tagging capability from 3Com.

Link protection is designed to safeguard networks from failure. Failures in high-speed networks have always been a concern of utmost importance. A single fiber cut can lead to heavy losses of traffic and protection-switching techniques have been used as the key source to ensure survivability in networks. Survivability can be addressed in many layers in a network and protection can be performed at the physical layer, Layer 2 and Layer 3 (IP).

Time-Sensitive Networking (TSN) is a set of standards under development by the Time-Sensitive Networking task group of the IEEE 802.1 working group. The TSN task group was formed in November 2012 by renaming the existing Audio Video Bridging Task Group and continuing its work. The name changed as a result of the extension of the working area of the standardization group. The standards define mechanisms for the time-sensitive transmission of data over deterministic Ethernet networks.

FlexE, also known as Flexible Ethernet is a communications protocol published by the Optical Internetworking Forum (OIF).

<span class="mw-page-title-main">Audio Video Bridging</span> Specifications for synchronized, low-latency streaming through IEEE 802 networks

Audio Video Bridging (AVB) is a common name for the set of technical standards which provide improved synchronization, low-latency, and reliability for switched Ethernet networks. AVB embodies the following technologies and standards:

References

  1. IEEE. IEEE 802.1AX-2008. IEEE.
  2. Bhagat, Amit N. "Multichassis Link Aggregation Group". Google Knowledge Base. Retrieved 15 March 2012.
  3. MikroTik: Multi-chassis Link Aggregation Group
  4. Mike Fratto (2011-03-07). "When MLAG Is Good Enough". Network Computing.