High-availability Seamless Redundancy

Last updated

High-availability Seamless Redundancy (HSR) is a network protocol for Ethernet that provides seamless failover against failure of any single network component. PRP and HSR are independent of the application-protocol and can be used by most Industrial Ethernet protocols in the IEC 61784 suite. HSR does not cover the failure of end nodes, but redundant nodes can be connected via HSR.

Contents

HSR nodes have two ports and act as a bridge, which allows arranging them into a ring or meshed structure without dedicated switches. This is in contrast to the companion standard Parallel Redundancy Protocol (PRP), [1] with which HSR shares the operating principle. PRP and HSR are standardized by the IEC 62439-3:2016. [2]

PRP and HSR are suited for applications that request high availability and short switchover time. [3] For such applications, the recovery time of commonly used protocols such as the Rapid Spanning Tree Protocol (RSTP) is too long. It has been adopted for electrical substation automation in the framework of IEC 61850. [4] It is used in synchronized drives (e.g. in printing machines) and high power inverters. [5]

The cost of HSR is that nodes require hardware support (FPGA or ASIC) to forward or discard frames within microseconds. This cost is compensated because no Ethernet switches are required. Hardware support is anyhow needed when the node supports clock synchronization or security.

Topology

An HSR network node (DANH) has at least two Ethernet ports, each attached to a neighbour HSR node, so that always two paths exist between two nodes. Therefore, as long as one path is operational, the destination application always receives one frame. HSR nodes check the redundancy continuously to detect lurking failures.

HSR is typically used in a ring topology or in another mesh topology.

Nodes with single attachment (such as a printer) are attached through a RedBox (Redundancy Box).

Redundant connections to other networks are possible, especially to a Parallel Redundancy Protocol (PRP) network.

Since HSR and PRP use the same duplicate identification mechanism, PRP and HSR networks can be connected without single point of failure and the same nodes can be built to be used in both PRP and HSR networks.

Operation

Every HSR node is a switching node, i.e. it can forward a frame received on one port to at least one other port in cut-through mode.

A source node sends the same frame over all ports to the neighbour nodes.

A destination node should receive, in the fault-free state, two identical frames within a certain time skew, forward the first frame to the application and discard the second frame when (and if) it comes.

A node forwards a frame unless it detects a frame that it sent itself or that it already sent. To reduce unicast traffic, a node does not forward a frame for which it is the sole destination (Mode U). This does not apply when traffic supervision is needed.

To reduce traffic, a node may refrain from sending a frame that it already received from the opposite direction on the same port (Mode X), [6] but this does not apply to all frames.

Also, several algorithms that relies on network node location learning can serve in the HSR traffic reduction, such as the Port Locking and Enhanced Port Locking, (PL) and (EPL) respectively, which work on closing the ports that leads to a non existed node, [7]

Especially, Precision Time Protocol frames (multicast) are no duplicates of each other since they are modified by each node to correct the time. Such frames can only be retired by the node that originally inserted them, or by another node that already sent them. Also, this mode cannot be used when deterministic operation is required.

A special treatment is given to link-specific frames such as LLDP or Pdelay_Req / Pdelay_Resp Precision Time Protocol frames, for which the HSR tag is ignored, but must be present.

Frame format

To simplify the detection of duplicates, the frames are identified by their source address and a sequence number that is incremented for each frame sent according to the HSR protocol. The sequence number, the frame size and the path identifier are appended in a 6-octet HSR tag (header).

NOTE: all legacy devices should accept Ethernet frames up to 1528 octets, this is below the theoretical limit of 1535 octets.

Performance

In an HSR ring, only about half of the network bandwidth is available to applications for multicast traffic (compared to RSTP). This is because all frames are sent twice over the same network, even when there is no failure. However, since also the network infrastructure is also doubled in closed ring topologies the nominal network bandwidth is available. E.g. in a 100 Mbit/s Ethernet ring 100 Mbit/s are available (but not 200 Mbit/s).

Implementation

Since the forwarding delay of every node in an HSR ring adds to the total network latency, frames are forwarded within microseconds.
In practice, hardware support (FPGA) [8] is required to bring down the per-hop latency to a reasonable value (some 5μs at 100 Mbit/s), using cut-through switching.
To this purpose, each frame has an HSR tag that allows recognition of whether the frame should be forwarded or not, to avoid store-and-forward. This means that corrupted frames will not be removed from the ring until they reach a node that already sent them.

Clock synchronization

IEC 62439-3 Annex C specifies a Precision Time Protocol Industry Profile (PIP L2P2P), that allows a clock synchronization down to an accuracy of 1 μs in a ring of 16 HSR nodes. This PTP protocol also allows operating the HSR ring deterministically for a dedicated class of traffic, for instance Sampled Values in IEC 61850. It has been adopted by IEEE as IEC/IEEE 61850-9-3, . [9]

See also

Related Research Articles

<span class="mw-page-title-main">Ethernet</span> Computer networking technology

Ethernet is a family of wired computer networking technologies commonly used in local area networks (LAN), metropolitan area networks (MAN) and wide area networks (WAN). It was commercially introduced in 1980 and first standardized in 1983 as IEEE 802.3. Ethernet has since been refined to support higher bit rates, a greater number of nodes, and longer link distances, but retains much backward compatibility. Over time, Ethernet has largely replaced competing wired LAN technologies such as Token Ring, FDDI and ARCNET.

<span class="mw-page-title-main">Fiber Distributed Data Interface</span> Standard for data transmission in a local area network

Fiber Distributed Data Interface (FDDI) is a standard for data transmission in a local area network. It uses optical fiber as its standard underlying physical medium, although it was also later specified to use copper cable, in which case it may be called CDDI, standardized as TP-PMD, also referred to as TP-DDI.

<span class="mw-page-title-main">Network topology</span> Arrangement of the elements of a communication network

Network topology is the arrangement of the elements of a communication network. Network topology can be used to define or describe the arrangement of various types of telecommunication networks, including command and control radio networks, industrial fieldbusses and computer networks.

A virtual local area network (VLAN) is any broadcast domain that is partitioned and isolated in a computer network at the data link layer. In this context, virtual refers to a physical object recreated and altered by additional logic, within the local area network. VLANs work by applying tags to network frames and handling these tags in networking systems – creating the appearance and functionality of network traffic that is physically on a single network but acts as if it is split between separate networks. In this way, VLANs can keep network applications separate despite being connected to the same physical network, and without requiring multiple sets of cabling and networking devices to be deployed.

IEC 61850 is an international standard defining communication protocols for intelligent electronic devices at electrical substations. It is a part of the International Electrotechnical Commission's (IEC) Technical Committee 57 reference architecture for electric power systems. The abstract data models defined in IEC 61850 can be mapped to a number of protocols. Current mappings in the standard are to Manufacturing Message Specification (MMS), GOOSE [see section 3, Terms and definitions, term 3.65 on page 14], SV or SMV, and soon to web services. In the previous version of the standard, GOOSE stood for "Generic Object Oriented Substation Event", but this old definition is still very common in IEC 61850 documentation. These protocols can run over TCP/IP networks or substation LANs using high speed switched Ethernet to obtain the necessary response times below four milliseconds for protective relaying.

The Precision Time Protocol (PTP) is a protocol used to synchronize clocks throughout a computer network. On a local area network, it achieves clock accuracy in the sub-microsecond range, making it suitable for measurement and control systems. PTP is employed to synchronize financial transactions, mobile phone tower transmissions, sub-sea acoustic arrays, and networks that require precise timing but lack access to satellite navigation signals.

<span class="mw-page-title-main">Profinet</span> Computer network protocol

Profinet is an industry technical standard for data communication over Industrial Ethernet, designed for collecting data from, and controlling equipment in industrial systems, with a particular strength in delivering data under tight time constraints. The standard is maintained and supported by Profibus and Profinet International, an umbrella organization headquartered in Karlsruhe, Germany.

Open Transport Network (OTN) is a flexible private communication network based on fiber optic technology, manufactured by OTN Systems.

EtherCAT is an Ethernet-based fieldbus system developed by Beckhoff Automation. The protocol is standardized in IEC 61158 and is suitable for both hard and soft real-time computing requirements in automation technology.

<span class="mw-page-title-main">Token Ring</span> Technology for computer networking

Token Ring is a physical and data link layer computer networking technology used to build local area networks. It was introduced by IBM in 1984, and standardized in 1989 as IEEE 802.5. It uses a special three-byte frame called a token that is passed around a logical ring of workstations or servers. This token passing is a channel access method providing fair access for all stations, and eliminating the collisions of contention-based access methods.

A switching loop or bridge loop occurs in computer networks when there is more than one layer 2 path between two endpoints. The loop creates broadcast storms as broadcasts and multicasts are forwarded by switches out every port, the switch or switches will repeatedly rebroadcast the broadcast messages flooding the network. Since the layer-2 header does not include a time to live (TTL) field, if a frame is sent into a looped topology, it can loop forever.

Sercos III is the third generation of the Sercos interface, a standardized open digital interface for the communication between industrial controls, motion devices, input/output devices (I/O), and Ethernet nodes, such as PCs. Sercos III applies the hard real-time features of the Sercos interface to Ethernet. It is based upon and conforms to the Ethernet standard. Work began on Sercos III in 2003, with vendors releasing first products supporting it in 2005.

Synchronous Ethernet, also referred as SyncE, is an ITU-T standard for computer networking that facilitates the transference of clock signals over the Ethernet physical layer. This signal can then be made traceable to an external clock.

Parallel Redundancy Protocol (PRP) is a network protocol standard for Ethernet that provides seamless failover against failure of any network component. This redundancy is invisible to the application.

RAPIEnet is Korea's first Ethernet international standard for real-time data transmission.

<span class="mw-page-title-main">Media Redundancy Protocol</span> Fault recovery system for Ethernet

Media Redundancy Protocol (MRP) is a data network protocol standardized by the International Electrotechnical Commission as IEC 62439-2. It allows rings of Ethernet switches to overcome any single failure with recovery time much faster than achievable with Spanning Tree Protocol. It is suitable to most industrial Ethernet applications.

Time-Sensitive Networking (TSN) is a set of standards under development by the Time-Sensitive Networking task group of the IEEE 802.1 working group. The TSN task group was formed in November 2012 by renaming the existing Audio Video Bridging Task Group and continuing its work. The name changed as a result of the extension of the working area of the standardization group. The standards define mechanisms for the time-sensitive transmission of data over deterministic Ethernet networks.

<span class="mw-page-title-main">IEC/IEEE 61850-9-3</span>

IEC/IEEE 61850-9-3 or PUP is an international standard for precise time distribution and clock synchronization in electrical grids with an accuracy of 1 μs.
It supports precise time stamping of voltage and current measurement for differential protection, wide area monitoring and protection, busbar protection and event recording.
It can be used to ensure deterministic operation of critical functions in the automation system.
It belongs to the IEC 61850 standard suite for communication networks and systems for power utility automation.

Industrial automation systems consisting of several distributed controllers need a precise synchronization for commands, events and process data. For instance, motors for newspaper printing are synchronized within some 5 microseconds to ensure that the color pixels in the different cylinders come within 0.1 mm at a paper speed of some 20 m/s. Similar requirements exist in high-power semiconductors and in drive-by-wire vehicles. This synchronisation is provided by the communication network, in most cases Industrial Ethernet. Many ad-hoc synchronization schemes exist, so IEEE published a standard Precision Time Protocol IEEE 1588 or "PTP", which allows sub-microsecond synchronization of clocks. PTP is formulated generally, so concrete applications need a stricter profile. In particular, PTP does not specify how the clocks should operate when the network is duplicated for better resilience to failures.

Deterministic Networking (DetNet) is an effort by the IETF DetNet Working Group to study implementation of deterministic data paths for real-time applications with extremely low data loss rates, packet delay variation (jitter), and bounded latency, such as audio and video streaming, industrial automation, and vehicle control.

References

  1. IEC 62439-3 Clause 4
  2. International Electrotechnical Commission IEC 62439-3:2016 Industrial communication networks - High availability automation networks - Part 3: Parallel Redundancy Protocol (PRP) and High-availability Seamless Redundancy (HSR)
  3. Kirrmann, Hubert; Dzung, Dacfey. Selecting a Standard Redundancy Method for Highly Available Industrial Networks , 2006 IEEE International Workshop on Factory Communication Systems, June 27, 2006 Page(s):386 – 390
  4. Kirrmann, Hubert Seamless redundancy - bumpless Ethernet redundancy for substations with IEC 61850 ABB review special report, 2013
  5. Pustylnik, Michael; Zafirovic-Vukotic, Mira; Moore, Roger Performance Of The Rapid Spanning Tree Protocol In Ring Network Topology , RUGGEDCOM
  6. Allawi, Saad; Rhee Jong Myung Improvement of high-availability seamless redundancy (HSR) traffic performance for smart grid communications , Journal of Communications and Networks ( Volume: 14, Issue: 6, Dec. 2012 )
  7. Altaha, Ibraheem; Rhee Jong Myung , Journal of Communications and Networks ( Volume: E98, No.:9 Sept. 2015 )
  8. SoCe HSR-PRP Switch IP Core
  9. Kirrmann, H.; Dickerson W. IEC IEEE Precision Time Protocol Archived 2017-02-05 at the Wayback Machine , Pacworld, September 2016

Historical Note

Originally, the protocol was named HASAR for the initial of the five companies working for electrical utilities that created it (Hirschmann, ABB, Siemens, Alstom and RuggedCom). Marketing renamed it HSR, for "High-availability Seamless Ring", but HSR is not limited to a simple ring topology.

Further reading