4B3T

Last updated

4B3T, which stands for 4 (four) binary 3 (three) ternary, is a line encoding scheme used for ISDN PRI interface. 4B3T represents four binary bits using three pulses.

Contents

Description

It uses three output levels:

This means we have 24 = 16 input combinations to represent, using 33 = 27 output combinations. 000 is not used to avoid long periods without a transition. 4B3T uses a paired disparity code to achieve an overall zero DC bias: six triplets are used which have no DC component (0+−, 0−+, +0−, −0+, +−0, −+0), and the remaining 20 are grouped into 10 pairs with differing disparity (e.g. ++− and −−+). When transmitting, the DC bias is tracked and a combination chosen that has a DC component of the opposite sign to the running total.

This mapping from 4 bits to three ternary states is given in a table known as Modified Monitoring State 43 (MMS43). A competing encoding technique, used for the ISDN basic rate interface where 4B3T is not used, is 2B1Q.

The sync sequence used is the 11-symbol Barker code, +++−−−+−−+− or its reverse, −+−−+−−−+++.

Encoding table

Each 4-bit input group is encoded as a 3-symbol group (transmitted left to right) from the following table. Encoding requires keeping track of the accumulated DC offset, the number of + pulses minus the number of − pulses in all preceding groups. The starting value is arbitrary; here we use the values 1 through 4, although −1.5, −0.5, +0.5 and +1.5 is another possibility.

MMS 43 coding table [1]
InputAccumulated DC offset
HexBinary1234
00000+ 0 +   (+2)0−0 (−1)
100010 − +   (+0)
20010+ − 0   (+0)
300110 0 +   (+1)− − 0   (−2)
40100− + 0   (+0)
501010 + +   (+2)− 0 0   (−1)
60110− + +   (+1)− − +   (−1)
70111− 0 +   (+0)
81000+ 0 0   (+1)0 − −   (−2)
91001+ − +   (+1)− − −   (−3)
A1010+ + −   (+1)+ − −   (−1)
B1011+ 0 −   (+0)
C1100+ + +   (+3)− + −   (−1)
D11010 + 0   (+1)− 0 −   (−2)
E11100 + −   (+0)
F1111+ + 0   (+2)0 0 −   (−1)

This code forces a transition after at most five consecutive identical non-zero symbols, or four consecutive zero symbols.

Decoding table

Decoding is simpler, as the decoder does not need to keep track of the encoder state, although doing so allows greater error detection. The 000 triplet is not a legal encoded sequence, but is typically decoded as binary 0000.

TernaryBinaryHexTernaryBinaryHexTernaryBinaryHex
0 0 0− 0 001015+ − −1010A
+ 0 +00000− + +01106+ 0 −1011B
0 − 000000− − +01106+ + +1100C
0 − +00011− 0 +01117− + −1100C
+ − 000102+ 0 0100080 + 01101D
0 0 +001130 − −10008− 0 −1101D
− − 000113+ − +100190 + −1110E
− + 001004− − −10019+ + 01111F
0 + +01015+ + −1010A0 0 −1111F

8B/6T

A similar paired-disparity code was used in the short-lived 100BASE-T4 Fast Ethernet standard. This achieved the same ratio of 4/3 bits per baud while needing to keep track of only two levels of accumulated DC offset by using a larger 256-entry coding table. 6 ternary symbols allow 140 balanced codes (30 permutations of +0000−, 90 permutations of ++00−−, and 20 permutations of +++−−−), and 126 codes with disparity +1 (6 permutations of +00000, 60 permutations of ++000−, and 60 permutations of +++0−−). The codes with disparity +1 are inverted whenever the running disparity is high. From the total of 266 code words, 256 are used as follows:

8B/6T coding table [2]
8B6T8B6T8B6T8B6T8B6T8B6T8B6T8B6T
00+−00+−2000−++−40+0+00−600−0++080+−+00−A00−0++−C0+−+0+−E0+−0++−
010+−+−021−−+00+41++00−06100−+0+81++−0−0A100−+−+C1++−+−0E10+−+−+
02+−0+−022++−0+−42+0+0−0620−0+0+82+−+0−0A20−0+−+C2+−++−0E2+−0+−+
03−0++−023++−0−+430++0−063−00+0+83−++0−0A3−00+−+C3−+++−0E3−0++−+
04−0+0+−2400+0−+440++00−64−00++084−++00−A4−00++−C4−++0+−E4−0+++−
050+−−0+2500+0+−45++0−006500−0++85++−−00A500−−++C5++−−0+E50+−−++
06+−0−0+2600−00+46+0+−00660−00++86+−+−00A60−0−++C6+−+−0+E6+−0−++
07−0+−0+27−−+++−470++−0067−000++87−++−00A7−00−++C7−++−0+E7−0+−++
08−+00+−28−0−++048000+0068−+−++0880+000−A8−+−++−C80+00+−E8−+0++−
090−++−029−−0+0+49000−++69−−++0+8900+0−0A9−−++−+C900++−0E90−++−+
0A−+0+−02A−0−+0+4A000+−+6A−+−+0+8A0+00−0AA−+−+−+CA0+0+−0EA−+0+−+
0B+0−+−02B0−−+0+4B000++−6B+−−+0+8B+000−0AB+−−+−+CB+00+−0EB+0−+−+
0C+0−0+−2C0−−++04C000−+06C+−−++08C+0000−AC+−−++−CC+000+−EC+0−++−
0D0−+−0+2D−−00++4D000−0+6D−−+0++8D00+−00AD−−+−++CD00+−0+ED0−+−++
0E−+0−0+2E−0−0++4E000+−06E−+−0++8E0+0−00AE−+−−++CE0+0−0+EE−+0−++
0F+0−−0+2F0−−0++4F000+0−6F+−−0++8F+00−00AF+−−−++CF+00−0+EF+0−−++
10+0+−−030+−00−+50+0+−−+70−++00090+−+−−+B00−000+D0+−+0−+F0+−000+
11++0−0−310+−−+051++0−+−71+−+00091++−−+−B100−0+0D1++−−+0F10+−0+0
12+0+−0−32+−0−+052+0+−+−72++−00092+−+−+−B20−00+0D2+−+−+0F2+−00+0
130++−0−33−0+−+0530++−+−7300+00093−++−+−B3−000+0D3−++−+0F3−0+0+0
140++−−034−0+0−+540++−−+74−0+00094−++−−+B4−0000+D4−++0−+F4−0+00+
15++00−−350+−+0−55++0+−−750−+00095++−+−−B500−+00D5++−+0−F50+−+00
16+0+0−−36+−0+0−56+0++−−76+0−00096+−++−−B60−0+00D6+−++0−F6+−0+00
170++0−−37−0++0−570+++−−770+−00097−+++−−B7−00+00D7−+++0−F7−0++00
180+−0+−38−+00−+58+++0−−780−−+++980+0−−+B8−+−00+D80+00−+F8−+000+
190+−0−+390−+−+059+++−0−79−0−+++9900+−+−B9−−+0+0D900+−+0F90−+0+0
1A0+−++−3A−+0−+05A+++−−07A−−0+++9A0+0−+−BA−+−0+0DA0+0−+0FA−+00+0
1B0+−00+3B+0−−+05B++0−−07B−−0++09B+00−+−BB+−−0+0DB+00−+0FB+0−0+0
1C0−+00+3C+0−0−+5C++0−−+7C++−00−9C+00−−+BC+−−00+DC+000−+FC+0−00+
1D0−+++−3D0−++0−5D++000−7D00+00−9D00++−−BD−−++00DD00++0−FD0−++00
1E0−+0−+3E−+0+0−5E−−+++07E++−−−+9E0+0+−−BE−+−+00DE0+0+0−FE−+0+00
1F0−+0+−3F+0−+0−5F00−++07F00+−−+9F+00+−−BF+−−+00DF+00+0−FF+0−+00
Unused+++−−−+−0000−+0000+000000+0000
−−−+++0000−+0000+−00000+0000+0

See also

Other line codes that have 3 states:

Related Research Articles

In mathematics and computing, the hexadecimal numeral system is a positional numeral system that represents numbers using a radix (base) of sixteen. Unlike the decimal system representing numbers using ten symbols, hexadecimal uses sixteen distinct symbols, most often the symbols "0"–"9" to represent values 0 to 9, and "A"–"F" to represent values from ten to fifteen.

<span class="mw-page-title-main">Line code</span> Pattern used within a communications system to represent digital data

In telecommunication, a line code is a pattern of voltage, current, or photons used to represent digital data transmitted down a communication channel or written to a storage medium. This repertoire of signals is usually called a constrained code in data storage systems. Some signals are more prone to error than others as the physics of the communication channel or storage medium constrains the repertoire of signals that can be used reliably.

Modified AMI codes are a digital telecommunications technique to maintain system synchronization. Alternate mark inversion (AMI) line codes are modified by deliberate insertion of bipolar violations. There are several types of modified AMI codes, used in various T-carrier and E-carrier systems.

<span class="mw-page-title-main">Non-return-to-zero</span> Telecommunication coding technique

In telecommunication, a non-return-to-zero (NRZ) line code is a binary code in which ones are represented by one significant condition, usually a positive voltage, while zeros are represented by some other significant condition, usually a negative voltage, with no other neutral or rest condition.

In telecommunication, a ternary signal is a signal that can assume, at any given instant, one of three states or significant conditions, such as power level, phase position, pulse duration, or frequency.

The U interface or U reference point is a Basic Rate Interface (BRI) in the local loop of an Integrated Services Digital Network (ISDN), connecting the network terminator (NT1/2) on the customer's premises to the line termination (LT) in the carrier's local exchange, in other words providing the connection from subscriber to central office.

Two-binary, one-quaternary (2B1Q) is a line code used in the U interface of the Integrated Services Digital Network (ISDN) Basic Rate Interface (BRI) and the high-bit-rate digital subscriber line (HDSL). 2B1Q is a four-level pulse-amplitude modulation (PAM-4) scheme without redundancy, mapping two bits (2B) into one quaternary symbol (1Q). Symbol rate is half of data rate.

Truncated binary encoding is an entropy encoding typically used for uniform probability distributions with a finite alphabet. It is parameterized by an alphabet with total size of number n. It is a slightly more general form of binary encoding when n is not a power of two.

UTF-7 is an obsolete variable-length character encoding for representing Unicode text using a stream of ASCII characters. It was originally intended to provide a means of encoding Unicode text for use in Internet E-mail messages that was more efficient than the combination of UTF-8 with quoted-printable.

<span class="mw-page-title-main">Return-to-zero</span> Line code in which the signal value returns to zero between each pulse

Return-to-zero describes a line code used in telecommunications signals in which the signal drops (returns) to zero between each pulse. This takes place even if a number of consecutive 0s or 1s occur in the signal. The signal is self-clocking. This means that a separate clock does not need to be sent alongside the signal, but suffers from using twice the bandwidth to achieve the same data-rate as compared to non-return-to-zero format.

In telecommunications, 8b/10b is a line code that maps 8-bit words to 10-bit symbols to achieve DC balance and bounded disparity, and at the same time provide enough state changes to allow reasonable clock recovery. This means that the difference between the counts of ones and zeros in a string of at least 20 bits is no more than two, and that there are not more than five ones or zeros in a row. This helps to reduce the demand for the lower bandwidth limit of the channel necessary to transfer the signal.

<span class="mw-page-title-main">Transition-minimized differential signaling</span> Digital serial communication standard

Transition-minimized differential signaling (TMDS) is a technology for transmitting high-speed serial data used by the DVI and HDMI video interfaces, as well as by other digital communication interfaces.

<span class="mw-page-title-main">Bipolar encoding</span>

In telecommunication, bipolar encoding is a type of return-to-zero (RZ) line code, where two nonzero values are used, so that the three values are +, −, and zero. Such a signal is called a duobinary signal. Standard bipolar encodings are designed to be DC-balanced, spending equal amounts of time in the + and − states.

de Bruijn sequence Cycle through all length-k sequences

In combinatorial mathematics, a de Bruijn sequence of order n on a size-k alphabet A is a cyclic sequence in which every possible length-n string on A occurs exactly once as a substring (i.e., as a contiguous subsequence). Such a sequence is denoted by B(k, n) and has length kn, which is also the number of distinct strings of length n on A. Each of these distinct strings, when taken as a substring of B(k, n), must start at a different position, because substrings starting at the same position are not distinct. Therefore, B(k, n) must have at leastkn symbols. And since B(k, n) has exactlykn symbols, de Bruijn sequences are optimally short with respect to the property of containing every string of length n at least once.

In a digitally modulated signal or a line code, symbol rate, modulation rate or baud rate is the number of symbol changes, waveform changes, or signaling events across the transmission medium per unit of time. The symbol rate is measured in baud (Bd) or symbols per second. In the case of a line code, the symbol rate is the pulse rate in pulses per second. Each symbol can represent or convey one or several bits of data. The symbol rate is related to the gross bit rate, expressed in bits per second.

In coding theory, a constant-weight code, also called an m-of-n code, is an error detection and correction code where all codewords share the same Hamming weight. The one-hot code and the balanced code are two widely used kinds of constant-weight code.

In telecommunications, the hybrid (H-) ternary line code is a line code that operates on a hybrid principle combining the binary non-return-to-zero-level (NRZL) and the polar return-to-zero (RZ) codes.

Single-precision floating-point format is a computer number format, usually occupying 32 bits in computer memory; it represents a wide dynamic range of numeric values by using a floating radix point.

Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude of the analog signal is sampled at uniform intervals, and each sample is quantized to the nearest value within a range of digital steps.

Asymmetric numeral systems (ANS) is a family of entropy encoding methods introduced by Jarosław (Jarek) Duda from Jagiellonian University, used in data compression since 2014 due to improved performance compared to previous methods. ANS combines the compression ratio of arithmetic coding, with a processing cost similar to that of Huffman coding. In the tabled ANS (tANS) variant, this is achieved by constructing a finite-state machine to operate on a large alphabet without using multiplication.

References

  1. "Wired Communications T-SMINTO 4B3T Second Gen. Modular ISDN NT (Ordinary)" (PDF) (Data sheet). Version 1.1. Infineon. November 2001. PEF 80902.
  2. Feit, Sidnie (June 19, 2000). "Appendix B.2: 8B/6T Tables". Local Area High Speed Networks. New Riders Publishing. ISBN   1-57870-113-9.