Covering code

Last updated June 19, 2024

In coding theory, a covering code is a set of elements (called codewords) in a space, with the property that every element of the space is within a fixed distance of some codeword.

Definition

Let $q\geq 2$ , $n\geq 1$ , $R\geq 0$ be integers. A code $C\subseteq Q^{n}$ over an alphabet Q of size |Q| = q is called q-ary R-covering code of length n if for every word $y\in Q^{n}$ there is a codeword $x\in C$ such that the Hamming distance $d_{H}(x,y)\leq R$ . In other words, the spheres (or balls or rook-domains) of radius R with respect to the Hamming metric around the codewords of C have to exhaust the finite metric space $Q^{n}$ . The covering radius of a code C is the smallest R such that C is R-covering. Every perfect code is a covering code of minimal size.

Example

C = {0134,0223,1402,1431,1444,2123,2234,3002,3310,4010,4341} is a 5-ary 2-covering code of length 4.^[1]

Covering problem

The determination of the minimal size $K_{q}(n,R)$ of a q-ary R-covering code of length n is a very hard problem. In many cases, only upper and lower bounds are known with a large gap between them. Every construction of a covering code gives an upper bound on K_q(n, R). Lower bounds include the sphere covering bound and Rodemich's bounds $K_{q}(n,1)\geq q^{n-1}/(n-1)$ and $K_{q}(n,n-2)\geq q^{2}/(n-1)$ .^[2] The covering problem is closely related to the packing problem in $Q^{n}$ , i.e. the determination of the maximal size of a q-ary e-error correcting code of length n.

Football pools problem

A particular case is the football pools problem, based on football pool betting, where the aim is to come up with a betting system over n football matches that, regardless of the outcome, has at most R 'misses'. Thus, for n matches with at most one 'miss', a ternary covering, K₃(n,1), is sought.

If $n={\tfrac {1}{2}}(3^{k}-1)$ then 3^n-k are needed, so for n = 4, k = 2, 9 are needed; for n = 13, k = 3, 59049 are needed.^[3] The best bounds known as of 2011^[4] are

n	1	2	3	4	5	6	7	8	9	10	11	12	13	14
K₃(n,1)	1	3	5	9	27	71-73	156-186	402-486	1060-1269	2854-3645	7832-9477	21531-27702	59049	166610-177147
K₃(n,2)		1	3	3	8	15-17	26-34	54-81	130-219	323-555	729	1919-2187	5062-6561	12204-19683
K₃(n,3)			1	3	3	6	11-12	14-27	27-54	57-105	117-243	282-657	612-1215	1553-2187

Applications

The standard work^[5] on covering codes lists the following applications.

Compression with distortion
Data compression
Decoding errors and erasures
Broadcasting in interconnection networks
Football pools ^[6]
Write-once memories
Berlekamp-Gale game
Speech coding
Cellular telecommunications
Subset sums and Cayley graphs

Related Research Articles

In information theory, the Hamming distance between two strings or vectors of equal length is the number of positions at which the corresponding symbols are different. In other words, it measures the minimum number of substitutions required to change one string into the other, or equivalently, the minimum number of errors that could have transformed one string into the other. In a more general context, the Hamming distance is one of several string metrics for measuring the edit distance between two sequences. It is named after the American mathematician Richard Hamming.

In coding theory, block codes are a large and important family of error-correcting codes that encode data in blocks. There is a vast number of examples for block codes, many of which have a wide range of practical applications. The abstract definition of block codes is conceptually useful because it allows coding theorists, mathematicians, and computer scientists to study the limitations of all block codes in a unified way. Such limitations often take the form of bounds that relate different parameters of the block code to each other, such as its rate and its ability to detect and correct errors.

In coding theory, the Kraft–McMillan inequality gives a necessary and sufficient condition for the existence of a prefix code or a uniquely decodable code for a given set of codeword lengths. Its applications to prefix codes and trees often find use in computer science and information theory. The prefix code can contain either finitely many or infinitely many codewords.

In coding theory, a linear code is an error-correcting code for which any linear combination of codewords is also a codeword. Linear codes are traditionally partitioned into block codes and convolutional codes, although turbo codes can be seen as a hybrid of these two types. Linear codes allow for more efficient encoding and decoding algorithms than other codes.

In coding theory, the Singleton bound, named after Richard Collom Singleton, is a relatively crude upper bound on the size of an arbitrary block code $with block length, size and minimum distance . It is also known as the Joshibound . proved by Joshi (1958) and even earlier by Komamiya (1953) .$

In mathematics and computer science, in the field of coding theory, the Hamming bound is a limit on the parameters of an arbitrary block code: it is also known as the sphere-packing bound or the volume bound from an interpretation in terms of packing balls in the Hamming metric into the space of all possible words. It gives an important limitation on the efficiency with which any error-correcting code can utilize the space in which its code words are embedded. A code that attains the Hamming bound is said to be a perfect code.

In coding theory, the Gilbert–Varshamov bound is a bound on the size of a code. It is occasionally known as the Gilbert–Shannon–Varshamov bound, but the name "Gilbert–Varshamov bound" is by far the most popular. Varshamov proved this bound by using the probabilistic method for linear codes. For more about that proof, see Gilbert–Varshamov bound for linear codes.

<i>m</i>-ary tree Tree data structure in which each node has at most m children.

In graph theory, an m-ary tree is an arborescence in which each node has no more than m children. A binary tree is an important case where m = 2; similarly, a ternary tree is one where m = 3.

In statistics and coding theory, a Hamming space is usually the set of all $binary strings of length N . It is used in the theory of coding signals and transmission.$

In coding theory, a cyclic code is a block code, where the circular shifts of each codeword gives another word that belongs to the code. They are error-correcting codes that have algebraic properties that are convenient for efficient error detection and correction.

In coding theory, a constant-weight code, also called an m-of-n code, is an error detection and correction code where all codewords share the same Hamming weight. The one-hot code and the balanced code are two widely used kinds of constant-weight code.

In coding theory, list decoding is an alternative to unique decoding of error-correcting codes for large error rates. The notion was proposed by Elias in the 1950s. The main idea behind list decoding is that the decoding algorithm instead of outputting a single possible message outputs a list of possibilities one of which is correct. This allows for handling a greater number of errors than that allowed by unique decoding.

A locally testable code is a type of error-correcting code for which it can be determined if a string is a word in that code by looking at a small number of bits of the string. In some situations, it is useful to know if the data is corrupted without decoding all of it so that appropriate action can be taken in response. For example, in communication, if the receiver encounters a corrupted code, it can request the data be re-sent, which could increase the accuracy of said data. Similarly, in data storage, these codes can allow for damaged data to be recovered and rewritten properly.

DNA code construction refers to the application of coding theory to the design of nucleic acid systems for the field of DNA–based computation.

The Elias Bassalygo bound is a mathematical limit used in coding theory for error correction during data transmission or communications.

In coding theory, the Wozencraft ensemble is a set of linear codes in which most of codes satisfy the Gilbert-Varshamov bound. It is named after John Wozencraft, who proved its existence. The ensemble is described by Massey (1963), who attributes it to Wozencraft. Justesen (1972) used the Wozencraft ensemble as the inner codes in his construction of strongly explicit asymptotically good code.

The Gilbert–Varshamov bound for linear codes is related to the general Gilbert–Varshamov bound, which gives a lower bound on the maximal number of elements in an error-correcting code of a given block length and minimum Hamming weight over a field $. This may be translated into a statement about the maximum rate of a code with given length and minimum distance. The Gilbert-Varshamov bound for linear codes asserts the existence of q -ary linear codes for any relative minimum distance less than the given bound that simultaneously have high rate. The existence proof uses the probabilistic method, and thus is not constructive. The Gilbert-Varshamov bound is the best known in terms of relative distance for codes over alphabets of size less than 49. For larger alphabets, algebraic geometry codes sometimes achieve an asymptotically better rate vs. distance tradeoff than is given by the Gilbert-Varshamov bound.$

In coding theory, the Zyablov bound is a lower bound on the rate $and relative distance that are achievable by concatenated codes.$

In coding theory, folded Reed–Solomon codes are like Reed–Solomon codes, which are obtained by mapping $Reed-Solomon codewords over a larger alphabet by careful bundling of codeword symbols.$

Permutation codes are a family of error correction codes that were introduced first by Slepian in 1965. and have been widely studied both in Combinatorics and Information theory due to their applications related to Flash memory and Power-line communication.

References

↑ P.R.J. Östergård (1991). "Upper bounds for q-ary covering codes". IEEE Transactions on Information Theory . 37: 660–664.
↑ E.R. Rodemich (1970). "Covering by rook-domains". Journal of Combinatorial Theory . 9: 117–128.
↑ Kamps, H.J.L.; van Lint, J.H. (December 1967). "The football pool problem for 5 matches" (PDF). Journal of Combinatorial Theory. 3 (4): 315–325. doi:10.1016/S0021-9800(67)80102-9 . Retrieved 9 November 2022.
↑ "Bounds on K3(n, R) (lower and upper bounds on the size of ternary optimal covering codes)" (PDF). SZÁMÍTÁSTECHNIKAI ÉS AUTOMATIZÁLÁSI KUTATÓINTÉZET. Archived (PDF) from the original on 27 October 2022. Retrieved 9 November 2022.
↑ G. Cohen, I. Honkala, S. Litsyn, A. Lobstein (1997). Covering Codes. Elsevier. ISBN 0-444-82511-8.{{cite book}}: CS1 maint: multiple names: authors list (link)
↑ H. Hämäläinen, I. Honkala, S. Litsyn, P.R.J. Östergård (1995). "Football pools — a game for mathematicians". American Mathematical Monthly . 102: 579–588.{{cite journal}}: CS1 maint: multiple names: authors list (link)

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] P.R.J. Östergård (1991). "Upper bounds for q-ary covering codes". IEEE Transactions on Information Theory . 37: 660–664.

[2] E.R. Rodemich (1970). "Covering by rook-domains". Journal of Combinatorial Theory . 9: 117–128.

[3] Kamps, H.J.L.; van Lint, J.H. (December 1967). "The football pool problem for 5 matches" (PDF). Journal of Combinatorial Theory. 3 (4): 315–325. doi:10.1016/S0021-9800(67)80102-9 . Retrieved 9 November 2022.

[4] "Bounds on K3(n, R) (lower and upper bounds on the size of ternary optimal covering codes)" (PDF). SZÁMÍTÁSTECHNIKAI ÉS AUTOMATIZÁLÁSI KUTATÓINTÉZET. Archived (PDF) from the original on 27 October 2022. Retrieved 9 November 2022.

[5] G. Cohen, I. Honkala, S. Litsyn, A. Lobstein (1997). Covering Codes. Elsevier. ISBN 0-444-82511-8.{{cite book}}: CS1 maint: multiple names: authors list (link)

[6] H. Hämäläinen, I. Honkala, S. Litsyn, P.R.J. Östergård (1995). "Football pools — a game for mathematicians". American Mathematical Monthly . 102: 579–588.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[1]

[2]

[3]

[4]

[5]

[6]