Disjunct matrix

Last updated November 09, 2024

In mathematics, a logical matrix may be described as d-disjunct and/or d-separable. These concepts play a pivotal role in the mathematical area of non-adaptive group testing.

A matrix is said to be d-separable if no two sets of d columns have the same boolean sum.
A matrix is said to be ${\overline {d}}$ -separable (that's d with an overline) if no two sets of d-or-fewer columns have the same boolean sum.
A matrix is said to be d-disjunct if no set of d columns has a boolean sum which is a superset of any other single column.

The following relationships are "well-known":^[3]

Every ${\overline {d+1}}$ -separable matrix is also $d$ -disjunct.^[3]
Every $d$ -disjunct matrix is also ${\overline {d}}$ -separable.^[3]
Every ${\overline {d}}$ -separable matrix is also $d$ -separable (by definition).

Concrete examples

The following $6\times 8$ matrix is 2-separable, because each pair of columns has a distinct sum. For example, the boolean sum (that is, the bitwise OR) of the first two columns is $110000\lor 001100=111100$ ; that sum is not attainable as the sum of any other pair of columns in the matrix.

However, this matrix is not 3-separable, because the sum of columns 1, 2, and 3 (namely $111111$ ) equals the sum of columns 1, 4, and 5.

This matrix is also not ${\overline {2}}$ -separable, because the sum of columns 1 and 8 (namely $110000$ ) equals the sum of column 1 alone. In fact, no matrix with an all-zero column can possibly be ${\overline {d}}$ -separable for any $d\geq 1$ .

$\quad \left[{\begin{array}{cccccccc}1&0&0&1&1&0&0&0\\1&0&0&0&0&1&1&0\\0&1&0&1&0&1&0&0\\0&1&0&0&1&0&1&0\\0&0&1&0&1&1&0&0\\0&0&1&1&0&0&1&0\\\end{array}}\right]$

The following $6\times 4$ matrix is ${\overline {3}}$ -separable (and thus 2-disjunct) but not 3-disjunct.

$\quad \left[{\begin{array}{cccc}1&0&0&1\\1&0&1&0\\0&1&1&0\\0&1&0&0\\0&0&1&0\\0&0&0&1\\\end{array}}\right]$

There are 15 possible ways to choose 3-or-fewer columns from this matrix, and each choice leads to a different boolean sum:

columns	boolean sum	columns	boolean sum
none	000000	2,3	011110
1	110000	2,4	101101
2	001100	3,4	111011
3	011010	1,2,3	111110
4	100001	1,2,4	111101
1,2	111100	1,3,4	111011
1,3	111010	2,3,4	111111
1,4	110001

However, the sum of columns 2, 3, and 4 (namely $111111$ ) is a superset of column 1 (namely $110000$ ), which means that this matrix is not 3-disjunct.

Application of d-separability to group testing

The non-adaptive group testing problem postulates that we have a test which can tell us, for any set of items, whether that set contains a defective item. We are asked to come up with a series of groupings that can exactly identify all the defective items in a batch of n total items, some d of which are defective.

A $d$ -separable matrix with $t$ rows and $n$ columns concisely describes how to use t tests to find the defective items in a batch of n, where the number of defective items is known to be exactly d.

A $d$ -disjunct matrix (or, more generally, any ${\overline {d}}$ -separable matrix) with $t$ rows and $n$ columns concisely describes how to use t tests to find the defective items in a batch of n, where the number of defective items is known to be no more than d.

Practical concerns and published results

For a given n and d, the number of rows t in the smallest d-separable $t\times n$ matrix may (according to current knowledge) be smaller than the number of rows t in the smallest d-disjunct $t\times n$ matrix, but in asymptotically they are within a constant factor of each other.^[3] Additionally, if the matrix is to be used for practical testing, some algorithm is needed that can "decode" a test result (that is, a boolean sum such as $111100$ ) into the indices of the defective items (that is, the unique set of columns that produce that boolean sum). For arbitrary d-disjunct matrices, polynomial-time decoding algorithms are known; the naïve algorithm is $O(nt)$ .^[4] For arbitrary d-separable but non-d-disjunct matrices, the best known decoding algorithms are exponential-time.^[3]

Porat and Rothschild (2008) present a deterministic $O(nt)$ -time algorithm for constructing a d-disjoint matrix with n columns and $\Theta (d^{2}\ln n)$ rows.^[5]

Related Research Articles

In mathematics, the determinant is a scalar-valued function of the entries of a square matrix. The determinant of a matrix $A$ is commonly denoted $det(A)$ , $det A$ , or $| A |$ . Its value characterizes some properties of the matrix and the linear map represented, on a given basis, by the matrix. In particular, the determinant is nonzero if and only if the matrix is invertible and the corresponding linear map is an isomorphism.

<span class="mw-page-title-main">Gaussian elimination</span> Algorithm for solving systems of linear equations

In mathematics, Gaussian elimination, also known as row reduction, is an algorithm for solving systems of linear equations. It consists of a sequence of row-wise operations performed on the corresponding matrix of coefficients. This method can also be used to compute the rank of a matrix, the determinant of a square matrix, and the inverse of an invertible matrix. The method is named after Carl Friedrich Gauss (1777–1855). To perform row reduction on a matrix, one uses a sequence of elementary row operations to modify the matrix until the lower left-hand corner of the matrix is filled with zeros, as much as possible. There are three types of elementary row operations:

<span class="mw-page-title-main">Hamming code</span> Family of linear error-correcting codes

In computer science and telecommunications, Hamming codes are a family of linear error-correcting codes. Hamming codes can detect one-bit and two-bit errors, or correct one-bit errors without detection of uncorrected errors. By contrast, the simple parity code cannot correct errors, and can detect only an odd number of bits in error. Hamming codes are perfect codes, that is, they achieve the highest possible rate for codes with their block length and minimum distance of three. Richard W. Hamming invented Hamming codes in 1950 as a way of automatically correcting errors introduced by punched card readers. In his original paper, Hamming elaborated his general idea, but specifically focused on the Hamming(7,4) code which adds three parity bits to four bits of data.

In mathematics, specifically in linear algebra, matrix multiplication is a binary operation that produces a matrix from two matrices. For matrix multiplication, the number of columns in the first matrix must be equal to the number of rows in the second matrix. The resulting matrix, known as the matrix product, has the number of rows of the first and the number of columns of the second matrix. The product of matrices $A$ and $B$ is denoted as $AB$ .

In mathematics, a complex square matrix $A$ is normal if it commutes with its conjugate transpose $A *$ :

In mathematics, a Hermitian matrix is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the $i$ -th row and $j$ -th column is equal to the complex conjugate of the element in the $j$ -th row and $i$ -th column, for all indices $i$ and $j$ :

In graph theory and computer science, an adjacency matrix is a square matrix used to represent a finite graph. The elements of the matrix indicate whether pairs of vertices are adjacent or not in the graph.

<span class="mw-page-title-main">Block matrix</span> Matrix defined using smaller matrices called blocks

In mathematics, a block matrix or a partitioned matrix is a matrix that is interpreted as having been broken into sections called blocks or submatrices.

In abstract algebra, a matrix ring is a set of matrices with entries in a ring R that form a ring under matrix addition and matrix multiplication. The set of all n × n matrices with entries in R is a matrix ring denoted M_n(R) (alternative notations: Mat_n(R) and R^n×n). Some sets of infinite matrices form infinite matrix rings. A subring of a matrix ring is again a matrix ring. Over a rng, one can form matrix rngs.

In mathematics, a unimodular matrixM is a square integer matrix having determinant +1 or −1. Equivalently, it is an integer matrix that is invertible over the integers: there is an integer matrix N that is its inverse. Thus every equation Mx = b, where M and b both have integer components and M is unimodular, has an integer solution. The n × n unimodular matrices form a group called the n × n general linear group over $, which is denoted .$

In linear algebra, the Strassen algorithm, named after Volker Strassen, is an algorithm for matrix multiplication. It is faster than the standard matrix multiplication algorithm for large matrices, with a better asymptotic complexity, although the naive algorithm is often better for smaller matrices. The Strassen algorithm is slower than the fastest known algorithms for extremely large matrices, but such galactic algorithms are not useful in practice, as they are much slower for matrices of practical size. For small matrices even faster algorithms exist.

In mathematics, especially in probability and combinatorics, a doubly stochastic matrix (also called bistochastic matrix) is a square matrix $of nonnegative real numbers, each of whose rows and columns sums to 1, i.e.,$

A logical matrix, binary matrix, relation matrix, Boolean matrix, or (0, 1)-matrix is a matrix with entries from the Boolean domain $B = {0, 1}.$ Such a matrix can be used to represent a binary relation between a pair of finite sets. It is an important tool in combinatorial mathematics and theoretical computer science.

In numerical analysis and linear algebra, lower–upper (LU) decomposition or factorization factors a matrix as the product of a lower triangular matrix and an upper triangular matrix. The product sometimes includes a permutation matrix as well. LU decomposition can be viewed as the matrix form of Gaussian elimination. Computers usually solve square systems of linear equations using LU decomposition, and it is also a key step when inverting a matrix or computing the determinant of a matrix. The LU decomposition was introduced by the Polish astronomer Tadeusz Banachiewicz in 1938. To quote: "It appears that Gauss and Doolittle applied the method [of elimination] only to symmetric equations. More recent authors, for example, Aitken, Banachiewicz, Dwyer, and Crout … have emphasized the use of the method, or variations of it, in connection with non-symmetric problems … Banachiewicz … saw the point … that the basic problem is really one of matrix factorization, or “decomposition” as he called it." It is also sometimes referred to as LR decomposition.

Data parallelism is parallelization across multiple processors in parallel computing environments. It focuses on distributing the data across different nodes, which operate on the data in parallel. It can be applied on regular data structures like arrays and matrices by working on each element in parallel. It contrasts to task parallelism as another form of parallelism.

In-place matrix transposition, also called in-situ matrix transposition, is the problem of transposing an N×M matrix in-place in computer memory, ideally with O(1) (bounded) additional storage, or at most with additional storage much less than NM. Typically, the matrix is assumed to be stored in row-major or column-major order.

In linear algebra, the computation of the permanent of a matrix is a problem that is thought to be more difficult than the computation of the determinant of a matrix despite the apparent similarity of the definitions.

The iterative proportional fitting procedure is the operation of finding the fitted matrix $which is the closest to an initial matrix but with the row and column totals of a target matrix . The fitted matrix being of the form, where and are diagonal matrices such that has the margins of . Some algorithms can be chosen to perform biproportion. We have also the entropy maximization, information loss minimization or RAS which consists of factoring the matrix rows to match the specified row totals, then factoring its columns to match the specified column totals; each step usually disturbs the previous step's match, so these steps are repeated in cycles, re-adjusting the rows and columns in turn, until all specified marginal totals are satisfactorily approximated. However, all algorithms give the same solution. In three- or more-dimensional cases, adjustment steps are applied for the marginals of each dimension in turn, the steps likewise repeated in cycles.$

In statistics and combinatorial mathematics, group testing is any procedure that breaks up the task of identifying certain objects into tests on groups of items, rather than on individual ones. First studied by Robert Dorfman in 1943, group testing is a relatively new field of applied mathematics that can be applied to a wide range of practical applications and is an active area of research today.

Birkhoff's algorithm is an algorithm for decomposing a bistochastic matrix into a convex combination of permutation matrices. It was published by Garrett Birkhoff in 1946. It has many applications. One such application is for the problem of fair random assignment: given a randomized allocation of items, Birkhoff's algorithm can decompose it into a lottery on deterministic allocations.

References

↑ De Bonis, Annalisa; Vaccaro, Ugo (2003). "Constructions of generalized superimposed codes with applications to group testing and conflict resolution in multiple access channels". Theoretical Computer Science. 306 (1–3): 223–243. doi:10.1016/S0304-3975(03)00281-0. MR 2000175.
↑ Paul Erdős; Péter Frankl; Zoltán Füredi (1985). "Families of finite sets in which no set is covered by the union of r others" (PDF). Israel Journal of Mathematics . 51 (1–2): 79–89. doi: 10.1007/BF02772959 . ISSN 0021-2172.
1 2 3 4 5 6 Hong-Bin Chen; Frank Hwang (2006-12-21). "Exploring the missing link among d-separable, d-separable and d-disjunct matrices". Discrete Applied Mathematics . 155 (5): 662–664. CiteSeerX 10.1.1.848.5161 . doi: 10.1016/j.dam.2006.10.009 . MR 2303978.
↑ Piotr Indyk; Hung Q. Ngo; Atri Rudra (2010). "Efficiently Decodable Non-adaptive Group Testing". Proceedings of the 21st ACM-SIAM Symposium on Discrete Algorithms (SODA). hdl:1721.1/63167. ISSN 1071-9040.
↑ Ely Porat; Amir Rothschild (2008). "Explicit Non-Adaptive Combinatorial Group Testing Schemes". Proceedings of the 35th International Colloquium on Automata, Languages and Programming (ICALP): 748–759. arXiv: 0712.3876 . Bibcode:2007arXiv0712.3876P.

Disjunct matrix

Contents

Concrete examples

Application of d-separability to group testing

Practical concerns and published results

See also

Related Research Articles

References

Further reading