3-partition problem

Last updated July 15, 2024

The 3-partition problem is a strongly NP-complete problem in computer science. The problem is to decide whether a given multiset of integers can be partitioned into triplets that all have the same sum. More precisely:

Input: a multiset S containing n positive integer elements.
Conditions: S must be partitionable into m triplets, S₁, S₂, …, S_m, where n = 3m. These triplets partition S in the sense that they are disjoint and they cover S. The target value T is computed by taking the sum of all elements in S, then divided by m.
Output: whether or not there exists a partition of S such that, for all triplets, the sum of the elements in each triplet equals T.

The 3-partition problem remains strongly NP-complete under the restriction that every integer in S is strictly between T/4 and T/2.

Example

The set $S=\{20,23,25,30,49,45,27,30,30,40,22,19\}$ can be partitioned into the four sets $\{20,25,45\},\{23,27,40\},\{49,22,19\},\{30,30,30\}$ , each of which sums to T = 90.
The set $S=\{1,2,5,6,7,9\}$ can be partitioned into the two sets $\{1,5,9\},\{2,6,7\}$ each of which sum to T = 15.
(every integer in S is strictly between T/4 and T/2): $S=\{4,5,5,5,5,6\}$ , thus m=2, and T=15. There is feasible 3-partition $\{4,5,6\},\{5,5,5\}$ .
(every integer in S is strictly between T/4 and T/2): $S=\{4,4,4,6,6,6\}$ , thus m=2, and T=15. There is no feasible solution.

Strong NP-completeness

The 3-partition problem remains NP-complete even when the integers in S are bounded above by a polynomial in n. In other words, the problem remains NP-complete even when representing the numbers in the input instance in unary. i.e., 3-partition is NP-complete in the strong sense or strongly NP-complete. This property, and 3-partition in general, is useful in many reductions where numbers are naturally represented in unary.

3-Partition vs Partition

The 3-partition problem is similar to the partition problem, in which the goal is to partition S into two subsets with equal sum, and the multiway number partitioning, in which the goal is to partition S into k subsets with equal sum, where k is a fixed parameter. In 3-Partition the goal is to partition S into m = n/3 subsets, not just a fixed number of subsets, with equal sum. Partition is "easier" than 3-Partition: while 3-Partition is strongly NP-hard, Partition is only weakly NP-hard - it is hard only when the numbers are encoded in non-unary system, and have value exponential in n. When the values are polynomial in n, Partition can be solved in polynomial time using the pseudopolynomial time number partitioning algorithm.

Variants

In the unrestricted-input variant, the inputs can be arbitrary integers; in the restricted-input variant, the inputs must be in (T/4, T/2). The restricted version is as hard as the unrestricted version: given an instance S_u of the unrestricted variant, construct a new instance of the restricted version $S r ≔ {s + 2 T | s \in S u$ }. Every solution of S_u corresponds to a solution of S_r but with a sum of 7 T instead of T, and every element of S_r is in [2 T , 3 T ] which is contained in ( T /4, 7 T /2).

In the distinct-input variant, the inputs must be in (T/4, T/2), and in addition, they must all be distinct integers. It, too, is as hard as the unrestricted version.^[1]

In the unrestricted-output variant, the m output subsets can be of arbitrary size - not necessarily 3 (but they still need to have the same sum T). The restricted-output variant can be reduced to the unrestricted-variant: given an instance S_r of the restricted variant, with 3m items summing up to mT, construct a new instance of the unrestricted variant $S u ≔ {s + 2 T | s \in S r$ }, with 3m items summing up to 7mT, and with target sum 7 T . Every solution of S_r naturally corresponds to a solution of S_u. Conversely, in every solution of S_u, since the target sum is 7 T and each element is in ( T /4, 7 T /2), there must be exactly 3 elements per set, so it corresponds to a solution of S_r.

The ABC-partition problem (also called numerical 3-d matching) is a variant in which, instead of a set S with 3 m integers, there are three sets A, B, C with m integers in each. The sum of numbers in all sets is ⁠ $mT$ ⁠. The goal is to construct m triplets, each of which contains one element from A, one from B and one from C, such that the sum of each triplet is T.^[2]

The 4-partition problem is a variant in which S contains n = 4 m integers, the sum of all integers is ⁠ $mT$ ⁠, and the goal is to partition it into m quadruplets, all with a sum of T. It can be assumed that each integer is strictly between T/5 and T/3. Similarly, ABCD-parititon is a variant of 4-partition in which each there are 4 input sets and each quadruplet should contain one element from each set.

Proofs

Garey and Johnson (1975) originally proved 3-Partition to be NP-complete, by a reduction from 3-dimensional matching.^[3] The classic reference by Garey and Johnson (1979) describes an NP-completeness proof, reducing from 3-dimensional matching to 4-partition to 3-partition.^[4] Logically, the reduction can be partitioned into several steps.

Reduction from 3d-matching to ABCD-partition

We are given an instance of E of 3d-matching, containing some m triplets {w_i,x_j,y_k}, where the vertices are w₁,...,w_q and x₁,...,x_q and y₁,...,y_q. We construct an instance of ABCD-partition with 4*m elements, as follows (where r := 32q):

For each triplet t = {w_i,x_j,y_k} in E, the set A contains an element u_t = 10r⁴-kr³-jr²-ir.
For each triplet t = {w_i,x_j,y_k} in E, the set B contains w_it, C contains x_jt, and D contains y_kt. So for each of w_i, x_j, y_k, there may be many corresponding elements in B, C, D - one for each triplet in which they appear. We consider one of these elements (denoted by "1") as the "real" one, and the others as "dummy" ones. The element sizes are as follows:
- w_i[1] = 10r⁴+ir; w_i[2..] = 11r⁴+ir.
- x_j[1] = 10r⁴+jr²; x_j[2..] = 11r⁴+jr².
- y_k[1] = 10r⁴+kr³; y_k[2..] = 8r⁴+kr³.
The sum of every three "real" elements or every three "dummy" elements is 30r⁴+ir+jr²+kr³; and if the triplet element is added, the sum is 40r⁴.
The threshold for the ABCD-partition instance is T=40r⁴. Note that the size of each element is in (T/3,T/5).

Given a perfect matching in E, we construct a 4-partition of ABCD as follows:

For each triplet t= {w_i,x_j,y_k} in the matching, we construct a 4-set {u_t, w_i[1], x_j[1], y_k[1]}.
For each triplet not in the matching, we construct a similar 4-set, but with the corresponding dummy elements.

In both cases, the sum of the 4-set is 40r⁴ as needed.

Given a partition of ABCD, the sum of each 4-set is 40r⁴. Therefore, the terms with r, r² and r³ must cancel out, and the terms with r⁴ must sum up to 40r⁴; so the 4-set must contain a triplet and 3 matching "real" elements, or a triplet and 3 matching "dummy" elements. From the triplets with the 3 matching "real" elements, we construct a valid perfect matching in E.

Note that, in the above reduction, the size of each element is polynomial in the input size; hence, this reduction shows that ABCD-partition is strongly NP-hard.

Reduction from ABCD-partition to 4-partition

Given an instance of ABCD-partition with m elements per set, threshold T, and sum mT, we construct an instance of 4-partition with 4m elements:

For each element a in A, the corresponding element has size 16a+1;
For each element b in B, the corresponding element has size 16b+2;
For each element c in C, the corresponding element has size 16c+4;
For each element d in D, the corresponding element has size 16d+8.

All in all, the sum is 16mT+15m, and the new threshold is 16T+15.

Every ABCD-partition corresponds naturally to a 4-partition. Conversely, in every 4-partition, the sum modulo 16 is 15, and therefore it must contain exactly one item with size modulo 16 = 1, 2, 4, 8; this corresponds to exactly one item from A, B, C, D, from which we can construct an ABCD-partition.

Using a similar reduction, ABC-partition can be reduced to 3-partition.

Reduction from 4-partition to 3-partition

We are given an instance A of 4-partition: 4m integers, a₁,...,a_4m, each of which in the range (T/3,T/5), summing up to mT. We construct an instance B of 3-partition as follows:

For each a_i in A, B contains a "regular" element w_i = 4*(5T+a_i)+1. All in all there are 4m regular elements, summing up to 81mT + 4m.
For each pair of elements a_i,a_j in A, B contains two "pairing" elements: u_ij = 4*(6T - a_i - a_j)+2 and u_ij' = 4*(5T + a_i + a_j)+2. All in all there are 4m*(4m-1) pairing elements, summing up to (88mT+16m)*(4m-1).
Additionally, B contains 8m²-3m "filler" elements, with size 20T, and total sum (8m²-3m)*20T.
All in all, B contains 24m²-3m = 3(8m²-m) elements, with sum (64T+4)*(8m²-m).
The threshold for the 3-partition instance is 64T+4; note that the sizes of all elements in B are in (16T+1,32T+2).

Given a 4-partition of A, we construct a 3-partition for B as follows:

For each 4-set {a₁,a₂,a₃,a₄} with sum T, we construct a 3-set {w₁,w₂,u₁₂} with sum 4*(5T+a₁+5T+a₂+6T-a₁-a₂)+1+1+2=64T+4 and another 3-set {w₃,w₄,u₁₂'} with sum 4*(5T+a₃+5T+a₄+5T+a₁+a₂)+1+1+2=64T+4. These sets contain all 4m regular elements and 2m matching pairs of pairing elements.
From the remaining elements, we construct 3-sets {u_ij,u_ij',filler} with sum 4*(6T-a_i-a_j+5T+a_i+a_j+5T)+2+2=64T+4.

Conversely, given a 3-partition of B, the sum of each 3-set is a multiple of 4, so it must contain either two regular items and one pairing item, or two pairing items and one filler item:

If a 3-set contains two pairing items u_ij, u_kl and one filler item, then the sum of the two pairing items must be 44T+4 = 4*(5T+6T)+2+2, so they must have matching sizes (a_i+a_j=a_k+a_l). Therefore, by replacing as needed, we can assume that the two pairing items are in fact u_ij and u_ij'. Therefore, the remaining pairing items also consist of n matching pairs.
Therefore, the remaining 3-sets can be partitioned into two groups: n 3-sets containing the items u_ij, and n 3-sets containing the items u_ij'. In each matching pair of 3-sets, the sum of the two pairing items u_ij+u_ij' is 44T+4, so the sum of the four regular items is 84T+4. Therefore, from the four regular items, we construct a 4-set in A, with sum T.

Applications

The NP-hardness of 3-partition was used to prove the NP-hardness rectangle packing, as well as of Tetris ^[5]^[6] and some other puzzles,^[7] and some job scheduling problems.^[8]

Related Research Articles

In combinatorial mathematics, a Steiner system is a type of block design, specifically a t-design with λ = 1 and t = 2 or (recently) t ≥ 2.

The subset sum problem (SSP) is a decision problem in computer science. In its most general formulation, there is a multiset $of integers and a target-sum, and the question is to decide whether any subset of the integers sum to precisely . The problem is known to be NP-complete. Moreover, some restricted variants of it are NP-complete too, for example:$

In computational complexity theory, the complexity class NP-equivalent is the set of function problems that are both NP-easy and NP-hard. NP-equivalent is the analogue of NP-complete for function problems.

In combinatorial mathematics, the Bell numbers count the possible partitions of a set. These numbers have been studied by mathematicians since the 19th century, and their roots go back to medieval Japan. In an example of Stigler's law of eponymy, they are named after Eric Temple Bell, who wrote about them in the 1930s.

<span class="mw-page-title-main">Set cover problem</span> Classical problem in combinatorics

The set cover problem is a classical question in combinatorics, computer science, operations research, and complexity theory.

In mathematics, particularly in combinatorics, given a family of sets, here called a collection C, a transversal (also called a cross-section) is a set containing exactly one element from each member of the collection. When the sets of the collection are mutually disjoint, each element of the transversal corresponds to exactly one member of C (the set it is a member of). If the original sets are not disjoint, there are two possibilities for the definition of a transversal:

Set packing is a classical NP-complete problem in computational complexity theory and combinatorics, and was one of Karp's 21 NP-complete problems. Suppose one has a finite set S and a list of subsets of S. Then, the set packing problem asks if some k subsets in the list are pairwise disjoint.

In number theory and computer science, the partition problem, or number partitioning, is the task of deciding whether a given multiset S of positive integers can be partitioned into two subsets S₁ and S₂ such that the sum of the numbers in S₁ equals the sum of the numbers in S₂. Although the partition problem is NP-complete, there is a pseudo-polynomial time dynamic programming solution, and there are heuristics that solve the problem in many instances, either optimally or approximately. For this reason, it has been called "the easiest hard problem".

In mathematics, the relaxation of a (mixed) integer linear program is the problem that arises by removing the integrality constraint of each variable.

<span class="mw-page-title-main">3-dimensional matching</span>

In the mathematical discipline of graph theory, a 3-dimensional matching is a generalization of bipartite matching to 3-partite hypergraphs, which consist of hyperedges each of which contains 3 vertices.

In combinatorial optimization, the matroid intersection problem is to find a largest common independent set in two matroids over the same ground set. If the elements of the matroid are assigned real weights, the weighted matroid intersection problem is to find a common independent set with the maximum possible weight. These problems generalize many problems in combinatorial optimization including finding maximum matchings and maximum weight matchings in bipartite graphs and finding arborescences in directed graphs.

Numerical 3-dimensional matching is an NP-complete decision problem. It is given by three multisets of integers $, and, each containing elements, and a bound . The goal is to select a subset of such that every integer in, and occurs exactly once and that for every triple in the subset holds. This problem is labeled as [SP16] in.$

<span class="mw-page-title-main">Bell triangle</span>

In mathematics, the Bell triangle is a triangle of numbers analogous to Pascal's triangle, whose values count partitions of a set in which a given element is the largest singleton. It is named for its close connection to the Bell numbers, which may be found on both sides of the triangle, and which are in turn named after Eric Temple Bell. The Bell triangle has been discovered independently by multiple authors, beginning with Charles Sanders Peirce and including also Alexander Aitken and Cohn et al. (1962), and for that reason has also been called Aitken's array or the Peirce triangle.

In graph theory, a rainbow-independent set (ISR) is an independent set in a graph, in which each vertex has a different color.

Rectangle packing is a packing problem where the objective is to determine whether a given set of small rectangles can be placed inside a given large polygon, such that no two small rectangles overlap. Several variants of this problem have been studied.

In computer science, multiway number partitioning is the problem of partitioning a multiset of numbers into a fixed number of subsets, such that the sums of the subsets are as similar as possible. It was first presented by Ronald Graham in 1969 in the context of the identical-machines scheduling problem. The problem is parametrized by a positive integer k, and called k-way number partitioning. The input to the problem is a multiset S of numbers, whose sum is k*T.

The multiple subset sum problem is an optimization problem in computer science and operations research. It is a generalization of the subset sum problem. The input to the problem is a multiset $of n integers and a positive integer m representing the number of subsets. The goal is to construct, from the input integers, some m subsets. The problem has several variants:$

Balanced number partitioning is a variant of multiway number partitioning in which there are constraints on the number of items allocated to each set. The input to the problem is a set of n items of different sizes, and two integers m, k. The output is a partition of the items into m subsets, such that the number of items in each subset is at most k. Subject to this, it is required that the sums of sizes in the m subsets are as similar as possible.

Matroid-constrained number partitioning is a variant of the multiway number partitioning problem, in which the subsets in the partition should be independent sets of a matroid. The input to this problem is a set S of items, a positive integer m, and some m matroids over the same set S. The goal is to partition S into m subsets, such that each subset i is an independent set in matroid i. Subject to this constraint, some objective function should be minimized, for example, minimizing the largest sum item sizes in a subset. In a more general variant, each of the m matroids has a weight function, which assigns a weight to each element of the ground-set. Various objective functions have been considered. For each of the three operators max,min,sum, one can use this operator on the weights of items in each subset, and on the subsets themselves. All in all, there are 9 possible objective functions, each of which can be maximized or minimized.

References

↑ Hulett, Heather; Will, Todd G.; Woeginger, Gerhard J. (2008-09-01). "Multigraph realizations of degree sequences: Maximization is easy, minimization is hard". Operations Research Letters. 36 (5): 594–596. doi:10.1016/j.orl.2008.05.004. ISSN 0167-6377.
↑ Demaine, Erik (2015). "MIT OpenCourseWare - Hardness made Easy 2 - 3-Partition I". Youtube. Archived from the original on 2021-12-14.
↑ Garey, Michael R. and David S. Johnson (1975). "Complexity results for multiprocessor scheduling under resource constraints". SIAM Journal on Computing. 4 (4): 397–411. doi:10.1137/0204035.
↑ Garey, Michael R. and David S. Johnson (1979), Computers and Intractability; A Guide to the Theory of NP-Completeness. ISBN 0-7167-1045-5. Pages 96–105 and 224.
↑ "Tetris is hard, even to approximate". Nature. 2002-10-28. doi:10.1038/news021021-9. ISSN 0028-0836.
↑ BREUKELAAR, RON; DEMAINE, ERIK D.; HOHENBERGER, SUSAN; HOOGEBOOM, HENDRIK JAN; KOSTERS, WALTER A.; LIBEN-NOWELL, DAVID (2004-04-01). "Tetris is Hard, Even to Approximate". International Journal of Computational Geometry & Applications. 14 (1n02): 41–68. arXiv: cs/0210020 . doi:10.1142/s0218195904001354. ISSN 0218-1959. S2CID 1177.
↑ Demaine, Erik D.; Demaine, Martin L. (2007-06-01). "Jigsaw Puzzles, Edge Matching, and Polyomino Packing: Connections and Complexity". Graphs and Combinatorics. 23 (S1): 195–208. doi:10.1007/s00373-007-0713-4. ISSN 0911-0119. S2CID 17190810.
↑ Bernstein, D.; Rodeh, M.; Gertner, I. (1989). "On the complexity of scheduling problems for parallel/pipelined machines". IEEE Transactions on Computers. 38 (9): 1308–1313. doi:10.1109/12.29469. ISSN 0018-9340.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Hulett, Heather; Will, Todd G.; Woeginger, Gerhard J. (2008-09-01). "Multigraph realizations of degree sequences: Maximization is easy, minimization is hard". Operations Research Letters. 36 (5): 594–596. doi:10.1016/j.orl.2008.05.004. ISSN 0167-6377.

[2] Demaine, Erik (2015). "MIT OpenCourseWare - Hardness made Easy 2 - 3-Partition I". Youtube. Archived from the original on 2021-12-14.

[3] Garey, Michael R. and David S. Johnson (1975). "Complexity results for multiprocessor scheduling under resource constraints". SIAM Journal on Computing. 4 (4): 397–411. doi:10.1137/0204035.

[4] Garey, Michael R. and David S. Johnson (1979), Computers and Intractability; A Guide to the Theory of NP-Completeness. ISBN 0-7167-1045-5. Pages 96–105 and 224.

[5] "Tetris is hard, even to approximate". Nature. 2002-10-28. doi:10.1038/news021021-9. ISSN 0028-0836.

[6] BREUKELAAR, RON; DEMAINE, ERIK D.; HOHENBERGER, SUSAN; HOOGEBOOM, HENDRIK JAN; KOSTERS, WALTER A.; LIBEN-NOWELL, DAVID (2004-04-01). "Tetris is Hard, Even to Approximate". International Journal of Computational Geometry & Applications. 14 (1n02): 41–68. arXiv: cs/0210020 . doi:10.1142/s0218195904001354. ISSN 0218-1959. S2CID 1177.

[7] Demaine, Erik D.; Demaine, Martin L. (2007-06-01). "Jigsaw Puzzles, Edge Matching, and Polyomino Packing: Connections and Complexity". Graphs and Combinatorics. 23 (S1): 195–208. doi:10.1007/s00373-007-0713-4. ISSN 0911-0119. S2CID 17190810.

[8] Bernstein, D.; Rodeh, M.; Gertner, I. (1989). "On the complexity of scheduling problems for parallel/pipelined machines". IEEE Transactions on Computers. 38 (9): 1308–1313. doi:10.1109/12.29469. ISSN 0018-9340.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]