David Shmoys

David Shmoys
David Shmoys
	David Shmoys
Born	1959 (age 64–65)
Alma mater	Princeton, ; University of California, Berkeley
Awards	Frederick W. Lanchester Prize (2013); Daniel H. Wagner Prize (2018); Khachiyan Prize (2022)
	Scientific career
Fields	Computer Science, Computational complexity theory
Institutions	Cornell
Thesis	Approximation Algorithms for Problems in Sequencing, Scheduling, and Communication Network Design (1984)
Doctoral advisor	Eugene Lawler
Doctoral students	Clifford Stein
Website	people.orie.cornell.edu/shmoys/

Last updated April 06, 2024

David Bernard Shmoys (born 1959) is a Professor in the School of Operations Research and Information Engineering and the Department of Computer Science at Cornell University. He obtained his Ph.D. from the University of California, Berkeley in 1984. His major focus has been in the design and analysis of algorithms for discrete optimization problems.

In particular, his work has highlighted the role of linear programming in the design of approximation algorithms for NP-hard problems. He is known for his pioneering research on providing first constant factor performance guarantee for several scheduling and clustering problems including the k-center and k-median problems and the generalized assignment problem. Polynomial-time approximation schemes that he developed for scheduling problems have found applications in many subsequent works. His current research includes stochastic optimization for data-driven models in a broad cross-section of areas, including COVID epidemiological modeling, congressional districting, transportation, and IoT network design. Shmoys is married to Éva Tardos, who is the Jacob Gould Schurman Professor of Computer Science at Cornell University.

Key contributions

Two of his key contributions are

Constant factor approximation algorithm for the Generalized Assignment Problem and Unrelated Parallel Machine Scheduling.
Constant factor approximation algorithm for k-Medians and Facility location problem.

These contributions are described briefly below:

Generalized Assignment Problem & Unrelated Parallel Machine Scheduling

The paper^[1] is a joint work by David Shmoys and Eva Tardos.

The generalized assignment problem can be viewed as the following problem of scheduling unrelated parallel machine with costs. Each of $n$ independent jobs (denoted $J$ ) have to be processed by exactly one of $m$ unrelated parallel machines (denoted $M$ ). Unrelated implies same job might take different amount of processing time on different machines. Job $j$ takes $p_{i,j}$ time units when processed by machine $i$ and incurs a cost $c_{i,j},i=1,2,..,m;j=1,2,..,n;n\geq m$ . Given $C$ and $T_{i},i=1,2,..,m$ , we wish to decide if there exists a schedule with total cost at most $C$ such that for each machine $i$ its load, the total processing time required for the jobs assigned to it is at most $T_{i},i=1,2,..,m$ . By scaling the processing times, we can assume, without loss of generality, that the machine load bounds satisfy $T_{1}=T_{2}=..=T_{m}=T$ . ``In other words, generalized assignment problem is to find a schedule of minimum cost subject to the constraint that the makespan, that the maximum machine load is at most $T$ ".

The work of Shmoys with Lenstra and Tardos cited here^[2] gives a 2 approximation algorithm for the unit cost case. The algorithm is based on a clever design of linear program using parametric pruning and then rounding an extreme point solution of the linear program deterministically. Algorithm for the generalized assignment problem is based on a similar LP through parametric pruning and then using a new rounding technique on a carefully designed bipartite graph. We now state the LP formulation and briefly describe the rounding technique.

We guess the optimum value of makespan $T$ and write the following LP. This technique is known as parametric pruning.

$LP(T)::\sum _{i=1}^{m}\sum _{j=1}^{n}c_{ij}x_{ij}\leq C$ ;

\sum _{i=1}^{m}x_{ij}=1\qquad j=1,\ldots ,n

;

\sum _{i=1}^{m}p_{ij}x_{ij}\leq T\qquad i=1,\ldots ,m

;

x_{ij}\geq 0\qquad i=1,\ldots ,m,\quad j=1,\ldots ,n

;

x_{ij}=0\qquad {\text{if}}\qquad p_{ij}\geq T,\qquad i=1,\ldots ,m,\quad j=1,\ldots ,n

;

The obtained LP solution is then rounded to an integral solution as follows. A weighted bipartite graph $G=(W\cup V,E)$ is constructed. One side of the bipartite graph contains the job nodes, $W=\{w_{j}|j\in J\}$ , and the other side contains several copies of each machine node, $V=\{v_{i,s}|i=1,2,..,m;s=1,2,..,k_{i}\}$ , where $k_{i}=\lceil \sum _{j}x_{ij}\rceil$ . To construct the edges to machine nodes corresponding to say machine $i$ , first jobs are arranged in decreasing order of processing time $p_{ij}$ . For simplicity, suppose, $p_{i1}\geq p_{i2}\geq \ldots \geq p_{in}$ . Now find the minimum index $j_{1}$ , such that $\sum _{j=1}^{j_{1}}x_{ij}\geq 1$ . Include in $E$ all the edges $(w_{j},v_{i1},j=1,2,..,j_{1}-1)$ with nonzero $x_{ij}$ and set their weights to $x'_{v_{i1}j}=x_{ij}$ . Create the edge $(w_{j_{1}},v_{i1})$ and set its weight to $x'_{v_{i1}j_{1}}=1-\sum _{i=1}^{j_{1}-1}x'_{v_{i1}j}$ . This ensures that the total weight of the edges incident on the vertex $v_{i1}$ is at most 1. If $x'_{v_{i1}j_{1}}<x_{ij_{1}}$ , then create an edge $(w_{j_{1}},v_{i2})$ with weight $x'_{v_{i2}j_{1}}=x_{ij}-x'_{v_{i1}j_{1}}$ . Continue with assigning edges to $v_{i2}$ in a similar way.

In the bipartite graph thus created, each job node in $W$ has a total edge weight of 1 incident on it, and each machine node in $V$ has edges with total weight at most 1 incident on it. Thus the vector $x'$ is an instance of a fractional matching on $G$ and thus it can be rounded to obtain an integral matching of same cost.

Now considering the ordering of processing times of the jobs on the machines nodes during construction of $G$ and using an easy charging argument, the following theorem can be proved:

Theorem: If $LP(T)$ has a feasible solution then a schedule can be constructed with makespan $T+max_{i,j}p_{i,j}$ and cost $C$ .

Since $max_{i,j}p_{i,j}\leq T$ , a 2 approximation is obtained.

K-medians and Facility Location Problem

The paper^[3] is a joint work by Moses Charikar, Sudipto Guha, Éva Tardos and David Shmoys. They obtain a $6{\frac {2}{3}}$ approximation to the metric k medians problem. This was the first paper to break the previously best known $O(\log {k}\ \log {\log {k}})$ approximation.

Shmoys has also worked extensively on the facility location problem. His recent results include obtaining a $3$ approximation algorithm for the capacitated facility location problem. The joint work with Fabian Chudak,^[4] has resulted in improving the previous known $5.69$ approximation for the same problem. Their algorithm is based on a variant of randomized rounding called the randomized rounding with a backup, since a backup solution is incorporated to correct for the fact that the ordinary randomized rounding rarely generates a feasible solution to the associated set covering problem.

For the incapacitated version of the facility location problem, again in a joint work with Chudak^[5] he obtained a $(1+2/e)\approx 1.736$ -approximation algorithm which was a significant improvement on the previously known approximation guarantees. The improved algorithm works by rounding an optimal fractional solution of a linear programming relaxation and using the properties of the optimal solutions of the linear program and a generalization of a decomposition technique.

Awards & honors

David Shmoys is an ACM Fellow, a SIAM Fellow, and a Fellow of the Institute for Operations Research and the Management Sciences (INFORMS) (2013). He has received the Sonny Yau Excellence in Teaching Award three times from Cornell's College of Engineering, and has been awarded a NSF Presidential Young Investigator Award, the Frederick W. Lanchester Prize (2013), the Daniel H. Wagner Prize for Excellence in the Practice of Advanced Analytics and Operations Research (2018), and the Khachiyan Prize of the INFORMS Optimization Society (2022), which is awarded annually for life-time achievements in the area of optimization.

Related Research Articles

The knapsack problem is the following problem in combinatorial optimization:

In statistics, the term linear model refers to any model which assumes linearity in the system. The most common occurrence is in connection with regression models and the term is often taken as synonymous with linear regression model. However, the term is also used in time series analysis with a different meaning. In each case, the designation "linear" is used to identify a subclass of models for which substantial reduction in the complexity of the related statistical theory is possible.

The travelling salesman problem, also known as the travelling salesperson problem (TSP), asks the following question: "Given a list of cities and the distances between each pair of cities, what is the shortest possible route that visits each city exactly once and returns to the origin city?" It is an NP-hard problem in combinatorial optimization, important in theoretical computer science and operations research.

The assignment problem is a fundamental combinatorial optimization problem. In its most general form, the problem is as follows:

In computer science and operations research, approximation algorithms are efficient algorithms that find approximate solutions to optimization problems with provable guarantees on the distance of the returned solution to the optimal one. Approximation algorithms naturally arise in the field of theoretical computer science as a consequence of the widely believed P ≠ NP conjecture. Under this conjecture, a wide class of optimization problems cannot be solved exactly in polynomial time. The field of approximation algorithms, therefore, tries to understand how closely it is possible to approximate optimal solutions to such problems in polynomial time. In an overwhelming majority of the cases, the guarantee of such algorithms is a multiplicative one expressed as an approximation ratio or approximation factor i.e., the optimal solution is always guaranteed to be within a (predetermined) multiplicative factor of the returned solution. However, there are also many approximation algorithms that provide an additive guarantee on the quality of the returned solution. A notable example of an approximation algorithm that provides both is the classic approximation algorithm of Lenstra, Shmoys and Tardos for scheduling on unrelated parallel machines.

In mathematics, a totally positive matrix is a square matrix in which all the minors are positive: that is, the determinant of every square submatrix is a positive number. A totally positive matrix has all entries positive, so it is also a positive matrix; and it has all principal minors positive. A symmetric totally positive matrix is therefore also positive-definite. A totally non-negative matrix is defined similarly, except that all the minors must be non-negative. Some authors use "totally positive" to include all totally non-negative matrices.

In queueing theory, a discipline within the mathematical theory of probability, a Jackson network is a class of queueing network where the equilibrium distribution is particularly simple to compute as the network has a product-form solution. It was the first significant development in the theory of networks of queues, and generalising and applying the ideas of the theorem to search for similar product-form solutions in other networks has been the subject of much research, including ideas used in the development of the Internet. The networks were first identified by James R. Jackson and his paper was re-printed in the journal Management Science’s ‘Ten Most Influential Titles of Management Sciences First Fifty Years.’

An eikonal equation is a non-linear first-order partial differential equation that is encountered in problems of wave propagation.

Semidefinite programming (SDP) is a subfield of mathematical programming concerned with the optimization of a linear objective function over the intersection of the cone of positive semidefinite matrices with an affine space, i.e., a spectrahedron.

In applied mathematics, the maximum generalized assignment problem is a problem in combinatorial optimization. This problem is a generalization of the assignment problem in which both tasks and agents have a size. Moreover, the size of each task might vary from one agent to the other.

In mathematics, the Grothendieck inequality states that there is a universal constant $with the following property. If M ij is an n \times n matrix with$

In classical mechanics, holonomic constraints are relations between the position variables that can be expressed in the following form:

Job-shop scheduling, the job-shop problem (JSP) or job-shop scheduling problem (JSSP) is an optimization problem in computer science and operations research. It is a variant of optimal job scheduling. In a general job scheduling problem, we are given n jobs J₁, J₂, ..., J_n of varying processing times, which need to be scheduled on m machines with varying processing power, while trying to minimize the makespan – the total length of the schedule. In the specific variant known as job-shop scheduling, each job consists of a set of operationsO₁, O₂, ..., O_n which need to be processed in a specific order. Each operation has a specific machine that it needs to be processed on and only one operation in a job can be processed at a given time. A common relaxation is the flexible job shop, where each operation can be processed on any machine of a given set.

In multilinear algebra, the tensor rank decomposition or the $decomposition of a tensor is the decomposition of a tensor in terms of a sum of minimum tensors. This is an open problem.$

In statistics, an additive model (AM) is a nonparametric regression method. It was suggested by Jerome H. Friedman and Werner Stuetzle (1981) and is an essential part of the ACE algorithm. The AM uses a one-dimensional smoother to build a restricted class of nonparametric regression models. Because of this, it is less affected by the curse of dimensionality than e.g. a p-dimensional smoother. Furthermore, the AM is more flexible than a standard linear model, while being more interpretable than a general regression surface at the cost of approximation errors. Problems with AM, like many other machine-learning methods, include model selection, overfitting, and multicollinearity.

The maximum coverage problem is a classical question in computer science, computational complexity theory, and operations research. It is a problem that is widely taught in approximation algorithms.

The weapon target assignment problem (WTA) is a class of combinatorial optimization problems present in the fields of optimization and operations research. It consists of finding an optimal assignment of a set of weapons of various types to a set of targets in order to maximize the total expected damage done to the opponent.

The Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) is a multi-criteria decision analysis method, which was originally developed by Ching-Lai Hwang and Yoon in 1981 with further developments by Yoon in 1987, and Hwang, Lai and Liu in 1993. TOPSIS is based on the concept that the chosen alternative should have the shortest geometric distance from the positive ideal solution (PIS) and the longest geometric distance from the negative ideal solution (NIS). A dedicated book in the fuzzy context was published in 2021

The multiplicative weights update method is an algorithmic technique most commonly used for decision making and prediction, and also widely deployed in game theory and algorithm design. The simplest use case is the problem of prediction from expert advice, in which a decision maker needs to iteratively decide on an expert whose advice to follow. The method assigns initial weights to the experts, and updates these weights multiplicatively and iteratively according to the feedback of how well an expert performed: reducing it in case of poor performance, and increasing it otherwise. It was discovered repeatedly in very diverse fields such as machine learning, optimization, theoretical computer science, and game theory.

Unrelated-machines scheduling is an optimization problem in computer science and operations research. It is a variant of optimal job scheduling. We need to schedule n jobs J₁, J₂, ..., J_n on m different machines, such that a certain objective function is optimized. The time that machine i needs in order to process job j is denoted by p_i,j. The term unrelated emphasizes that there is no relation between values of p_i,j for different i and j. This is in contrast to two special cases of this problem: uniform-machines scheduling - in which p_i,j = p_i / s_j, and identical-machines scheduling - in which p_i,j = p_i.

References

↑ Shmoys, D. B.; Tardos, É. (1993). "An approximation algorithm for the generalized assignment problem". Mathematical Programming. 62 (1–3): 461–474. doi:10.1007/BF01585178. S2CID 9027754.
↑ Lenstra, J. K.; Shmoys, D. B.; Tardos, É. (1990). "Approximation algorithms for scheduling unrelated parallel machines". Mathematical Programming. 46 (1–3): 259–271. doi:10.1007/BF01585745. S2CID 206799898.
↑ Charikar, M.; Guha, S.; Tardos, É.; Shmoys, D. B. (2002). "A Constant-Factor Approximation Algorithm for the k-Median Problem". Journal of Computer and System Sciences. 65: 129–149. doi: 10.1006/jcss.2002.1882 .
↑ Chudak, F. N. A.; Williamson, D. P. (2004). "Improved approximation algorithms for capacitated facility location problems". Mathematical Programming. 102 (2): 207. CiteSeerX 10.1.1.53.5219 . doi:10.1007/s10107-004-0524-9. S2CID 40133143.
↑ Chudak, F. N. A.; Shmoys, D. B. (2003). "Improved Approximation Algorithms for the Uncapacitated Facility Location Problem". SIAM Journal on Computing. 33: 1–25. doi:10.1137/S0097539703405754.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[MATHPROG93-1] Shmoys, D. B.; Tardos, É. (1993). "An approximation algorithm for the generalized assignment problem". Mathematical Programming. 62 (1–3): 461–474. doi:10.1007/BF01585178. S2CID 9027754.

[MATHPROG193-2] Lenstra, J. K.; Shmoys, D. B.; Tardos, É. (1990). "Approximation algorithms for scheduling unrelated parallel machines". Mathematical Programming. 46 (1–3): 259–271. doi:10.1007/BF01585745. S2CID 206799898.

[JCSS02-3] Charikar, M.; Guha, S.; Tardos, É.; Shmoys, D. B. (2002). "A Constant-Factor Approximation Algorithm for the k-Median Problem". Journal of Computer and System Sciences. 65: 129–149. doi: 10.1006/jcss.2002.1882 .

[S99-4] Chudak, F. N. A.; Williamson, D. P. (2004). "Improved approximation algorithms for capacitated facility location problems". Mathematical Programming. 102 (2): 207. CiteSeerX 10.1.1.53.5219 . doi:10.1007/s10107-004-0524-9. S2CID 40133143.

[S03-5] Chudak, F. N. A.; Shmoys, D. B. (2003). "Improved Approximation Algorithms for the Uncapacitated Facility Location Problem". SIAM Journal on Computing. 33: 1–25. doi:10.1137/S0097539703405754.

[1]

[2]

[3]

[4]

[5]

Authority control databases
International	ISNI VIAF
National	Germany Israel United States Netherlands
Academics	DBLP Google Scholar MathSciNet Mathematics Genealogy Project zbMATH
Other	IdRef