Submodular set function

Last updated February 03, 2025

In mathematics, a submodular set function (also known as a submodular function) is a set function that, informally, describes the relationship between a set of inputs and an output, where adding more of one input has a decreasing additional benefit (diminishing returns). The natural diminishing returns property which makes them suitable for many applications, including approximation algorithms, game theory (as functions modeling user preferences) and electrical networks. Recently, submodular functions have also found utility in several real world problems in machine learning and artificial intelligence, including automatic summarization, multi-document summarization, feature selection, active learning, sensor placement, image collection summarization and many other domains.^[1]^[2]^[3]^[4]

Definition

If $\Omega$ is a finite set, a submodular function is a set function $f:2^{\Omega }\rightarrow \mathbb {R}$ , where $2^{\Omega }$ denotes the power set of $\Omega$ , which satisfies one of the following equivalent conditions.^[5]

For every $X,Y\subseteq \Omega$ with $X\subseteq Y$ and every $x\in \Omega \setminus Y$ we have that $f(X\cup \{x\})-f(X)\geq f(Y\cup \{x\})-f(Y)$ .
For every $S,T\subseteq \Omega$ we have that $f(S)+f(T)\geq f(S\cup T)+f(S\cap T)$ .
For every $X\subseteq \Omega$ and $x_{1},x_{2}\in \Omega \backslash X$ such that $x_{1}\neq x_{2}$ we have that $f(X\cup \{x_{1}\})+f(X\cup \{x_{2}\})\geq f(X\cup \{x_{1},x_{2}\})+f(X)$ .

A nonnegative submodular function is also a subadditive function, but a subadditive function need not be submodular. If $\Omega$ is not assumed finite, then the above conditions are not equivalent. In particular a function $f$ defined by $f(S)=1$ if $S$ is finite and $f(S)=0$ if $S$ is infinite satisfies the first condition above, but the second condition fails when $S$ and $T$ are infinite sets with finite intersection.

Types and examples of submodular functions

Monotone

A set function $f$ is monotone if for every $T\subseteq S$ we have that $f(T)\leq f(S)$ . Examples of monotone submodular functions include:

Linear (Modular) functions: Any function of the form $f(S)=\sum _{i\in S}w_{i}$ is called a linear function. Additionally if $\forall i,w_{i}\geq 0$ then f is monotone.
Budget-additive functions: Any function of the form $f(S)=\min \left\{B,~\sum _{i\in S}w_{i}\right\}$ for each $w_{i}\geq 0$ and $B\geq 0$ is called budget additive.^[6]
Coverage functions: Let $\Omega =\{E_{1},E_{2},\ldots ,E_{n}\}$ be a collection of subsets of some ground set $\Omega '$ . The function $f(S)=\left|\bigcup _{E_{i}\in S}E_{i}\right|$ for $S\subseteq \Omega$ is called a coverage function. This can be generalized by adding non-negative weights to the elements.
Entropy: Let $\Omega =\{X_{1},X_{2},\ldots ,X_{n}\}$ be a set of random variables. Then for any $S\subseteq \Omega$ we have that $H(S)$ is a submodular function, where $H(S)$ is the entropy of the set of random variables $S$ , a fact known as Shannon's inequality.^[7] Further inequalities for the entropy function are known to hold, see entropic vector.
Matroid rank functions: Let $\Omega =\{e_{1},e_{2},\dots ,e_{n}\}$ be the ground set on which a matroid is defined. Then the rank function of the matroid is a submodular function.^[8]

Non-monotone

A submodular function that is not monotone is called non-monotone. In particular, a function is called non-monotone if it has the property that adding more elements to a set can decrease the value of the function. More formally, the function $f$ is non-monotone if there are sets $S,T$ in its domain s.t. $S\subset T$ and $f(S)>f(T)$ .

Symmetric

A non-monotone submodular function $f$ is called symmetric if for every $S\subseteq \Omega$ we have that $f(S)=f(\Omega -S)$ . Examples of symmetric non-monotone submodular functions include:

Graph cuts: Let $\Omega =\{v_{1},v_{2},\dots ,v_{n}\}$ be the vertices of a graph. For any set of vertices $S\subseteq \Omega$ let $f(S)$ denote the number of edges $e=(u,v)$ such that $u\in S$ and $v\in \Omega -S$ . This can be generalized by adding non-negative weights to the edges.
Mutual information: Let $\Omega =\{X_{1},X_{2},\ldots ,X_{n}\}$ be a set of random variables. Then for any $S\subseteq \Omega$ we have that $f(S)=I(S;\Omega -S)$ is a submodular function, where $I(S;\Omega -S)$ is the mutual information.

Asymmetric

A non-monotone submodular function which is not symmetric is called asymmetric.

Directed cuts: Let $\Omega =\{v_{1},v_{2},\dots ,v_{n}\}$ be the vertices of a directed graph. For any set of vertices $S\subseteq \Omega$ let $f(S)$ denote the number of edges $e=(u,v)$ such that $u\in S$ and $v\in \Omega -S$ . This can be generalized by adding non-negative weights to the directed edges.

Continuous extensions of submodular set functions

Often, given a submodular set function that describes the values of various sets, we need to compute the values of fractional sets. For example: we know that the value of receiving house A and house B is V, and we want to know the value of receiving 40% of house A and 60% of house B. To this end, we need a continuous extension of the submodular set function.

Formally, a set function $f:2^{\Omega }\rightarrow \mathbb {R}$ with $|\Omega |=n$ can be represented as a function on $\{0,1\}^{n}$ , by associating each $S\subseteq \Omega$ with a binary vector $x^{S}\in \{0,1\}^{n}$ such that $x_{i}^{S}=1$ when $i\in S$ , and $x_{i}^{S}=0$ otherwise. A continuous extension of $f$ is a continuous function $F:[0,1]^{n}\rightarrow \mathbb {R}$ , that matches the value of $f$ on $x\in \{0,1\}^{n}$ , i.e. $F(x^{S})=f(S)$ .

Several kinds of continuous extensions of submodular functions are commonly used, which are described below.

Lovász extension

This extension is named after mathematician László Lovász.^[9] Consider any vector $\mathbf {x} =\{x_{1},x_{2},\dots ,x_{n}\}$ such that each $0\leq x_{i}\leq 1$ . Then the Lovász extension is defined as

$f^{L}(\mathbf {x} )=\mathbb {E} (f(\{i|x_{i}\geq \lambda \}))$

where the expectation is over $\lambda$ chosen from the uniform distribution on the interval $[0,1]$ . The Lovász extension is a convex function if and only if $f$ is a submodular function.

Multilinear extension

Consider any vector $\mathbf {x} =\{x_{1},x_{2},\ldots ,x_{n}\}$ such that each $0\leq x_{i}\leq 1$ . Then the multilinear extension is defined as ^[10]^[11] $F(\mathbf {x} )=\sum _{S\subseteq \Omega }f(S)\prod _{i\in S}x_{i}\prod _{i\notin S}(1-x_{i})$ .

Intuitively, x_i represents the probability that item i is chosen for the set. For every set S, the two inner products represent the probability that the chosen set is exactly S. Therefore, the sum represents the expected value of f for the set formed by choosing each item i at random with probability xi, independently of the other items.

Convex closure

Consider any vector $\mathbf {x} =\{x_{1},x_{2},\dots ,x_{n}\}$ such that each $0\leq x_{i}\leq 1$ . Then the convex closure is defined as $f^{-}(\mathbf {x} )=\min \left(\sum _{S}\alpha _{S}f(S):\sum _{S}\alpha _{S}1_{S}=\mathbf {x} ,\sum _{S}\alpha _{S}=1,\alpha _{S}\geq 0\right)$ .

The convex closure of any set function is convex over $[0,1]^{n}$ .

Concave closure

Consider any vector $\mathbf {x} =\{x_{1},x_{2},\dots ,x_{n}\}$ such that each $0\leq x_{i}\leq 1$ . Then the concave closure is defined as $f^{+}(\mathbf {x} )=\max \left(\sum _{S}\alpha _{S}f(S):\sum _{S}\alpha _{S}1_{S}=\mathbf {x} ,\sum _{S}\alpha _{S}=1,\alpha _{S}\geq 0\right)$ .

Relations between continuous extensions

For the extensions discussed above, it can be shown that $f^{+}(\mathbf {x} )\geq F(\mathbf {x} )\geq f^{-}(\mathbf {x} )=f^{L}(\mathbf {x} )$ when $f$ is submodular.^[12]

Properties

The class of submodular functions is closed under non-negative linear combinations. Consider any submodular function $f_{1},f_{2},\ldots ,f_{k}$ and non-negative numbers $\alpha _{1},\alpha _{2},\ldots ,\alpha _{k}$ . Then the function $g$ defined by $g(S)=\sum _{i=1}^{k}\alpha _{i}f_{i}(S)$ is submodular.
For any submodular function $f$ , the function defined by $g(S)=f(\Omega \setminus S)$ is submodular.
The function $g(S)=\min(f(S),c)$ , where $c$ is a real number, is submodular whenever $f$ is monotone submodular. More generally, $g(S)=h(f(S))$ is submodular, for any non decreasing concave function $h$ .
Consider a random process where a set $T$ is chosen with each element in $\Omega$ being included in $T$ independently with probability $p$ . Then the following inequality is true $\mathbb {E} [f(T)]\geq pf(\Omega )+(1-p)f(\varnothing )$ where $\varnothing$ is the empty set. More generally consider the following random process where a set $S$ is constructed as follows. For each of $1\leq i\leq l,A_{i}\subseteq \Omega$ construct $S_{i}$ by including each element in $A_{i}$ independently into $S_{i}$ with probability $p_{i}$ . Furthermore let $S=\cup _{i=1}^{l}S_{i}$ . Then the following inequality is true $\mathbb {E} [f(S)]\geq \sum _{R\subseteq [l]}\Pi _{i\in R}p_{i}\Pi _{i\notin R}(1-p_{i})f(\cup _{i\in R}A_{i})$ .^{[ citation needed ]}

Optimization problems

Submodular functions have properties which are very similar to convex and concave functions. For this reason, an optimization problem which concerns optimizing a convex or concave function can also be described as the problem of maximizing or minimizing a submodular function subject to some constraints.

Submodular set function minimization

The hardness of minimizing a submodular set function depends on constraints imposed on the problem.

The unconstrained problem of minimizing a submodular function is computable in polynomial time,^[13]^[14] and even in strongly-polynomial time.^[15]^[16] Computing the minimum cut in a graph is a special case of this minimization problem.
The problem of minimizing a submodular function with a cardinality lower bound is NP-hard, with polynomial factor lower bounds on the approximation factor.^[17]^[18]

Submodular set function maximization

Unlike the case of minimization, maximizing a generic submodular function is NP-hard even in the unconstrained setting. Thus, most of the works in this field are concerned with polynomial-time approximation algorithms, including greedy algorithms or local search algorithms.

The problem of maximizing a non-negative submodular function admits a 1/2 approximation algorithm.^[19]^[20] Computing the maximum cut of a graph is a special case of this problem.
The problem of maximizing a monotone submodular function subject to a cardinality constraint admits a $1-1/e$ approximation algorithm.^[21]^[22] The maximum coverage problem is a special case of this problem.
The problem of maximizing a monotone submodular function subject to a matroid constraint (which subsumes the case above) also admits a $1-1/e$ approximation algorithm.^[23]^[24]^[25]

Many of these algorithms can be unified within a semi-differential based framework of algorithms.^[18]

Applications

Submodular functions naturally occur in several real world applications, in economics, game theory, machine learning and computer vision.^[4]^[29] Owing to the diminishing returns property, submodular functions naturally model costs of items, since there is often a larger discount, with an increase in the items one buys. Submodular functions model notions of complexity, similarity and cooperation when they appear in minimization problems. In maximization problems, on the other hand, they model notions of diversity, information and coverage.

Citations

↑ H. Lin and J. Bilmes, A Class of Submodular Functions for Document Summarization, ACL-2011.
↑ S. Tschiatschek, R. Iyer, H. Wei and J. Bilmes, Learning Mixtures of Submodular Functions for Image Collection Summarization, NIPS-2014.
↑ A. Krause and C. Guestrin, Near-optimal nonmyopic value of information in graphical models, UAI-2005.
1 2 A. Krause and C. Guestrin, Beyond Convexity: Submodularity in Machine Learning, Tutorial at ICML-2008
↑ (Schrijver 2003 , §44, p. 766)
↑ Buchbinder, Niv; Feldman, Moran (2018). "Submodular Functions Maximization Problems". In Gonzalez, Teofilo F. (ed.). Handbook of Approximation Algorithms and Metaheuristics, Second Edition: Methodologies and Traditional Applications. Chapman and Hall/CRC. doi:10.1201/9781351236423. ISBN 9781351236423.
↑ "Information Processing and Learning" (PDF). cmu.
↑ Fujishige (2005) p.22
↑ Lovász, L. (1983). "Submodular functions and convexity". Mathematical Programming the State of the Art. pp. 235–257. doi:10.1007/978-3-642-68874-4_10. ISBN 978-3-642-68876-8. S2CID 117358746.
↑ Vondrak, Jan (2008-05-17). "Optimal approximation for the submodular welfare problem in the value oracle model". Proceedings of the fortieth annual ACM symposium on Theory of computing. STOC '08. New York, NY, USA: Association for Computing Machinery. pp. 67–74. doi:10.1145/1374376.1374389. ISBN 978-1-60558-047-0. S2CID 170510.
↑ Calinescu, Gruia; Chekuri, Chandra; Pál, Martin; Vondrák, Jan (January 2011). "Maximizing a Monotone Submodular Function Subject to a Matroid Constraint". SIAM Journal on Computing. 40 (6): 1740–1766. doi:10.1137/080733991. ISSN 0097-5397.
↑ Vondrák, Jan. "Polyhedral techniques in combinatorial optimization: Lecture 17" (PDF).
↑ Grötschel, M.; Lovasz, L.; Schrijver, A. (1981). "The ellipsoid method and its consequences in combinatorial optimization". Combinatorica. 1 (2): 169–197. doi:10.1007/BF02579273. hdl: 10068/182482 . S2CID 43787103.
↑ Cunningham, W. H. (1985). "On submodular function minimization". Combinatorica. 5 (3): 185–192. doi:10.1007/BF02579361. S2CID 33192360.
↑ Iwata, S.; Fleischer, L.; Fujishige, S. (2001). "A combinatorial strongly polynomial algorithm for minimizing submodular functions". J. ACM. 48 (4): 761–777. doi:10.1145/502090.502096. S2CID 888513.
↑ Schrijver, A. (2000). "A combinatorial algorithm minimizing submodular functions in strongly polynomial time". J. Combin. Theory Ser. B. 80 (2): 346–355. doi: 10.1006/jctb.2000.1989 .
↑ Z. Svitkina and L. Fleischer, Submodular approximation: Sampling-based algorithms and lower bounds, SIAM Journal on Computing (2011).
1 2 R. Iyer, S. Jegelka and J. Bilmes, Fast Semidifferential based submodular function optimization, Proc. ICML (2013).
↑ U. Feige, V. Mirrokni and J. Vondrák, Maximizing non-monotone submodular functions, Proc. of 48th FOCS (2007), pp. 461–471.
↑ N. Buchbinder, M. Feldman, J. Naor and R. Schwartz, A tight linear time (1/2)-approximation for unconstrained submodular maximization, Proc. of 53rd FOCS (2012), pp. 649-658.
↑ Nemhauser, George; Wolsey, L. A.; Fisher, M. L. (1978). "An analysis of approximations for maximizing submodular set functions I". Mathematical Programming. 14 (14): 265–294. doi:10.1007/BF01588971. S2CID 206800425.
↑ Williamson, David P. "Bridging Continuous and Discrete Optimization: Lecture 23" (PDF).
↑ G. Calinescu, C. Chekuri, M. Pál and J. Vondrák, Maximizing a submodular set function subject to a matroid constraint, SIAM J. Comp. 40:6 (2011), 1740-1766.
↑ M. Feldman, J. Naor and R. Schwartz, A unified continuous greedy algorithm for submodular maximization, Proc. of 52nd FOCS (2011).
↑ Y. Filmus, J. Ward, A tight combinatorial algorithm for submodular maximization subject to a matroid constraint, Proc. of 53rd FOCS (2012), pp. 659-668.
↑ M. Narasimhan and J. Bilmes, A submodular-supermodular procedure with applications to discriminative structure learning, In Proc. UAI (2005).
↑ R. Iyer and J. Bilmes, Algorithms for Approximate Minimization of the Difference between Submodular Functions, In Proc. UAI (2012).
↑ R. Iyer and J. Bilmes, Submodular Optimization Subject to Submodular Cover and Submodular Knapsack Constraints, In Advances of NIPS (2013).
↑ J. Bilmes, Submodularity in Machine Learning Applications, Tutorial at AAAI-2015.

Related Research Articles

In computational mathematics, an iterative method is a mathematical procedure that uses an initial value to generate a sequence of improving approximate solutions for a class of problems, in which the i-th approximation is derived from the previous ones.

Linear programming (LP), also called linear optimization, is a method to achieve the best outcome in a mathematical model whose requirements and objective are represented by linear relationships. Linear programming is a special case of mathematical programming.

Big <i>O</i> notation Describes limiting behavior of a function

Big O notation is a mathematical notation that describes the limiting behavior of a function when the argument tends towards a particular value or infinity. Big O is a member of a family of notations invented by German mathematicians Paul Bachmann, Edmund Landau, and others, collectively called Bachmann–Landau notation or asymptotic notation. The letter O was chosen by Bachmann to stand for Ordnung, meaning the order of approximation.

A greedy algorithm is any algorithm that follows the problem-solving heuristic of making the locally optimal choice at each stage. In many problems, a greedy strategy does not produce an optimal solution, but a greedy heuristic can yield locally optimal solutions that approximate a globally optimal solution in a reasonable amount of time.

In the mathematical field of real analysis, the monotone convergence theorem is any of a number of related theorems proving the good convergence behaviour of monotonic sequences, i.e. sequences that are non-increasing, or non-decreasing. In its simplest form, it says that a non-decreasing bounded-above sequence of real numbers $converges to its smallest upper bound, its supremum. Likewise, a non-increasing bounded-below sequence converges to its largest lower bound, its infimum. In particular, infinite sums of non-negative numbers converge to the supremum of the partial sums if and only if the partial sums are bounded.$

In mathematics, Fatou's lemma establishes an inequality relating the Lebesgue integral of the limit inferior of a sequence of functions to the limit inferior of integrals of these functions. The lemma is named after Pierre Fatou.

In mathematics, a supermodular function is a function on a lattice that, informally, has the property of being characterized by "increasing differences." Seen from the point of set functions, this can also be viewed as a relationship of "increasing returns", where adding more elements to a subset increases its valuation. In economics, supermodular functions are often used as a formal expression of complementarity in preferences among goods. Supermodular functions are studied and have applications in game theory, economics, lattice theory, combinatorial optimization, and machine learning.

Convex optimization is a subfield of mathematical optimization that studies the problem of minimizing convex functions over convex sets. Many classes of convex optimization problems admit polynomial-time algorithms, whereas mathematical optimization is in general NP-hard.

In mathematics, a polymatroid is a polytope associated with a submodular function. The notion was introduced by Jack Edmonds in 1970. It is also a generalization of the notion of a matroid.

The Frank–Wolfe algorithm is an iterative first-order optimization algorithm for constrained convex optimization. Also known as the conditional gradient method, reduced gradient algorithm and the convex combination algorithm, the method was originally proposed by Marguerite Frank and Philip Wolfe in 1956. In each iteration, the Frank–Wolfe algorithm considers a linear approximation of the objective function, and moves towards a minimizer of this linear function.

Linear Programming Boosting (LPBoost) is a supervised classifier from the boosting family of classifiers. LPBoost maximizes a margin between training samples of different classes, and thus also belongs to the class of margin classifier algorithms.

In mathematics, the Lions–Lax–Milgram theorem is a result in functional analysis with applications in the study of partial differential equations. It is a generalization of the famous Lax–Milgram theorem, which gives conditions under which a bilinear function can be "inverted" to show the existence and uniqueness of a weak solution to a given boundary value problem. The result is named after the mathematicians Jacques-Louis Lions, Peter Lax and Arthur Milgram.

In mathematical optimization, linear-fractional programming (LFP) is a generalization of linear programming (LP). Whereas the objective function in a linear program is a linear function, the objective function in a linear-fractional program is a ratio of two linear functions. A linear program can be regarded as a special case of a linear-fractional program in which the denominator is the constant function 1.

The maximum coverage problem is a classical question in computer science, computational complexity theory, and operations research. It is a problem that is widely taught in approximation algorithms.

In mathematics, there are many kinds of inequalities involving matrices and linear operators on Hilbert spaces. This article covers some important operator inequalities connected with traces of matrices.

In mathematics, a subadditive set function is a set function whose value, informally, has the property that the value of function on the union of two sets is at most the sum of values of the function on each of the sets. This is thematically related to the subadditivity property of real-valued functions.

In the mathematical theory of probability, the drift-plus-penalty method is used for optimization of queueing networks and other stochastic systems.

In financial mathematics and stochastic optimization, the concept of risk measure is used to quantify the risk involved in a random outcome or risk position. Many risk measures have hitherto been proposed, each having certain characteristics. The entropic value at risk (EVaR) is a coherent risk measure introduced by Ahmadi-Javid, which is an upper bound for the value at risk (VaR) and the conditional value at risk (CVaR), obtained from the Chernoff inequality. The EVaR can also be represented by using the concept of relative entropy. Because of its connection with the VaR and the relative entropy, this risk measure is called "entropic value at risk". The EVaR was developed to tackle some computational inefficiencies of the CVaR. Getting inspiration from the dual representation of the EVaR, Ahmadi-Javid developed a wide class of coherent risk measures, called g-entropic risk measures. Both the CVaR and the EVaR are members of this class.

Quadratic pseudo-Boolean optimisation (QPBO) is a combinatorial optimization method for minimizing quadratic pseudo-Boolean functions in the form

The welfare maximization problem is an optimization problem studied in economics and computer science. Its goal is to partition a set of items among agents with different utility functions, such that the welfare – defined as the sum of the agents' utilities – is as high as possible. In other words, the goal is to find an item allocation satisfying the utilitarian rule.

References

Schrijver, Alexander (2003), Combinatorial Optimization, Springer, ISBN 3-540-44389-4
Lee, Jon (2004), A First Course in Combinatorial Optimization, Cambridge University Press, ISBN 0-521-01012-8
Fujishige, Satoru (2005), Submodular Functions and Optimization, Elsevier, ISBN 0-444-52086-4
Narayanan, H. (1997), Submodular Functions and Electrical Networks, Elsevier, ISBN 0-444-82523-1
Oxley, James G. (1992), Matroid theory, Oxford Science Publications, Oxford: Oxford University Press, ISBN 0-19-853563-5, Zbl 0784.05002

External links

http://www.cs.berkeley.edu/~stefje/references.html has a longer bibliography
http://submodularity.org/ includes further material on the subject

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[LB-1] H. Lin and J. Bilmes, A Class of Submodular Functions for Document Summarization, ACL-2011.

[TIWB-2] S. Tschiatschek, R. Iyer, H. Wei and J. Bilmes, Learning Mixtures of Submodular Functions for Image Collection Summarization, NIPS-2014.

[KG1-3] A. Krause and C. Guestrin, Near-optimal nonmyopic value of information in graphical models, UAI-2005.

[KG-4] 1 2 A. Krause and C. Guestrin, Beyond Convexity: Submodularity in Machine Learning, Tutorial at ICML-2008

[5] (Schrijver 2003 , §44, p. 766)

[BF-6] Buchbinder, Niv; Feldman, Moran (2018). "Submodular Functions Maximization Problems". In Gonzalez, Teofilo F. (ed.). Handbook of Approximation Algorithms and Metaheuristics, Second Edition: Methodologies and Traditional Applications. Chapman and Hall/CRC. doi:10.1201/9781351236423. ISBN 9781351236423.

[7] "Information Processing and Learning" (PDF). cmu.

[F22-8] Fujishige (2005) p.22

[L-9] Lovász, L. (1983). "Submodular functions and convexity". Mathematical Programming the State of the Art. pp. 235–257. doi:10.1007/978-3-642-68874-4_10. ISBN 978-3-642-68876-8. S2CID 117358746.

[10] Vondrak, Jan (2008-05-17). "Optimal approximation for the submodular welfare problem in the value oracle model". Proceedings of the fortieth annual ACM symposium on Theory of computing. STOC '08. New York, NY, USA: Association for Computing Machinery. pp. 67–74. doi:10.1145/1374376.1374389. ISBN 978-1-60558-047-0. S2CID 170510.

[11] Calinescu, Gruia; Chekuri, Chandra; Pál, Martin; Vondrák, Jan (January 2011). "Maximizing a Monotone Submodular Function Subject to a Matroid Constraint". SIAM Journal on Computing. 40 (6): 1740–1766. doi:10.1137/080733991. ISSN 0097-5397.

[JV2-12] Vondrák, Jan. "Polyhedral techniques in combinatorial optimization: Lecture 17" (PDF).

[GLS-13] Grötschel, M.; Lovasz, L.; Schrijver, A. (1981). "The ellipsoid method and its consequences in combinatorial optimization". Combinatorica. 1 (2): 169–197. doi:10.1007/BF02579273. hdl: 10068/182482 . S2CID 43787103.

[Cunningham-14] Cunningham, W. H. (1985). "On submodular function minimization". Combinatorica. 5 (3): 185–192. doi:10.1007/BF02579361. S2CID 33192360.

[IFF-15] Iwata, S.; Fleischer, L.; Fujishige, S. (2001). "A combinatorial strongly polynomial algorithm for minimizing submodular functions". J. ACM. 48 (4): 761–777. doi:10.1145/502090.502096. S2CID 888513.

[Schrijver-16] Schrijver, A. (2000). "A combinatorial algorithm minimizing submodular functions in strongly polynomial time". J. Combin. Theory Ser. B. 80 (2): 346–355. doi: 10.1006/jctb.2000.1989 .

[SF-17] Z. Svitkina and L. Fleischer, Submodular approximation: Sampling-based algorithms and lower bounds, SIAM Journal on Computing (2011).

[IJB-18] 1 2 R. Iyer, S. Jegelka and J. Bilmes, Fast Semidifferential based submodular function optimization, Proc. ICML (2013).

[FMV-19] U. Feige, V. Mirrokni and J. Vondrák, Maximizing non-monotone submodular functions, Proc. of 48th FOCS (2007), pp. 461–471.

[BFNS-20] N. Buchbinder, M. Feldman, J. Naor and R. Schwartz, A tight linear time (1/2)-approximation for unconstrained submodular maximization, Proc. of 53rd FOCS (2012), pp. 649-658.

[NVF-21] Nemhauser, George; Wolsey, L. A.; Fisher, M. L. (1978). "An analysis of approximations for maximizing submodular set functions I". Mathematical Programming. 14 (14): 265–294. doi:10.1007/BF01588971. S2CID 206800425.

[22] Williamson, David P. "Bridging Continuous and Discrete Optimization: Lecture 23" (PDF).

[CCPV-23] G. Calinescu, C. Chekuri, M. Pál and J. Vondrák, Maximizing a submodular set function subject to a matroid constraint, SIAM J. Comp. 40:6 (2011), 1740-1766.

[FNS-24] M. Feldman, J. Naor and R. Schwartz, A unified continuous greedy algorithm for submodular maximization, Proc. of 52nd FOCS (2011).

[FW-25] Y. Filmus, J. Ward, A tight combinatorial algorithm for submodular maximization subject to a matroid constraint, Proc. of 53rd FOCS (2012), pp. 659-668.

[NB-26] M. Narasimhan and J. Bilmes, A submodular-supermodular procedure with applications to discriminative structure learning, In Proc. UAI (2005).

[IBUAI-27] R. Iyer and J. Bilmes, Algorithms for Approximate Minimization of the Difference between Submodular Functions, In Proc. UAI (2012).

[IB-28] R. Iyer and J. Bilmes, Submodular Optimization Subject to Submodular Cover and Submodular Knapsack Constraints, In Advances of NIPS (2013).

[JB-29] J. Bilmes, Submodularity in Machine Learning Applications, Tutorial at AAAI-2015.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]