Smith set

Last updated February 15, 2025

The Smithset,^{[note 1]} sometimes called the top-cycle or Condorcet winning set,^[1] generalizes the idea of a Condorcet winner to cases where no such winner exists. It does so by allowing cycles of candidates to be treated jointly, as if they were a single Condorcet winner.^[2] Voting systems that always elect a candidate from the Smith set pass the Smith criterion. The Smith set and Smith criterion are both named for mathematician John H Smith.

Definition

The Smith set is formally defined as the smallest set such that every candidate inside the set S is pairwise unbeaten by every candidate outside S.

Alternatively, it can be defined as the set of all candidates with a (non-strict) beatpath to any candidate who defeats them.

A set of candidates each of whose members pairwise defeats every candidate outside the set is known as a dominating set. Thus the Smith set is also called the smallest dominating set.

Strict top-cycle (Schwartz set)

The Schwartz set is equivalent to the Smith set, except it ignores tied votes. Formally, the Schwartz set is the set such that any candidate inside the set has a strict beatpath to any candidate who defeats them.

The Smith set can be constructed from the Schwartz set by repeatedly adding two types of candidates until no more such candidates exist outside the set:

candidates that are pairwise-tied with candidates in the set,
candidates that defeat a candidate in the set.

Note that candidates of the second type can only exist after candidates of the first type have been added.

Properties

The Smith set always exists and is non-empty. It is also well-defined (see next section).
The Smith set can have more than one candidate, either because of pairwise ties or because of cycles, such as in Condorcet's paradox.
The Condorcet winner, if one exists, is the sole member of the Smith set. If weak Condorcet winners exist they are in the Smith set.
The Smith set is always a subset of the smallest mutual majority-preferred set of candidates.^[3]

Properties of dominating sets

Theorem: Dominating sets are nested; that is, of any two dominating sets in an election, one is a subset of the other.

Proof: Suppose on the contrary that there exist two dominating sets, D and E, neither of which is a subset of the other. Then there must exist candidates d ∈ D,e ∈ E such that d ∉ E and e ∉ D. But by hypothesis d defeats every candidate not in D (including e) while e defeats every candidate not in E (including d), a contradiction. ∎

Corollary: It follows that the Smith set is the smallest non-empty dominating set, and that it is well defined.

Theorem: If D is a dominating set, then there is some threshold θ_D such that the elements of D are precisely the candidates whose Copeland scores are at least θ_D. (A candidate's Copeland score is the number of other candidates whom he or she defeats plus half the number of other candidates with whom he or she is tied.)

Proof: Choose d as an element of D with minimum Copeland score, and identify this score with θ_D. Now suppose that some candidate e ∉ D has a Copeland score not less than θ_D. Then since d belongs to D and e doesn't, it follows that d defeats e; and in order for e's Copeland score to be at least equal to d's, there must be some third candidate f against whom e gets a better score than does d. If f ∈ D, then we have an element of D who does not defeat e, and if f ∉ D then we have a candidate outside of D whom d does not defeat, leading to a contradiction either way. ∎

The Smith criterion

The Smith criterion is a voting system criterion that formalizes a stronger idea of majority rule than the Condorcet criterion. A voting system satisfies the Smith criterion if it always picks a candidate from the Smith set.

Though less common, the term Smith-efficient has also been used for methods that elect from the Smith set.^[4]

Here is an example of an electorate in which there is no Condorcet winner: There are four candidates: A, B, C and D. 40% of the voters rank D>A>B>C. 35% of the voters rank B>C>A>D. 25% of the voters rank C>A>B>D. The Smith set is {A,B,C}. All three candidates in the Smith set are majority-preferred over D (since 60% rank each of them over D). The Smith set is not {A,B,C,D} because the definition calls for the smallest subset that meets the other conditions. The Smith set is not {B,C} because B is not majority-preferred over A; 65% rank A over B. (Etc.)

con pro	A	B	C	D
A	—	65	40	60
B	35	—	75	60
C	60	25	—	60
D	40	40	40	—
max opp	60	65	75	60
minimax	60			60

In this example, under minimax, A and D tie; under Smith//Minimax, A wins.

In the example above, the three candidates in the Smith set are in a "rock/paper/scissors" majority cycle: A is ranked over B by a 65% majority, B is ranked over C by a 75% majority, and C is ranked over A by a 60% majority.

Other criteria

Any election method that complies with the Smith criterion also complies with the Condorcet winner criterion, since if there is a Condorcet winner, then it is the only candidate in the Smith set. Smith methods also comply with the Condorcet loser criterion, because a Condorcet loser will never fall in the Smith set. It also implies the mutual majority criterion, since the Smith set is a subset of the MMC set.^[3] Conversely, any method that fails any of those three majoritarian criteria (Mutual majority, Condorcet loser or Condorcet winner) will also fail the Smith criterion.

Complying methods

The Smith criterion is satisfied by Ranked Pairs, Schulze's method, Nanson's method, and several other methods. Moreover, any voting method can be modified to satisfy the Smith criterion, by finding the Smith set and eliminating any candidates outside of it.

For example, the voting method Smith//Minimax applies Minimax to the candidates in the Smith set. Another example is the Tideman alternative method, which alternates between eliminating candidates outside of the Smith set, and eliminating the candidate who was the plurality loser (similar to instant-runoff), until a Condorcet winner is found. A different approach is to elect the member of the Smith set that is highest in the voting method's order of finish.

Methods failing the Condorcet criterion also fail the Smith criterion. However, some Condorcet methods (such as Minimax) can fail the Smith criterion.

Relation to other tournament sets

The Smith set contains the Copeland set and Landau set as subsets.

It also contains the Banks set and the Bipartisan set. A number of other subsets of the Smith set have been defined as well.^[5]

Computing the Smith set

The Smith set can be calculated with the Floyd–Warshall algorithm in time Θ (n³) or Kosaraju's algorithm in time Θ(n²).

Detailed algorithm

The algorithm can be presented in detail through an example. Suppose that the results matrix is as follows:

2nd 1st	A	B	C	D	E	F	G	score
A	–	1	1	1	1	1	0	5
B	0	–	0	0	1	0	0	1
C	0	1	–	0	1	⁠1/2⁠	1	3⁠1/2⁠
D	0	1	1	–	1	1	1	5
E	0	0	0	0	–	0	0	0
F	0	1	⁠1/2⁠	0	1	–	0	2⁠1/2⁠
G	1	1	0	0	1	1	–	4

Here an entry in the main table is 1 if the first candidate was preferred to the second by more voters than preferred the second to the first; 0 if the opposite relation holds; and ⁠1/2⁠ if there is a tie. The final column gives the Copeland score of the first candidate.

The algorithm to compute the Smith set is agglomerative: it starts with the Copeland set, which is guaranteed to be a subset of it but will often be smaller, and adds items until no more are needed. The first step is to sort the candidates according to score:

2nd 1st	A	D	G	C	F	B	E	score
A	–	1	0	1	1	1	1	5
D	0	–	1	1	1	1	1	5
G	1	0	–	0	1	1	1	4
C	0	0	1	–	⁠1/2⁠	1	1	3⁠1/2⁠
F	0	0	0	⁠1/2⁠	–	1	1	2⁠1/2⁠
B	0	0	0	0	0	–	1	1
E	0	0	0	0	0	0	–	0

We look at the highest score (5) and consider the candidates (Copeland winners) whose score is at least this high, i.e. {A,D}. These certainly belong to the Smith set, and any candidates whom they do not defeat will need to be added. To find undefeated candidates we look at the cells in the table below the top-left 2×2 square containing {A,D} (this square is shown with a broken border): the cells in question are shaded yellow in the table. We need to find the lowest (positionally) non-zero entry among these cells, which is the cell in the G row. All candidates as far down as this row, and any lower rows with the same score, need to be added to the set, which expands to {A,D,G}.

Now we look at any new cells which need to be considered, which are those below the top-left square containing {A,D,G}, but excluding those in the first two columns which we have already accounted for. The cells which need attention are shaded pale blue. As before we locate the positionally lowest non-zero entry among the new cells, adding all rows down to it, and all rows with the same score as it, to the expanded set, which now comprises {A,D,G,C}.

We repeat the operation for the new cells below the four members which are known to belong to the Smith set. These are shaded pink, and allow us to find any candidates not defeated by any of {A,D,G,C}. Again there is just one, F, whom we add to the set.

The cells which come into consideration are shaded pale green, and since all their entries are zero we do not need to add any new candidates to the set, which is therefore fixed as {A,D,G,C,F}. And by noticing that all the entries in the black box are zero, we have confirmation that all the candidates above it defeat all the candidates within it.

The following C function illustrates the algorithm by returning the cardinality of the Smith set for a given doubled results matrix r and array s of doubled Copeland scores. There are n candidates; r_i j is 2 if more voters prefer i to j than prefer j to i, 1 if the numbers are equal, and 0 if more voters prefer j to i than prefer i to j ; s_i is the sum over j of the r_i j. The candidates are assumed to be sorted in decreasing order of Copeland score.

intsmithset(int**r,int*s,intn){introw,col,lhs,rhs;for(rhs=1,lhs=0;lhs<rhs;lhs=rhs,rhs=row+1){for(;rhs<n&&s[rhs]==s[rhs-1];rhs++);/* this line optional */for(col=rhs,row=n;col==rhs&&row>=rhs;row--)for(col=lhs;col<rhs&&r[row-1][col]==0;col++);}returnlhs;}

Notes

↑ Many authors reserve the term "Schwartz set" for the strict Smith set described below.

References

↑ Elkind, Edith; Lang, Jérôme; Saffidine, Abdallah (2015-03-01). "Condorcet winning sets". Social Choice and Welfare. 44 (3): 493–517. doi:10.1007/s00355-014-0853-4. ISSN 1432-217X.
↑ Soh, Leen-Kiat (2017-10-04). "Voting: Preference Aggregating & Social Choice [CSCE475/875 class handout]" (PDF).
1 2 Brandt, Felix (2009-07-17). "Some Remarks on Dodgson's Voting Rule". Mathematical Logic Quarterly. 55 (4). Wiley: 460–463. doi:10.1002/malq.200810017. ISSN 0942-5616.
↑ Boudet, Samuel (2023-09-06), Bipartisan/Range Voting in Two Rounds Reaches a Promising Balance between Efficiency and Strategy-Resistance, MDPI AG, doi: 10.20944/preprints202309.0388.v1
↑ Brandt, Felix; Brill, Markus; Harrenstein, Paul (2018-01-16). "Extending tournament solutions" (PDF). Social Choice and Welfare. 51 (2). Springer Science and Business Media LLC. doi:10.1007/s00355-018-1112-x. ISSN 0176-1714. For many tournament solutions, generalizations or extensions to weak tournaments have been proposed in the literature