Smith set

Last updated

The Smith set, also known as the top cycle, is a concept from the theory of electoral systems that generalizes the Condorcet winner to cases where no such winner exists, by allowing cycles of candidates to be treated jointly as if they were a single Condorcet winner. Named after John H. Smith, the Smith set is the smallest non-empty set of candidates in a particular election, such that each member defeats every candidate outside the set in a pairwise election. The Smith set provides one standard of optimal choice for an election outcome. Voting systems that always elect a candidate from the Smith set pass the Smith criterion.

Contents

A set of candidates each of whose members pairwise defeats every candidate outside the set is known as a dominating set; thus, the Smith set is also called the smallest dominating set.

An alternative generalization of the Condorcet criterion (which is stricter than the Smith criterion) is given by the Landau set.

Properties of Smith sets

Properties of dominating sets

Theorem: Dominating sets are nested; that is, of any two dominating sets in an election, one is a subset of the other.

Proof: Suppose on the contrary that there exist two dominating sets, D and E, neither of which is a subset of the other. Then there must exist candidates dD,eE such that dE and eD. But by hypothesis d defeats every candidate not in D (including e) while e defeats every candidate not in E (including d), which is a contradiction. ∎

Corollary: It follows that the Smith set is the smallest non-empty dominating set, and that it is well defined.

Theorem: If D is a dominating set, then there is some threshold θD such that the elements of D are precisely the candidates whose Copeland scores are at least θD. (A candidate's Copeland score is the number of other candidates whom he or she defeats plus half the number of other candidates with whom he or she is tied.)

Proof: Choose d as an element of D with minimum Copeland score, and identify this score with θD. Now suppose that some candidate eD has a Copeland score not less than θD. Then since d belongs to D and e doesn't, it follows that d defeats e; and in order for e's Copeland score to be at least equal to d's, there must be some third candidate f against whom e gets a better score than does d. If fD, then we have an element of D who does not defeat e, and if fD then we have a candidate outside of D whom d does not defeat, leading to a contradiction either way. ∎

Schwartz set comparison

The Schwartz set is closely related to and is always a subset of the Smith set. The Smith set is larger if and only if a candidate in the Schwartz set has a pair-wise tie with a candidate that is not in the Schwartz set.

The Smith set can be constructed from the Schwartz set by repeatedly adding two types of candidates until no more such candidates exist outside the set:

Note that candidates of the second type can only exist after candidates of the first type have been added.

The Smith criterion

The Smith criterion is satisfied by any voting method where the winner always belongs to the Smith set. Any method which satisfies the Smith criterion must also satisfy the Condorcet criterion; hence any method (such as IRV) which is not Condorcet-consistent must also fail the Smith criterion. Minimax is the most well-known Condorcet method that fails the Smith criterion.

Elimination ties

The IRV component of both Smith//IRV and Tideman's Alternative will occasionally encounter a tie for bottom place amongst first preferences (the probability of this becomes arbitrarily small as the number of voters increases). When such a tie arises it may be broken in various ways. For Smith//IRV, the set of all candidates with the fewest first-order votes whose votes together total less than any other candidate's can be eliminated without changing the outcome; Tideman's Alternative recalculates the Smith set after each single elimination, and cannot be optimized in this manner.

Instant-runoff voting cannot accept ballots with two candidates at the same rank, but reducing the field to the Smith set is possible even if voters have indicated ties between candidates. Ballots with equal first-choice rankings after eliminating non-Smith candidates must then be discarded.

Computing the Smith set

The logical properties of dominating sets stated above can be used to construct an efficient algorithm for computing the Smith set. We have seen that the dominating sets are nested by Copeland score. It follows that by adjusting the Copeland threshold it is possible to work through the nested sets in increasing order of size until a dominating set is reached; and this set is necessarily the Smith set. Darlington sketches this method. [2]

Testing whether a set is a dominating set at each stage might repeat some calculations, but this can fairly easily be avoided leading to an algorithm whose work factor is quadratic in the number of candidates.

Detailed algorithm

The algorithm can be presented in detail through an example. Suppose that the results matrix is as follows:

2nd
1st
ABCDEFGscore
A1111105
B0001001
C01011/2131/2
D0111115
E0000000
F011/201021/2
G1100114

Here an entry in the main table is 1 if the first candidate was preferred to the second by more voters than preferred the second to the first; 0 if the opposite relation holds; and 1/2 if there is a tie. The final column gives the Copeland score of the first candidate.

The algorithm to compute the Smith set is agglomerative: it starts with the Copeland set, which is guaranteed to be a subset of it but will often be smaller, and adds items until no more are needed. The first step is to sort the candidates according to score:

2nd
1st
ADGCFBEscore
A1011115
D0111115
G1001114
C0011/21131/2
F0001/21121/2
B0000011
E0000000

We look at the highest score (5) and consider the candidates (Copeland winners) whose score is at least this high, i.e. {A,D}. These certainly belong to the Smith set, and any candidates whom they do not defeat will need to be added. To find undefeated candidates we look at the cells in the table below the top-left 2×2 square containing {A,D} (this square is shown with a broken border): the cells in question are shaded yellow in the table. We need to find the lowest (positionally) non-zero entry among these cells, which is the cell in the G row. All candidates as far down as this row, and any lower rows with the same score, need to be added to the set, which expands to {A,D,G}.

Now we look at any new cells which need to be considered, which are those below the top-left square containing {A,D,G}, but excluding those in the first two columns which we have already accounted for. The cells which need attention are shaded pale blue. As before we locate the positionally lowest non-zero entry among the new cells, adding all rows down to it, and all rows with the same score as it, to the expanded set, which now comprises {A,D,G,C}.

We repeat the operation for the new cells below the four members which are known to belong to the Smith set. These are shaded pink, and allow us to find any candidates not defeated by any of {A,D,G,C}. Again there is just one, F, whom we add to the set.

The cells which come into consideration are shaded pale green, and since all their entries are zero we do not need to add any new candidates to the set, which is therefore fixed as {A,D,G,C,F}. And by noticing that all the entries in the black box are zero, we have confirmation that all the candidates above it defeat all the candidates within it.

The following C function illustrates the algorithm by returning the cardinality of the Smith set for a given doubled results matrix r and array s of doubled Copeland scores. There are n candidates; ri j is 2 if more voters prefer i to j than prefer j to i, 1 if the numbers are equal, and 0 if more voters prefer j to i than prefer i to j ; si is the sum over j of the ri j . The candidates are assumed to be sorted in decreasing order of Copeland score.

intsmithset(int**r,int*s,intn){introw,col,lhs,rhs;for(rhs=1,lhs=0;lhs<rhs;lhs=rhs,rhs=row+1){for(;rhs<n&&s[rhs]==s[rhs-1];rhs++);/* this line optional */for(col=rhs,row=n;col==rhs&&row>=rhs;row--)for(col=lhs;col<rhs&&r[row-1][col]==0;col++);}returnlhs;}

See also

Related Research Articles

In social choice theory and politics, the spoiler effect refers to a situation where the entry of a losing candidate affects the results of an election. Spoiler effects can happen in one of two ways: that sincere voters who are faced with multiple ways to express their opinion change how they do so as a new candidate enters the race, or that the voting system itself fails the independence of irrelevant alternatives criterion, making the winner change due to additional information provided to the algorithm itself.

<span class="mw-page-title-main">Condorcet method</span> Pairwise-comparison electoral system

A Condorcet method is an election method that elects the candidate who wins a majority of the vote in every head-to-head election against each of the other candidates, whenever there is such a candidate. A candidate with this property, the pairwise champion or beats-all winner, is formally called the Condorcet winner. The head-to-head elections need not be done separately; a voter's choice within any given pair can be determined from the ranking.

<span class="mw-page-title-main">Copeland's method</span> Single-winner ranked vote system

Copeland's method, also called Llull's method or round-robin voting, is a ranked-choice voting system based on scoring pairwise wins and losses.

In voting systems, the Schwartz set is the union of all Schwartz set components. A Schwartz set component is any non-empty set S of candidates such that

  1. Every candidate inside the set S is pairwise unbeaten by every candidate outside S; and
  2. No non-empty proper subset of S fulfills the first property.

Ranked pairs, sometimes called the Tideman method, is a tournament-style system of ranked-choice voting first proposed by Nicolaus Tideman in 1987.

The Schulze method is an electoral system developed in 1997 by Markus Schulze that selects a single winner using votes that express preferences. The method can also be used to create a sorted list of winners. The Schulze method is also known as Schwartz Sequential dropping (SSD), cloneproof Schwartz sequential dropping (CSSD), the beatpath method, beatpath winner, path voting, and path winner. The Schulze method is a Condorcet method, which means that if there is a candidate who is preferred by a majority over every other candidate in pairwise comparisons, then this candidate will be the winner when the Schulze method is applied.

In an election, a candidate is called a Condorcet, beats-all, or majority-rule winner if more than half of voters would support them in any one-on-one matchup with another candidate. Such a candidate is also called an undefeated, or tournament champion, by analogy with round-robin tournaments. Voting systems where a majority-rule winner will always win the election are said to satisfy the Condorcetcriterion. Condorcet voting methods extend majority rule to elections with more than one candidate.

The Smith criterion is a voting system criterion that formalizes the concept of a majority rule. A voting system satisfies the Smith criterion if it always elects a candidate from the Smith set, which generalizes the idea of a "Condorcet winner" to cases where there may be cycles or ties, by allowing for several who together can be thought of as being "Condorcet winners." A Smith method will always elect a candidate from the Smith set.

The participation criterion, also called vote or population monotonicity, is a voting system criterion that says that a candidate should never lose an election because they have "too much support." It says that adding voters who support A over B should not cause A to lose the election to B.

A voting system satisfies join-consistency if combining two sets of votes, both electing A over B, always results in a combined electorate that ranks A over B. It is a stronger form of the participation criterion, which only requires join-consistency when one of the sets of votes unanimously prefers A over B.

In single-winner voting system theory, the Condorcet loser criterion (CLC) is a measure for differentiating voting systems. It implies the majority loser criterion but does not imply the Condorcet winner criterion.

In voting systems, the Minimax Condorcet method is a single-winner ranked-choice voting method that always elects the majority (Condorcet) winner. Minimax compares all candidates against each other in a round-robin tournament, then ranks candidates by their worst election result. The candidate with the largest (maximum) margin of victory in their worst (minimum) matchup is declared the winner.

Reversal symmetry is a voting system criterion which requires that if candidate A is the unique winner, and each voter's individual preferences are inverted, then A must not be elected. Methods that satisfy reversal symmetry include Borda count, ranked pairs, Kemeny–Young method, and Schulze method. Methods that fail include Bucklin voting, instant-runoff voting and Condorcet methods that fail the Condorcet loser criterion such as Minimax.

The Kemeny–Young method is an electoral system that uses ranked ballots and pairwise comparison counts to identify the most popular choices in an election. It is a Condorcet method because if there is a Condorcet winner, it will always be ranked as the most popular choice.

In voting systems theory, the independence of clones criterion measures an election method's robustness to strategic nomination. Nicolaus Tideman was the first to formulate this criterion, which states that the winner must not change due to the addition of a non-winning candidate who is similar to a candidate already present. It is a relative criterion: it states how changing an election should or shouldn't affect the outcome.

<span class="mw-page-title-main">Ranked voting</span> Family of electoral systems

The term ranked voting, also known as preferential voting or ranked-choice voting, pertains to any voting system where voters indicate a rank to order candidates or options—in a sequence from first, second, third, and onwards—on their ballots. Ranked voting systems vary based on the ballot marking process, how preferences are tabulated and counted, the number of seats available for election, and whether voters are allowed to rank candidates equally.

The later-no-help criterion is a voting system criterion formulated by Douglas Woodall. The criterion is satisfied if, in any election, a voter giving an additional ranking or positive rating to a less-preferred candidate can not cause a more-preferred candidate to win. Voting systems that fail the later-no-help criterion are vulnerable to the tactical voting strategy called mischief voting, which can deny victory to a sincere Condorcet winner.

A major branch of social choice theory is devoted to the comparison of electoral systems, otherwise known as social choice functions. Viewed from the perspective of political science, electoral systems are rules for conducting elections and determining winners from the ballots cast. From the perspective of economics, mathematics, and philosophy, a social choice function is a mathematical function that determines how a society should make choices, given a collection of individual preferences.

Tideman's Alternative Method, also called Alternative Smith or Alternative Schwartz, is an electoral system developed by Nicolaus Tideman which selects a single winner using votes that express preferences.

Round-robin voting refers to a set of ranked voting systems that elect winners by comparing all candidates in a round-robin tournament. Every candidate is matched up against every other candidate, where their point total is equal to the number of votes they receive; the method then selects a winner based on the results of these paired matchups.

References

  1. http://dss.in.tum.de/files/brandt-research/dodgson.pdf [ bare URL PDF ]
  2. R. B. Darlington, 'Are Condorcet and minimax voting systems the best?' (v8, 2021).

Further reading