This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\hideLIPIcs

Département d’informatique et de recherche opérationnelle, Université de Montréal, Montréal, Québec, H3T 1J4, [email protected]://orcid.org/0000-0002-4347-8931 Département d’informatique et de recherche opérationnelle, Université de Montréal, Montréal, Québec, H3T 1J4, [email protected] https://orcid.org/ 0000-0002-8941-2284Supported by NSERC through an Individual Discovery Grant RGPIN-2016-04576 \CopyrightX. K. Phung and S. Hamel \ccsdesc[100]Theory of computation \rightarrow Theory and algorithms for application domains; Applied computing \rightarrow Law, social and behavioral sciences; Mathematics of computing \rightarrow discrete mathematics \ccsdesc[100]Theory of computation \rightarrow Theory and algorithms for application domains; Applied computing \rightarrow Law, social and behavioral sciences; Mathematics of computing \rightarrow discrete mathematics

Space reduction techniques for the 33-wise Kemeny problem

Xuan Kien Phung    Sylvie Hamel
Abstract

Kemeny’s rule is one of the most studied and well-known voting schemes with various important applications in computational social choice and biology [3, 4, 6, 8]. Recently, Kemeny’s rule was generalized via a set-wise approach by Gilbert et. al. in [13, 14]. This paradigm presents interesting advantages in comparison with Kemeny’s rule since not only pairwise comparisons but also the discordance between the winners of subsets of three alternatives are also taken into account in the definition of the 33-wise Kendall-tau distance between two rankings. In spite of the NP-hardness of the 3-wise Kemeny problem which consists of computing the set of 33-wise consensus rankings, namely rankings whose total 33-wise Kendall-tau distance to a given voting profile is minimized, we establish in this paper several generalizations of the Major Order Theorems, as obtained by Milosz and Hamel [19] for Kemeny’s rule, for the 33-wise Kemeny voting schemes to achieve a substantial search space reduction by efficiently determining in polynomial time the relative orders of pairs of alternatives. Essentially, our theorems quantify precisely the nontrivial property that if the preference for an alternative over another one in an election is strong enough, not only in the head-to-head competition but even when taking into account one or two more alternatives, then the relative order of these two alternatives in all 33-wise consensus rankings must be as expected. As an application, we also obtain an improvement of the Major Order Theorems for Kememy’s rule. Moreover, we show that the well-known 3/43/4-majority rule of Betzler et al. [6] for Kemeny’s rule is only valid in general for elections with no more than 55 alternatives with respect to the 33-wise Kemeny scheme. Several simulations and tests of our algorithms on real-world and uniform data are provided.

keywords:
Kemeny problem, Kendall-tau distance, Kemeny rule, median permutation, computational social theory
category:
\relatedversion
keywords:
Kemeny problem, Kendall-tau distance, Kemeny rule, Median permutation, Computational social choice, Major Order Theorems, Consensus ranking, Generalized Kemeny rule
category:
\relatedversion

1 Introduction

In this article, an election is a finite collection C={c1,,cn}C=\{c_{1},\dots,c_{n}\} of alternatives together with a voting profile consisting of a finite number of (not necessarily distinct) votes. A ranking or a vote is simply a complete and strict total ordering π:cπ(1)>cπ(2)>>cπ(n)\pi\colon c_{\pi(1)}>c_{\pi(2)}>\dots>c_{\pi(n)} which we identify with a permutation of {1,2,,n}\{1,2,\dots,n\} also denoted by π\pi. Here, x>yx>y means that the alternative xx is ranked before the alternative yy. The space of all rankings, for fixed nn, can be equipped with several natural distances, for example, the Kendall-tau distance which counts the number of order disagreements between pairs of elements in two permutations, namely, the bubble-sort distance between two permutations, or more generally the kk-wise Kendall-tau distance [14] (see Equation (2.1)). The important Kemeny problem (cf. [15], [16], [26]) consists of finding the set of kk-wise medians of a given election, i.e., permutations whose total distance to the voting profile is minimized with respect to the kk-wise Kendall-tau distance. In other words, a median is a ranking that agrees the most with the voting profile.

By taking into consideration not only pairwise comparisons but also the discordance between the winners of subsets of three alternatives, the 33-wise Kemeny voting scheme seems to be more resistant to coalitional manipulation than the classical 22-wise Kemeny rule: it is much more difficult for an alternative to win an election or even to simply win another specific alternative in an election under the 33-wise Kemeny voting scheme. In fact, most of the best-known space reduction results for Kemeny’s rule fail in the 33-wise setting (see [21, Table 1]), including the powerful Major Order Theorems discovered in [19] (see Example 5.1) and the Condorcet criterion. It was shown in [21] that even the 2/32/3 majority in every duel is not enough to guarantee that an alternative will win an election according to the 33-wise Kemeny voting scheme. This phenomenon is in stark contrast to the Condorcet criterion where a Condorcet winner, namely an alternative which is preferred by more voters than any others, must be the unique winner of the election. Nevertheless, we know that an alternative obtaining a 3/43/4 majority in every duel must be the unique winner in the 33-wise Kemeny voting scheme [21].

In many situations, the 33-wise Kemeny rule is also more suitable than Kemeny’s rule since it puts more weight on alternatives which are more frequently ranked in top positions in the votes. Indeed, Kemeny’s rule puts equal weight on the head-to-head competition of two alternatives x,yx,y when in a particular vote, xx is the winner followed by yy and when in another vote, xx and yy occupy the last two position in that order. However, it is reasonable to assume that typical voters only pay attention to a shortlist of their favorite alternatives and put a rather random order for the rest of the alternatives. Such undesirable behavior creates noises that can alter the Kemeny median ranking while the problem can be solved effectively using the 33-wise Kemeny voting scheme in specific instances (see Example 5.1 and Appendix B). The above limitation of the Kemeny rule leads to the notion of weighted Kendall tau distances introduced by Kumar and Vassilvitskii [17] as well as the notion of set-wise Kemeny distance of Gilbert et. al. [14].

Motivated by the above potential and interesting features of the 33-wise Kemeny rule as well as the NP-hardness of the various Kemeny problems (see [arrow], [12], [14]), our main goal is to formulate new quantitative results concerning the majority rules in 33-wise Kemeny voting schemes associated with the 33-wise Kendall-tau distance introduced recently in [14], which provide much more refined space reductions to the Kemeny problem in comparison to existing techniques in the literature. Our first result (cf. Theorem 3.1) shows that the fundamental 3/43/4-majority rule of Betzler et al. [6] for the classical Kemeny rule is also applicable for all elections with no more than 55 alternatives with respect to the 33-wise Kemeny scheme. However, the 3/43/4-majority rule fails in general as soon as there are at least 66 alternatives. Note that without restriction on the number of alternatives, the 5/65/6-majority rule obtained in [21] serves as the 33-wise counterpart of the 3/43/4-majority rule.

The second and central result of the paper is the Major Order Theorem for the 33-wise voting scheme, denoted 3MOT (Theorem 5.4), and its improved version, denoted Iterated 3MOT (Theorem 6.2), which, to the limit of our knowledge, are the most efficient space reduction techniques for the 33-wise Kemeny rule in the literature. In essence, our Major Order Theorems show that if the preference for an alternative xx over another alternative yy in an election is strong enough, as measured quantitatively not only in the head-to-head competition but also when taking into account the interactions with one or two more other alternatives, then xx must be ranked before yy in all 33-wise medians of the election. The corresponding algorithms not only function efficiently in polynomial time O(n4m)O(n^{4}m), where nn is the number of alternatives and mm is size of the voting profile, but also drastically reduce the search space of 33-wise medians. To get an idea on the efficiency and interests of our results, let 0p<10\leq p<1 be the proportion of pairs of alternatives solved by 3MOT or Iterated 3MOT out of the total of n(n1)/2n(n-1)/2 pairs. Then the original search space consisting of all n!n! possible rankings would be reduced by a factor (reduction rate) at least equal to (cf. [linear-extension-2018, Lemma 2, Lemma 4], see also Table 2)

max(n!(1+0.5n(1p))n,ep2n/32)Ω(c(p)n), where c(p)=max(2e(1p),ep2/32).\max\left(n!\left(1+0.5n(1-p)\right)^{-n},\,e^{p^{2}n/32}\right)\in\,\Omega(c(p)^{n}),\quad\text{ where }\,c(p)=\max\left(\frac{2}{e(1-p)},\,e^{p^{2}/32}\right). (1.1)

On real-world data, especially for political elections and competitions where there exists a high similarity between the votes, our algorithms prove to be particularly useful as pp usually ranges from 0.60.6 to 0.90.9 after only a few iterations of Iterated 3MOT (see Table 2 and also Appendix C.1 for some concrete examples). The performance is also very encouraging even on the hard case of uniformly generated data where, for example when m=15m=15, the 33-wise Major Order Theorems can determine the relative rankings of approximately 47%47\% pairs of alternatives on average when n=10n=10 and approximately 31%31\% when n=15n=15.

Our results not only extend and improve the important 22-wise Major Order Theorem of [19] (see Section 2.4 and Theorem 7.1) but also provide a unified approach and technique which should pave the way for the research of more refined algorithms and quantitative properties of kk-wise Kemeny voting schemes for k2k\geq 2. It is worth comparing our method to the space reduction method based on a 33-wise majority digraph introduced in [14, Theorem 3] whose vertices are the alternatives. While we can obtain, under some assumptions on the 33-wise digraph of an election, a set of rankings which contains some 33-wise median using [14, Theorem 3], our 33-wise Major Order Theorems provide, for all pairs of alternatives (x,y)(x,y), easy-to-compute and mild sufficient conditions so that x>yx>y in all 33-wise medians. In fact, by relaxing the conditions in our Major Order Theorems, we can determine more relative orderings of a pair of alternatives in some 3-wise median (see Theorem 8.1). Another major difference is that [14, Theorem 3] only considers the strength of the preference for xx over yy in the presence of one other alternative zx,yz\neq x,y while the 33-wise Major Order Theorems quantify the preference for xx over yy not only in the head-to-head competition but even when taking into account one or two more alternatives, which should provide a more refined space reduction method (see Example 6.4). In any case, the constraints on all 33-wise medians found by our 3-wise Major Order Theorems should greatly accelerate other complementary space reduction methods and vice versa, nontrivial constraints obtained by other methods can serve as the inputs to boost our algorithms, especially Iterated 3MOT. Table 1 below summarizes our results and some well-known space reduction criteria for the classical and 33-wise Kemeny voting schemes.

Table 1: Some efficient space reduction techniques for set-wise Kemeny voting schemes
Criterion Kemeny rule 33-wise Kemeny rule
Extended Always theorem (Pareto efficiency, unanimity) Phung & Hamel [21, Theorem 3] Phung & Hamel [21] (Theorem C.4)
3/43/4-majority rule (Section 2.3) Betzler et al. [6] Valid only for elections of 55 alternatives or less, Theorem 3.1
Extended ss-majority rule Phung & Hamel [21, Section 4] Valid if s5/6s\geq 5/6[21, Section 8]
Major Order Theorems Milosz & Hamel [19] (Section 2.4) Improved Iterated MOT (Theorem 7.1) Not valid, see Example 5.1
33-wise Major Order Theorems Theorems 5.4, 6.2, 8.1.

To illustrate the utility of our obtained algorithms, we performed several simulations and tests on real-world and uniform data. Table 3 and Table 4 in Appendix C record the approximate average proportion of pairs with relative order solved by the 33-wise Extended Always Theorem obtained in [21] (see Theorem C.4) and the 3-wise Major Order Theorems over 100 000 uniformly generated instances. Several concrete examples with real-world data (elections with a dozen and up to 250 alternatives) taken from PREFLIB [18] are described in Appendix C.1 (see also Table 2). A direct implementation shows that for voting profiles with n,m40n,m\leq 40, the list of all pairs of alternatives with relative order in all 33-wise medians determined by 3MOT, resp. Iterated 3MOT, can be obtained in approximately less than 2.5 seconds, resp. 1 minute, with an M1 MacBook Air 16GB RAM. More optimized implementation can definitely improve the running time of the algorithms.

Finally, we explain how to apply our results and the set-wise Kemeny rule to deal with incomplete votes in Section 9 and propose the usage of the proportion of pairs whose relative order are determined by the 33-wise Major Order Theorems as a meaningful measure of the consensus level of the electorates (see Appendix A).

2 Preliminaries

We recall in this section the classical 2-wise Major Order Theorems, the 3/43/4-majority rule as well as the definition of kk-wise medians and kk-wise Kendall-tau distance.

2.1 The kk-wise Kemeny rule

Let k2k\geq 2 be an integer and let CC be a finite set of alternatives. Let S(C)S(C) be the set of all rankings of CC. Let Δk(C)2C\Delta^{k}(C)\subset 2^{C} be the collection of all subsets of CC which contain no more than kk elements. By counting the number of winner disagreements on all subsets taken from Δk(C)\Delta^{k}(C), the kk-wise Kendall-tau distance dKTk(π,σ)d^{k}_{KT}(\pi,\sigma) between two rankings π,σ\pi,\sigma of CC is defined by (cf. [14]):

dKTk(π,σ)=SΔk(C)𝟙B(π,σ)(S).d^{k}_{KT}(\pi,\sigma)=\sum_{S\in\Delta^{k}(C)}\mathbbm{1}_{B(\pi,\sigma)}(S). (2.1)

where B(π,σ)={SΔk(C):topS(π)topS(σ)}B(\pi,\sigma)=\{S\in\Delta^{k}(C)\colon\mathrm{top}_{S}(\pi)\neq\mathrm{top}_{S}(\sigma)\} and topS(π)S\mathrm{top}_{S}(\pi)\in S denotes the highest ranked element of the restriction π|S\pi|_{S} of π\pi to SS. The kk-wise Kendall-tau distance between a ranking π\pi of CC and a collection of rankings AA of CC is dKTk(π,A)=σAdKTk(π,σ)d^{k}_{KT}(\pi,A)=\sum_{\sigma\in A}d^{k}_{KT}(\pi,\sigma). Given a voting profile VV over CC, we say that a ranking π\pi^{*} of CC is a median with respect to the kk-wise Kemeny rule or a kk-wise median if dKTk(π,V)=minπS(C)dKTk(π,V)d^{k}_{KT}(\pi^{*},V)=\min_{\pi\in S(C)}d^{k}_{KT}(\pi,V). For k=2k=2, we recover the Kendall-tau distance dKT2=dKTd^{2}_{KT}=d_{KT}. It was shown in [14] that the decision variant of the kk-wise Kemeny problem is NP-complete for every k3k\geq 3.

2.2 Transitivity lemma

Definition 2.1.

Let x,yx,y be alternatives in an election with voting profile VV and let s[0,1]s\in[0,1]. When xx is ranked before yy in a vote vVv\in V, we denote x>vyx>^{v}y or simply x>yx>y when there is no possible confusion. We write xsyx\geq_{s}y, resp. x>syx>_{s}y, if x>yx>y in at least, resp. in more than, s|V|s|V| votes.

The following simple but very useful observation serves as a weak transitivity property.

Lemma 2.2.

Let x,y,zx,y,z be three distinct alternatives in an election with voting profile VV. If xsyx\geq_{s}y and xszx\geq_{s}z for some s[0,1]s\in[0,1], then in at least (2s1)|V|(2s-1)|V| votes, we have x>y,zx>y,z, i.e., x>yx>y and x>zx>z.

Proof 2.3.

Let A,BVA,B\subset V be the sets of votes in which x>yx>y and x>zx>z respectively. Since xsyx\geq_{s}y and xszx\geq_{s}z, we deduce that |A|s|V||A|\geq s|V| and |B|s|V||B|\geq s|V|. Consequently, from the set equality |AB|=|A|+|B||AB||A\cup B|=|A|+|B|-|A\cap B|, we obtain the following estimation |AB|=|A|+|B||AB|s|V|+s|V||V|=(2s1)|V||A\cap B|=|A|+|B|-|A\cup B|\geq s|V|+s|V|-|V|=(2s-1)|V|. Since in every vote vABv\in A\cap B, we have x>y,zx>y,z, the conclusion thus follows.

2.3 The 3/43/4-majority rule

Following [6], a non-dirty pair of alternatives in an election with respect to a threshold s[0,1]s\in[0,1] is a pair (x,y)(x,y) such that either xx is ranked before yy in at least s|V|s|V| votes, or yy ranked before xx in at least s|V|s|V| votes. Then we say that an alternative is non-dirty if (x,y)(x,y) is a non-dirty pair with respect to the threshold ss for every other alternative yxy\neq x. An election with a certain voting rule satisfies the ss-majority rule if for every non-dirty candidate xx with respect to the threshold ss and every other candidate yxy\neq x, the relative positions of the pair (x,y)(x,y) in every median is determined by the head-to-head majority rule. By pioneering results in [6], every election satisfies the 3/43/4-majority rule with respect to the 22-wise Kemeny voting scheme. We shall prove in Section 3 that with respect to the 33-wise Kemeny voting scheme, the 3/43/4-majority rule holds for all elections with no more than 55 alternatives but fails in general otherwise.

2.4 The classical 22-wise Major Order Theorems

Since the scattered foundational works of Condorcet [11] in 1785, of Truchon [24] in 1990, of Kemeny [15] in 1959, of Young-Levenglick [25] in 1978, and of Young [26] in 1988, only the last two decades have witnessed the most important discoveries in the NP-hardness complexity (Bachmeier et al. [arrow], Dwork et al. [12]), in heuristic and approximation algorithms (Ailon et al. [1], Ali-Meilă [2], Nishimura-Simjour [20], Schalekamp-van Zuylen [22]), in lower bounds (Cotnizer et al. [9], Davenport-Kalagnanam [10]), and in exact space reduction techniques (Betzler et al. [6], Blin et al. [7], Milosz-Hamel [19], Phung-Hamel [21]) of the Kemeny problem. Among the space reduction techniques, the Major Order Theorems obtained in [19] provide some of the most efficient known algorithms. Such theorems are based on the observation that if two alternatives are close enough in all votes they will have the tendency to be placed in their major order in any consensus ranking. To state the theorems, let x,yx,y be two alternatives of an election with voting profile VV over a set of alternatives CC. Let δxy\delta_{xy} denotes the difference between the number of votes in which x>yx>y and respectively in which x<yx<y. Let ExyE_{xy} (called the interference set) be the multiset containing all alternatives zz such that x>z>yx>z>y in some vote where the multiplicity of zExyz\in E_{xy} is the number of votes in which x>z>yx>z>y.

Theorem 2.4 (MOT, [19]).

Suppose that x>yx>y in more than half of the votes and δxy>|EyxExy|\delta_{xy}>|E_{yx}\setminus E_{xy}|. Then xx is ranked before yy in every 22-wise median of the election.

If we iterate Theorem 2.4 by eliminating from the interference set ExyE_{xy} all alternatives which have been found to rank before (or after) both xx and yy by previous ii iterations to obtain the trimmed interference set Exy(i+1)E^{(i+1)}_{xy}, we obtain an iterated version of Theorem MOT (see Corollary 7.3 for another formulation).

Theorem 2.5 (Iterated MOT, [19]).

If x>yx>y in more than half of the votes and δxy>|Eyx(i)Exy(i)|\delta_{xy}>|E^{(i)}_{yx}\setminus E^{(i)}_{xy}| for some i0i\geq 0, then xx is ranked before yy in every 22-wise median of the election.

Over uniformly distributed elections with 1515 alternatives and 1515 votes, Iterated MOT can solve on average more than 50%\% of possible relative orders of all pairs of alternatives [19].

3 The 33-wise 3/43/4-majority rule for elections with no more than 5 alternatives

In this section, we shall prove that for the 33-wise Kemeny voting scheme, the 3/43/4-majority rule holds in general only for elections of at most 55 alternatives.

Theorem 3.1.

Let n1n\geq 1 be an integer. The 3/43/4-majority rule holds for all elections of nn alternatives with respect to the 33-wise Kemeny voting scheme if and only if n5n\leq 5.

Theorem 3.1 results directly from Lemma 3.2 and Lemma 3.4 below which prove each of the two implications in Theorem 3.1.

Lemma 3.2.

For every n6n\geq 6, the 3/43/4-majority rule fails for some elections consisting of nn alternatives with respect to the 33-wise Kemeny voting scheme.

Proof 3.3.

Let n6n\geq 6 be an integer. Let WW denote the ranking w1>>wn6w_{1}>\dots>w_{n-6}. Let us consider the following voting profile VV consisting of 44 votes over nn alternatives x,y,z,t,u,v,w1,,wn6x,y,z,t,u,v,w_{1},\dots,w_{n-6}

r1:W>x>y>z>t>u>v (5 votes),\displaystyle r_{1}:W>x>y>z>t>u>v\,\,\text{ (5 votes)},\quad r2:W>u>y>z>x>t>v (5 votes),\displaystyle r_{2}:W>u>y>z>x>t>v\,\,\text{ (5 votes)},
r3:W>v>z>y>x>t>u (5 votes),\displaystyle r_{3}:W>v>z>y>x>t>u\,\,\text{ (5 votes)},\quad r4:W>t>u>x>v>y>z (4 votes),\displaystyle r_{4}:W>t>u>x>v>y>z\,\,\text{ (4 votes)},
r5:W>t>u>x>v>y>z (1 vote).\displaystyle r_{5}:W>t>u>x>v>y>z\,\,\text{ (1 vote)}.\quad

Let π\pi^{*} be a 33-wise median of the above election. Then by the 33-wise Extended Always theorem [21], the first n8n-8 positions in π\pi^{*} is given by W:w1>>wn6W\colon w_{1}>\dots>w_{n-6}. Hence, we can write π:W>σ\pi^{*}\colon W>\sigma^{*} where σ\sigma^{*} is a ranking of the alternatives x,y,z,t,u,vx,y,z,t,u,v. It is then clear from the definition of dKT3d^{3}_{KT} that π\pi^{*} is a 33-wise median if and only if σ\sigma^{*} is a 33-wise median of the following induced election VV^{\prime}:

r1:x>y>z>t>u>v (5 votes),\displaystyle r^{\prime}_{1}:x>y>z>t>u>v\,\,\text{ (5 votes)},\quad r2:u>y>z>x>t>v (5 votes),\displaystyle r^{\prime}_{2}:u>y>z>x>t>v\,\,\text{ (5 votes)},
r3:v>z>y>x>t>u (5 votes),\displaystyle r^{\prime}_{3}:v>z>y>x>t>u\,\,\text{ (5 votes)},\quad r4:t>u>x>v>y>z (4 votes),\displaystyle r^{\prime}_{4}:t>u>x>v>y>z\,\,\text{ (4 votes)},
r5:t>u>x>v>y>z (1 vote).\displaystyle r^{\prime}_{5}:t>u>x>v>y>z\,\,\text{ (1 vote)}.\quad

The only 33-wise medians of VV^{\prime} are σ1:u>x>y>z>t>v\sigma^{*}_{1}\colon u>x>y>z>t>v and σ2:u>y>z>x>t>v\sigma^{*}_{2}\colon u>y>z>x>t>v. Consequently, π=π1\pi^{*}=\pi^{*}_{1} or π=π2\pi^{*}=\pi^{*}_{2} where π1:W>u>x>y>z>t>v\pi^{*}_{1}\colon W>u>x>y>z>t>v and π2:W>u>y>z>x>t>v\pi^{*}_{2}\colon W>u>y>z>x>t>v. Therefore, the election VV admits tt as a non-dirty alternative according to the 3/43/4 threshold since t3/4ut\geq_{3/4}u, t3/4vt\geq_{3/4}v, and w3/4tw\geq_{3/4}t for all w{x,y,z}{wi:1in6}w\in\{x,y,z\}\cup\{w_{i}\colon 1\leq i\leq n-6\}. However, tt is ranked after uu in both π1\pi^{*}_{1} and π2\pi^{*}_{2}. Thus, the 3/43/4-majority rule fails for all 33-wise medians of VV.

Lemma 3.4.

The 3/43/4-majority rule holds for all elections of at most 55 alternatives with respect to the 33-wise Kemeny voting scheme.

For the proof of Lemma 3.4 (and Theorem 4.3), we shall need the following auxiliary result.

Lemma 3.5.

Let C={x,y,z,t}C=\{x,y,z,t\} be a set of 4 alternatives in an election such that z3/4xz\geq_{3/4}x, and x3/4yx\geq_{3/4}y, and x3/4tx\geq_{3/4}t. Then the contribution of the pair of subsets {x,y,t}\{x,y,t\} and {y,z,t}\{y,z,t\} to Δ=dKT3(zxyt,V)dKT3(yzxt,V)\Delta=d^{3}_{KT}(zxyt,V)-d^{3}_{KT}(yzxt,V) is at most 0.

Proof 3.6.

Let mm be the number of votes. Then by the definition of dKT3d_{KT}^{3}, the maximal contribution (divided by mm) of the pair of subsets {x,y,t}\{x,y,t\} and {y,z,t}\{y,z,t\} to Δ\Delta is the optimal objective value of following maximization problem over 2424 variables cπc_{\pi} where π\pi denotes a strict ranking of the alternatives x,y,z,tx,y,z,t:

maximize: π[y]<π[x],π[t]cπ\displaystyle\sum\limits_{\pi[y]<\pi[x],\pi[t]}c_{\pi}- π[x]<π[y],π[t]cπ+π[y]<π[z],π[t]cππ[z]<π[y],π[t]cπ\displaystyle\sum\limits_{\pi[x]<\pi[y],\pi[t]}c_{\pi}+\sum\limits_{\pi[y]<\pi[z],\pi[t]}c_{\pi}-\sum\limits_{\pi[z]<\pi[y],\pi[t]}c_{\pi}
subject to:
πcπ\displaystyle\sum\limits_{\pi}c_{\pi} =1\displaystyle=1
π[z]<π[x]cπ\displaystyle\sum\limits_{\pi[z]<\pi[x]}c_{\pi} 34\displaystyle\geq\frac{3}{4}
π[x]<π[y]cπ\displaystyle\sum\limits_{\pi[x]<\pi[y]}c_{\pi} 34\displaystyle\geq\frac{3}{4}
π[x]<π[t]cπ\displaystyle\sum\limits_{\pi[x]<\pi[t]}c_{\pi} 34\displaystyle\geq\frac{3}{4}
cπ\displaystyle c_{\pi} 0 for all π.\displaystyle\geq 0\quad\text{ for all }\pi.

Here, cπc_{\pi} represents the number of votes in VV with ranking π\pi and π[x]\pi[x], π[y]\pi[y], π[z]\pi[z], π[t]{1,2,3,4}\pi[t]\in\{1,2,3,4\} represent respectively the positions of x,y,z,tx,y,z,t in π\pi. Since the optimal objective value of the above optimization problem is 0 (a Python code is available at https://github.com/XKPhung), the proof is complete.

Generalizing the usage of linear programming in the above lemma 3.5, we obtain the following proof of Lemma 3.4 which shows that the 33-wise 3/43/4-majority rule holds for all elections with no more than 55 alternatives.

Proof 3.7 (Proof of Lemma 3.4).

Let us fix a positive integer n5n\leq 5 and a set A={a1,a2,,an}A=\{a_{1},a_{2},\dots,a_{n}\} of nn alternatives. Let SnS_{n} denotes the set of all rankings rr over AA with the convention that r[i]{1,,n}r[i]\in\{1,\dots,n\} denotes the position of the alternative aia_{i} in the ranking rr.

Then observe that the 33-wise 3/43/4-majority rule fails for some election with exactly nn alternatives if and only if there exist {romanenumerate}

a voting profile VV over AA consisting of crc_{r} votes with ranking rr for each rSnr\in S_{n};

a non-dirty alternative akAa_{k}\in A with respect to VV and the 3/43/4 threshold where, up to renumbering the alternatives, we can assume that ai3/4aka_{i}\geq_{3/4}a_{k} for all i<ki<k and ak3/4aia_{k}\geq_{3/4}a_{i} for all i>ki>k;

a "bad" ranking pSnp\in S_{n} with respect to the 3/43/4-majority rule, i.e., (ik)(p[i]p[k])<0(i-k)(p[i]-p[k])<0 for some aiAa_{i}\in A; such that pp is at least as good as a 33-wise consensus as any "good" ranking qSnq\in S_{n}, i.e., (ik)(q[i]q[k])>0(i-k)(q[i]-q[k])>0 for all iki\neq k. In other words, for every good ranking qSnq\in S_{n}, we have:

Δ(p,q)=dKT3(p,V)dKT3(q,V)0.\displaystyle\Delta(p,q)=d^{3}_{KT}(p,V)-d^{3}_{KT}(q,V)\leq 0. (3.1)

Let Pk,QkSnP_{k},Q_{k}\subset S_{n} be respectively the set of all bad rankings and the set of all good rankings. By homogeneity, we can furthermore suppose that 0cr10\leq c_{r}\leq 1 for each rSnr\in S_{n} and rSncr=1\sum_{r\in S_{n}}c_{r}=1. Consequently, the existence of the data in (i), (ii), and (iii) satisfying the relation (3.1) is equivalence to the existence of k{1,2,,n}k\in\{1,2,\dots,n\}, pPkp\in P_{k} and the satisfiability of the following system ()(\star) of linear constraints on the variables {cr:rSn}\{c_{r}\colon r\in S_{n}\}:

{rSncr=1,cr0for allrSn,rSnαi,k,rcr3/4for alli{1,2,,n},rSnβp,q,rcr0for allqQk,\begin{cases}\sum_{r\in S_{n}}c_{r}=1,\quad c_{r}\geq 0&\text{for all}\quad r\in S_{n},\\ \sum_{r\in S_{n}}\alpha_{i,k,r}\,c_{r}\geq 3/4&\text{for all}\quad i\in\{1,2,\dots,n\},\\ \sum_{r\in S_{n}}\beta_{p,q,r}\,c_{r}\leq 0&\text{for all}\quad q\in Q_{k},\end{cases}

where the constants αi,k,r,βp,q,r\alpha_{i,k,r},\beta_{p,q,r} are defined as follows {alphaenumerate}

αi,k,r=1\alpha_{i,k,r}=1 if (ik)(r[i]r[k])>0(i-k)(r[i]-r[k])>0 and αi,k,r=0\alpha_{i,k,r}=0 otherwise;

βp,q,r=dKT3(p,r)dKT3(q,r)\beta_{p,q,r}=d^{3}_{KT}(p,r)-d^{3}_{KT}(q,r) .

It is immediate that the conditions rSnαi,k,rcr3/4\sum_{r\in S_{n}}\alpha_{i,k,r}\,c_{r}\geq 3/4 for all i{1,2,,n}i\in\{1,2,\dots,n\} translate the property that aka_{k} is a non-dirty alternative satisfying (ii). Moreover, we have:

Δ(p,q)=dKT3(p,V)dKT3(q,V)\displaystyle\Delta(p,q)=d^{3}_{KT}(p,V)-d^{3}_{KT}(q,V) =rSncrdKT3(p,r)rSncrdKT3(q,r)\displaystyle=\sum_{r\in S_{n}}c_{r}\,d^{3}_{KT}(p,r)-\sum_{r\in S_{n}}c_{r}\,d^{3}_{KT}(q,r)
=rSn(dKT3(p,r)dKT3(q,r))cr\displaystyle=\sum_{r\in S_{n}}\left(d^{3}_{KT}(p,r)-d^{3}_{KT}(q,r)\right)c_{r}
=rSnβp,q,rcr.\displaystyle=\sum_{r\in S_{n}}\beta_{p,q,r}\,c_{r}.

Thus, the conditions rSnβp,q,rcr0\sum_{r\in S_{n}}\beta_{p,q,r}\,c_{r}\leq 0 for all qQkq\in Q_{k} altogether translate the property (3.1) for all qQkq\in Q_{k}. We can check by a direct implementation with linear programming (see, e.g., a python code at https://github.com/XKPhung) that the system ()(\star) does not admit a solution for any value of k{1,2,,n}k\in\{1,2,\dots,n\} and pPkp\in P_{k} as long as n5n\leq 5. Therefore, we can conclude that the 3-wise 3/43/4-majority rule holds for all elections with no more than 55 alternatives.

4 Some weak versions of the 3-wise 3/4-majority rule

For elections with 66 alternatives, we have the following weak form of the 33-wise 3/43/4-majority rule.

Theorem 4.1.

Let CC be a set of 66 alternatives. Suppose that in an election over CC, we have a partition C=A{x}BC=A\cup\{x\}\cup B where |A|2|A|\leq 2 such that y3/4xy\geq_{3/4}x for all yAy\in A and x3/4zx\geq_{3/4}z for all zBz\in B. Then the election satisfies the 33-wise 3/43/4-majority rule.

Proof 4.2.

The proof follows immediately by using a similar translation procedure to a problem in linear programming as described in detail in the above proof of Lemma 3.4.

By results in [21], the 33-wise 3/43/4-majority holds for all non-dirty alternatives which win every duel by the ratio 3/43/4. For non-dirty alternatives which lose the head-to-head competition to exactly one other alternative, we have the following useful results. We drop the symbol >> in a ranking for simplicity.

Theorem 4.3.

Let C={x,z}JC=\{x,z\}\cup J be a partition of the set of alternatives of an election VV such that z3/4xz\geq_{3/4}x and x3/4yx\geq_{3/4}y for all yJy\in J. Then the following properties hold:

  1. (a)

    For all partitions J=ABCJ=A\cup B\cup C where A,B,CA,B,C are ordered sets with BB\neq\varnothing, we have:

    dKT3(AzBxC,V)>dKT3(AzxBC,V).d^{3}_{KT}(AzBxC,V)>d^{3}_{KT}(AzxBC,V).
  2. (b)

    For all partitions J=ABJ=A\cup B where A,BA,B are ordered sets with AA\neq\varnothing, we have:

    dKT3(AzxB,V)>dKT3(zxAB,V).d^{3}_{KT}(AzxB,V)>d^{3}_{KT}(zxAB,V).
  3. (c)

    For all partitions J=ABCJ=A\cup B\cup C where A,B,CA,B,C are ordered sets with |BC|=5|B\cup C|=5, |B|2|B|\leq 2, we have:

    dKT3(AxBzC,V)>dKT3(AzxBC,V)d^{3}_{KT}(AxBzC,V)>d^{3}_{KT}(AzxBC,V)
Proof 4.4.

For (a), let us consider the rankings AzByxCAzByxC and AzBxyCAzBxyC where J=ABCJ=A\cup B\cup C is a partition with A,B,CA,B,C ordered sets. Then the only subsets that can contribute to

Δa=dKT3(AzBxyC,V)dKT3(AzByxC,V)\Delta_{a}=d^{3}_{KT}(AzBxyC,V)-d^{3}_{KT}(AzByxC,V)

are {x,y}\{x,y\} and {x,y,t}\{x,y,t\} where tCt\in C. Since x3/4yx\geq_{3/4}y and x3/4tx\geq_{3/4}t, we have x>y,tx>y,t in at least half of the votes (Lemma 2.2). It follows that the contribution of every subset {x,y,t}\{x,y,t\} to Δa\Delta_{a} is at most 0. Likewise, the contribution of the subset {x,y}\{x,y\} to Δa\Delta_{a} is at most 14|V|34|V|=12|V|<0\frac{1}{4}|V|-\frac{3}{4}|V|=-\frac{1}{2}|V|<0. Therefore, Δa<0\Delta_{a}<0 and (a) follows by an immediate induction.

For (b), let yJy\in J and let us consider two rankings AyzxBAyzxB and AzxyBAzxyB where J{y}=ABJ\setminus\{y\}=A\cup B is a partition with A,BA,B ordered sets. Then the only subsets that can contribute to

Δb=dKT3(AyzxB,V)dKT3(AzxyB,V)\Delta_{b}=d^{3}_{KT}(AyzxB,V)-d^{3}_{KT}(AzxyB,V)

are {x,y}\{x,y\}, {y,z}\{y,z\}, and {x,y,t}\{x,y,t\}, {y,z,t}\{y,z,t\} for all tBt\in B. Since z3/4xz\geq_{3/4}x and x3/4yx\geq_{3/4}y, we have z1/2yz\geq_{1/2}y (Lemma 2.2). Hence, the pair (x,z)(x,z) contributes at most 0 to Δb\Delta_{b} and the pair (x,y)(x,y) contributes at most 14|V|34|V|=12|V|<0\frac{1}{4}|V|-\frac{3}{4}|V|=-\frac{1}{2}|V|<0 to Δ\Delta.

Lemma 3.5 shows that for every tBt\in B, the pair of subsets {x,y,t}\{x,y,t\} and {y,z,t}\{y,z,t\} contributes at most 0 to Δb\Delta_{b}. To summarize, we have proved that Δb<0\Delta_{b}<0. It is then clear that (b) follows by an immediate induction.

For (c), the only subsets that can contribute to

Δc=dKT3(AzxBC,V)dKT3(AxBzC,V)\Delta_{c}=d^{3}_{KT}(AzxBC,V)-d^{3}_{KT}(AxBzC,V)

are {x,z}\{x,z\}, {y,z}\{y,z\}, {x,z,t}\{x,z,t\}, and {y,z,t}\{y,z,t\} for all yBy\in B and tBCt\in B\cup C. Since z3/4xz\geq_{3/4}x, the contribution of the pair (x,z)(x,z) to Δc\Delta_{c} is at most

14|V|34|V|=12|V|.\frac{1}{4}|V|-\frac{3}{4}|V|=-\frac{1}{2}|V|.

Since z3/4xz\geq_{3/4}x and x3/4yx\geq_{3/4}y for all yBy\in B, we have z1/2yz\geq_{1/2}y by Lemma 2.2 and thus the contribution of all such pairs (y,z)(y,z) is at most 0.

Let tBCt\in B\cup C. Under the assumption that x3/4tx\geq_{3/4}t and z3/4xz\geq_{3/4}x, linear programming (see, e.g., https://github.com/XKPhung) shows that the contribution of the subset {x,z,t}\{x,z,t\} to Δc\Delta_{c} is at most 14|V|-\frac{1}{4}|V|. Hence, the total contribution of such subsets {x,z,t}\{x,z,t\} (tBCt\in B\cup C) to Δc\Delta_{c} is at most

14|V|(|B|+|C|).-\frac{1}{4}|V|(|B|+|C|).

Similarly, under the assumption that x3/4y,tx\geq_{3/4}y,t and z3/4xz\geq_{3/4}x, linear programming (a Python code is available at https://github.com/XKPhung) shows that for all yBy\in B and tBCt\in B\cup C, the contribution of the subset {y,z,t}\{y,z,t\} to Δc\Delta_{c} is at most 14|V|\frac{1}{4}|V|. Hence, the total contribution of such subsets is at most

14|V|(|B|(|B|1)2+|B||C|).\frac{1}{4}|V|\left(\frac{|B|(|B|-1)}{2}+|B||C|\right).

To summarize, we deduce the following estimation:

Δc|V|1214(|B|+|C|)+14(|B|(|B|1)2+|B||C|).\displaystyle\frac{\Delta_{c}}{|V|}\leq-\frac{1}{2}-\frac{1}{4}(|B|+|C|)+\frac{1}{4}\left(\frac{|B|(|B|-1)}{2}+|B||C|\right).

Since |B|+|C|=5|B|+|C|=5 and |B|2|B|\leq 2, by substituting all possible values of (|B|,|C|)(|B|,|C|), we deduce that Δc<0\Delta_{c}<0 and the proof of (c) is complete.

5 33-wise Major Order Theorem and applications

5.1 Major Order Theorems fail for the 33-wise Kemeny voting scheme

The example below shows that all the Major Order Theorems established in [19] fail for the 33-wise Kemeny voting scheme. The discussion following Example 5.1 also explains why the 33-wise Kemeny voting scheme is more suitable than the Kemeny rule to avoid manipulations.

Example 5.1.

Let us consider the example taken from [21]. Let C={x,y,z,t}C=\{x,y,z,t\} be a set of 4 alternatives and consider the voting profile VV:

r1:z>t>x>y (5 votes),\displaystyle r_{1}:z>t>x>y\,\,\text{ (5 votes)},\quad r2:y>t>x>z (2 votes),\displaystyle r_{2}:y>t>x>z\,\,\text{ (2 votes)},
r3:x>y>t>z (2 votes),\displaystyle r_{3}:x>y>t>z\,\,\text{ (2 votes)},\quad r4:t>x>y>z (2 votes).\displaystyle r_{4}:t>x>y>z\,\,\text{ (2 votes)}.

Then r1r_{1} is the unique 33-wise median r1r_{1} of the election. Note that the Major Order Theorems in [19] do not hold for the pair of alternatives (z,t)(z,t) in the above election with respect to the 33-wise Kemeny voting scheme. Indeed, t>zt>z in a majority of the votes and the multiset consisting of alternatives cc with z>c>tz>c>t in a vote is empty so that the conditions in the Major Order Theorems are satisfied. However, we have z>tz>t in the unique 33-wise median r1r_{1} of the election.

5.2 33-wise Major Order Theorem

Definition 5.2.

Let VV be a voting profile. Let x1,,xkx_{1},\dots,x_{k} be some alternatives of the election. Then we denote by nx1xk(V)n_{x_{1}\dots x_{k}}(V) or simply nx1xkn_{x_{1}\dots x_{k}} the number of votes in VV in which the relative orders of x1,,xkx_{1},\dots,x_{k} is x1>x2>>xkx_{1}>x_{2}>\dots>x_{k}. For two alternatives x,yx,y, we also define δxy=nxynyx\delta_{xy}=n_{xy}-n_{yx}. If δxy>0\delta_{xy}>0, we say that the major order and the minor order of the pair (x,y)(x,y) are respectively x>yx>y and x<yx<y.

Definition 5.3.

For alternatives x,y,z,tx,y,z,t of an election with voting profile VV, we define

Px,y,z\displaystyle P_{x,y,z} =3nxzy+nxyz+nzxy+nzyx4nyzx2nyxz\displaystyle=3n_{xzy}+n_{xyz}+n_{zxy}+n_{zyx}-4n_{yzx}-2n_{yxz}
Qx,y,z\displaystyle Q_{x,y,z} =nxyz+nxzynyxznyzx=Qy,x,z\displaystyle=n_{xyz}+n_{xzy}-n_{yxz}-n_{yzx}=-Q_{y,x,z}
Ry,x,z,t\displaystyle R_{y,x,z,t} =2nyztx+2nyzxt+nytzx+nytxz\displaystyle=2n_{yztx}+2n_{yzxt}+n_{ytzx}+n_{ytxz}
Sy,x,z,t\displaystyle S_{y,x,z,t} =Ry,x,z,tRx,y,z,t.\displaystyle=R_{y,x,z,t}-R_{x,y,z,t}.

The underlying idea is that the quantity Sy,x,z,tS_{y,x,z,t} captures the preference of the alternative yy over the alternative xx in the presence of the alternatives zz, tt. On the other hand, Px,y,zP_{x,y,z} and Qx,y,z=Qy,x,zQ_{x,y,z}=-Q_{y,x,z} are certain measures of the preference of xx over yy when zz comes into play. Note that Sy,x,z,t=(Qz,y,t+Qx,z,t)S_{y,x,z,t}=-(Q_{z,y,t}+Q_{x,z,t}) (see Lemma 5.8 below). Hence, the intuition is that if the preference for xx over yy in the election is strong enough, not only in the head-to-head competition but even when taking into consideration one or two more alternatives, then Px,y,zP_{x,y,z} and Qx,y,zQ_{x,y,z} tend to be positive while Sy,x,z,tS_{y,x,z,t} should be negative. To quantify the above intuition and the interplay between the quantities Qx,y,zQ_{x,y,z}, Px,y,zP_{x,y,z}, and Sy,x,z,tS_{y,x,z,t}, we establish the following 33-wise Major Order Theorem in the vein of the various Major Order Theorems [19] for the classical Kemeny rule.

Theorem 5.4 (3MOT).

Let x,yx,y be two alternatives of an election with voting profile VV over a set of alternatives CC. Suppose that x>yx>y is the major order and

  1. (i)

    Qx,y,z0Q_{x,y,z}\geq 0 for all zC{x,y}z\in C\setminus\{x,y\},

  2. (ii)

    2δxy>zx,ymax(0,Px,y,z+tx,y,zmax(0,Sy,x,z,t))2\delta_{xy}>\sum_{z\neq x,y}\max\left(0,-P_{x,y,z}+\sum_{t\neq x,y,z}\max\left(0,S_{y,x,z,t}\right)\right).

Then xx is ranked before yy in every 33-wise median of the election.

The following observation is clear.

Lemma 5.5.

The theoretical time complexity of the corresponding algorithm of 3MOT to determine the relative order of pairs of alternatives is 𝒪(n4m)\mathcal{O}(n^{4}m) where nn is the number of alternatives and mm is the number of votes. ∎

For the proof of 3MOT, we need to establish a series of auxiliary results. The first lemma is the following simple but useful observation.

Lemma 5.6.

Given distinct alternatives x,y,z,tx,y,z,t in an election, the following relations hold:

nxy\displaystyle n_{xy} =nzxy+nxzy+nxyz,\displaystyle=n_{zxy}+n_{xzy}+n_{xyz},
nxyz\displaystyle n_{xyz} =ntxyz+nxtyz+nxytz+nxyzt,\displaystyle=n_{txyz}+n_{xtyz}+n_{xytz}+n_{xyzt},

where the quantities nxyn_{xy}, nxyzn_{xyz}, nxyztn_{xyzt}, etc. are defined in Definition 5.2.

Proof 5.7.

For the relation nxy=nzxy+nxzy+nxyzn_{xy}=n_{zxy}+n_{xzy}+n_{xyz}, it suffices to observe that for each vote where x>yx>y, we have exactly three mutually exclusive possibilities according to three possible relative positions of zz with respect to xx and yy, namely, z>x>yz>x>y or x>z>yx>z>y or x>y>zx>y>z. A similar argument shows that nxyz=ntxyz+nxtyz+nxytz+nxyztn_{xyz}=n_{txyz}+n_{xtyz}+n_{xytz}+n_{xyzt}.

The next lemma describes the relation between the quantities Sy,x,z,tS_{y,x,z,t} and Qz,y,tQ_{z,y,t}, Qx,z,tQ_{x,z,t}.

Lemma 5.8.

For all distinct alternatives x,y,z,tx,y,z,t in an election, we have

Sy,x,z,t=Qz,y,t+Qx,z,t.-S_{y,x,z,t}=Q_{z,y,t}+Q_{x,z,t}.
Proof 5.9.

By direct computation, we infer from Lemma 5.6 and the definition of Qz,y,tQ_{z,y,t}, Qx,z,tQ_{x,z,t}, Rx,y,z,tR_{x,y,z,t}, Ry,x,z,tR_{y,x,z,t}, and Sy,x,z,tS_{y,x,z,t} that:

Qz,y,t+Qx,z,t\displaystyle Q_{z,y,t}+Q_{x,z,t}
=\displaystyle=\,\, nzyt+nztynyztnytz+nxzt+nxtznzxtnztx\displaystyle n_{zyt}+n_{zty}-n_{yzt}-n_{ytz}+n_{xzt}+n_{xtz}-n_{zxt}-n_{ztx}
=\displaystyle=\,\, nxzyt+nzxyt+nzyxt+nzytx\displaystyle n_{xzyt}+n_{zxyt}+n_{zyxt}+n_{zytx}
+\displaystyle+\,\, nxzty+nzxty+nztxy+nztyx\displaystyle n_{xzty}+n_{zxty}+n_{ztxy}+n_{ztyx}
\displaystyle-\,\, nxyztnyxztnyzxtnyztx\displaystyle n_{xyzt}-n_{yxzt}-n_{yzxt}-n_{yztx}
\displaystyle-\,\, nxytznyxtznytxznytzx\displaystyle n_{xytz}-n_{yxtz}-n_{ytxz}-n_{ytzx}
+\displaystyle+\,\, nyxzt+nxyzt+nxzyt+nxzty\displaystyle n_{yxzt}+n_{xyzt}+n_{xzyt}+n_{xzty}
+\displaystyle+\,\, nyxtz+nxytz+nxtyz+nxtzy\displaystyle n_{yxtz}+n_{xytz}+n_{xtyz}+n_{xtzy}
\displaystyle-\,\, nyzxtnzyxtnzxytnzxty\displaystyle n_{yzxt}-n_{zyxt}-n_{zxyt}-n_{zxty}
\displaystyle-\,\, nyztxnzytxnztyxnztxy\displaystyle n_{yztx}-n_{zytx}-n_{ztyx}-n_{ztxy}
=\displaystyle=\,\, 2nxzyt+2nxzty2nyzxt2nyztxnytxznytzx+nxtyz+nxtzy\displaystyle 2n_{xzyt}+2n_{xzty}-2n_{yzxt}-2n_{yztx}-n_{ytxz}-n_{ytzx}+n_{xtyz}+n_{xtzy}
=\displaystyle=\,\, Rx,y,z,tRy,x,z,t\displaystyle R_{x,y,z,t}-R_{y,x,z,t}
=\displaystyle=\,\, Sy,x,z,t.\displaystyle-S_{y,x,z,t}.

Therefore, Qz,y,t+Qx,z,t=Sy,x,z,tQ_{z,y,t}+Q_{x,z,t}=-S_{y,x,z,t} and the proof is complete.

The next lemma allows us to express the quantity Px,y,zP_{x,y,z} as a sum of Qz,y,xQ_{z,y,x}, Qx,y,zQ_{x,y,z}, and δxz\delta_{xz}, δzy\delta_{zy}.

Lemma 5.10.

For all distinct alternatives x,y,zx,y,z in an election, we have:

Px,y,z=δxz+δzy+Qx,y,z+Qz,y,x.P_{x,y,z}=\delta_{xz}+\delta_{zy}+Q_{x,y,z}+Q_{z,y,x}.
Proof 5.11.

For strict rankings, we deduce from Lemma 5.6 that:

δxz+δzy=nzy+nxznyznzx\displaystyle\delta_{xz}+\delta_{zy}=n_{zy}+n_{xz}-n_{yz}-n_{zx}
=\displaystyle=\,\, nxzy+nzxy+nzyx+nyxz+nxyz+nxzynxyznyxznyzxnyzxnzyxnzxy\displaystyle n_{xzy}+n_{zxy}+n_{zyx}+n_{yxz}+n_{xyz}+n_{xzy}-n_{xyz}-n_{yxz}-n_{yzx}-n_{yzx}-n_{zyx}-n_{zxy}
=\displaystyle=\,\, 2(nxzynyzx),\displaystyle 2(n_{xzy}-n_{yzx}),

and

Qx,y,z+Qz,y,x\displaystyle Q_{x,y,z}+Q_{z,y,x}
=\displaystyle= nxyz+nxzynyxznyzx+nzxy+nzyxnyxznyzx\displaystyle\,\,n_{xyz}+n_{xzy}-n_{yxz}-n_{yzx}+n_{zxy}+n_{zyx}-n_{yxz}-n_{yzx}
=\displaystyle= nxyz+nxzy+nzxy+nzyx2nyxz2nyzx.\displaystyle\,\,n_{xyz}+n_{xzy}+n_{zxy}+n_{zyx}-2n_{yxz}-2n_{yzx}.

Therefore, we obtain

δxz+δzy+Qx,y,z+Qz,y,x=Px,y,z.\displaystyle\delta_{xz}+\delta_{zy}+Q_{x,y,z}+Q_{z,y,x}=P_{x,y,z}. (5.1)

and the proof is thus complete.

We can now state the following key technical lemma in the proof of 3MOT. The main point is that Lemma 5.12, using only the quantities of the form δx,y\delta_{x,y}, Px,y,zP_{x,y,z}, Qx,y,zQ_{x,y,z}, Ss,y,zS_{s,y,z}, allows us to compare simultaneously two potential consensus rankings with a given ranking where one would like to switch the relative order of a certain pair of alternatives.

Lemma 5.12.

Let π:L>y>Z>x>R\pi\colon L>y>Z>x>R, σ1:L>Z>x>y>R\sigma_{1}^{*}\colon L>Z>x>y>R, and σ2:L>x>y>Z>R\sigma^{*}_{2}\colon L>x>y>Z>R be three rankings of an election where L,Z,RL,Z,R are linearly ordered sets of alternatives. For i{1,2}i\in\{1,2\}, let Δi=dKT3(π,V)dKT3(σi,V)\Delta_{i}=d_{KT}^{3}(\pi,V)-d_{KT}^{3}(\sigma_{i}^{*},V). Then we have:

Δ1+Δ2=2tRQx,y,t+2δxy+zZPx,y,zzZ,tZR,z>πtSy,x,z,t.\displaystyle\displaystyle\Delta_{1}+\Delta_{2}=2\sum_{t\in R}Q_{x,y,t}+2\delta_{xy}+\sum_{z\in Z}P_{x,y,z}-\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}S_{y,x,z,t}. (5.2)
Proof 5.13.

By the definition of δxy\delta_{xy}, Qu,v,wQ_{u,v,w}, and the 33-wise Kendall-tau distance dKT3d_{KT}^{3}, we find that:

Δ1=\displaystyle\Delta_{1}= nxynyx+zZ(nzynyz)+zZ,tZR,z>πt(nzyt+nztynyztnytz)\displaystyle\,\,n_{xy}-n_{yx}+\sum_{z\in Z}(n_{zy}-n_{yz})+\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}(n_{zyt}+n_{zty}-n_{yzt}-n_{ytz})
+zZ(nzxy+nzyxnyxznyzx)+tR(nxyt+nxtynyxtnytx)\displaystyle+\sum_{z\in Z}(n_{zxy}+n_{zyx}-n_{yxz}-n_{yzx})+\sum_{t\in R}(n_{xyt}+n_{xty}-n_{yxt}-n_{ytx})
=\displaystyle= δxy+zZ(nzynyz)+zZ,tZR,z>πtQz,y,t+zZQz,y,x+tRQx,y,t\displaystyle\,\,\delta_{xy}+\sum_{z\in Z}(n_{zy}-n_{yz})+\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}Q_{z,y,t}+\sum_{z\in Z}Q_{z,y,x}+\sum_{t\in R}Q_{x,y,t}

and similarly

Δ2=\displaystyle\Delta_{2}= nxynyx+zZ(nxznzx)+zZ,tZR,z>πt(nxzt+nxtznzxtnztx)\displaystyle\,\,n_{xy}-n_{yx}+\sum_{z\in Z}(n_{xz}-n_{zx})+\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}(n_{xzt}+n_{xtz}-n_{zxt}-n_{ztx})
+zZ(nxyz+nxzynyxznyzx)+tR(nxyt+nxtynyxtnytx)\displaystyle+\sum_{z\in Z}(n_{xyz}+n_{xzy}-n_{yxz}-n_{yzx})+\sum_{t\in R}(n_{xyt}+n_{xty}-n_{yxt}-n_{ytx})
=\displaystyle= δxy+zZ(nxznzx)+zZ,tZR,z>πtQx,z,t+zZQx,y,z+tRQx,y,t.\displaystyle\,\,\delta_{xy}+\sum_{z\in Z}(n_{xz}-n_{zx})+\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}Q_{x,z,t}+\sum_{z\in Z}Q_{x,y,z}+\sum_{t\in R}Q_{x,y,t}.

Hence, we deduce that:

Δ1+Δ2=\displaystyle\Delta_{1}+\Delta_{2}=   2δxy+2tTQx,y,t+zZ,tZT,z>πt(Qz,y,t+Qx,z,t)\displaystyle\,\,2\delta_{xy}+2\sum_{t\in T}Q_{x,y,t}+\sum_{z\in Z,\,t\in Z\cup T,\,z>^{\pi}t}(Q_{z,y,t}+Q_{x,z,t})
+zZ(nzy+nxznyznzx+Qx,y,z+Qz,y,x).\displaystyle+\sum_{z\in Z}(n_{zy}+n_{xz}-n_{yz}-n_{zx}+Q_{x,y,z}+Q_{z,y,x}). (5.3)

On the one hand, we have δab=nabnba\delta_{ab}=n_{ab}-n_{ba} and Pa,b,c=δac+δcb+Qa,b,c+Qc,b,xP_{a,b,c}=\delta_{ac}+\delta_{cb}+Q_{a,b,c}+Q_{c,b,x} for all distinct alternatives a,b,ca,b,c by Lemma 5.10. Therefore,

zZ(nzy+nxznyznzx+Qx,y,z+Qz,y,x)=zZPx,y,z.\displaystyle\sum_{z\in Z}(n_{zy}+n_{xz}-n_{yz}-n_{zx}+Q_{x,y,z}+Q_{z,y,x})=\sum_{z\in Z}P_{x,y,z}. (5.4)

On the other hand, since Sb,a,c,d=Qc,b,d+Qa,c,d-S_{b,a,c,d}=Q_{c,b,d}+Q_{a,c,d} for all distinct alternatives a,b,c,da,b,c,d by Lemma 5.8, we deduce that:

zZ,tZR,z>πt(Qz,y,t+Qx,z,t)=zZ,tZR,z>πtSy,x,z,t.\displaystyle\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}(Q_{z,y,t}+Q_{x,z,t})=-\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}S_{y,x,z,t}. (5.5)

By combining the equalities (5.13), (5.4), (5.5), we obtain the desired relation (5.2).

We can finally give the proof of 3MOT.

Proof 5.14 (Proof of Theorem 5.4).

Suppose on the contrary that π:L>y>Z>x>R\pi\colon L>y>Z>x>R is a 33-wise median of the election where L,Z,RL,Z,R are ordered sets of alternatives. We will show that for the rankings σ1:L>Z>x>y>R\sigma_{1}^{*}\colon L>Z>x>y>R and σ2:L>x>y>Z>R\sigma^{*}_{2}\colon L>x>y>Z>R, there exists i{1,2}i\in\{1,2\} such that

Δi=dKT3(π,V)dKT3(σi,V)>0.\Delta_{i}=d_{KT}^{3}(\pi,V)-d_{KT}^{3}(\sigma_{i}^{*},V)>0.

Indeed, by Lemma 5.12 on the differences Δ1\Delta_{1} and Δ2\Delta_{2}, we find that:

Δ1+Δ2=2tRQx,y,t+2δxy+zZPx,y,zzZ,tZR,z>πtSy,x,z,t.\displaystyle\displaystyle\Delta_{1}+\Delta_{2}=2\sum_{t\in R}Q_{x,y,t}+2\delta_{xy}+\sum_{z\in Z}P_{x,y,z}-\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}S_{y,x,z,t}. (5.6)

We deduce that Δ1+Δ2>0\Delta_{1}+\Delta_{2}>0 by conditions (i) and (ii). Note that the inequality is strict by condition (ii). Therefore, there exists i{1,2}i\in\{1,2\} such that Δi>0\Delta_{i}>0. In other words, σi\sigma^{*}_{i} would then be a strictly better 33-wise Kemeny consensus than π\pi, which contradicts the hypothesis that π\pi is a 33-wise median of the election. Therefore, the alternative xx must be ranked before yy in every 33-wise median of the election.

Example 5.15.

In Example 5.1 where the Major Order Theorems [19] fail with respect to the 33-wise Kemeny voting scheme, the 33-wise Major Order Theorem 5.4 tells us that in every 33-wise median, we have t>xt>x and x>yx>y. Therefore, t>x>yt>x>y in every 33-wise median and we only need to determine the position of the alternative zz, i.e., to check which of the following 4 rankings is the 33-wise median:

r1:z>t>x>y,\displaystyle r_{1}:z>t>x>y,\quad\quad r2:t>z>x>y,\displaystyle r_{2}:t>z>x>y,
r3:t>x>z>y,\displaystyle r_{3}:t>x>z>y,\quad\quad r4:t>x>y>z.\displaystyle r_{4}:t>x>y>z.

The research space is thus reduced by 4!/4=64!/4=6 times. Note that r1r_{1} is the unique 33-wise median.

Example 5.16.

In Example 5.1, we have δtz=1\delta_{tz}=1, Pt,z,x=4P_{t,z,x}=4, and Pt,z,y=0P_{t,z,y}=0 but Qt,z,x=1Q_{t,z,x}=-1, Qt,z,y=3Q_{t,z,y}=-3, and moreover Sz,t,y,x=2S_{z,t,y,x}=2, Sz,t,x,y=4S_{z,t,x,y}=4. Thus, the preference for tt over zz is not strong enough to guarantee that tt is ranked before zz in a 33-wise median: all the conditions (i), (ii) in the 33-wise Major Order Theorem (Theorem 5.4) fail for the pair (t,z)(t,z) in the majority order.

5.3 Applications on consecutive alternatives in a median

As an immediate consequence of the above proof of Theorem 5.4, we obtain the following criteria for the ranking of two consecutive alternatives in a 33-wise median which extends the corresponding criteria for the classical Kemeny rule [7].

Theorem 5.17.

Let x,yx,y be two consecutive alternatives in a 33-wise median ranking π\pi^{*} of an election. Suppose that δxy>zx,ymax(0,Qy,x,z)\delta_{xy}>\sum_{z\neq x,y}\max(0,Q_{y,x,z}). Then xx must be ranked before yy in π\pi^{*}.

Proof 5.18.

Suppose on the contrary that π:L>y>x>R\pi^{*}\colon L>y>x>R where L,RL,R are ordered sets of alternatives. Let σ:L>x>y>R\sigma^{*}\colon L>x>y>R, we denote

Δ=dKT3(π,V)dKT3(σ,V).\Delta=d_{KT}^{3}(\pi^{*},V)-d_{KT}^{3}(\sigma^{*},V).

By the relation (5.6) in the proof of Theorem 5.4 (cf. Lemma 5.12), we deduce that

2Δ\displaystyle\displaystyle 2\Delta =2δxy+2tRQx,y,t=2δxy2tRQy,x,t2δxy2zx,ymax(0,Qy,x,z)\displaystyle=2\delta_{xy}+2\sum_{t\in R}Q_{x,y,t}=2\delta_{xy}-2\sum_{t\in R}Q_{y,x,t}\geq 2\delta_{xy}-2\sum_{z\neq x,y}\max(0,Q_{y,x,z})

which is strictly positive by the hypothesis. Hence, σ\sigma^{*} must be a strictly better 33-wise Kemeny consensus than π\pi^{*}, which is a contradiction. The theorem thus follows.

Corollary 5.19.

Let x,yx,y be two consecutive alternatives in a 33-wise median π\pi^{*} of an election. If x>yx>y is the major order and Qx,y,z0Q_{x,y,z}\geq 0 for every alternative zx,yz\neq x,y, then xx must be ranked before yy in π\pi^{*}.

Proof 5.20.

Since for every alternative zx,yz\neq x,y, we have Qx,y,z0Q_{x,y,z}\geq 0 thus Qy,x,z0Q_{y,x,z}\leq 0 and max(0,Qy,x,z)=0\max(0,Q_{y,x,z})=0. Hence, as x>yx>y is the major order, we deduce that

δxy>0=zx,ymax(0,Qy,x,z)\delta_{xy}>0=\sum_{z\neq x,y}\max(0,Q_{y,x,z})

and the result follows from Theorem 5.17.

6 Iterated 33-wise Major Order Theorem

Developing the ideas of the classical Iterated MOT (Theorem  2.5), we can strengthen even further the 33-wise Major Order Theorem 5.4 based on the following two observations. The first one is the transitivity of preferences in a single ranking as used in the proof of Theorem  2.5. Suppose that (x,y)(x,y) and (y,z)(y,z) are ordered pairs of alternatives found by the 33-wise Major Order Theorem 5.4 such that x>yx>y and y>zy>z in every 33-wise median. Then by transitivity, we also have x>zx>z in every 33-wise median.

The second observation is similar but slightly more sophisticated. Suppose that we know z>xz>x and z>yz>y in every 33-wise median. Then it is clear that a 33-wise median of the election must be of the form L>x>Z>y>RL>x>Z>y>R or L>y>Z>x>RL>y>Z>x>R where L,Z,RL,Z,R are ordered set of alternatives such that zLz\in L. Then we can completely ignore such an alternative zz in conditions (i) and (ii) in the 33-wise Major Order Theorem 5.4. Similarly, if we know that x>zx>z and y>zy>z in every 33-wise median, then we can ignore the alternative zz in condition (ii) in the outermost summation. In fact, Theorem 6.2 shows that we can even further relax conditions (i) and (ii) by a more careful analysis.

We can obviously iterate in parallel the above two processes corresponding to the two observations to obtain larger and larger sets consisting of constraints determined by ordered pairs of alternatives until these sets stabilize. Hence, we obtain the following iterated version of the 33-wise Major Order Theorem.

Definition 6.1.

Let VV be an election. Let UU be an anti-symmetric set consisting of ordered pairs of distinct alternatives (x,y)(x,y) in the election with no two pairs violating the transitivity property, i.e.,

  1. (a)

    (y,x)U(y,x)\notin U whenever (x,y)U(x,y)\in U,

  2. (b)

    (z,x)U(z,x)\notin U whenever (x,y),(y,z)U(x,y),(y,z)\in U.

We denote by U¯U\overline{U}\supset U the transitive closure of UU: if (x,y),(y,z)U(x,y),(y,z)\in U then (x,z)U¯(x,z)\in\overline{U}. We define also

Lx(U)\displaystyle L_{x}(U) ={z:(z,x)U¯},\displaystyle=\{z\colon(z,x)\in\overline{U}\}, Lx,y(U)\displaystyle L_{x,y}(U) ={z:(z,x),(z,y)U¯},\displaystyle=\{z\colon(z,x),(z,y)\in\overline{U}\},
Rx(U)\displaystyle R_{x}(U) ={z:(x,z)U¯},\displaystyle=\{z\colon(x,z)\in\overline{U}\}, Rx,y(U)\displaystyle R_{x,y}(U) ={z:(x,z),(y,z)U¯}.\displaystyle=\{z\colon(x,z),(y,z)\in\overline{U}\}.
Theorem 6.2 (Iterated 3MOT).

Let VV be an election over a set of alternatives CC. Let U1=U_{-1}=\varnothing and for k0k\geq 0, we define inductively UkU_{k} as the set of ordered pairs (x,y)(x,y) of distinct alternatives such that

  1. (i)

    Qx,y,z0Q_{x,y,z}\geq 0 for all zC(Ly(U¯k1){x,y})z\in C\setminus\left(L_{y}\left(\overline{U}_{k-1}\right)\cup\{x,y\}\right),

  2. (ii)

    2δxy>zXkmax(0,Px,y,z+tYk{z}max(0,Sy,x,z,t))2\delta_{xy}>\sum_{z\in X_{k}}\max\left(0,-P_{x,y,z}+\sum_{t\in Y_{k}\setminus\{z\}}\max\left(0,S_{y,x,z,t}\right)\right)

where Xk=C(Ly(U¯k1)Rx(U¯k1){x,y})X_{k}=C\setminus\left(L_{y}\left(\overline{U}_{k-1}\right)\cup R_{x}\left(\overline{U}_{k-1}\right)\cup\{x,y\}\right) and Yk=C(Ly(U¯k1){x,y})Y_{k}=C\setminus\left(L_{y}\left(\overline{U}_{k-1}\right)\cup\{x,y\}\right). Then for every integer k1k\geq-1, we have UkUk+1{U}_{k}\subset U_{k+1}. Moreover, if (x,y)U¯k(x,y)\in\overline{U}_{k} for some k0k\geq 0, the alternative xx is ranked before yy in every 33-wise median of the election.

Proof 6.3.

Since U1=U_{-1}=\varnothing, we have U1U0U_{-1}\subset U_{0} trivially. Note that U0U_{0} is exactly the set consisting of ordered pairs (x,y)(x,y) of distinct alternatives satisfying Theorem 5.4. The first assertion UkUk+1U_{k}\subset U_{k+1} for all k1k\geq-1 is then clear by an immediate induction on kk using the definition of conditions (i) and (ii) and the obvious properties A¯B¯\overline{A}\subset\overline{B}, Ly(A)Ly(B)L_{y}(A)\subset L_{y}(B), and Rx(A)Rx(B)R_{x}(A)\subset R_{x}(B) for all ABA\subset B.

Consider the second assertion that xx is ranked before yy in every 33-wise median of the election whenever (x,y)U¯k(x,y)\in\overline{U}_{k} for some k0k\geq 0. The case k=0k=0 results directly from Theorem 5.4. Assume that the second assertion of Theorem 6.2 holds for some k0k\geq 0 and let (x,y)Uk+1(x,y)\in U_{k+1}. Suppose on the contrary that π:L>y>Z>x>R\pi\colon L>y>Z>x>R is a 33-wise median of the election VV where L,Z,RL,Z,R are ordered sets of alternatives. Consider the rankings σ1:L>Z>x>y>R\sigma_{1}^{*}\colon L>Z>x>y>R and σ2:L>x>y>Z>R\sigma^{*}_{2}\colon L>x>y>Z>R as in the proof of Theorem 5.4. Then for

Δi=dKT3(π,V)dKT3(σi,V),i{1,2},\Delta_{i}=d_{KT}^{3}(\pi,V)-d_{KT}^{3}(\sigma_{i}^{*},V),\quad i\in\{1,2\},

we also have (cf. Lemma 5.12):

Δ1+Δ2=2tRQx,y,t+2δxy+zZPx,y,zzZ,tZR,z>πtSy,x,z,t.\displaystyle\displaystyle\Delta_{1}+\Delta_{2}=2\sum_{t\in R}Q_{x,y,t}+2\delta_{xy}+\sum_{z\in Z}P_{x,y,z}-\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}S_{y,x,z,t}. (6.1)

By our induction hypothesis, we have Ly(U¯k)LL_{y}\left(\overline{U}_{k}\right)\subset L and Rx(U¯k)RR_{x}\left(\overline{U}_{k}\right)\subset R. Consequently, we find that

Z\displaystyle Z Xk=C(Ly(U¯k)Rx(U¯k){x,y}) and\displaystyle\subset X_{k}=C\setminus\left(L_{y}\left(\overline{U}_{k}\right)\cup R_{x}\left(\overline{U}_{k}\right)\cup\{x,y\}\right)\text{ and} (6.2)
R\displaystyle R Yk{z}=C(Ly(U¯k){x,y,z}).\displaystyle\subset Y_{k}\setminus\{z\}=C\setminus\left(L_{y}\left(\overline{U}_{k}\right)\cup\{x,y,z\}\right).

Therefore, we infer from (6.2), (6.1) and conditions (i) and (ii) satisfied by (x,y)(x,y) that

Δ1+Δ2\displaystyle\Delta_{1}+\Delta_{2} =2tRQx,y,t+2δxy+zZPx,y,zzZ,tZR,z>πtSy,x,z,t>0.\displaystyle=2\sum_{t\in R}Q_{x,y,t}+2\delta_{xy}+\sum_{z\in Z}P_{x,y,z}-\sum_{z\in Z,\,t\in Z\cup R,\,z>^{\pi}t}S_{y,x,z,t}>0.

We deduce that either Δ1>0\Delta_{1}>0 or Δ2>0\Delta_{2}>0. Thus, either σ1\sigma_{1}^{*} or σ2\sigma_{2}^{*} is a strictly better 33-wise consensus than π\pi, which is a contradiction to the choice of π\pi. Hence, xx must be ranked before yy in every 33-wise median whenever (x,y)Uk+1(x,y)\in U_{k+1}. As every 33-wise median of VV is a linear, strict, and complete ranking of the set CC of alternatives, it follows from the transitivity that xx must be ranked before yy in every 33-wise median whenever (x,y)U¯k+1(x,y)\in\overline{U}_{k+1}. The proof is complete.

In practice, Theorem 6.2 requires 3 to 6 iterations. We now compare Theorem 6.2 with the 3-wise majority digraph method of Gilbert et al. [14, Theorem 3] on the voting profile taken from [14, Example 5].

Example 6.4.

Consider the following voting profile VV over a set of 6 alternatives:

r1:c1>c2>c4>c3>c5>c6 (4 votes),\displaystyle r_{1}:c_{1}>c_{2}>c_{4}>c_{3}>c_{5}>c_{6}\,\,\text{ (4 votes)},\quad r3:c6>c1>c2>c4>c3>c5 (1 vote),\displaystyle r_{3}:c_{6}>c_{1}>c_{2}>c_{4}>c_{3}>c_{5}\,\,\text{ (1 vote)},
r2:c1>c3>c2>c4>c5>c6 (4 votes),\displaystyle r_{2}:c_{1}>c_{3}>c_{2}>c_{4}>c_{5}>c_{6}\,\,\text{ (4 votes)},\quad r4:c6>c1>c4>c3>c2>c5 (1 vote).\displaystyle r_{4}:c_{6}>c_{1}>c_{4}>c_{3}>c_{2}>c_{5}\,\,\text{ (1 vote)}.

By [14, Theorem 3], there is some 3-wise median of VV among the 4 rankings c1>c2>c3>c4>c5>c6c_{1}>c_{2}>c_{3}>c_{4}>c_{5}>c_{6}, c1>c2>c3>c4>c6>c5c_{1}>c_{2}>c_{3}>c_{4}>c_{6}>c_{5}, c1>c2>c4>c3>c5>c6c_{1}>c_{2}>c_{4}>c_{3}>c_{5}>c_{6}, and c1>c2>c4>c3>c6>c5c_{1}>c_{2}>c_{4}>c_{3}>c_{6}>c_{5}. On the other hand, Iterated 3MOT (Theorem 6.2) obtains (after 4 iterations) the following constraints c1>c2>c4>c5>c6c_{1}>c_{2}>c_{4}>c_{5}>c_{6} and c1>c3>c5c_{1}>c_{3}>c_{5} in all 3-wise medians of VV. Consequently, Iterated 3MOT implies that every 3-wise median must be one of the following 3 rankings c1>c3>c2>c4>c5>c6c_{1}>c_{3}>c_{2}>c_{4}>c_{5}>c_{6}, c1>c2>c3>c4>c5>c6c_{1}>c_{2}>c_{3}>c_{4}>c_{5}>c_{6}, and c1>c2>c4>c3>c5>c6c_{1}>c_{2}>c_{4}>c_{3}>c_{5}>c_{6}. Note that the only 3-wise median of VV is c1>c2>c4>c3>c5>c6c_{1}>c_{2}>c_{4}>c_{3}>c_{5}>c_{6}.

7 Improved Iterated MOT

As a byproduct of our technique, we also obtain the following net improvement of Iterated MOT of Milosz and Hamel (Theorem 2.5) for classical Kemeny voting scheme.

Theorem 7.1.

Let VV be an election over a set of alternatives CC. Let W1=W_{-1}=\varnothing and for every k0k\geq 0, we define inductively WkW_{k} as the set of ordered pairs (x,y)(x,y) of distinct alternatives such that 2δxy>zZkmax(0,δyz+δzx)2\delta_{xy}>\sum_{z\in Z_{k}}\max\left(0,\delta_{yz}+\delta_{zx}\right) where Zk=C(Ly(W¯k1)Rx(W¯k1){x,y})Z_{k}=C\setminus\left(L_{y}\left(\overline{W}_{k-1}\right)\cup R_{x}\left(\overline{W}_{k-1}\right)\cup\{x,y\}\right). Suppose that (x,y)W¯k(x,y)\in\overline{W}_{k} for some k0k\geq 0. Then xx is ranked before yy in every 22-wise median of the election.

Indeed, in Theorem 7.1, if we replace ZkZ_{k} by a larger set

Zk=C(Lx,y(Wk1)Rx,y(Wk1){x,y})Z^{\prime}_{k}=C\setminus\left(L_{x,y}\left({W}_{k-1}\right)\cup R_{x,y}\left({W}_{k-1}\right)\cup\{x,y\}\right)

for all k0k\geq 0 then we obtain as a consequence Iterated MOT (see Corollary 7.3 and Lemma 7.5). Therefore, in comparison with Iterated MOT, Theorem 7.1 will either find more constraints on the relative orders of pairs of alternatives in all 2-wise medians or reduce the number of required iterations otherwise.

Proof 7.2 (Proof of Theorem 7.1).

We proceed by induction on k1k\geq-1 as in the proof of Theorem 6.2. The case k=1k=-1 is trivial since W1=W_{-1}=\varnothing. Assume that the conclusion holds for some integer k1k\geq-1 and let (x,y)Wk+1(x,y)\in W_{k+1}. Suppose on the contrary that π:L>y>Z>x>R\pi\colon L>y>Z>x>R is a 22-wise median of the election where L,Z,RL,Z,R are ordered sets of alternatives. Let σ1:L>Z>x>y>R\sigma_{1}^{*}\colon L>Z>x>y>R and σ2:L>x>y>Z>R\sigma^{*}_{2}\colon L>x>y>Z>R. Let Δi=dKT2(π,V)dKT2(σi,V)\Delta_{i}=d_{KT}^{2}(\pi,V)-d_{KT}^{2}(\sigma_{i}^{*},V) where i{1,2}i\in\{1,2\}. We find by a direct inspection that:

Δ1+Δ2=(δxyzZδyz)+(δxyzZδzx)=2δxyzZ(δyz+δzx).\displaystyle\displaystyle\Delta_{1}+\Delta_{2}=\left(\delta_{xy}-\sum_{z\in Z}\delta_{yz}\right)+\left(\delta_{xy}-\sum_{z\in Z}\delta_{zx}\right)=2\delta_{xy}-\sum_{z\in Z}\left(\delta_{yz}+\delta_{zx}\right).

By our induction hypothesis, we have Ly(W¯k)LL_{y}\left(\overline{W}_{k}\right)\subset L and Rx(W¯k)RR_{x}\left(\overline{W}_{k}\right)\subset R. Therefore,

ZZk=C(Ly(W¯k)Rx(W¯k){x,y}).\displaystyle Z\subset Z_{k}=C\setminus\left(L_{y}\left(\overline{W}_{k}\right)\cup R_{x}\left(\overline{W}_{k}\right)\cup\{x,y\}\right).

Consequently, since (x,y)Wk+1(x,y)\in W_{k+1}, we deduce that

Δ1+Δ2\displaystyle\Delta_{1}+\Delta_{2} =2δxyzZ(δyz+δzx)2δxyzZkmax(0,δyz+δzx)>0.\displaystyle=2\delta_{xy}-\sum_{z\in Z}\left(\delta_{yz}+\delta_{zx}\right)\geq 2\delta_{xy}-\sum_{z\in Z_{k}}\max\left(0,\delta_{yz}+\delta_{zx}\right)>0.

Hence, Δi>0\Delta_{i}>0 for some i{1,2}i\in\{1,2\} and σi\sigma_{i}^{*} is a strictly better 22-wise consensus than π\pi, which contradicts the choice of π\pi. Thus, xx is ranked before yy in every 22-wise median whenever (x,y)Wk+1(x,y)\in W_{k+1}. By transitivity, xx must be ranked before yy in every 22-wise median whenever (x,y)W¯k+1(x,y)\in\overline{W}_{k+1}. The proof is complete.

As a direct consequence of Theorem 7.1, we obtain the following weaker result which turns out to be equivalent to Iterated MOT (Theorem 2.5).

Corollary 7.3.

Let VV be an election over a set of alternatives CC. Let W1=W^{\prime}_{-1}=\varnothing and for every k0k\geq 0, we define inductively WkW^{\prime}_{k} as the set of ordered pairs (x,y)(x,y) of distinct alternatives such that 2δxy>zZkmax(0,δyz+δzx)2\delta_{xy}>\sum_{z\in Z^{\prime}_{k}}\max\left(0,\delta_{yz}+\delta_{zx}\right) where Zk=C(Lx,y(Wk1)Rx,y(Wk1){x,y})Z^{\prime}_{k}=C\setminus\left(L_{x,y}\left({W}^{\prime}_{k-1}\right)\cup R_{x,y}\left({W}^{\prime}_{k-1}\right)\cup\{x,y\}\right). Suppose that (x,y)Wk¯(x,y)\in\overline{W^{\prime}_{k}} for some k0k\geq 0. Then xx is ranked before yy in every 22-wise median of the election.

Proof 7.4.

By an immediate induction on kk, it is clear that WkWkW^{\prime}_{k}\subset W_{k} and ZkZkZ_{k}\subset Z^{\prime}_{k} for all integers k1k\geq-1. Indeed, the case k=1k=-1 is trivial. For the induction step, it suffices to use the the induction hypothesis WkWkW^{\prime}_{k}\subset W_{k} and observe that (cf. Definition 6.1)

Lx,y(Wk)Lx,y(Wk)Ly(Wk)Ly(W¯k),Rx,y(Wk)Rx,y(Wk)Rx(Wk)Rx(W¯k),\displaystyle L_{x,y}(W^{\prime}_{k})\subset L_{x,y}(W_{k})\subset L_{y}(W_{k})\subset L_{y}(\overline{W}_{k}),\quad\quad R_{x,y}(W^{\prime}_{k})\subset R_{x,y}(W_{k})\subset R_{x}(W_{k})\subset R_{x}(\overline{W}_{k}),

which subsequently imply Zk+1Zk+1Z_{k+1}\subset Z^{\prime}_{k+1} and thus Wk+1Wk+1W^{\prime}_{k+1}\subset W_{k+1}.

Lemma 7.5.

Theorem 2.5 is equivalent to Corollary 7.3. In particular, Theorem 2.5 is a consequence of Theorem 7.1.

Proof 7.6.

Let VV be an election over a finite set of alternatives CC and let the notations be as in Theorem 2.5 and Corollary 7.3. Then for all distinct alternatives x,yx,y, it is not hard to see that |Exy|=zx,ynxzy|E_{xy}|=\sum_{z\neq x,y}n_{xzy} and |EyxExy|=zx,ymax(0,nyzxnxzy)|E_{yx}\setminus E_{xy}|=\sum_{z\neq x,y}\max\left(0,n_{yzx}-n_{xzy}\right). Similarly, let Zk=C(Lx,y(Wk1)Rx,y(Wk1){x,y})Z^{\prime}_{k}=C\setminus\left(L_{x,y}\left({W}_{k-1}\right)\cup R_{x,y}\left({W}_{k-1}\right)\cup\{x,y\}\right) then for all k0k\geq 0, we have |Exy(k)|=zZknxzy|E^{(k)}_{xy}|=\sum_{z\in Z_{k}^{\prime}}n_{xzy} and |Exy(k)Eyx(k)|=zZkmax(0,nyzxnxzy)|E^{(k)}_{xy}\setminus E^{(k)}_{yx}|=\sum_{z\in Z^{\prime}_{k}}\max\left(0,n_{yzx}-n_{xzy}\right). Moreover, we infer from Lemma 5.6 that

δyz+δzx\displaystyle\delta_{yz}+\delta_{zx}
=\displaystyle=\,\, nyznzy+nzxnxz\displaystyle n_{yz}-n_{zy}+n_{zx}-n_{xz}
=\displaystyle=\,\, (nxyz+nyxz+nyzx)(nxzy+nzxy+nzyx)+(nyzx+nzyx+nzxy)(nyxz+nxyz+nxzy)\displaystyle(n_{xyz}+n_{yxz}+n_{yzx})-(n_{xzy}+n_{zxy}+n_{zyx})+(n_{yzx}+n_{zyx}+n_{zxy})-(n_{yxz}+n_{xyz}+n_{xzy})
=\displaystyle=\,\, 2(nyzxnxzy).\displaystyle 2(n_{yzx}-n_{xzy}).

Therefore, for every k0k\geq 0, the condition δxy>|Exy(k)Eyx(k)|\delta_{xy}>|E^{(k)}_{xy}\setminus E^{(k)}_{yx}| in Theorem 2.5 is equivalent to the condition 2δxy>zZkmax(0,δyz+δzx)2\delta_{xy}>\sum_{z\in Z^{\prime}_{k}}\max\left(0,\delta_{yz}+\delta_{zx}\right) in Corollary 7.3. The proof is thus complete.

8 33-wise Major Order Theorem with equality

In the case when we are only interested in obtaining the relative ordering of a pair of alternatives in some 33-wise median (and thus not necessarily in every 33-median), we can apply the following augmented version of the 33-wise Major Order Theorem 5.4.

Theorem 8.1 (3MOTe).

Let x,yx,y be two alternatives of an election with voting profile VV over a set of alternatives CC. Suppose that δxy0\delta_{xy}\geq 0 and

  1. (i)

    Qx,y,z0Q_{x,y,z}\geq 0 for all zC{x,y}z\in C\setminus\{x,y\},

  2. (ii)

    2δxyzx,ymax(0,Px,y,z+tx,y,zmax(0,Sy,x,z,t))2\delta_{xy}\geq\sum_{z\neq x,y}\max\left(0,-P_{x,y,z}+\sum_{t\neq x,y,z}\max\left(0,S_{y,x,z,t}\right)\right).

Then xx is ranked before yy in some 33-wise median of the election.

Proof 8.2.

The proof is the same, mutatis mutandis, as the proof of Theorem 5.4 where we notice that the inequality Δ1+Δ20\Delta_{1}+\Delta_{2}\geq 0 (cf. (5.6)) is no longer strict but it is enough to conclude that either the ranking σ1\sigma_{1}^{*} or σ2\sigma_{2}^{*} is at least as good as π\pi as a 33-wise consensus of the election. The conclusion follows since in the rankings σ1\sigma_{1}^{*} and σ2\sigma_{2}^{*}, we have x>yx>y.

9 Concluding remarks with extended set-wise Kemeny schemes

In real-life data, we can occasionally encounter incomplete rankings of alternatives, notably in surveys and political elections using for example the Single transferable vote method. These particular situations correspond very well to the reasonable assumption that typical voters only pay attention to the ranking of a shortlist of their favorite alternatives and put a rather random order for the rest of the alternatives in complete votes or simply do not indicate any preferences on such alternatives. To deal with such a situation using the 33-wise Kemeny voting scheme, we shall suppose that in an incomplete vote, all the alternatives which are not ranked will equally occupy the last position. The 33-wise Kendall-tau distance can still apply for such incomplete rankings as well as the notion of the 33-wise median, or more generally, kk-wise median for every k2k\geq 2. The above method clearly increases the range of applicability of set-wise Kemeny voting schemes. See Appendix C.1 and Table 2 for some examples with real elections. We also observe that 33-wise Kemeny voting scheme can be applied for committee elections where we would like to select a shortlist consisting of an arbitrarily fixed number of best alternatives according to certain criteria.

Appendix A 33-wise Major Order Theorem as a measure of consensus

We can observe that in an election, the proportion of pairs whose relative order are determined by the 33-wise Major Order Theorems (Theorem 5.4 and Theorem 6.2) can be used as a good measure of the consensus level of the electorates. Hence, a high percentage of such pairs means that the electorates, as a whole, agree on the relative rankings of a high percentage of pairs of alternatives, since every 33-wise median admits the same relative rankings on such pairs. We thus arrive as the following definition.

Definition A.1.

Let VV be an election over nn alternatives and let MM be the number of pairs (x,y)(x,y) which satisfy conditions (i) and (ii) in Theorem 5.4, and let MM^{*} be the number of pairs (x,y)(x,y) in k0Uk\cup_{k\geq 0}U_{k} in Theorem 6.2. Then the 3MOT\mathrm{3MOT}-consensus level of the election VV is defined by:

L3MOT(V)Mn(n1)2=2Mn(n1).L_{\mathrm{3MOT}}(V)\coloneqq\frac{M}{\frac{n(n-1)}{2}}=\frac{2M}{n(n-1)}. (A.1)

Similarly, the 3MOT\mathrm{3MOT^{*}}-consensus level of the election VV is defined by:

L3MOT(V)Mn(n1)2=2Mn(n1)L3MOT(V).L_{\mathrm{3MOT^{*}}}(V)\coloneqq\frac{M^{*}}{\frac{n(n-1)}{2}}=\frac{2M^{*}}{n(n-1)}\geq L_{\mathrm{3MOT}}(V). (A.2)

In particular, if the consensus levels L3MOT(V)L_{\mathrm{3MOT}}(V), L3MOT(V)L_{\mathrm{3MOT^{*}}}(V) of an election VV are high, then we have a lot of restrictions on the possibilities of 33-wise medians and thus the total number of possible 33-wise medians is small. Clearly, a small number of 33-wise medians of an election is a good indicator that the voters in the electorate tend to agree with each other. See Table 2 for the consensus level L3MOTL_{\mathrm{3MOT}} and L3MOTL_{\mathrm{3MOT^{*}}} of several real-world elections recorded in PREFLIB [18] and Table 3, Table 4 for the average consensus level of random elections for several different values of the numbers of alternatives and votes.

Appendix B 33-wise scheme can be more resistant to manipulation

Note that the unique 22-wise median of the election VV in Example 5.1 is t>z>x>yt>z>x>y. Therefore, the unique 22-wise winner is tt while arguably, the more convincing winner should be zz instead. Indeed, consider the following imaginary election over the same 4 alternatives x,y,z,tx,y,z,t representing 4 different political parties. Assume that the electorate consists of 11 million voters and the preferences of 10 million voters are fixed as follows.

r1:z>t>x>y (5 million votes),\displaystyle r_{1}:z>t>x>y\,\,\text{ (5 million votes)},\quad\quad r2:y>t>x>z (2 million votes),\displaystyle r_{2}:y>t>x>z\,\,\text{ (2 million votes)},
r3:x>y>t>z (0 vote),\displaystyle r_{3}:x>y>t>z\,\,\text{ (0 vote)},\quad\quad r3:x>y>z>t (1 million votes),\displaystyle r^{\prime}_{3}:x>y>z>t\,\,\text{ (1 million votes)},
r4:t>x>y>z (2 million votes).\displaystyle r_{4}:t>x>y>z\,\,\text{ (2 million votes)}.\quad\quad

Suppose that the remaining group GG of 1 million voters prefer xx and yy in that order and simply ignore the alternatives z,tz,t. Hence, we can reasonably assume that among these voters, roughly half of them chose t>zt>z (see r3r_{3}) and the other half chose z>tz>t (see r3r^{\prime}_{3}). Consider the following voting profile VV^{\prime} where exactly half of the group GG prefer zz over tt:

r1:z>t>x>y (5 million votes),\displaystyle r_{1}:z>t>x>y\,\,\text{ (5 million votes)},\quad\quad r2:y>t>x>z (2 million votes),\displaystyle r_{2}:y>t>x>z\,\,\text{ (2 million votes)},
r3:x>y>t>z (0.5 million votes),\displaystyle r_{3}:x>y>t>z\,\,\text{ (0.5 million votes)},\quad\quad r3:x>y>z>t (1.5 million votes),\displaystyle r^{\prime}_{3}:x>y>z>t\,\,\text{ (1.5 million votes)},
r4:t>x>y>z (2 million votes).\displaystyle r_{4}:t>x>y>z\,\,\text{ (2 million votes)}.\quad\quad

Then the only 22-wise medians of the election VV^{\prime} are r4r_{4} and r1r_{1}. Moreover, if more than half of the group GG prefer zz over tt then r1r_{1} is the unique 22-wise median. Similarly, if more than half of the group GG prefer tt over zz then r4r_{4} is the unique 22-wise median. In other words, the group GG ultimately decides zz or tt by the majority rule as the unique 22-wise winner. Consequently, if more than half of the group GG chose x>y>t>zx>y>t>z, for example by strategic voting or under the influence of manipulative forces and even bribery of the political party of tt, the ranking r4r_{4} will be the unique 22-wise median where tt surpasses zz and emerges as the unique 22-wise winner. On the other hand, the ranking r1r_{1} remains the unique 33-wise median and thus zz is the indisputable 33-wise winner of the election regardless of the preferences between zz and tt of voters in the group GG (by the computation with the voting profile VV), even if all the voters in GG chose x>y>t>zx>y>t>z. Therefore, we conclude that in such situations, the 33-wise Kemeny voting scheme should be preferred over the classical Kemeny rule to avoid undesirable outcomes and manipulations.

Appendix C Simulations on real-world and uniform data and applications

C.1 Examples with real-world data from PREFLIB

Example C.1.

To illustrate, consider the 2007 Glasgow City Council elections which were held by Ward (cf. [18, Dataset 00008] and the data available on Wikipedia). Out of the total 21 Wards, the EastCentre Ward was allocated 4 seats in the city council which were chosen using the Single transferable vote method among 13 alternatives by 9078 valid votes. The alternatives are:

  1. 1)

    Jim Adams

  2. 2)

    Patricia Chalmers

  3. 3)

    Elaine Cooper

  4. 4)

    Drew Dickie

  5. 5)

    Frank Docherty

  6. 6)

    Jennifer Dunn

  7. 7)

    Stuart Grieve

  8. 8)

    David Johnston

  9. 9)

    John Kerr

  10. 10)

    Elaine Mcdougall

  11. 11)

    William Mclachlan

  12. 12)

    Daniel O’Donnell

  13. 13)

    Randle Wilson

We shall identify each alternative with the order given above. Under the above described extended 33-wise Kemeny voting scheme, the 33-wise Major Order Theorem 5.4 determines 67 pairs (out of 78 possible pairs) of the relative order of the form (x,y)(x,y), which means that x>yx>y in every 33-wise median:

(1,4),(1,8),(1,9),(1,11),(1,12),(1,13),(2,1),(2,3),(2,4),(2,7),(2,8),(2,9),\displaystyle(1,4),(1,8),(1,9),(1,11),(1,12),(1,13),(2,1),(2,3),(2,4),(2,7),(2,8),(2,9),
(2,11),(2,12),(2,13),(3,4),(3,8),(3,9),(3,11),(3,12),(3,13),(4,8),(4,11),\displaystyle(2,11),(2,12),(2,13),(3,4),(3,8),(3,9),(3,11),(3,12),(3,13),(4,8),(4,11),
(5,1),(5,3),(5,4),(5,7),(5,8),(5,9),(5,10),(5,11),(5,12),(5,13),\displaystyle(5,1),(5,3),(5,4),(5,7),(5,8),(5,9),(5,10),(5,11),(5,12),(5,13),
(6,1),(6,3),(6,4),(6,7),(6,8),(6,9),(6,11),(6,12),(6,13),\displaystyle(6,1),(6,3),(6,4),(6,7),(6,8),(6,9),(6,11),(6,12),(6,13),
(7,4),(7,8),(7,9),(7,11),(7,12),(7,13),(9,8),(9,11),\displaystyle(7,4),(7,8),(7,9),(7,11),(7,12),(7,13),(9,8),(9,11),
(10,1),(10,3),(10,4),(10,7),(10,8),(10,9),(10,11),(10,12),(10,13),\displaystyle(10,1),(10,3),(10,4),(10,7),(10,8),(10,9),(10,11),(10,12),(10,13),
(12,4),(12,8),(12,9),(12,11),(13,4),(13,8),(13,9),(13,11).\displaystyle(12,4),(12,8),(12,9),(12,11),(13,4),(13,8),(13,9),(13,11).

Therefore, in every 33-wise median, we have

5\displaystyle 5 >1,3,4,7,8,9,10,11,12,13\displaystyle>1,3,4,7,8,9,10,11,12,13
2,6,10\displaystyle 2,6,10 >1,3,4,7,8,9,11,12,13.\displaystyle>1,3,4,7,8,9,11,12,13.

Hence, according to the 33-wise Kemeny scheme, the alternatives 2,5,6,10, namely, Patricia Chalmers, Frank Docherty, Jennifer Dunn, and Elaine Mcdougall, must win the Ward election. It turns out that the above 33-wise election result coincides with the official result using the Single transferable vote method used in the 2007 Glasgow City Council elections.

Similarly, for the Shettleston Ward, there were 8803 valid votes to determine 4 seats from 11 alternatives:

  1. 1)

    Mick Eyre

  2. 2)

    Walter Hamilton

  3. 3)

    Wheatley Harris

  4. 4)

    David Jackson

  5. 5)

    Gordon Kirker

  6. 6)

    Catherine Maguire

  7. 7)

    Tom Mckeown

  8. 8)

    John F. Mclaughlin

  9. 9)

    Euan Mcleod

  10. 10)

    George Ryan

  11. 11)

    Alex White

The 33-wise Major Order Theorem 5.4 determines the relative order of 45 pairs (out of 55 possible pairs) from which we deduce that:

7,8,9,10\displaystyle 7,8,9,10 >1,2,3,4,5,6,11,\displaystyle>1,2,3,4,5,6,11,

in every 33-wise median. Hence, if we were to follow the 33-wise Kemeny voting scheme, the 4 seats would belong to Tom Mckeown, John F. Mclaughlin, Euan Mcleod, and George Ryan, which is exactly the same official result of the election using the Single transferable vote method.

The Newlands Ward has 3 seats to be determined from 8654 valid votes among 9 alternatives:

  1. 1)

    Kay Allan

  2. 2)

    Charlie Baillie

  3. 3)

    Eamonn Coyle

  4. 4)

    Stephen Curran

  5. 5)

    Colin Deans

  6. 6)

    Robert Mcelroy

  7. 7)

    Jim Mcnally

  8. 8)

    Gordon Morgan

  9. 9)

    Robert W Stewart

The 33-wise Major Order Theorem 5.4 implies that 4,5,7>1,2,3,6,7,84,5,7>1,2,3,6,7,8 in every 33-wise Kemeny. Hence, Stephen Curran, Colin Deans, and Jim Mcnally win according to the 33-wise Kemeny rule. Again, the same official result holds using the Single transferable vote method.

Our next example concerns one of the most important elections conducted with the Instant Runoff Voting (IRV) method.

Example C.2.

Consider the UK Labor Party Leadership election in 2010 (see [18, Dataset 00030] and rangevoting.org/LabourUK2010\mathrm{rangevoting.org/LabourUK2010}) whose valid electorate consists of 266 voters to vote for the following 5 alternatives identified with their number:

  1. 1)

    Diane Abbott

  2. 2)

    Ed Balls

  3. 3)

    Andy Burnham

  4. 4)

    David Miliband

  5. 5)

    Ed Miliband.

While the number of pairs of the form (x,y)(x,y), which means that x>yx>y in every median, determined by the Unanimity property is zero, the Extended 22-wise Always Theorem [21] applies and finds 6 such pairs out of 10 pairs in total, namely, (2,1)(2,1), (3,1)(3,1), (4,1)(4,1), (5,1)(5,1), (4,3)(4,3), (5,3)(5,3). Hence, we deduce that 4,5>3>14,5>3>1 and 2>12>1 in every 22-wise median of the UK Labor Leadership election in 2010. In fact, by the 22-wise Major Order Theorem (Theorem 2.4), the unique 22-wise median of the election is 4>5>2>3>14>5>2>3>1.

With respect to the 33-wise Kemeny rule, the 33-wise Extended Always Theorem [21] (Theorem C.4) determines the preferences of each pair (in the given order) (2,1)(2,1), (3,1)(3,1), (4,1)(4,1), (5,1)(5,1). Therefore, we have in every 33-wise median that 2,3,4,5>12,3,4,5>1.

On the other hand, the 3MOT (Theorem 5.4) determines 9 out of the total 10 pairs:

(2,1),(2,3),(3,1),(4,1),(4,2),(4,3),(5,1),(5,2),(5,3)(2,1),(2,3),(3,1),(4,1),(4,2),(4,3),(5,1),(5,2),(5,3)

The remaining pair (4,5)(4,5) is found by Iterated 3MOT (Theorem 6.2) after two iterations and thus the unique 33-wise median of the election is given by

4>5>2>3>14>5>2>3>1

which coincides with the unique 22-wise median. Consequently, the unique 33-wise winner is David Miliband, who is also the winner with plain plurality (first-past-the-post), a method used in a preceding election in 1994. Note that the official winner (under the IRV method) is Ed Miliband, who won only by a small margin against David Miliband.

Example C.3.

Table 2 describes the efficiency of the 33-wise Major Order Theorems on real-world examples taken from PREFLIB of elections with a medium or large number of alternatives in various domains ranging from sport competitions, web search, and university rankings to food and music preferences. The Python codes for Table 2 are available at https://github.com/XKPhung.

Table 2: Proportion of pairs (out of all possible pairs) with relative order solved by the 3-wise Major Order Theorems 5.4 and 6.2 for real-world elections taken from PREFLIB [18] where nn is the total number of alternatives, mm is the total number of votes, and n(n1)/2n(n-1)/2 is the total number of pairs of alternatives. In the column Iterated 3MOT, (kk iter.) signifies that Iterated 3MOT does not generate new information starting from the (k+1)(k+1)-th iteration. The symbol \geq in the column Iterated 3MOT means that, due to lack of time, we did not run Iterated 3MOT on the corresponding instance. However, the result must be at least as good as 3MOT. The last column gives a lower bound on the reduction rate of the search space by Iterated 3MOT (cf. (1.1)). Thus, the size of the search space for 3-wise medians is bounded by n!/(reduction rate)n!/(\text{reduction rate}).
Dataset number n m n(n - 1)/2 3MOT Iterated 3MOT Reduction rate \geq
00056-00003990.soc 256 17 32640 60.82%\% \geq 60.82%\% 2.94×10692.94\times 10^{69}
00015-00000002.soc 240 5 28680 56.86%\% \geq 56.86%\% 1.74×10551.74\times 10^{55}
00015-00000023.soc 142 4 10011 45.71%\% \geq 45.71%\% 4.39×10184.39\times 10^{18}
00056-00003986.soc 130 5 8385 87.67%\% \geq 87.67%\% 4.66×10954.66\times 10^{95}
00015-00000033.soc 128 4 8128 45.14%\% \geq 45.14%\% 1.62×10161.62\times 10^{16}
00015-00000018.soc 115 4 6555 46.90%\% 48.88%\% (3 iter.) 8.82×10178.82\times 10^{17}
00014-00000002.soi 100 5000 4950 85.17%\% \geq 85.17%\% 2.91×10652.91\times 10^{65}
00015-00000069.soc 81 4 3240 52.65%\% 59.14%\% (10 iter.) 9.55×10199.55\times 10^{19}
00049-00000010.soi 64 11 2016 72.47%\% 93.80%\% (19 iter.) 5.20×10585.20\times 10^{58}
00049-00000049.soi 55 27 1485 40.60%\% 43.43%\% (4 iter.) 1.15×1061.15\times 10^{6}
00049-000000114.soi 40 40 780 64.49%\% 73.97%\% (4 iter.) 1.58×10161.58\times 10^{16}
00049-00000083.soi 38 38 703 57.75%\% 64.01%\% (4 iter.) 5.48×10105.48\times 10^{10}
00049-00000116.soi 29 45 406 64.29%\% 70.44%\% (3 iter.) 9.45×1099.45\times 10^{9}
00006-00000028.soc 24 9 276 89.86%\% 97.46%\% (3 iter.) 1.05×10211.05\times 10^{21}
00052-00000070.soc 20 21 190 62.63%\% 94.74%\% (5 iter.) 5.19×10145.19\times 10^{14}
00053-00000362.soi 19 68 171 78.36%\% 96.49%\% (4 iter.) 5.13×10145.13\times 10^{14}
00053-00000383.soi 17 50 136 67.65%\% 92.65%\% (4 iter.) 9.28×10109.28\times 10^{10}
00035-00000002.soc 15 42 105 50.48%\% 68.57%\% (4 iter.) 1.69×1041.69\times 10^{4}
00035-00000003.soc 15 42 105 71.43%\% 90.48%\% (4 iter.) 4.04×1084.04\times 10^{8}
00035-00000004.soc 15 42 105 41.90%\% 79.05%\% (5 iter.) 9.21×1059.21\times 10^{5}
00035-00000005.soc 15 42 105 35.24%\% 72.38%\% (5 iter.) 6.40×1046.40\times 10^{4}
00035-00000006.soc 15 42 105 48.57%\% 88.57%\% (5 iter.) 1.21×1081.21\times 10^{8}
00035-00000007.soc 15 42 105 66.67%\% 91.43%\% (4 iter.) 7.64×1087.64\times 10^{8}
00001-00000003.soi 14 6481 91 79.12%\% 92.31%\% (2 iter.) 2.10×1082.10\times 10^{8}

C.2 Tests on uniform data

For the hard case of uniformly generated data, we shall compare the performance of our 33-wise Major Order Theorems with the following two theoretical space reduction results recently found in [21]. The first one is the 33-wise Extended Always theorem which generalizes the unanimity property [14, Proposition 5] also known as the Pareto efficiency and the second one is the 5/65/6-majority rule which is the 33-wise counterpart of the well-known 3/43/4-majority rule of Betzler et al. [6].

Theorem C.4 (3AT, [21]).

Let x,yx,y be candidates in an election with n2n\geq 2 candidates. Suppose that xαyx\geq_{\alpha}y for some α[0,1]\alpha\in[0,1] such that

α>g(n)=11n23n+4.\alpha>g(n)=1-\frac{1}{n^{2}-3n+4}.

Then x>yx>y in every 33-wise median of the election.

Theorem C.5 (5/6-majority rule, [21]).

Let xx be a non-dirty candidate in an election with respect to the 5/65/6-majority rule and let I={zx:z5/6x}I=\{z\neq x\colon z\geq_{5/6}x\}. Let rr be a 33-wise median such that z>xz>x in rr for all zIz\in I. Suppose that |I|(|I|4)3|{z:x>z in r}||I|(|I|-4)\leq 3|\{z\colon x>z\text{ in }r\}|. Then for every candidate yxy\neq x with x5/6yx\geq_{5/6}y, we have x>yx>y in rr.

The simulation results are given Table 3 and Table 4 below. Observe that in general, the performance of the 33-wise Major Order Theorems is better when the number mm of votes is odd than when the number of votes (with a comparable size) is even. This phenomenon is explained by the property that unlike the case when mm is odd, the probability that δxy=0\delta_{xy}=0 is strictly higher than 0 when mm is even, which reduces the applicability of the 33-wise Major Order Theorems. Moreover, as for the classical 3/43/4-majority rule of Betzler et al. [6], the applicability of the 33-wise 5/65/6 majority rule drops quickly to almost 0%0\% on average for elections with at least 1010 alternatives and at least 1515 votes. Hence, for ease of reading, we only report the simulation results for the 33-wise Extended Always Theorem [21] and the 3-wise Major Order Theorems 5.4, 6.2, and 8.1. The Python codes for Table 3, Table 4 are available at https://github.com/XKPhung. Note also that for each pair (n,m)(n,m) with 3n,m303\leq n,m\leq 30, the approximate average maximal number of iterations of Iterated 3MOT ranges from 2 to 6. For Kemeny’s rule, due to lack of time, we do not provide similar complete comparison between the performance of Theorem 7.1 and Theorem 2.5 on uniformly generated data. However, by realizing several quick simulations, we find that for n,m30n,m\leq 30, Theorem 7.1 can solve approximately from 2%\% to 7%\% more pairs than Theorem 2.5.

Table 3: Proportion of pairs with relative order solved by the 33-wise Extended Always Theorem [21] and the 3-wise Major Order Theorems 5.4, 6.2, and 8.1 over 100 000 uniformly generated random instances where nn is the total number of alternatives and mm is the total number of votes.
n m 3AT 3MOT 3MOTe Iterated 3MOT
3 3 25.11% 86.14% 94.50% 94.46%
- 4 12.44% 61.39% 98.30% 64.48%
- 5 37.57% 86.16% 86.16% 94.63%
- 6 21.76% 65.42% 97.56% 68.55%
- 7 12.51% 83.17% 86.05% 93.75%
- 8 7.00% 68.10% 93.36% 71.76%
- 9 18.03% 81.91% 84.82% 91.90%
- 10 11.04% 68.79% 92.30% 73.15%
- 15 3.51% 81.13% 82.85% 90.92%
4 3 24.93% 75.62% 81.44% 89.63%
- 4 12.44% 59.35% 80.53% 64.42%
- 5 6.26% 72.59% 75.12% 88.37%
- 6 3.10% 60.55% 78.52% 65.75%
- 7 1.52% 70.10% 73.96% 86.20%
- 8 0.78% 61.55% 75.45% 68.14%
- 9 3.87% 69.40% 72.11% 84.19%
- 10 2.12% 61.75% 74.95% 68.99%
- 15 0.10% 68.02% 69.03% 82.21%
5 3 25.05% 66.46% 71.34% 85.20%
- 4 12.52% 54.90% 68.96% 60.90%
- 5 6.26% 62.58% 65.63% 81.97%
- 6 3.15% 55.43% 66.53% 62.12%
- 7 1.59% 60.46% 64.13% 78.60%
- 8 0.80% 55.34% 64.00% 63.77%
- 9 0.40% 59.85% 62.29% 76.85%
- 10 0.21% 55.23% 63.52% 64.17%
- 15 0.10% 58.57% 60.35% 74.49%
Table 4: Proportion of pairs with relative order solved by the 33-wise Extended Always Theorem [21] and the 3-wise Major Order Theorems 5.4, 6.2, and 8.1 over 100 000 random instances where nn is the total number of alternatives and mm is the total number of votes. For 3MOT and 3MOTe with n=15n=15, the simulation is realized with random 10 000 instances. For n=8n=8, resp. n{10,15}n\in\{10,15\}, the simulation for Iterated 3MOT is realized with random 10 000, resp. 1000, instances. For n=20n=20, all the simulations are realized with random 1000 instances. All instances are uniformly generated.
n m 3AT 3MOT 3MOTe Iterated 3MOT
8 3 24.93% 48.97% 52.10% 72.12%
- 4 12.52% 43.22% 48.86% 50.57%
- 5 6.25% 43.97% 46.71% 64.33%
- 6 3.12% 41.71% 46.00% 50.82%
- 7 1.56% 42.45% 44.93% 59.95%
- 8 0.79% 40.91% 44.35% 51.15%
- 9 0.39% 41.94% 43.76% 58.32%
- 10 0.19% 40.38% 43.57% 50.83%
- 15 0.01% 40.57% 41.92% 55.65%
10 3 24.97% 42.83% 45.14% 65.00%
- 4 12.51% 37.37% 41.14% 45.24%
- 5 6.23% 36.64% 38.85% 54.17%
- 6 3.13% 35.14% 38.01% 41.44%
- 7 1.55% 35.21% 37.17% 50.60%
- 8 0.79% 34.18% 36.52% 44.02%
- 9 0.39% 34.59% 36.08% 50.07 %
- 10 0.20% 33.62% 35.75% 43.48%
- 15 0.01% 33.22% 34.34% 46.74%
15 3 24.99% 35.20% 36.43% 51.36%
- 4 12.49% 28.14% 30.01% 35.45%
- 5 6.27% 25.92% 27.32% 38.54%
- 6 3.12% 24.72% 26.22% 32.62%
- 7 1.57% 24.19% 25.41% 34.56%
- 8 0.78% 23.76% 25.01% 31.90%
- 9 0.39% 23.5% 24.51% 33.16%
- 10 0.20% 23.17% 24.28% 30.82%
- 15 0.01% 22.25% 22.98% 30.91%
20 3 25.38% 32.07% 32.81% 42.90%
- 4 12.40% 23.04% 24.13% 28.65%
- 5 6.06% 19.77% 20.74% 28.11%
- 6 3.16% 18.89% 19.88% 25.16%
- 7 1.56% 18.33% 19.16% 25.68%
- 8 0.77% 17.69% 18.50% 23.76%
- 9 0.39% 17.14% 17.87% 24.27%
- 10 0.19% 17.12% 17.87% 23.30%
- 15 0.01% 16.29% 16.82% 22.44%

References

  • [1] N. Ailon, M. Charikar, and A. Newman. Aggregating inconsistent information: Ranking and clustering. J. ACM, 55(5):1–27, 2008.
  • [2] A. Ali and M. Meilă. Experiments with Kemeny ranking: What works when? Mathematical Social Science, 64:28–40, 2012.
  • [3] P. Andrieu, B. Brancotte, L. Bulteau, S. Cohen-Boulakia, A. Denise, A. Pierrot, and S. Vialette. Efficient, robust and effective rank aggregation for massive biological datasets. Future Generation Computer Systems, 124:406–421, 2021.
  • [4] K. Arrow, A. Sen, and K. Suzumura. Handbook of Social Choice and Welfare, volume 1. Elsevier, 2002.
  • [5] K.J. Arrow. k-majority digraphs and the hardness of voting with a constant number of voters. Journal of Computer and System Sciences, 105:130–157, 2019.
  • [6] N. Betzler, R. Bredereck, and R. Niedermeier. Theoretical and empirical evaluation of data reduction for exact Kemeny Rank Aggregation. Auton Agent Multi-Agent Syst, 28:721–748, 2014. doi:https://doi.org/10.1007/s10458-013-9236-y.
  • [7] G. Blin, M. Crochemore, S. Hamel, and S. Vialette. Median of an odd number of permutations. Pure Mathematics and Applications, 21(2):161–175, 2010.
  • [8] B. Brancotte, B. Rance, A. Denise, and S. Cohen-Boulakia. ConQuR-Bio: Consensus ranking with query reformulation for biological data. Data Integration in the Life Sciences, pages 128–142, 2014.
  • [9] V. Conitzer, A. Davenport, and J. Kalagnanam. Improved bounds for computing Kemeny rankings. Proceedings of the 21st conference on Artificial intelligence, AAAI’06, 1:620–626, 2006.
  • [10] A. Davenport and J. Kalagnanam. A Computational Study of the Kemeny Rule for Preference Aggregation. AAAI’04: Proceedings of the 19th national conference on Artifical intelligence, pages 697–702, 2004.
  • [11] Marquis de Condorcet. Essai sur l’application de l’analyse à la probabilité des décisions rendues à la pluralité des voix. Imprimerie Royale, Paris (1785). Translated in English in I. McLean and E. Hewitt, eds., Condorcet: Foundations of Social Choice and Political Theory (Edward Elgar, Aldershot, England) pp. 120–158 (1994).
  • [12] C. Dwork, R. Kumar, M. Naor, and D. Sivakumar. Rank aggregation methods for the web. WWW ’01: Proceedings of the 10th international conference on World Wide Web, May 2001, pages 613–622. doi:https://doi.org/10.1145/371920.372165.
  • [13] H. Gilbert, T. Portoleau, and O. Spanjaard. Beyond Pairwise Comparisons in Social Choice: A Setwise Kemeny Aggregation Problem. Proceedings of the AAAI Conference on Artificial Intelligence, 34(02):1982–1989, 2020. doi:https://doi.org/10.1609/aaai.v34i02.5569.
  • [14] H. Gilbert, T. Portoleau, and O. Spanjaard. Beyond pairwise comparisons in social choice: A setwise Kemeny aggregation problem. Theoretical Computer Science, 904:27–47, 2022. doi:https://doi.org/10.1016/j.tcs.2021.07.004.
  • [15] J. Kemeny. Mathematics without numbers. Daedalus, 88:577–591, 1959.
  • [16] J. Kemeny and L. Snell. Mathematical Models in the Social Sciences. Ginn, Boston, 1960.
  • [17] R. Kumar and S. Vassilvitskii. Generalized distances between rankings. in: Proceedings of the 19th International Conference on World Wide Web, WWW 2010, Raleigh, North Carolina, USA, April 26-30, 2010, pages 571–580, 2010.
  • [18] N. Mattei and T. Walsh. PREFLIB: A Library for Preferences http://www.preflib.org. In: Perny, P., Pirlot, M., Tsoukiàs, A. (eds) Algorithmic Decision Theory. ADT 2013. Lecture Notes in Computer Science, 8176. doi:https://doi.org/10.1007/978-3-642-41575-3_20.
  • [19] R. Milosz and S. Hamel. Space reduction constraints for the median of permutations problem. Discrete Applied Mathematics, 280:201–213, 2020. doi:https://doi.org/10.1016/j.dam.2018.03.076.
  • [20] N. Nishimura and N. Simjour. Parameterized Enumeration of (Locally-) Optimal Aggregations. In: Dehne, F., Solis-Oba, R., Sack, JR. (eds) Algorithms and Data Structures. WADS 2013. Lecture Notes in Computer Science, Springer, Berlin, Heidelberg, 8037, 2013. doi:https://doi.org/10.1007/978-3-642-40104-6_44.
  • [21] X.K. Phung and S. Hamel. Optimal majority rules and quantitative Condorcet properties of setwise Kemeny voting schemes. Preprint, available on arXiv from May 1st 2023.
  • [22] F. Schalekamp and A. van Zuylen. Rank aggregation: Together we’re strong. pages 38–51. doi:10.1137/1.9781611972894.4.
  • [23] J. H. Smith. Aggregation of preferences with variable electorate. Econometrica, 41:1027–1041, 1973.
  • [24] M. Truchon. An Extension of the Condorcet Criterion and Kemeny orders. Technical report, cahier 98–15 du Centre de Recherche en Économie et Finance Appliquées, Université Laval, Québec, Canada.
  • [25] H. P. Young and A. Levenglick. A Consistent Extension of Condorcet’s Election Principle. SIAM Journal on Applied Mathematics, 35(2):285–300, 1978.
  • [26] H.P. Young. Condorcet’s Theory of Voting. American Political Science Rev., 82(4):1231–1244, 1988.