Rank Aggregation Methods for the Web CS728 Lecture 11.

Slides:



Advertisements
Similar presentations
A Simple Proof "There is no consistent method by which a democratic society can make a choice (when voting) that is always fair when that choice must be.
Advertisements

6.896: Topics in Algorithmic Game Theory Lecture 18 Constantinos Daskalakis.
Web Information Retrieval
Approximation, Chance and Networks Lecture Notes BISS 2005, Bertinoro March Alessandro Panconesi University La Sapienza of Rome.
Common Voting Rules as Maximum Likelihood Estimators Vincent Conitzer (Joint work with Tuomas Sandholm) Early version of this work appeared in UAI-05.
Presented by: Katherine Goulde
Voting and social choice Vincent Conitzer
Algorithmic Game Theory Uri Feige Robi Krauthgamer Moni Naor Lecture 9: Social Choice Lecturer: Moni Naor.
Voting and social choice Looking at a problem from the designers point of view.
The Voting Problem: A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor SIUC.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. CHOOSING.
How “impossible” is it to design a Voting rule? Angelina Vidali University of Athens.
IMPOSSIBILITY AND MANIPULABILITY Section 9.3 and Chapter 10.
CS 886: Electronic Market Design Social Choice (Preference Aggregation) September 20.
Voting Theory.
Computing Kemeny and Slater Rankings Vincent Conitzer (Joint work with Andrew Davenport and Jayant Kalagnanam at IBM Research.)
Theory of Computing Lecture 3 MAS 714 Hartmut Klauck.
Social Choice: The Impossible Dream Michelle Blessing February 23, 2010 Michelle Blessing February 23, 2010.
Excursions in Modern Mathematics Sixth Edition
Using a Modified Borda Count to Predict the Outcome of a Condorcet Tally on a Graphical Model 11/19/05 Galen Pickard, MIT Advisor: Dr. Whitman Richards,
1 By Gil Kalai Institute of Mathematics and Center for Rationality, Hebrew University, Jerusalem, Israel presented by: Yair Cymbalista.
CPS Voting and social choice
Social choice theory = preference aggregation = voting assuming agents tell the truth about their preferences Tuomas Sandholm Professor Computer Science.
Rank Aggregation Methods II Experiments CS728 Lecture 12.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 April 20, 2005
CRITERIA FOR A FAIR ELECTION
Social Choice Theory By Shiyan Li. History The theory of social choice and voting has had a long history in the social sciences, dating back to early.
Common Voting Rules as Maximum Likelihood Estimators Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science Department.
Preference Analysis Joachim Giesen and Eva Schuberth May 24, 2006.
Social choice theory = preference aggregation = truthful voting Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
CS522: Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian
1 The Process of Computing Election Victories Computational Sociology: Social Choice and Voting Methods CS110: Introduction to Computer Science – Lab Module.
Social choice (voting) Vincent Conitzer > > > >
Social Choice Lecture 19 John Hey.
CPS Voting and social choice Vincent Conitzer
Group Recommendations with Rank Aggregation and Collaborative Filtering Linas Baltrunas, Tadas Makcinskas, Francesco Ricci Free University of Bozen-Bolzano.
Information Networks Rank Aggregation Lecture 10.
Math for Liberal Studies.  We have seen many methods, all of them flawed in some way  Which method should we use?  Maybe we shouldn’t use any of them,
Great Theoretical Ideas in Computer Science.
Let’s take a class vote. How many of you are registered to vote?
Social choice theory = preference aggregation = voting assuming agents tell the truth about their preferences Tuomas Sandholm Professor Computer Science.
1 Learning to Rank --A Brief Review Yunpeng Xu. 2 Ranking and sorting Rank: only has K structured categories Sorting: each sample has a distinct rank.
Arrow’s Impossibility Theorem. Question: Is there a public decision making process, voting method, or “Social Welfare Function” (SWF) that will tell us.
Chapter 9: Social Choice: The Impossible Dream Lesson Plan Voting and Social Choice Majority Rule and Condorcet’s Method Other Voting Systems for Three.
Excursions in Modern Mathematics, 7e: 1.Conclusion - 2Copyright © 2010 Pearson Education, Inc. 1 The Mathematics of Voting CONCLUSION Elections, Fairness,
Random Sampling Algorithms with Applications Kyomin Jung KAIST Aug ERC Workshop.
Voting System Review Borda – Sequential Run-Off – Run-Off –
Arrow’s Impossibility Theorem
Ashish Das Indian Institute of Technology Bombay India.
1 The Process of Computing Election Victories Computational Sociology: Social Choice and Voting Methods CS110: Introduction to Computer Science – Lab Module.
Algorithms for Large Data Sets
Impossibility and Other Alternative Voting Methods
1.
Social choice theory = preference aggregation = voting assuming agents tell the truth about their preferences Tuomas Sandholm Professor Computer Science.
Applied Mechanism Design For Social Good
8.2 Voting Possibilities and Fairness Criteria
Impossibility and Other Alternative Voting Methods
Introduction If we assume
Rank Aggregation.
Alex Tabarrok Arrow’s Theorem.
5-2 Election Theory Flaws of Voting.
Models and Algorithms for Complex Networks
Voting systems Chi-Kwong Li.
MULTIDIMENSIONAL RANKING
Voting and social choice
CPS 173 Voting and social choice
Quiz – 1/24 - Friday How many people voted in the election?
Flaws of the Voting Methods
CPS Voting and social choice
Outline Rank Aggregation Computing aggregate scores
Presentation transcript:

Rank Aggregation Methods for the Web CS728 Lecture 11

Web Page Ranking Methods Reviewed PageRank – global link analysis Indegree – local link analysis HITS- topic-based link analysis Voting –NNN and Correlation Graph distance from seed URL length and depth Text-based methods (e.g., tf*idf)

Rank Aggregation BDCABDCA ABDCFEABDCFE BCDAFEBCDAFE “Consensus” ranking of all BDCAFEBDCAFE

Notations for Ranking Given a universe U, and ordered list τ of a subset of S of U τ=[x 1 ≥ x 2 ≥… ≥x d ], x i in S τ(i) : position of rank of i |τ|: number of elements full list : τ which contains all the elements in U partial list : rank only some of elements in U top d list : all d ranked elements are above all unranked elements Question: when are two orderings similar? Can you give a distance measure?

Measuring Distance Between Orderings Spearman’s Footrule Distance –σ, τ : two full list. –σ( i ) :rank of candidate i Kendall tau distance –Count the number of pairwise disagreements between the two lists

Example of Ordered-List Distance Example –S = {A,B,C,D,E} –σ, τ : two full list Spearman’s Footrule Distance –F(σ, τ ) = = 6 Kendall tau distance –K(σ, τ ) = |{(A,C), (B.D), (B,E), (D,E)}| = 4 ACEDBACEDB CABDECABDE τσ

Optimal ranking aggregation Optimality depends on the distance measure we use. Optimizing with Kendall tau distance, we obtain Kemeny optimal aggregation Can show satisfies neutrality and consistency –important properties of rank aggregation functions. Useful but computationally hard. Kemeny optimal aggregation is NP-hard. Will show that footrule-optimal is in P.

Two properties relate K and F For any full lists σ,τ K(σ,τ) ≤ F(σ,τ) ≤ 2 K(σ,τ) So we get a 2-approximation to Kemeny-optimality Since, if σ is the Kemeny optimal aggregation of full lists τ 1,…, τ k and σ’ optimizes the footrule aggregation then, K(σ’, τ 1,…, τ k ) ≤ 2 K(σ, τ 1,…, τ k )

Condorcet Criterion –An element of S which wins every other in pairwise simple majority voting should be ranked first. Extended Condorcet Criterion (XCC): –If most voters prefer candidate a to candidate b (i.e., # of i s.t.  i (a) <  i (b) is at least n/2), then also  should prefer a to b (i.e.,  (a) <  (b)). XCC is effective in ‘spam-fighting’ and thus good to use in meta-search. Condorcet Criteria and SPAM Filters

XCC: Not always realizable cba aab bcc  a) <  (b) <  (c) cba acb bac Not realizable

Voting Theory: Desired Properties Given set of candidates and voter preferences: seek an algorithm that ranks candidates which satisfies a set of desired properties Which combination of properties are realizable? 1) Independence from Irrelevant Alternatives: Relative order of a and b in  should depend only on relative order of a and b in  1,…,  n. –Ex: if  i = (a b c) changes to (a c b), relative order of a,b in  should not change.

Desired Properties: 2) Neutrality No candidate should be favored to others. –If two candidates switch positions in  1,…,  n, they should switch positions also in . 3) Anonymity No voter should be favored to others. –If two voters switch their orderings,  should remain the same.

Desired Properties: 4) Monotonicity If the ranking of a candidate is improved by a voter, its ranking in  can only improve. 5) Consistency If voters are split into two disjoint sets, S and T, and both the aggregation of voters in S and the aggregation of voters in T prefer a to b, then also the aggregation of all voters should prefer a to b.

Desired Properties 6) No Dictatorship: f(  1,…,  n ) !=  I 7) Unanimity (a.k.a. Pareto optimality): If all voters prefer candidate a to candidate b (i.e.,  i (a) <  i (b) for all i), then also  should prefer a to b (i.e.,  (a) <  (b)).

Desired Properties 8) Democracy: satisfies extended Condorcet Criterion XCC. –Always works for m = 2. –Not always realizable for m ≥ 3. Theorem [May, 1952]: For m = 2, Democracy is the only rank aggregation function which is monotone, neutral, and anonymous.

Arrow’s Impossibility Theorem [Arrow, 1951] Theorem: If m ≥ 3, then the only rank aggregation function that is unanimous and independent from irrelevant alternatives is dictatorship. –Won Nobel prize (1972)

Borda’s method Easy and intuitive - Several “score-based”variants; 1781 Violates independence from irrelevant alternatives C3 C1. C7 C8 C10 C7 C1. C8 C3 C10 C3 C2. C7 C10 C9 C3 C8. C1 C15 C10  1  2  3  4 B i (c)=the number of candidates ranked below c in  i B (c)=  i B i (c) Sorted in decreasing order B i (C8) =

Partial lists Handle partial lists by giving all the excess scores equally among all unranked candidates, Example: Candidates number = 100 Ranked candidates number =70 (score: 31~100) =>Assign score 31/30 to each 30 unranked candidates

Footrule optimal aggregation Footrule optimal aggregation can be computed in polynomial time. is a good approximation of Kemeny optimal aggregation. Proof : Via minimum cost perfect matching

Markov Chain method for rank aggregation. States=candidates Transitions depend on the preference orders given by voters Basic idea: probabilistically switch to a “better candidate” Rank candidates based on stationary probabilities!

Markov chain advantages Handling partial list and top d list by using available comparisons to infer new ones Handling uneven comparison and list length Computation efficiency –O(NK) preprocessing,O(K) per step for about O(N) steps

Four ways to build transition Matrix Current state is candidate a. MC1: Choose uniformly from multiset of all candidates that were ranked at least as high as a by some voter. – Probability to stay at a: ~ average rank of a. MC2: Choose a voter i uniformly at random and pick uniformly at random from among the candidates that the i-th voter ranked at least as high as a. MC3: Choose a voter i uniformly at random and pick uniformly at random a candidate b. If i-th voter ranked b higher than a, go to b. Otherwise, stay in a. MC4: Choose a candidate b uniformly at random If most voters ranked b higher than a, go to b. Otherwise, stay in a. – Rank of a ~ # of “pairwise contests” a wins.

A locally Kemeny optimal aggregation is a relaxation of Kemeny Optimality A locally Kemeny optimal aggregation satisfies the extended Condorcet property and can be computed in “kO(nlogn)” worst case, O(n 2 ) Many of existing aggregation methods do not satisfy ECC. =>Given τ 1, …,τ k use your favorite aggregation method to obtain a full list μ. And Apply local kemenization to μ with respect to τ 1, …,τ k.

A local Kemenization of a full list with respect to Compute a locally Kemeny optimal aggregation of that is maximally consistent with This approach: (1) preserves the strengths of the initial aggregation. (2) ranks non-spam above spam. (3) gives a result that disagrees with on any pair ( i, j ) only if a majority of the τ’s endorse this disagreement. (4) for every d, 1 ≤ d ≤ | μ |, the restriction of the output is a local Kemenization of the top d elements of μ Local Kemenization is a procedure to get locally Kemeny optimal aggregation.

How do we perform local kemenization? ABFECDABFECD BCAEFDBCAEFD ACFDEBACFDEB BFDCAEBFDCAE CABFEDCABFED BADCEFBADCEF BBABA ABAB ABDABD ABDCABDC ABCDABCD ABCFEDABCFED Local Kemenization Example! disagree A>B: 3 A<B: 2 B>D: 4 B<D: 1

Experiments: meta-search K = Kendall distance SF = scaled footrule distance IF = induced footrule distance LK = Local Kemenization