Presentation is loading. Please wait.

Presentation is loading. Please wait.

Ashish Das Indian Institute of Technology Bombay India.

Similar presentations


Presentation on theme: "Ashish Das Indian Institute of Technology Bombay India."— Presentation transcript:

1 Ashish Das Indian Institute of Technology Bombay India

2 2  The Multidimensional Ranking Problem  Applications  Desirable Properties of the Ranking Solution  Ranking Methods with Illustrations  Extensions and Modifications

3 3  m candidates (or “alternatives”) ◦ M = {1,…,m}: set of candidates  n voters (or “agents” or “judges”) ◦ N = {1,…,n}: set of voters  Each voter i, has an ranking  i on M ◦  i (a) <  i (b) means i-th voter prefers a to b  The rank aggregation problem: Combine  1,…,  n into a single ranking  on M, which represents the “social choice” of the voters. ◦ Rank aggregation function: f(  1,…,  n ) = 

4  Multidimensional Ranking is an algorithm for comprehensive ranking based on more than one variable or opinion. The aim of multidimensional ranking is to combine many different rank orderings on a set of variables, in order to obtain a better ordering. It provides an aggregate or over-all measure by multi criteria decision making or data integration methods.

5  Meta search: Combine results of different web search engines into a better overall ranking  Rank items in a database according to multiple criteria ◦ Ex: Choose a restaurant by cuisine, distance, price, quality, etc. ◦ Ex: Choose a flight ticket by price, # of stops, date and time, frequent flier bonuses, etc. ◦ Overall ranking of various mid-size cars available in the market ◦ Overall ranking of Banks ◦ Ranking service quality of mobile cell providers ◦ Overall ranking of universities  Compare overall rankings over time

6  Unanimity (or Pareto optimality): If all voters prefer candidate a to candidate b (i.e.,  i (a) <  i (b) for all i), then also  should prefer a to b (i.e.,  (a) <  (b)). aca cab bbc a:b = 3:0

7  Condorcet Criterion [Condorcet, 1785] : ◦ Condorcet winner: a candidate a, which is preferred by most voters to any other candidate b (i.e., for all b, # of i s.t.  i (a) <  i (b) is at least n/2). ◦ Condorcet criterion: If Condorcet winner exists,  should rank it first (i.e.,  (a) = 1). cba aab bcc a:b = 2:1, a:c = 2:1 cba acb bac No Condorcet winner

8  Extended Condorcet Criterion (XCC): ◦ If most voters prefer candidate a to candidate b (i.e., # of i s.t.  i (a) <  i (b) is at least n/2), then also  should prefer a to b (i.e.,  (a) <  (b)).  Not always realizable cba aab bcc  a) <  (b) <  (c) cba acb bac Not realizable

9  Neutrality No candidate should be favored to others. ◦ If two candidates switch positions in  1,…,  n, they should switch positions also in .  Anonymity No voter should be favored to others. ◦ If two voters switch their orderings,  should remain the same.

10  Monotonicity If the ranking of a candidate is improved by a voter, its ranking in  can only improve.  Consistency If voters are split into two disjoint sets, S and T, and both the aggregation of voters in S and the aggregation of voters in T prefer a to b, then also the aggregation of all voters should prefer a to b.

11 For finding the best option from all the feasible alternatives, we use some sound approaches to aggregate or rank with respect to more than one attributes. A. Positional Rank Aggregation Methods B. TOPSIS METHOD (Technique for Order Preference by Similarity to an Ideal Solution) C. KEMENEY’S METHOD

12  Plurality ◦ score(a) = # of voters who chose a as #1 ◦  : order candidates by decreasing scores  Top-k approval ◦ score(a) = # of voters who chose a as one of the top k ◦  : order candidates by decreasing scores  Borda’s rule [Borda, 1781] ◦ score(a) =  i  i (a) ◦  : order candidates by increasing scores

13 bcaa dbdb cdcc aabd BordaTop-2 ApprovalPlurality 1+1+4+4=1022a 2+4+2+1=931b 3+3+1+3=1011c 4+2+3+2=1120d

14  TOPSIS [Hwang, C.L. and Yoon, K., 1981], compares each state or candidate with respect to positive and negative ideal solution  Composite ranking ranks that alternative the best that has shortest distance from positive ideal solution & longest from the negative ideal solution  The method induces an ordering of the solutions based on similarity to the ideal point, guiding the search towards the zone of interest

15  The TOPSIS distances dz-dz- dj-dj- dj+dj+ dz+dz+ I+I+ I-I- f 2 min f 1 min ajaj azaz

16 Steps in TOPSIS computation Suppose S 1, S 2,…, S m are m possible alternatives among which decision makers have to choose based on n criteria. Let, C 1, C 2,…, C n are the criteria with which alternative performance are measured, X ij is the rating of alternative S i with respect to the criterion C j. Thus, the data matrix is,

17

18

19  Average Monthly Per Capita Expenditure  Average per person consumption of non-cereals in total food consumption (%)  Average expenditure per person per 30 days on consumption of non-food articles (Rs.)  Per 1000 number of hh who use LPG as main source for cooking  Per 1000 number of households who use electricity as primary source of energy for lighting  Per 1000 number of persons aged 7 & above who are literate  Per 1000 number of persons aged 7 & above whose level of education is Higher Secondary or above  Average covered area of dwelling unit (sq. m) per household  Per 1000 no. of hh who do not stay in self owned dwelling unit

20

21

22

23

24

25  d: distance measure among rankings  Definition: The optimal rank aggregation for  1,…,  n w.r.t. d is the ranking  which minimizes  i d( ,  i ).  22 11 nn

26  Kendall tau distance (or “bubble sort distance”) ◦ K( ,  ) = # of pairs of candidates (a,b) on which  and  disagree ◦ Ex: K( (a b c d), (a d c b)) = 0 + 2 + 1 = 3  Spearman footrule distance ◦ F( ,  ) =  a |  (a) -  (a)| ◦ Ex: F((a b c d), (a d c b)) = 0 + 2 + 0 + 2 = 4

27  Optimal aggregation w.r.t. Kendall-tau distance (NP-hard even for small n)  K-distance. Let π and σ be two partial lists of {1,…, m}. The K-distance of π and σ, denoted K(π, σ), is the number of pairs i,j ε {1, …, m} such that π(i) σ(j). Note that if it is not the case that both i and j appear in both lists π and σ, then the pair (i,j) contributes nothing to the K-distance between the two lists. For any two partial lists K(π, σ) = K(σ, π).  SK, Kemeny optimal. For a collection of partial lists τ 1, τ 2,…, τ n and a full list π we denote SK(π, τ 1, τ 2,…, τ n ) = Σ K(π, τ i ). We say that a permutation σ is a Kemeny optimal aggregation of a collection of partial lists τ 1, τ 2,…, τ n if it minimizes SK(π, τ 1, τ 2,…, τ n ) over all permutations π.  Theorem [Young & Levenglick, 1978] [Truchon 1998]: Kemeny optimal aggregation is the only rank aggregation function, which is neutral, consistent, and satisfies the Extended Condorcet principle.

28 In Kemeny’s method one finds an optimal ranking on the basis of some given condition (variables). We illustrate through the same data set used in TOPSIS method. We apply KEMENY’S method in two different ways:  Direct application of KEMENY’S method  Stepwise application of KEMENY’S method

29 Step1: We first decide on the number of states for applying KEMENY. Suppose we choose m=4 rows (states A=1, B=2, C=3, D=4). We work out the 4! = 24 permutations of 1,2,3,4. A permutation 2314 means that the ranking is BCAD. Step2: We incorporate our data matrix as 4 rows and n = 9 (say) columns. Then we construct the rank (position) matrix involving ranking of the states with respect to each of the 9 variables. Step3: We calculate the number of non-matching pairs between each one of the permutations of 1,2,3,4 and position orderings for each of the 9 variables. Step4: We next calculate the K-distance for each of the permutations. K- distances are defined as the number of non-matching pairs between a permutations and position orderings for each of the 9 variables.

30 Variables ->123456789 A=1Bihar541.30.57218.31411352239386 B=2Orissa458.60.56193.3253026004047.321 C=3West Bengal629.90.62258.8313737104038.325 D=4Madhya Pradesh514.90.67251.1216606114047.519 Rank Matrix 3 2 2 1 1 1 1 1 1 1 1 1 3 2 2 4 3 3 4 3 4 4 3 4 4 2 4 2 4 3 2 4 3 4 4 2 Position Matrix 2 2 2 1 1 1 1 1 1 4 1 1 4 2 2 4 3 4 1 3 4 2 3 4 4 2 2 3 4 3 3 4 3 4 4 3

31 S.No. Permutations 1 4 3 2 1 2 4 3 1 2 3 4 2 3 1 4 4 2 1 3 5 4 1 2 3 6 4 1 3 2 7 3 4 2 1 8 3 4 1 2 9 3 2 4 1 10 3 2 1 4 11 3 1 2 4 12 3 1 4 2 13 2 3 4 1 14 2 3 1 4 15 2 4 3 1 16 2 4 1 3 17 2 1 4 3 18 2 1 3 4 19 1 3 2 4 20 1 3 4 2 21 1 2 3 4 22 1 2 4 3 23 1 4 2 3 24 1 4 3 2 CountingK-distance 3 5 4 4 6 5 6 5 442 4 6 5 3 5 4 6 4 340 2 4 3 3 5 4 6 6 336 1 3 2 2 4 3 6 5 228 2 4 3 1 3 2 6 4 126 3 5 4 2 4 3 6 3 232 4 4 5 5 5 6 6 4 544 5 5 6 4 4 5 6 3 442 3 3 4 6 4 5 6 3 640 4 2 3 5 3 4 5 2 533 5 3 4 4 2 3 5 1 431 6 4 5 3 3 4 5 2 335 2 2 3 5 3 4 6 4 534 3 1 2 4 2 3 5 3 427 1 3 2 4 4 3 6 5 432 0 2 1 3 3 2 6 4 324 1 1 0 2 2 1 5 3 217 2 0 1 3 1 2 5 2 319 4 2 3 3 1 2 5 0 323 5 3 4 2 2 3 5 1 227 3 1 2 2 0 1 5 1 217 2 2 1 1 1 0 5 2 115 3 3 2 0 2 1 5 3 019 4 4 3 1 3 2 5 2 125 Position Matrix 2413 2134 2143 1423 1234 1243 1444 1324 1423

32  Stepwise application of KEMENEY’S method is a new concept of applying the basic KEMENEY’S method. It is very useful for large data set, say for example m=26. Here the direct application of Kemeny is not feasible since there are 26! Combinations to look into. We introduce the concept of stepwise Kemeny.  Step1: We have to first fix the number of row (= k) to be taken at a time (say, 3, or 4, or 5, …). After fixing this, we proceed as in Step1 (of Kemeny) taking m = k.  Step2: We then incorporate our data matrix as 26 rows and 9 columns (say). Then we recursively choose our inputs from original data matrix. We choose first k (say 6) rows, and then construct rank matrix and position rank matrix as in Step 2 of Kemeny.  Step3: We then proceed to Steps 3 and 4 of Kemeny based on these first 6 rows. The optimal Kemeny, thus obtained, is used to identify the Rank 1 state. In the next iteration we remove the Rank 1 state and add the 7 th state in order to make a new set of 6 row. This process continues to identify Rank 2 state, Rank 3 state and so on Rank 20 state. The ranking in the final iteration gives Rank 21 through 26.  Step4: Repeat the process till the rankings stabilize.

33 In Kemeny-2 it took 2 iterations to stabilize 12345671234567

34  We have described 2 different approaches for TOPSIS method and KEMENY method.  We compare the optimum results for each of the above methods.  As direct application of KEMENY method is not feasible for a large data set, for illustration, we take a small data set.

35 Result based on original TOPSIS Result based on Ranked TOPSIS Result based on KEMENY

36


Download ppt "Ashish Das Indian Institute of Technology Bombay India."

Similar presentations


Ads by Google