The Page Rank Axioms Based on Ranking Systems: The PageRank Axioms, by Alon Altman and Moshe Tennenholtz. Presented by Aron Matskin.

Slides:



Advertisements
Similar presentations
Ranking, Trust, and Recommendation Systems: An Axiomatic Approach Moshe Tennenholtz TechnionIsrael Institute of Technology and Microsoft Israel R&D Center.
Advertisements

Great Theoretical Ideas in Computer Science
1 Decomposing Hypergraphs with Hypertrees Raphael Yuster University of Haifa - Oranim.
22C:19 Discrete Structures Induction and Recursion Spring 2014 Sukumar Ghosh.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 – CHAPTER 4 GRAPHS 1.
Markov Models.
1 The PageRank Citation Ranking: Bring Order to the web Lawrence Page, Sergey Brin, Rajeev Motwani and Terry Winograd Presented by Fei Li.
Multicut Lower Bounds via Network Coding Anna Blasiak Cornell University.
Link Analysis: PageRank
Trust and Profit Sensitive Ranking for Web Databases and On-line Advertisements Raju Balakrishnan (Arizona State University)
Approximation Algorithms for Unique Games Luca Trevisan Slides by Avi Eyal.
4/5/05Tucker, Sec Applied Combinatorics, 4rth Ed. Alan Tucker Section 4.3 Graph Models Prepared by Jo Ellis-Monaghan.
Entropy Rates of a Stochastic Process
6.896: Probability and Computation Spring 2011 Constantinos (Costis) Daskalakis lecture 2.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Great Theoretical Ideas in Computer Science.
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Estimating the Global PageRank of Web Communities Paper by Jason V. Davis & Inderjit S. Dhillon Dept. of Computer Sciences University of Texas at Austin.
Tirgul 9 Amortized analysis Graph representation.
15-853Page :Algorithms in the Real World Indexing and Searching III (well actually II) – Link Analysis – Near duplicate removal.
Expanders Eliyahu Kiperwasser. What is it? Expanders are graphs with no small cuts. The later gives several unique traits to such graph, such as: – High.
Lecture 10: Search Structures and Hashing
CS522: Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian
1 COMP4332 Web Data Thanks for Raymond Wong’s slides.
1 Uniform Sampling from the Web via Random Walks Ziv Bar-Yossef Alexander Berg Steve Chien Jittat Fakcharoenphol Dror Weitz University of California at.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Relations Chapter 9.
C o n f i d e n t i a l HOME NEXT Subject Name: Data Structure Using C Unit Title: Graphs.
The effect of New Links on Google Pagerank By Hui Xie Apr, 07.
The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University, Technical Report 1998 Presented by: Ratiya Komalarachun.
Presented By: - Chandrika B N
On the approximability of the link building problem Author - MartinOlsena,AnastasiosViglasb, ∗ Speaker - Wayne Yang.
The PageRank Citation Ranking: Bringing Order to the Web Presented by Aishwarya Rengamannan Instructor: Dr. Gautam Das.
Entropy Rate of a Markov Chain
Piyush Kumar (Lecture 2: PageRank) Welcome to COT5405.
Google’s Billion Dollar Eigenvector Gerald Kruse, PhD. John ‘54 and Irene ‘58 Dale Professor of MA, CS and I T Interim Assistant Provost Juniata.
Theory of Computing Lecture 10 MAS 714 Hartmut Klauck.
Reputations Based On Transitive Trust Slides by Josh Albrecht.
Chapter 9. Chapter Summary Relations and Their Properties n-ary Relations and Their Applications (not currently included in overheads) Representing Relations.
Lectures 6 & 7 Centrality Measures Lectures 6 & 7 Centrality Measures February 2, 2009 Monojit Choudhury
DATA MINING LECTURE 13 Pagerank, Absorbing Random Walks Coverage Problems.
Discrete Math for CS Binary Relation: A binary relation between sets A and B is a subset of the Cartesian Product A x B. If A = B we say that the relation.
The PageRank Citation Ranking: Bringing Order to the Web Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd Presented by Anca Leuca, Antonis Makropoulos.
Center for E-Business Technology Seoul National University Seoul, Korea BrowseRank: letting the web users vote for page importance Yuting Liu, Bin Gao,
Chapter 9. Section 9.1 Binary Relations Definition: A binary relation R from a set A to a set B is a subset R ⊆ A × B. Example: Let A = { 0, 1,2 } and.
Greedy Algorithms and Matroids Andreas Klappenecker.
Week 11 - Monday.  What did we talk about last time?  Binomial theorem and Pascal's triangle  Conditional probability  Bayes’ theorem.
TRUST-BASED RECOMMENDATION SYSTEMS : an axiomatic approach Microsoft Research, Redmond WA Jennifer Chayes Christian Borgs Reid Anderson Uri Feige Abie.
Chapter 10 Graph Theory Eulerian Cycle and the property of graph theory 10.3 The important property of graph theory and its representation 10.4.
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.
Understanding Google’s PageRank™ 1. Review: The Search Engine 2.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Chapter 9. Chapter Summary Relations and Their Properties n-ary Relations and Their Applications (not currently included in overheads) Representing Relations.
CompSci 102 Discrete Math for Computer Science March 13, 2012 Prof. Rodger Slides modified from Rosen.
Link Analysis Algorithms Page Rank Slides from Stanford CS345, slightly modified.
Ljiljana Rajačić. Page Rank Web as a directed graph  Nodes: Web pages  Edges: Hyperlinks 2 / 25 Ljiljana Rajačić.
Theory of Computational Complexity Probability and Computing Lee Minseon Iwama and Ito lab M1 1.
Random Sampling Algorithms with Applications Kyomin Jung KAIST Aug ERC Workshop.
Computation on Graphs. Graphs and Sparse Matrices Sparse matrix is a representation of.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Dynamic Network Analysis Case study of PageRank-based Rewiring Narjès Bellamine-BenSaoud Galen Wilkerson 2 nd Second Annual French Complex Systems Summer.
Topics In Social Computing (67810) Module 1 (Structure) Centrality Measures, Graph Clustering Random Walks on Graphs.
Code: BCA302 Data Structures with C Prof. (Dr.) Monalisa Banerjee By.
Jeffrey D. Ullman Stanford University.  Web pages are important if people visit them a lot.  But we can’t watch everybody using the Web.  A good surrogate.
Binary Relation: A binary relation between sets A and B is a subset of the Cartesian Product A x B. If A = B we say that the relation is a relation on.
Relations Chapter 9.
PageRank and Markov Chains
DTMC Applications Ranking Web Pages & Slotted ALOHA
Piyush Kumar (Lecture 2: PageRank)
Locality In Distributed Graph Algorithms
Presentation transcript:

The Page Rank Axioms Based on Ranking Systems: The PageRank Axioms, by Alon Altman and Moshe Tennenholtz. Presented by Aron Matskin

Judge and be prepared to be judged. Ayn Rand רבי שמעון אומר שלשה כתרים הם : כתר תורה, וכתר כהונה, וכתר מלכות ; וכתר שם טוב עולה על גביהן. פירקי אבות

Talking Points Ranking and reputation in general Connections to the Internet world PageRank web ranking system PageRank representation theorem

Ranking: What Abilities Choices Reputation Quality Quality of information Popularity Good looks What not?

Ranking: How Voting Reputation systems Peer review Performance reviews Sporting competition Intuitive or ad-hoc

Ranking Systems ’ Properties Ad-hoc or systematic Centralized or distributed Feedback or indicator-based Peer, “ second-party ”, or third-party Update period Volatility Other?

Agents Ranking Themselves Community reputation Professional associations Peer review Performance reviews (in part) Web page ranking

Ranking: Problems and Issues Eliciting information Information aggregation Information distribution Truthfulness Strategic considerations Fear of retribution / expectation of kick-backs Coalition formation Agent identification (pseudonym problem) Need analysis!

Ranking Systems: Analysis Empirical Because theories often lack Theoretical Because theoreticians need to eat, too Provides valuable insight

Social Choice Theory Two approaches: Normative – from properties to implementations. Example: Arrow ’ s Impossibility Theorem Descriptive – from implementation to properties. The Holy Grail: representation theorems (uniqueness results)

PageRank Method A method for computing a popularity (or importance) ranking for every web page based on the graph of the web. Has applications in search, browsing, and traffic estimation.

PageRank: Intuition Internet pages form a directed graph Node ’ s popularity measure is a positive real number. The higher number represents higher popularity. Let ’ s call it weight Node ’ s weight is distributed equally among nodes it links to We look for a stationary solution: the sum of weights a page receives from its backlinks is equal to its weight b=2 c=1 a=

PageRank as Random Walk Suppose you land on a random page and proceed by clicking on hyper-links uniformly randomly Then the (normalized) rank of a page is the probability of visiting it

PageRank: Some Math Represent the graph as a matrix: b c a 010 ½ 01 ½ 00 abc a b c

PageRank: Some Math Find a solution of the equation: A G r = r Under the assumption that the graph is strongly connected there is only one normalized solution The assumption is not used by the real PageRank algorithm which uses workarounds to overcome it The solution r is the rank vector.

Calculating PageRank Take any non-zero vector r 0 Let r i+1 = A G r i Then the sequence r k converges to r Since the Internet graph is an expander, the convergence is very fast: O(log n) steps to reach given precision

PageRank: The Good News Intuitive Relatively easy to calculate Hard to manipulate Great for common case searches May be used to assess quality of information (assuming popularity ≈ trust)

PageRank: The Bad News PageRank is proprietary to Webmasters can ’ t manipulate it, but can Every change in the algorithm is good for someone and is bad for someone else Popular become more popular Popularity ≠ quality of information

The Representation Theorem We next present a set of axioms (i.e. properties) for ranking procedures Some of the axioms are more intuitive then others, but all are satisfied by PageRank We then show that PageRank is the only ranking algorithm that satisfies the axioms We try to be informal, but convincing

Ranking Systems Defined A ranking system F is a functional that maps every finite strongly connected directed graph (SCDG) G=(V,E) into a reflexive, transitive, complete, and anti- symmetric binary relation ≤ on V

Ranking Systems: Example MyRank ranks vertices in G in ascending order of the number of incoming links b c a MyRank(G): c = a < b PageRank(G): c < a = b

Axiom 1: Isomorphism (ISO) F satisfies ISO iff it is independent of vertex names Consequence: symmetric vertices have the same rank b e a g f j i h e = f = g = h = i = j a = b

Axiom 2: Self Edge (SE) Node v has a self-edge (v,v) in G ’, but does not in G. Otherwise G and G ’ are identical. F satisfies SE iff for all u,w ≠ v: (u ≤ v  u < ’ v) and (u ≤ w  u ≤ ’ w) PageRank satisfies SE: Suppose v has k outgoing edges in G. Let (r 1, …,r v, …,r N ) be the rank vector of G, then (r 1, …,r v + 1/k, …,r N ) is the rank vector of G ’

Axiom 3: Vote by Committee (VBC) a c b a c b 1.In the example page a links only to b and c, but there may be more successors of a 2.Incoming links of a and all other links of the successors of a remain the same

Axiom 4: Collapsing (COL) b a b 1.The sets of predecessors of a and b are disjoint 2.Pages a and b must not link to each other or have self-links 3.The sets of successors of a and b coincide

Axiom 5: Proxy (PRO) 1.All predecessors of x have the same rank 2.|P(x)| = |S(x)| 3. x is the only successor of each of its predecessors x = =

Useful Properties: DEL 1.|P(b)|=|S(b)|=1 2.There is no direct edge between a and c 3. a and c are otherwise unrestricted a c b d a c d

DEL: Proof a c b d c b d a VBC

DEL: Proof c b d a VBC c b d a

DEL: Proof ISO,PRO c b d a c b d a

DEL: Proof PRO c d a c b d a

DEL: Proof PRO c d a c d a

DEL: Proof VBC c d a c d a

DEL: Proof VBC c d a a c d

DEL for Self-Edge It can also be shown that DEL holds for self-edges: aa

Useful Properties: DELETE 1.Nodes in P(x) have no other outgoing edges 2. x has no other edges x = = = =

DELETE: Proof x = = = = COL x y

DELETE: Proof PRO x y

Useful Properties: DUPLICATE 1.All successors of a are duplicated the same number of times 2.There are no edges from S(a) to S(a) c b d a c b d a

DUPLICATE: Proof c b d a c b d a VBC

DUPLICATE: Proof c b d a VBC c b d a

DUPLICATE: Proof c b d a COL c b d a

DUPLICATE: Proof c b d a ISO,PRO c b d a

DUPLICATE: Proof c b d a COL -1 c b d a

DUPLICATE: Proof VBC -1 c b d a c b d a

The Representation Theorem Proof Given a SCDG G=(V,E) and a,b in V, we eliminate all other nodes in G while preserving the relative ranking of a and b In the resulting graph G ’ the relative ranking of a and b given by the axioms can be uniquely determined. Therefore the axioms rank any SCDG uniquely It follows that all ranking systems satisfying the axioms coincide

Proof by Example on b and d b c a abc a b c d ⅓ 00 ½ ⅓ 00 ½ ⅓ d d a b c d

Step 1: Insert Nodes b c a d b c a d By DEL the relative ranking is preserved

Step 2: Choose Node to Remove b c a d

Step 3: Remove “ self-edges ” b c a d

Step 4: Duplicate Predecessors b c a d

Step 5: DELETE the Node b c d

Step 5: DELETE the Extras There still are nodes to delete: back to Step 2 b c d

Step 2: Choose Node to Remove Steps 3,4 - no changes b c d

Step 5: DELETE the Node b d

Step 6: DELETE the Extras No original nodes to remove: proceed to Step 7 b d

Step 7: Balance by Duplication b d This is our G ’

Step 8: Equalize by Reverse DEL b d By ISO b=d. By DEL and SE: in G ’ b<d.

Example for a and d b c a d b c a d

After Removal of c b a d

Duplicate Predecessors of b b a d

DELETE b a d

DELETE Extras a d

Before Balancing a d

After Balancing a d Conclusion: a<d.

What about a and b? b a d

b a d

b a

b a

b a

b a Conclusion: a=b.

Concluding Remarks ‘ Representation theorems isolate the “ essence ” of particular ranking systems, and provide means for the evaluation (and potential comparison) of such systems ’ – Alon & Tennenholtz

The End c b d a ½ 00 ½ a bc a b c