CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1.

Slides:



Advertisements
Similar presentations
Social network partition Presenter: Xiaofei Cao Partick Berg.
Advertisements

Analysis and Modeling of Social Networks Foudalis Ilias.
1 The PageRank Citation Ranking: Bring Order to the web Lawrence Page, Sergey Brin, Rajeev Motwani and Terry Winograd Presented by Fei Li.
 Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Enabling Networked Knowledge.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
Lecture 9 Measures and Metrics. Structural Metrics Degree distribution Average path length Centrality Degree, Eigenvector, Katz, Pagerank, Closeness,
Introduction to PageRank Algorithm and Programming Assignment 1 CSC4170 Web Intelligence and Social Computing Tutorial 4 Tutor: Tom Chao Zhou
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
How Google Relies on Discrete Mathematics Gerald Kruse Juniata College Huntingdon, PA
Cloud Computing Lecture #5 Graph Algorithms with MapReduce Jimmy Lin The iSchool University of Maryland Wednesday, October 1, 2008 This work is licensed.
Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:
Global topological properties of biological networks.
Design Patterns for Efficient Graph Algorithms in MapReduce Jimmy Lin and Michael Schatz University of Maryland Tuesday, June 29, 2010 This work is licensed.
CSE 321 Discrete Structures Winter 2008 Lecture 25 Graph Theory.
1 COMP4332 Web Data Thanks for Raymond Wong’s slides.
Cloud Computing Lecture #4 Graph Algorithms with MapReduce Jimmy Lin The iSchool University of Maryland Wednesday, February 6, 2008 This work is licensed.
The Very Small World of the Well-connected. (19 june 2008 ) Lada Adamic School of Information University of Michigan Ann Arbor, MI
Chapter 8 Web Structure Mining Part-1 1. Web Structure Mining Deals mainly with discovering the model underlying the link structure of the web Deals with.
More Algorithms for Trees and Graphs Eric Roberts CS 106B March 11, 2013.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University, Technical Report 1998 Presented by: Ratiya Komalarachun.
Models of Influence in Online Social Networks
The PageRank Citation Ranking: Bringing Order to the Web Presented by Aishwarya Rengamannan Instructor: Dr. Gautam Das.
Graph Algorithms Ch. 5 Lin and Dyer. Graphs Are everywhere Manifest in the flow of s Connections on social network Bus or flight routes Social graphs:
Google’s Billion Dollar Eigenvector Gerald Kruse, PhD. John ‘54 and Irene ‘58 Dale Professor of MA, CS and I T Interim Assistant Provost Juniata.
1 Applications of Relative Importance  Why is relative importance interesting? Web Social Networks Citation Graphs Biological Data  Graphs become too.
MapReduce and Graph Data Chapter 5 Based on slides from Jimmy Lin’s lecture slides ( (licensed.
Class 2: Graph theory and basic terminology Learning the language Network Science: Graph Theory 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr.
Lectures 6 & 7 Centrality Measures Lectures 6 & 7 Centrality Measures February 2, 2009 Monojit Choudhury
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
The PageRank Citation Ranking: Bringing Order to the Web Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd Presented by Anca Leuca, Antonis Makropoulos.
Networks Igor Segota Statistical physics presentation.
Graph Algorithms. Graph Algorithms: Topics  Introduction to graph algorithms and graph represent ations  Single Source Shortest Path (SSSP) problem.
Neural Network of C. elegans is a Small-World Network Masroor Hossain Wednesday, February 29 th, 2012 Introduction to Complex Systems.
L – Modelling and Simulating Social Systems with MATLAB Lesson 6 – Graphs (Networks) Anders Johansson and Wenjian Yu (with S. Lozano.
Slides are modified from Lada Adamic
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
CS 590 Term Project Epidemic model on Facebook
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Informatics tools in network science
Importance Measures on Nodes Lecture 2 Srinivasan Parthasarathy 1.
Topics In Social Computing (67810) Module 1 (Structure) Centrality Measures, Graph Clustering Random Walks on Graphs.
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
Lecture 23: Structure of Networks
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
The PageRank Citation Ranking: Bringing Order to the Web
The PageRank Citation Ranking: Bringing Order to the Web
Topics In Social Computing (67810)
Link-Based Ranking Seminar Social Media Mining University UC3M
Network analysis.
Community detection in graphs
Lecture 23: Structure of Networks
Network Science: A Short Introduction i3 Workshop
Section 8.6 of Newman’s book: Clustering Coefficients
Centrality in Social Networks
Cloud Computing Lecture #4 Graph Algorithms with MapReduce
Graph Algorithms Ch. 5 Lin and Dyer.
Sarthak Ahuja ( ) Saumya jain ( )
Why Social Graphs Are Different Communities Finding Triangles
CS 440 Database Management Systems
PageRank algorithm based on Eigenvectors
Clustering Coefficients
Lecture 23: Structure of Networks
Adjacency Matrices and PageRank
Graph Algorithms Ch. 5 Lin and Dyer.
Presentation transcript:

CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1

Graphs from the Real World Königsberg's Bridges Ref:

Primitives and Notations  G = (V, E)  E can also be represented as an adjacency matrix  Undirected vs. directed graph  Degree  (Shortest) distance between two vertices 3

Properties of Nodes  Centrality: how “central” a node is in the graph  How close the node is to all other nodes?  How much is a node a “choke point”? st is the number of shortest paths between s and t 4

Properties of Nodes  Clustering coefficient: how much does a node cluster with neighbors  Local clustering coefficient  Global clustering coefficient 5

Background  Besides the keywords, what other evidence can one use to rate the importance of a webpage ?  Solution: Use the hyperlink structure  E.g. a webpage linked by many webpages is probably important.  but this method is not global (comprehensive).  PageRank is developed by Larry Page in

Idea  A graph representing WWW  Node: webpage  Directed edge: hyperlink  A user randomly clicks the hyperlink to surf WWW.  The probability a user stop in a particular webpage is the PageRank value.  A node that is linked by many nodes with high PageRank value receives a high rank itself; If there are no links to a node, then there is no support for that page. 7

A simple version  u: a webpage  B u : the set of u’s backlinks  N v : the number of forward links of page v  Initially, R(u) is 1/N for every webpage  Iteratively update each webpage’s PR value until convergence. 8

Example 1 9 PageRank Calculation: first iteration

Example 1 10 PageRank Calculation: second iteration

Example 1 11 Convergence after some iterations

A little more advanced version  Adding a damping factor d  Imagine that a surfer would stop clicking a hyperlink with probability 1-d  R(u) is at least (1-d)/(N-1)  N is total num. of nodes. 12

Other applications  Social network (Facebook, Twitter, etc)  Node: Person; Edge: Follower / Followee / Friend  Higher PR value: Celebrity  Citation network  Node: Paper; Edge: Citation  Higher PR values: Important Papers.  Protein-protein interaction network  Node: Protein; Edge: Two proteins bind together  Higher PR values: Essential proteins. 13