CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.

Slides:



Advertisements
Similar presentations
Routing in Poisson small-world networks A. J. Ganesh Microsoft Research, Cambridge Joint work with Moez Draief.
Advertisements

Complex Network Theory
1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
Small-world networks.
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Analysis and Modeling of Social Networks Foudalis Ilias.
Web as Network: A Case Study Networked Life CIS 112 Spring 2010 Prof. Michael Kearns.
Online Social Networks and Media Navigation in a small world.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Power Laws: Rich-Get-Richer Phenomena
Advanced Topics in Data Mining Special focus: Social Networks.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Network Measurements.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
CS728 Lecture 5 Generative Graph Models and the Web.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Advanced Topics in Data Mining Special focus: Social Networks.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Summary from Previous Lecture Real networks: –AS-level N= 12709, M=27384 (Jan 02 data) route-views.oregon-ix.net, hhtp://abroude.ripe.net/ris/rawdata –
CS347 Lecture 12 May 21, 2001 ©Prabhakar Raghavan.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Information Networks Power Laws and Network Models Lecture 3.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Christian Schindelhauer Search Algorithms Winter Semester 2004/ Dec.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
Internet Economics כלכלת האינטרנט Class 10 – it’s a small world 1.
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Online Social Networks and Media
(with an application of Web Spam detection) CS315-Web Search and Mining Power Laws and Rich-Get-Richer Phenomena.
Complex Network Theory – An Introduction Niloy Ganguly.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Complex Network Theory – An Introduction Niloy Ganguly.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.
How Do “Real” Networks Look?
CS:4980:0005 Peer-to-Peer and Social Networks Fall 2015 Introduction.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
The Structure of Scientific Collaboration Networks by M. E. J. Newman CMSC 601 Paper Summary Marie desJardins January 27, 2009.
Models of Web-Like Graphs: Integrated Approach
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Christian Schindelhauer Search Algorithms Winter Semester 2004/ Dec.
Lecture 23: Structure of Networks
Graph Models Class Algorithmic Methods of Data Mining
Social Networks Analysis
Topics In Social Computing (67810)
Identity and Search in Social Networks
How Do “Real” Networks Look?
Lecture 23: Structure of Networks
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Lecture 13 Network evolution
Peer-to-Peer and Social Networks Fall 2017
How Do “Real” Networks Look?
Network Science: A Short Introduction i3 Workshop
Lecture 23: Structure of Networks
Modelling and Searching Networks Lecture 2 – Complex Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian

Topics covered in the course Structure and modeling of social networks Power law graphs; Small world phenomenon; High clustering coefficient; Probabilistic and game theoretic models Algorithms for link analysis Crawling the web; HITS; Page Rank; Webspam; Rank aggregation; Spectral clustering Economic aspects of the Internet Peering relations; Alternative mechanisms for routing; P2P networks Topics motivated by e-commerce Reputation mechanisms; Recommendation systems; Ad auctions

Logistics Course web page: Course work:  reading papers (1/week on avg)  possibly a few problem sets How to contact us:

Social Networks A social network is a graph that represents relationships between independent entities.  Graph of friendships (or in the virtual world, networks like orkut)  Web of sexual contact  Graph of scientific collaborations  Cross-posts in newsgroups  Web graph (links between webpages)  Internet: Inter/Intra-domain graph

Scientific Collaboration Network 400,000 nodes, authors in Mathematical Reviews database An edge between two authors if they have a joint paper Just 676,000 edges Picture from orgnet.com

Scientific Collaboration Network Average degree 3.36 A few high-degrees:  Paul Erdös, 509  Frank Harary, 268  Yuri Alekseevich Mitropolskii, 244 Many low-degrees: (100,000 of degree 1) Picture from orgnet.com

Scientific Collaboration Network Short paths  Max Erdös # is 13  Any two authors connected by path of length at most 23  Average distance between two authors is 7.64  e.g.: John Nash → Shapley → Fulkerson → Hoffman → Paul Erdös Many triangles … Picture from orgnet.com

9/11 Terrorist Network Picture from orgnet.com

Newsgroup Cross-Post Graph Nodes are newsgroups, essentially archived lists Edges are cross-posts, i.e. there is an edge between two newsgroups to which an identical is posted alt.microsoft.sucksalt.linux.sucks

Internet Graphs Inter-domain graphs  Nodes are autonomous systems or domains  Edges are inter-domain connections SPRINTAOL

Inter-domain graph Picture from caida.org

Internet Graphs Intra-domain graphs  Nodes are routers  Edges are links between routers

Intra-domain graph

Colored by AS number Picture from lumeta.com

World Wide Web Nodes are webpages Arcs (i.e., directed edges) are hyperlinks

Web graph, Chicago Tribune Page Picture generated by Nicheworks

Social Networks

Why Study These Networks Understand the creation of these networks Understand viral epidemics Help design crawling strategies for the web Analyze behavior of algorithms (web/internet) Predict evolution of the network and emergence of new phenomena

In this lecture Common properties of social networks  Power law degree distribution  Small world phenomenon  High clustering coefficient Structure of the web graph

Power Laws Two quantities x and y are related by a power law if y is proportional to x (-c) for a constant c y = .x (-c) If x and y are related by a power law, then the graph of log(y) versus log(x) is a straight line log(y) = -c.log(x) + log(  ) The slope of the log-log plot is the power exponent c

Power Law Distributions A random variable X has a power law distribution if Pr[X=k] is proportional to k (-c) for a constant c The cumulative distribution, Pr[X>k], of a power law distribution is proportional to k (-c+1), and is called the Pareto law Similar to a power law, the Zipf law relates the rank r of X to its size: the r’th largest instance of X is proportional to r (-c’)

Example: City Populations 1. New York7,322, Los Angeles 3,485, Chicago2,783, Houston 1,630, Philadelphia 1,585, San Diego 1,110, Detroit 1,027, Dallas 1,006, Phoenix 983, San Antonio 935,933

Example: City Populations 1. New York7,322, Los Angeles 3,485, Chicago2,783, Seattle 516, Spokane, WA 177, Tacoma, WA 176, Little Rock, AR 175, Bakersfield, CA 174, Fremont, CA 173, Fort Wayne, IN 173, Arlington, VA 170,936

Example: City Populations Power law exponent: c = 0.74

Power Laws in Networks Degree distribution often satisfies a power law: fraction of nodes f d of degree d is proportional to d -c Degree dFraction f d = 1/(2d) 11/2 21/4 31/6 4~1/8

Example: Collaboration Graph Power law exp: c = 2.97 With exponential decay factor, c = 2.46

Example: Cross-Post Graph Power law exponent: c = 1.3

Example: Inter-Domain Internet Power law exponent: 2.15 < c < 2.2

Example: Intra-Domain Internet Power law exponent: c = 2.48

Example: Web Graph In-Degree Power law exponent: c = 2.09

Example: Web Graph Out-Degree Power law exponent: c = 2.72

Small World Phenomenon Six degrees of separation: “Everybody on this planet is separated by only six other people. Six degrees of separation between us and everyone else on this planet. The President of the United States, a gondolier in Venice, just fill in the names.”

Small World Phenomenon Milgram’s famous experiment (1960s):  Choose a random person in Nebraska, Bob  Ask Bob to deliver a letter to a random person in Massachusetts, Lashawn  Tell Bob target’s name, address, and occupation  Instruct Bob to only send letter to people he knows on a first-name basis

Small World Phenomenon Bob, a farmer in Nebraska David, mayor of Bob’s town Bernard, David’s cousin who went to college with Maya, who grew up in Boston With Lashawn

Small World Phenomenon in Graphs The diameter of a graph is the maximum distance (number of edges) between any pair of nodes The average distance of a graph is the average distance between any pair of nodes The average connected distance of a graph is the average distance between any pair of connected nodes

Small World Phenomenon in Graphs A graph exhibits a small world phenomenon if it has low diameter or average (connected) distance Typically, the average distance of a small world graph is on the order of log n (where n is the number of nodes)

Examples Collaboration graph  401,000 nodes, 676,000 edges (average degree 3.37)  Diameter: 23, Average distance: 7.64 Cross-post graph, giant component  30,000 nodes, 800,000 edges (average degree 53.3)  Diameter: 13, Average distance: 3.8 Web graph  200 million nodes, 1.5 billion edges (average degree 15)  Average connected distance: 16 Inter-domain Internet  3500 nodes, 6500 edges (average degree 3.71)  95% of pairs of nodes within distance 5

High Clustering Coefficient The clustering coefficient of a graph is the fraction of triangles among connected triples of nodes Intuitively, the clustering coefficient reflects the probability that your friends are themselves friends We expect social networks to have a high clustering coefficient

Examples Collaboration graph  Clustering coefficient is 0.14  Density of edges is Cross-post graph  Clustering coefficient is  Density of edges is

Assignment READ: A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Wiener, Graph structure in the web, WWW, 2000.

Graph Structure of the Web Breadth-first search from randomly chosen start nodes  Follow both forward and backward links  Reveal directed and undirected graph structure Over 90% of nodes reachable if links are treated as undirected Directed graph reveals complex bow-tie structure

Bow-Tie Structure of Web Graph Picture from the Nature journal

Next Time Probabilistic models for social networks