PageRank Identifying key users in social networks Student : Ivan Todorović, 3231/2014 Mentor : Prof. Dr Veljko Milutinović.

Slides:



Advertisements
Similar presentations
Overview of this week Debugging tips for ML algorithms
Advertisements

Graphs (Part II) Shannon Quinn (with thanks to William Cohen and Aapo Kyrola of CMU, and J. Leskovec, A. Rajaraman, and J. Ullman of Stanford University)
CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1.
Analysis and Modeling of Social Networks Foudalis Ilias.
SOCELLBOT: A New Botnet Design to Infect Smartphones via Online Social Networking th IEEE Canadian Conference on Electrical and Computer Engineering(CCECE)
Graphs, Node importance, Link Analysis Ranking, Random walks
Distributed Breadth-First Search with 2-D Partitioning Edmond Chow, Keith Henderson, Andy Yoo Lawrence Livermore National Laboratory LLNL Technical report.
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
Qiang Cao Duke University
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Design Patterns for Efficient Graph Algorithms in MapReduce Jimmy Lin and Michael Schatz University of Maryland Tuesday, June 29, 2010 This work is licensed.
CSE 321 Discrete Structures Winter 2008 Lecture 25 Graph Theory.
Web Data Management Dr. Daniel Deutch. Web Data The web has revolutionized our world Data is everywhere Constitutes a great potential But also a lot of.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
1 COMP4332 Web Data Thanks for Raymond Wong’s slides.
PageRank Debapriyo Majumdar Data Mining – Fall 2014 Indian Statistical Institute Kolkata October 27, 2014.
HCC class lecture 22 comments John Canny 4/13/05.
The Very Small World of the Well-connected. (19 june 2008 ) Lada Adamic School of Information University of Michigan Ann Arbor, MI
Midterm Presentation Undergraduate Researchers: Graduate Student Mentor: Faculty Mentor: Jordan Cowart, Katie Allmeroth Krist Culmer Dr. Wenjun (Kevin)
Google’s PageRank: The Math Behind the Search Engine Author:Rebecca S. Wills, 2006 Instructor: Dr. Yuan Presenter: Wayne.
Leveraging Big Data: Lecture 11 Instructors: Edith Cohen Amos Fiat Haim Kaplan Tova Milo.
Design Patterns for Efficient Graph Algorithms in MapReduce Jimmy Lin and Michael Schatz University of Maryland MLG, January, 2014 Jaehwan Lee.
Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Piyush Kumar (Lecture 2: PageRank) Welcome to COT5405.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
1 Applications of Relative Importance  Why is relative importance interesting? Web Social Networks Citation Graphs Biological Data  Graphs become too.
Your Friends Have More Friends Than You Do: Identifying Influential Mobile Users Through Random Walks Bo Han, Aravind Srinivasan University of Maryland.
Advanced Software Engineering PROJECT. 1. MapReduce Join (2 students)  Focused on performance analysis on different implementation of join processors.
MapReduce and Graph Data Chapter 5 Based on slides from Jimmy Lin’s lecture slides ( (licensed.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
Web Data Management Dr. Daniel Deutch. Web Data The web has revolutionized our world Data is everywhere Constitutes a great potential But also a lot of.
Use of Electronic and Internet advertising options Standard 3.4.
Design Patterns for Efficient Graph Algorithms in MapReduce Jimmy Lin and Michael Schatz (Slides by Tyler S. Randolph)
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
DATA MINING LECTURE 13 Pagerank, Absorbing Random Walks Coverage Problems.
The PageRank Citation Ranking: Bringing Order to the Web Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd Presented by Anca Leuca, Antonis Makropoulos.
The Matrix: Using Intermediate Features to Classify and Predict Friends in a Social Network Michael Matczynski Status Report April 14, 2006.
Graph Algorithms. Graph Algorithms: Topics  Introduction to graph algorithms and graph represent ations  Single Source Shortest Path (SSSP) problem.
Ranking Link-based Ranking (2° generation) Reading 21.
LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Data Structures and Algorithms in Parallel Computing Lecture 3.
Topic 12 Graphs 1. Graphs Definition: Two types:
Comparison of Tarry’s Algorithm and Awerbuch’s Algorithm CS 6/73201 Advanced Operating System Presentation by: Sanjitkumar Patel.
Multiparty Access Control for Online Social Networks : Model and Mechanisms.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
CS 590 Term Project Epidemic model on Facebook
CSE 450 – Web Mining Seminar Professor Brian D. Davison Fall 2005 A PRESENTATION on What is this Page Known for? Computing Web Page Reputations D. Rafiei.
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
Ljiljana Rajačić. Page Rank Web as a directed graph  Nodes: Web pages  Edges: Hyperlinks 2 / 25 Ljiljana Rajačić.
Progress Report ekker. Problem Definition In cases such as object recognition, we can not include all possible objects for training. So transfer learning.
Importance Measures on Nodes Lecture 2 Srinivasan Parthasarathy 1.
The Anatomy of a Large-Scale Hypertextual Web Search Engine S. Brin and L. Page, Computer Networks and ISDN Systems, Vol. 30, No. 1-7, pages , April.
Ariel Fuxman, Panayiotis Tsaparas, Kannan Achan, Rakesh Agrawal (2008) - Akanksha Saxena 1.
CS 440 Database Management Systems Web Data Management 1.
Computation on Graphs. Graphs and Sparse Matrices Sparse matrix is a representation of.
CS 540 Database Management Systems Web Data Management some slides are due to Kevin Chang 1.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Quality of a search engine
Prof. Paolo Ferragina, Algoritmi per "Information Retrieval"
Prof. Paolo Ferragina, Algoritmi per "Information Retrieval"
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
GANG: Detecting Fraudulent Users in OSNs
Graph and Link Mining.
Adjacency Matrices and PageRank
PRSim: Sublinear Time SimRank Computation on Large Power-Law Graphs.
Presentation transcript:

PageRank Identifying key users in social networks Student : Ivan Todorović, 3231/2014 Mentor : Prof. Dr Veljko Milutinović

Introduction Social Networks – Connecting people Sustainable revenues Full advertising potential Key Users Novel PageRank 2/19

What is a Key User ? Large community Affects a large number of persons Unlikely to live OSN Pay for Premium services 3/19

Users’ Connectivity in OSN Structural characteristics of the network Well-connected users Social Graph Centrality measures –Degree –Closeness –Betweenness 4/19

Users’ Communication Activity Exchange of information User interaction Activity Graph Strong/Weak connection 5/19

PageRank An algorithm used by Google PageRank is a link analysis algorithm Outputs a probability distribution Apply to any graph or network Personalized PageRank is used by Twitter 6/19

Novel PageRank Identify key users First step –Derive a weighted activity graph Second step –Determine users’ centrality scores 7/19

Weighted Activity Graph Users who actually communicate Graph Links Informational and Normative influence 8/19

Weighted Activity Graph Graph representation –Symmetric adjacency matrix Weight of an undirected activity link C ij – number of communication activities (i  j) C ji – number of communication activities (j  i) Activity Graph n – Number of users 9/19

Users’ Centrality Scores PageRank used by Google N – Total number of webpages O j – Number of outgoing links from page j B i – Set of web pages pointing to web page i d – dampening factor (usually set to 0.85) Novel PageRank F i – Set of users connected to i 10/19

Demonstration and Evaluation Facebook dataset – New Orleans –Set of users (63,731) –Set of social links (817,090) –Communication activity –832,277 wall posts –BFS Crawler 11/19

Demonstration and Evaluation 12/19

Pros and Cons Great results Complexity O(n²) Social and Activity Graph Offline contacts Direction of posts/messages Privacy risks 13/19

Conclusion Potential to generate sustainable revenues Easy to implement Efficient 14/19

Improvements Text Mining to detect influence Scan user messages Detect positive/negative user response Use it to form directed activity graph 15/19

Improvements 16/19 Hey, check this movie (…) Well, I don’t like comedy moves Okay, maybe we could watch this one (…) That trailer looks really good A B A B Detected negative response Influence confirmed

Improvements Distributed PageRank algorithm Monte Carlo approximation Perform K random walks in parallel –Walk to a random neighbor (probability 1- Ɛ ) –Terminate in current node (probability Ɛ ) After walk termination –Each node computes its PageRank value Complexity O(log n / Ɛ ) 17/19

Literature Antonio Caso, Silvia Rossi, “Users Ranking in Online Social Networks to Support POI Selection in Small Groups”, University of Naples Wikipedia, “PageRank”, December Julia Heidemann, Mathias Klier,Florian Probst, “Identifying Key Users in Online Social Networks – PageRank Based Approach”, Research Paper, University of Augsburg, University of Innsbruck 18/19

Thank you for your attention Questions ? 19/19