Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Slides:



Advertisements
Similar presentations
LEARNING INFLUENCE PROBABILITIES IN SOCIAL NETWORKS Amit Goyal Francesco Bonchi Laks V. S. Lakshmanan University of British Columbia Yahoo! Research University.
Advertisements

TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Finding Topic-sensitive Influential Twitterers Presenter 吴伟涛 TwitterRank:
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Presented By: Omofonmwan Nelson. Agenda:  Twitter  Benefits of Twitter  Tweet  Tweeter Services  Geographical Distribution  Conclusion.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
A Probabilistic Model for Road Selection in Mobile Maps Thomas C. van Dijk Jan-Henrik Haunert W2GIS, 5 April 2013.
What means for you Alisa Leonard Vice President, Marketing Strategy iCrossing.
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
Link creation and profile alignment in the aNobii social network Luca Maria Aiello et al. Social Computing Feb 2014 Hyewon Lim.
Graph Data Management Lab School of Computer Science , Bristol, UK.
More on Rankings. Query-independent LAR Have an a-priori ordering of the web pages Q: Set of pages that contain the keywords in the query q Present the.
Web Search – Summer Term 2006 VI. Web Search - Ranking (c) Wolfgang Hürst, Albert-Ludwigs-University.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 3 March 23, 2005
Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:
Topic-Sensitive PageRank Taher H. Haveliwala. PageRank Importance is propagated A global ranking vector is pre-computed.
PageRank Debapriyo Majumdar Data Mining – Fall 2014 Indian Statistical Institute Kolkata October 27, 2014.
Web Search – Summer Term 2006 VII. Selected Topics - PageRank (closer look) (c) Wolfgang Hürst, Albert-Ludwigs-University.
Overview of Web Data Mining and Applications Part I
Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web.
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Leveraging Big Data: Lecture 11 Instructors: Edith Cohen Amos Fiat Haim Kaplan Tova Milo.
Models of Influence in Online Social Networks
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
1 Announcements Research Paper due today Research Talks –Nov. 29 (Monday) Kayatana and Lance –Dec. 1 (Wednesday) Mark and Jeremy –Dec. 3 (Friday) Joe and.
+ Offline Optimal Ads Allocation in SNS Advertising Hui Miao, Peixin Gao.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
A Comparison of Microblog Search and Web Search.
Pete Bohman Adam Kunk. What is real-time search? What do you think as a class?
2015/10/111 DBconnect: Mining Research Community on DBLP Data Osmar R. Zaïane, Jiyang Chen, Randy Goebel Web Mining and Social Network Analysis Workshop.
Information Spread and Information Maximization in Social Networks Xie Yiran 5.28.
Pete Bohman Adam Kunk. Real-Time Search  Definition: A search mechanism capable of finding information in an online fashion as it is produced. Technology.
TWITTER What is Twitter, a Social Network or a News Media? Haewoon Kwak Changhyun Lee Hosung Park Sue Moon Department of Computer Science, KAIST, Korea.
Microblogs: Information and Social Network Huang Yuxin.
The PageRank Citation Ranking: Bringing Order to the Web Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd Presented by Anca Leuca, Antonis Makropoulos.
Recommending Twitter Users to Follow Using Content and Collaborative Filtering Approaches John HannonJohn Hannon, Mike Bennett, Barry SmythBarry Smyth.
Date: 2012/4/23 Source: Michael J. Welch. al(WSDM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Topical semantics of twitter links 1.
Optimal Link Bombs are Uncoordinated Sibel Adali Tina Liu Malik Magdon-Ismail Rensselaer Polytechnic Institute.
Query Suggestion Naama Kraus Slides are based on the papers: Baeza-Yates, Hurtado, Mendoza, Improving search engines by query clustering Boldi, Bonchi,
Finding Top-k Shortest Path Distance Changes in an Evolutionary Network SSTD th August 2011 Manish Gupta UIUC Charu Aggarwal IBM Jiawei Han UIUC.
Detecting Communities Via Simultaneous Clustering of Graphs and Folksonomies Akshay Java Anupam Joshi Tim Finin University of Maryland, Baltimore County.
Probabilistic Models for Discovering E-Communities Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha The Pennsylvania State University WWW.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Measuring Behavioral Trust in Social Networks
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
On TWITTER on TWITTER.   Regular tweets with the added bonus of reaching both current and potential followers  Can Appear in:  User Timelines and.
“In the beginning -- before Google -- a darkness was upon the land.” Joel Achenbach Washington Post.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
11 A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 1, Michael R. Lyu 1, Irwin King 1,2 1 The Chinese.
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme ISWC 2008 Hyewon Lim January.
Analyzing and Predicting Question Quality in Community Question Answering Services Baichuan Li, Tan Jin, Michael R. Lyu, Irwin King, and Barley Mak CQA2012,
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Exploring Social Influence via Posterior Effect of Word-of-Mouth Recommendations Junming Huang, Xue-Qi Cheng, Hua-Wei Shen, Tao Zhou, Xiaolong Jin WSDM.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
TwitterFeedRank Nick Flacco Dalton Huynh Abhishek Jha Phong Lam.
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Topical Authority Detection and Sentiment Analysis on Top Influencers
Neighborhood - based Tag Prediction
Market Intelligence Analysis
Link-Based Ranking Seminar Social Media Mining University UC3M
E-Commerce Theories & Practices
Information Propagation Speed and Patterns in Social Networks: a Case Study Analysis of German Tweets International Conference of Algorithms, Computing.
Luca Lugini Publication by Yingze Wang, Guang Xiang, and Shi-Kuo Chang
Learning Emoji Embeddings Using Emoji Co-Occurrence Network Graph
Speaker:Xu Bo Time:2011/9/7.
Biased Random Walk based Social Regularization for Word Embeddings
Example: Academic Search
Graph and Link Mining.
--WWW 2010, Hongji Bao, Edward Y. Chang
Presentation transcript:

Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International Conference on Web Search and Data Mining (WSDM 2010)

Outline Problem Dataset Twitter rank Results Recommended paper

Problem Identifying influential users of micro-blogging services How?

Frequently used algorithms NameAlgorithm Measures the influence withProblems InDIn-degree number of followersdoes not accurately capture the notion of influence PRPageRank link structure of the networkignores the interests of twitterers, which affects the way twitterers influence one another TSPR Topic-sensitive pagerank calculating PageRank vector for each topic. propagates a twitterer’s influence in one topic to her friends in different topics with equal probabilities TRTwitterrank Link structure + Topic similarity between users

Problem Identifying influential users of micro-blogging services Homophily

Why the problem important?

Brings order to the real-time web in that it allows the search results to be sorted by the authority/influence of the contributing twitterers giving a timely update of the thoughts of influential twitterers A marketing platform. Targeting those influential users will increase the e ffi ciency of the marketing campaign. For example, a hand phone manufacturer can engage those twitterers influential in topics about IT gadgets to potentially influence more people. There are also applications that utilize Twitter to gather opinions and information on particular topics. Identifying influential twitterers for interesting topics can improve the quality of the opinions gathered. ……

Context _ twitter Friend tweet Follower following friendfollower Tweet (<140words ) following

Context _ dataset Top-1000 Singapore

Context _ dataset ResourceSize Top-1000Core996 Extended twitterers All the followers and friends of each individual twitterer 6,748 TweetsAll the tweets they had published so far1,021,039

Context _ dataset Power-law distribution 5686 >1 tweets Mean= Among the 6745 twitterers, 957 have no friends, while 1782 have no followers

Context _ dataset Reciprocity Too casual Strong indicator of the similarity among users 72.4%80% following 80.5%80% following Homophily? How to prove?

Twitterrank Homophily

Topic distillation LDA(Latent Dirichlet Allocation) p( word | tweet ) = p( word | topic )*p( topic | tweet ) tweet topic tweet word topic DT : the probability that twitterer si is interested in topic tj. Automatically identify the topics that twitterers are interested in based on the tweets they published. WT

Homophily

Most of the twitterers (3785/4050) have less than 30 friends, which is not statistically significant. Therefore, two cases are considered. Why? pairs of twitterers with reciprocal “following” relationship.

Twitterrank

Jump why jump? It is possible that some twitterers would “follow” one another in a looping manner without “following” other twitterers outside the loop. Such loop will accumulate high influence without distribute their influence. To tackle this, a teleportation vector E t is also introduced, which basically captures the probability that the random surfer would “jump”to some twitterers instead of following the edges of the graph D

Twitterrank

Results Influential twitterers identified in the Twitter dataset

Compared algorithms NameAlgorithm Measures the influence withProblems InDIn-degree number of followersdoes not accurately capture the notion of influence PRPageRank link structure of the networkignores the interests of twitterers, which affects the way twitterers influence one another TSPR Topic-sensitive pagerank calculating PageRank vector for each topic. propagates a twitterer’s influence in one topic to her friends in different topics with equal probabilities TRTwitterrank Link structure + Topic similarity between users

Results

Application _ recommendation task Randomly choose | L | existing “following” relationship formed among twitterers A B A B

A recommended paper Java A, Song X, Finin T, et al. Why we twitter: understanding microblogging usage and communities[C]//Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis. ACM, 2007: 引用: 2126 次 Main work: People talk about their daily activities and to seek or share information. Analyze the user intentions associated at a community level and show how users with similar intentions connect with each other.