Hasan T Karaoglu. Introduction Blogs are different! Methods are different! Contents are different! Some methods on Some Content of Some Blogs Discussion.

Slides:



Advertisements
Similar presentations
Chapter 6 Politically Connected Your Vote Doesnt Count If you applied rationality to voting, you probably wouldnt vote. The costs of voting researching.
Advertisements

Measurement and Analysis of Online Social Networks 1 A. Mislove, M. Marcon, K Gummadi, P. Druschel, B. Bhattacharjee Presentation by Shahan Khatchadourian.
Tracking Information Epidemics in Blogspace A paper synopsis Alistair Wright, Ken Tan, Kisan Kansagra, Jenn Houston.
An Online News Recommender System for Social Networks Department of Computer Science University of Illinois at Urbana-Champaign Manish Agrawal, Maryam.
1 KSIDI June 9, 2010 Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Max Planck Institute for Software Systems (MPI-SWS)
Stelios Lelis UAegean, FME: Special Lecture Social Media & Social Networks (SM&SN)
Analysis and Modeling of Social Networks Foudalis Ilias.
Advanced Topics in Data Mining Special focus: Social Networks.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Enabling the Social Web Krishna P. Gummadi Networked Systems Group Max Planck Institute for Software Systems.
Masters Thesis Defense Amit Karandikar Advisor: Dr. Anupam Joshi Committee: Dr. Finin, Dr. Yesha, Dr. Oates Date: 1 st May 2007 Time: 9:30 am Place: ITE.
TDTS21: Advanced Networking Lecture 8: Online Social Networks Based on slides from P. Gill Revised 2015 by N. Carlsson.
Mining and Searching Massive Graphs (Networks)
On the Structure, Properties and Utility of Internal Corporate Blogs Pranam Kolari Tim Finin, Yelena Yesha, Yaacov Yesha Kelly Lyons, Stephen Perelgut,
Hoax or Truth: Pattern Extraction Through Diffusion Analysis Hasan T Karaoglu.
Cascading Behavior in Large Blog Graphs Patterns and a Model Leskovec et al. (SDM 2007)
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Measurement and Analysis of Online Social Networks By Alan Mislove, Massimiliano Marcon, Krishna P. Gummadi, Peter Druschel, Bobby Bhattacharjee Attacked.
Implicit Structure and Dynamics of BlogSpace Eytan Adar, Li Zhang, Lada Adamic, & Rajan Lukose HP Labs, Palo Alto, CA.
CS 599: Social Media Analysis University of Southern California1 Information Diffusion Kristina Lerman University of Southern California.
Measurement and Analysis of Online Social Networks Alan Mislove,Massimiliano Marcon, Krishna P. Gummadi, Peter Druschel, Bobby Bhattacharjee Presented.
The Social Web: A laboratory for studying s ocial networks, tagging and beyond Kristina Lerman USC Information Sciences Institute.
1 Measurement and Analysis of Online Social Networks A. Mislove, M. Marcon, K Gummadi, P. Druschel, B. Bhattacharjee Presentation by Yong Wang (Defense.
Inclusive Learning Through Technology Damian Gordon.
Overview of Web Data Mining and Applications Part I
Introduction Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Introduction Facebook How does Facebook use your data? Where do you think.
* What is reading? * Challenges for older readers and writers * What can I do to help? * What is available to support me? * Questions * Reading and writers.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Standards Certification Education & Training Publishing Conferences & Exhibits Human Factors Engineering Why Smart People do Dumb Things.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Consider ways to use social software in your professional learning and school.
Social Networking and On-Line Communities: Classification and Research Trends Maria Ioannidou, Eugenia Raptotasiou, Ioannis Anagnostopoulos.
Authors: Xu Cheng, Haitao Li, Jiangchuan Liu School of Computing Science, Simon Fraser University, British Columbia, Canada. Speaker : 童耀民 MA1G0222.
AVI/Psych 358/IE 340: Human Factors Web 2.0 November
Neural Networks AI – Week 21 Sub-symbolic AI One: Neural Networks Lee McCluskey, room 3/10
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
 The nugger was flinp.  The nugger was flinp and wugnet.  The nugger was flinp, wugnet and manple in my waslet.  What was flinp?  How else does the.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
Generating and Tracking Communities Based on Implicit Affinities Matthew Smith – BYU Data Mining Lab April 2007.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network author: Meeyoung Cha Alan Mislove Krishna P. Gummadi From Saarbrucken,
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Microblogs: Information and Social Network Huang Yuxin.
Teaching reading.
Technical Reading Presented by Beatrice Moore Luchin NUMBERS Mathematics Professional Development NUMBERSmpd.com.
Predicting Positive and Negative Links in Online Social Networks
NTU Natural Language Processing Lab. 1 Investment and Attention in the Weblog Community Advisor: Hsin-Hsi Chen Speaker: Sheng-Chung Yen.
A measurement-driven Analysis of Information Propagation in the Flickr Social Network Meeyoung Cha Alan Mislove Krisnna P. Gummadi.
Illusions and Other Visual Defects CITA 6016 Food Sensory Analysis University of Puerto Rico Food Science & Technology.
Mining information from social media
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
© 2004 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Implicit Structure and Dynamics of.
1 Patterns of Cascading Behavior in Large Blog Graphs Jure Leskoves, Mary McGlohon, Christos Faloutsos, Natalie Glance, Matthew Hurst SDM 2007 Date:2008/8/21.
CIS750 – Seminar in Advanced Topics in Computer Science Advanced topics in databases – Multimedia Databases V. Megalooikonomou Link mining ( based on slides.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
Link Prediction Class Data Mining Technology for Business and Society
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
The Spread of Media Content through the Blogosphere
Elements of Effective Literacy Instruction in Grades 5-8
Aoccdrnig to rscheearch at an Elingsh uinervtisy, it deosn’t mttaer in what oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht the frist.
Link Prediction Seminar Social Media Mining University UC3M
Generative Model To Construct Blog and Post Networks In Blogosphere
NSDL/NSTA Web Seminar: Learning By Doing—Computational Science
The likelihood of linking to a popular website is higher
Even though the next page may look weird, you can still read it!
DEBBIE CHENG * LISA HANKIN * JOHN MARK JOSLING
A Network Science Approach to Fake News Detection on Social Media
How does your brain perceive objects?
Modeling Trust and Influence in the Blogosphere using Link Polarity
Presentation transcript:

Hasan T Karaoglu

Introduction Blogs are different! Methods are different! Contents are different! Some methods on Some Content of Some Blogs Discussion

Blogs are a popular way to share personal journals, discuss matters of public opinion, have collaborative conversations, aggregate content on similar topics. Blogs also disseminate new content novel ideas How does content spread across, what kinds of content spreads, and at what rate?

Epidemics : one way of modeling these aspects Physics of Information Diffusion Disease Propagation Model Susceptible Infected Recovered Mutation? Threshold Model for Social Networks

Youtube, Flickr (Content Sharing ) Amazon CNN, MSNBC (Web) Linkedln (Professional Networking) Orkut, Facebook, Yonja (Social Networking) Twitter (?) Blogger, Blogspot, LiveJournal, Slashdot (Blogspace)

High level of reciprocity Symmetric indegree – outdegree In contrast to Web (high authority sites)

Average Path Length is very short in compared to Web. (Directionality ?)

Joint Degree Distribution (High Degree Nodes Connect to Other High Degree Nodes) Epidemics on Network Core? Youtube Celebrities?

Strongly Connected Core Analysis Slowly Increasing Shortest Path High Clustering

Strong Local Clustering (people tend to be introduced to other people via mutual friends)

Epidemics Gossip Influence Map (Word of Mouth) Recommendation Based Web (Data) Mining Mathematical Modeling (Markov Chains, Information Theory, …) …

Recommendation News (Political, Fun, Paparazzi) Gossip Media (Music, News, Excerpts)

Infection Inference technique introduced by Adamic et al. Link inference Link classification Classifier training Problems and Challenges

Pattern Used for Classifier Training The number of common blogs explicitly linked to by both blogs (indicating whether two blogs are in the same community) The number of non-blog links (i.e. URLs) shared by the two Text similarity Order and frequency of repeated infections. Specifically, the number of times one blog mentions a URL before the other and the number of times They both mention the URL on the same day. In-link and out-link counts for the two blogs

Text Similarity s(A,B) = n AB / √n A / √n B

Timing of Infection

Link Inference Blog URL and Text Similarity Patterns Three-way Classifier (57%) reciprocated links, one way links, unlinked pairs Two-way Classifier (SVM 91.2% Logistic Regression 91.9%) linked unlinked pairs Infection Inference n A-before-B /n A, n A-after-B /n A, n A-same-day-B /n A Timing Patterns (75%) with all 6 timing patterns and text/blog similarity patterns (61 – 75%) link-in / link-out counts

Visualization Heuristics using classifiers Two types of graph Directed Acyclic Graph Most likely tree

Epidemic Propagation Model by Gruhl et al. Topics Individuals Topics Topic = Chatter + Spike + (Resonance)

Epidemic Propagation Model by Gruhl et al. Topics Individuals Topics Topic = Chatter + Spike + (Resonance)

aoccdrnig to rscheearch at an elingsh uinervtisy it deosn’t mttaer in waht oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht the frist and lsat ltteer is at the rghit pclae

Power-law Characteristic for Individuals Different Posting Behaviors for Individuals

Propagation Model Cascading Model Copy Probability κ(v,w) Noticing Probability r(v,w) For 7K topics, r mean 0.28 and std 0.22, κ quite low, mean 0.04 and std 0.07, Even bloggers who commonly read from another source are selective in the topics they choose to write about.

Could we use these models to extract further pattern or characteristics ? Classification of Hoax, Fake News ? Prediction of Popular songs, videos at their inception …..

Thanks!

D. W. Drezner, and H. Farrell, “Web of Influence,” Foreign Policy, vol. 145, pp , Dec E. Adar and L. A. Adamic, “Tracking Information Epidemics in Blogspace,” Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 207–214, D. Gruhl, R. Guha, D. Liben-Nowell, and A. Tomkins, “Information diffusion through blogspace,” Proceedings of the 13th international conference on World Wide Web, pp ,2004. A. Mislove, M. Marcon, K. P. Gummadi, P. Druschel, and B. Bhattacharjee, “Measurement and Analysis of Online Social Networks,” Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, pp , 2007 M. Cha, J. A. N. Perez, and H. Haddadi, "Flash Floods and Ripples: The Spread of Media Content through the Blogosphere", 3rd Int'l AAAI Conference on Weblogs and Social Media (ICWSM) Data Challenge Workshop, May , 2009, San Jose, CaliforniaM. Young, The Technical Writer's Handbook. Mill Valley, CA: University Science, Z. Fanzi, Q. Zhengding, L. Dongsheng, and Y. Jianhai, “Shape-based time series similarity measure and pattern discovery algorithm”, Journal of Electronics, vol. 22, pp , Aug. 2007