Twitter Community Discovery & Analysis Using Topologies Andrew McClain Karen Aguar.

Slides:



Advertisements
Similar presentations
Community Detection and Graph-based Clustering
Advertisements

Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
An Evaluation of Community Detection Algorithms on Large-Scale Traffic 1 An Evaluation of Community Detection Algorithms on Large-Scale Traffic.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
1 Complexity of Network Synchronization Raeda Naamnieh.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Clementine Server Clementine Server A data mining software for business solution.
The Very Small World of the Well-connected. (19 june 2008 ) Lada Adamic School of Information University of Michigan Ann Arbor, MI
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Data Mining Techniques
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
Social Network Analysis via Factor Graph Model
Module 3: Business Information Systems Chapter 11: Knowledge Management.
Data Mining Chun-Hung Chou
SOCIAL NETWORKS AND THEIR IMPACTS ON BRANDS Edwin Dionel Molina Vásquez.
SPDG Day Presentation July 18, 2011 Cynthia Glimpse Technical Assistance Coordination Center.
1/27 Ensemble Visualization for Cyber Situation Awareness of Network Security Data Lihua Hao 1, Christopher G. Healey 1, Steve E. Hutchinson 2 1 North.
Graph Data Management Lab, School of Computer Science Put conference information here: The 12-th International Conference.
The Effects of Ranging Noise on Multihop Localization: An Empirical Study from UC Berkeley Abon.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
1 1 Slide Introduction to Data Mining and Business Intelligence.
CS 765 – Fall 2014 Paulo Alexandre Regis Reddit analysis.
Network Analysis Diffusion Networks. Social Network Philosophy Social structure is visible in an anthill Movements & contacts one sees are not random.
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Presented By :Ayesha Khan. Content Introduction Everyday Examples of Collaborative Filtering Traditional Collaborative Filtering Socially Collaborative.
A Graph-based Friend Recommendation System Using Genetic Algorithm
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
On Finding Fine-Granularity User Communities by Profile Decomposition Seulki Lee, Minsam Ko, Keejun Han, Jae-Gil Lee Department of Knowledge Service Engineering.
A Discrepancy Detector James Abello, CCICADA-DIMACS FACULTY ( Student: Nishchal Devanur CS Dept Rutgers Goal To detect the most influential.
Big Ideas Differentiation Frames with Icons. 1. Number Uses, Classification, and Representation- Numbers can be used for different purposes, and numbers.
+ The Division or Classification Essay Catherine Wishart Senior Adjunct Instructor.
Principles of Data Mining. Introduction: Topics 1. Introduction to Data Mining 2. Nature of Data Sets 3. Types of Structure Models and Patterns 4. Data.
Visual Analytics with Linked Open Data and Social Media for e- Governance Vitaveska Lanfranchi Suvodeep Mazumdar Tomi Kauppinen Anna Lisa Gentile Updated.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Prediction of Influencers from Word Use Chan Shing Hei.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Chapter 1 Table of Contents Section 1 What Is Physics?
Mining and Visualizing the Evolution of Subgroups in Social Networks Falkowsky, T., Bartelheimer, J. & Spiliopoulou, M. (2006) IEEE/WIC/ACM International.
Local/Global Term Analysis for Discovering Community Differences in Social Networks David Fuhry, Yiye Ruan, and Srinivasan Parthasarathy Data Mining Research.
Week 2 The lecture for this week is designed to provide students with a general overview of 1) quantitative/qualitative research strategies and 2) 21st.
Measuring Behavioral Trust in Social Networks
Visual Data Analysis By: Anna Samouris, Tyerell Warren, LaVonda Suggs & Ashley Turner.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Chapter 1 Preview Objectives Physics The Scientific Method Models
An Effective Method to Improve the Resistance to Frangibility in Scale-free Networks Kaihua Xu HuaZhong Normal University.
Big Data Using Big Data for Cultures and Communities Jeremy Reffin Simon Wibberley CASM, University of Sussex Carl Miller CASM, Demos July 2014.
Twitter Community Discovery & Analysis Using Topologies
Visualization of Washing Powder Formulation ———seeking the best ingredients of washing powder.
Visualization of evolutionary cascades of messages using force-directed graphs Master’s thesis defense, 9 may 2011 Artjom Kurapov Supervisor: Helena Kruus.
1 Visualizing Multi-dimensional Clusters, Trends, and Outliers using Star Coordinates Author : Eser Kandogan Reporter : Tze Ho-Lin 2007/5/9 SIGKDD, 2001.
Visualizing Neuron Behavior in Dynamic Mouse Brain Networks Chihua Ma 4/9/2015.
Clustering Microarray Data based on Density and Shared Nearest Neighbor Measure CATA’06, March 23-25, 2006 Seattle, WA, USA Ranapratap Syamala, Taufik.
Location-based Social Networks 6/11/20161 CENG 770.
Department of Computer Science Sir Syed University of Engineering & Technology, Karachi-Pakistan. Presentation Title: DATA MINING Submitted By.
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
What Is Cluster Analysis?
DATA MINING © Prentice Hall.
Statistical Data Analysis
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Dieudo Mulamba November 2017
Data Exploration Of Wikipedia
Data Science introduction.
Statistical Data Analysis
Data Warehousing Data Mining Privacy
Scientific Method Lab Mapping.
Discovering Important Nodes through Graph Entropy
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Presentation transcript:

Twitter Community Discovery & Analysis Using Topologies Andrew McClain Karen Aguar

Outline Introduction Motivation Project Description o Our Objective o Community Discovery o Analysis & Application Data collection Use of Gephi

Introduction Many people use services like twitter to stay in contact with groups in which they are members or to interact with other people with similar interests These groups are considered “communities”

Community? A network or group of nodes with greater ties internally than to the rest of the network There are various derivations of a community: o Some communities are tightly bound together o Others are loose associations of people Communities can be defined by a quality function o Several quality functions may be used & will vary based on the situations o Experimentally determine the best quality function for our purposes

Motivation We want to classify these communities & find real world implications of their digital associations Project Description: Discovering communities & examining the properties of the graph to give us insight into the community itself. Ex: Find the organizers of a hobby group by the twitter activity

Our Project Our project can be broken into 2 main sections 1.Twitter community discovery 2.Analysis of the community graphs & its correlation to the real world community structure

Community Discovery 1.Select a diverse number of individuals from known real-world communities 2.Apply local graph clustering to isolate the community that they belong to 1.Example: CNN 3.Generate graphs of the communities

Analysis & Application 1.Analyze the relationships in the graphs using a variety of analysis techniques 2.Detect behavior patterns and structures in twitter communities 1.Shape, interconnectivity, how the information flows through it 3.Apply our knowledge to learn about unknown communities based on twitter behavior *time permitting

Data Collection Datasets containing actual tweets are now unavailable due to a change in Twitter’s terms of use. We will collect our own data through the use of: Twitter Rest API Gephi -- open source graph visualization platform o Retweet plugin for Gephi

Gephi for Community Discovery We will use Gephi partitioning methods to set different ways of partitioning the graph & use quality functions to determine what is / is not a community. Gephi gives us very powerful filter functions so that we can reduce data down to what we want very quickly

References Community Discovery in Social Networks: Applications, Methods and Emerging Trends o S. Parthasarathy, Y. Ruan, V. Satuluri [2011] gephi.com dev.twitter.com