Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012.

Slides:



Advertisements
Similar presentations
Social Network Analysis (in 10 minutes) Nick Crossley.
Advertisements

Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale Network Theory: Computational Phenomena and Processes Social Network.
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Analysis and Modeling of Social Networks Foudalis Ilias.
Social Network Analysis and Its Applications By Paul Rossman Indiana University of Pennsylvania.
CS 206 Introduction to Computer Science II 03 / 27 / 2009 Instructor: Michael Eckmann.
Relationship Mining Network Analysis Week 5 Video 5.
Feb 20, Definition of subgroups Definition of sub-groups: “Cohesive subgroups are subsets of actors among whom there are relatively strong, direct,
By: Roma Mohibullah Shahrukh Qureshi
Interactive Data Visualization for Rapid Understanding of Scientific Literature Cody Dunne Dept. of Computer Science and Human-Computer Interaction Lab,
Copyright © hutchinson associates 2005 The Knowledge is in the Network Patti Anklam June Holley Valdis Krebs Using Network Analysis to Understand and Improve.
Centrality and Prestige HCC Spring 2005 Wednesday, April 13, 2005 Aliseya Wright.
CS 206 Introduction to Computer Science II 03 / 25 / 2009 Instructor: Michael Eckmann.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
HCC class lecture 22 comments John Canny 4/13/05.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Supporting Distributed Relationships: A study of relations and media use over time Caroline Haythornthwaite Graduate School of Library and Information.
Infrastructure of MANETs  MANETS are without a fixed infrastructure  Network Graphs in MANETS are rarely or ever connected  MANET routing protocols.
Social Network Analysis: A Non- Technical Introduction José Luis Molina Universitat Autònoma de Barcelona
Alias Detection Using Social Network Analysis Ralf Holzer, Bradley Malin, Latanya Sweeney LinkKDD 2005 Advisor: Dr. Koh Jia-Ling Reporter: Che-Wei,
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
Social Network Analysis (1) LING 575 Fei Xia 01/04/2011.
FRE 2661 CSCL Conference, Bergen, june 2003C. Reffay, T. Chanier 1 How Social Network Analysis can help to measure cohesion in collaborative distance-learning.
Vertices and Edges Introduction to Graphs and Networks Mills College Spring 2012.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 25, 2012.
Social Network Metrics. Types of network metrics Group level – Density – Components Isolates – Cliques – Centralization Degree Closeness Betweenness –
Centrality Spring 2012.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Chapter 5 Social Network Analysis: Techniques to Discover How Work Really Gets Done.
Mathematics of Networks (Cont)
Chapter 3. Community Detection and Evaluation May 2013 Youn-Hee Han
How to Analyse Social Network?
A project from the Social Media Research Foundation: Finding direction in a sea of connection:
Partitioning The Network Copyright © 2012: HyperEdge Pty Ltd 1.
Network Community Behavior to Infer Human Activities.
+ Big Data, Network Analysis Week How is date being used Predict Presidential Election - Nate Silver –
Its a specific set of linkages among a defined set of persons with the additional property that the characteristics of these linkages as a whole may be.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Graphs G = (V,E) V is the vertex set. Vertices are also called nodes and points. E is the edge set. Each edge connects two different vertices. Edges are.
Social network analysis
Selected Topics in Data Networking Explore Social Networks: Center and Periphery.
How to Analyse Social Network? Social networks can be represented by complex networks.
Informatics tools in network science
Selected Topics in Data Networking Explore Social Networks: Cliques and Sub-group (Example Application)
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
Bennington’s Community Health Network. Study Objective Objective Describe the network of organizations that has emerged in each Blueprint HSA to support.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Springfield’s Community Health Network. Study Objective Objective Describe the network of organizations that has emerged in each Blueprint HSA to support.
CRIM6660 Terrorist Networks Lesson 1: Introduction, Terms and Definitions.
Core Methods in Educational Data Mining
Classroom network analysis
Groups of vertices and Core-periphery structure
Social Networks Analysis
Applications of graph theory in complex systems research
Comparison of Social Networks by Likhitha Ravi
Core Methods in Educational Data Mining
Network analysis.
Core Methods in Educational Data Mining
CS120 Graphs.
Network Science: A Short Introduction i3 Workshop
Centrality in Social Networks
Bikalp Chamola (VAF) Shyam Singh (IRMA)
(Social) Networks Analysis II
Core Methods in Educational Data Mining
Graphs G = (V,E) V is the vertex set.
Presentation transcript:

Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012

Today’s Class Social Network Analysis

General Principles of Social Network Analysis

General Postulates of Social Network Analysis

There are many entities, referred to as nodes or vertices Nodes have connections to other notes, referred to as ties or links Nodes can have different types or identities Links can have different types or identities Links can have different strengths

Example (Student work groups – Kay et al., 2006)

nodes

Example (Student work groups – Kay et al., 2006) ties

Example (Student work groups – Kay et al., 2006) Strong ties Weak ties

Which student group works together better?

Which is the most collaborative pair?

Who is the most collaborative student?

Types In a graph of classroom interactions, what different types of nodes could there be?

Types In a graph of classroom interactions, what different types of nodes could there be? – Teacher – TA – Student – Project Leader – Project Scribe

Types In a graph of classroom interactions, what different types of links could there be?

Types In a graph of classroom interactions, what different types of links could there be? – Leadership role (X leads Y) – Working on same learning resource – Helping act – Criticism act – Insult – Note that links can be directed or undirected

Strength In a graph of classroom interactions, what would make links stronger or weaker?

Strength In a graph of classroom interactions, what would make links stronger or weaker? – Intensity of act (Examples?) – Frequency of act (Examples?)

Examples What might be some types of social networks that would be studied in the learning sciences? What might be some relevant research questions?

Social Network Analysis Use social network graphs to study the patterns and regularities of the relationships between the nodes

Density Proportion of possible lines that are actually present in graph What is the density of these graphs?

Reachability A node is “reachable” if a path goes from any other node to it Which nodes are reachable and unreachable?

Geodesic Distance The number of nodes between one node N and another node M

Example (Dawson, 2008)

What is the geodesic distance?

Geodesic Distance What might be a use for geodesic distance in educational research?

Flow How many possible paths are there between node N and node M?

What is the flow?

Flow What might be a use for flow in educational research?

Centrality How important is a node within the graph?

Centrality Four common measures – Degree centrality – Closeness centrality – Betweeness centrality – Eigenvector centrality

Nodal Degree Number of lines that connect to a node

Which node has the highest nodal degree?

Nodal Degree Indegree: number of lines that come into a node – How might this be interpreted for some link types you might see in educational data? Outdegree: number of lines that come out of a node – How might this be interpreted for some link types you might see in educational data?

Closeness A node N’s closeness is defined as the sum of its distance to other nodes The most central node in terms of closeness is the node with the lowest value for this metric Note that strengths can be used as a distance measure for calculating closeness – Higher strength = closer nodes

Which node has highest closeness? (looking solely at number of steps)

Which node has highest closeness? (looking at link strengths)

Betweenness Betweeness centrality for node N is computed as: The percent of cases where For each pair of nodes M and P (which are not N) – The shortest path from M to P passes through N

What is this node’s betweenness

Betweenness How might this be interpreted for some link types you might see in educational data?

Eigenvector Centrality Complex math, but assigns centrality to nodes through recursive process where More and stronger connections are positive Connections to nodes with higher eigenvector centrality contribute more than connections to nodes with lower eigenvector centrality

Eigenvector Centrality What type of applications might this have?

How do these measures differ in meaning?

Reciprocity What percentage of ties are bi-directional? – Can be computed as number of bi-directional ties over total number of connected pairs

What is the reciprocity?

What could reciprocity tell you? For educational data

Clique Sub-set of a network for which all nodes are connected to each other – If there is any node which is connected to all nodes in the clique – Then it is also part of the clique

What are the cliques?

Clique What could cliques tell you in educational research problems?

N-Clique Sub-set of a network for which all nodes are connected to each other with a path of geodesic distance of N or less

What are the 2-cliques?

K-plex Sub-set of a network, of size N, for which all nodes are connected to at least N-K other members of the K-plex

What are the 1-plexes?

Connections between cliques Can represent key conduits for information Example from Haythornthwaite (1998)

Communication in a class (letters indicate groups)

Comments? Questions?

Case Studies in Uses of Social Network Analysis (Haythornthwaite, 2001) (Dawson, 2008)

How? How did Haythornthwaite and Dawson use social network analysis to learn about collaborative learning?

Haythornthwaite Analyzed data from four groups from same class over time Analyzed students’ communication behaviors – Collaborative Work – Exchanging Advice – Socializing – Emotional Support Analyzing students’ use of communication technologies – Webboard – IRC – – NetMeeting – Telephone – Face-to-Face

Dawson Analyzed student perception of being part of a social community and a learning community, in relation to their centrality (multiple measures)

Other uses? What are some other uses of social network analysis for learning beyond those we’ve discussed today?

Comments? Questions?

Assignment 6 Solutions

Assignment 7

Next Class Wednesday, March 27 Correlation Mining and Causal Mining Readings Arroyo, I., Woolf, B. (2005) Inferring learning and attitudes from a Bayesian Network of log file data. Proceedings of the 12th International Conference on Artificial Intelligence in Education, Rai, D., Beck, J.E. (2011) Exploring user data from a game-like math tutor: a case study in causal modeling. Proceedings of the 4th International Conference on Educational Data Mining, Rau, M. A., & Scheines, R. (2012) Searching for Variables and Models to Investigate Mediators of Learning from Multiple Representations. Proceedings of the 5th International Conference on Educational Data Mining, Assignments Due: None

The End