An Ant Colony Optimization Approach Expert Identification in Social Networks Muhammad Aurangzeb Ahmad, Jaideep Srivastava Department of Computer Science.

An Ant Colony Optimization Approach Expert Identification in Social Networks Muhammad Aurangzeb Ahmad, Jaideep Srivastava Department of Computer Science and Engineering, University of Minnesota International Workshop on Social Computing, Behavioral Modeling, and Prediction Phoenix, AZ, April 1-2, 2008

Outline  Part A: Minnesota research program  Data mining @ Minnesota  SBP research @ Minnesota  Part B: Expert identification in social networks  Background  Possible Approaches  Ant Colony Optimization (ACO)  ACO for Expert Identification  Current Work  Results  Future Work University of Minnesota

Data Mining at Minnesota Core Research Models, algorithms tools NSF, ARL, NASA Business Applications Sales & marketing (IBM) Automotive (Daimler/Chrysler) CRM (Intel) e-CRM (Intuit) Claims fraud (Ingenix/UHG) Bio-Medical Applications Medical informatics (Mayo) Bioinformatics (NIH) Behavioral ecology (NSF) Government Applications Cyber security (ARDA, ARL) Transportation (MnDoT, FHWA) Physical security (United Tech) Tax Audit (Dept of Revenue) Driver modeling (Eaton) Science & Engineering Applications Climate modeling (NASA) High energy physics (Fermi) Penetration mechanics (ARL) Simulation (LLNL, DOE) Vehicle Health (NASA)  Technology transfer  Minnesota Intrusion Detection System (MINDS) – being used by Army Research Lab and University of Minnesota  Sales Opportunity Miner – IBM is building a full-fledged tool based on this research  Global Climate Modeling for NASA  MN Dept of Revenue uses our models for selecting tax payers to audit

Books Authored by DM Faculty

SBP Research @ Minnesota  Virtual world exploratorium for computational social sciences  Computational approach to modeling trust  Trust, reputation, social capital, etc.  SNA driven healthcare management incentive design  Structure and resource identification from the Web  Information search, expert identification, community identification, etc.

MMO Games  MMO (Massively Multiplayer Online) Games are computer games that allow hundreds to thousands of players to interact and play together in a persistent online world Popular MMO Games- Everquest 2, World of Warcraft and Second Life

University of Minnesota 7 Sponsorship National Science Foundation Army Research Institute Sony Corporation UMN, UIUC, USC, Northwestern Sociology research questions How do networks within the ecosystem of groups enable and constrain the formation of groups? How do micro-group processes influence group effectiveness and social identity? Psychology research questions What impact does playing video games has people’s real lives? Is online behavior different in MMOs vs. tradition video games? Macroeconomics research questions lots of them Computer Science research questions quantitative metrics algorithms & scalability Marketing questions (Sony) early identification of customer attrition social influence and its impact on up-sell & cross-sell Team training what team structures and communication patterns facilitate and/or inhibit performance how can +ve structures & patterns be incorporated in training programs Enabling research in multiple disciplines Noshir Contractor, Northwestern communication theory, orgn theory Scott Poole, Urbana sociology Dmitri Williams, USC social psychology Jaideep Srivastava, Minnesota computer science Sony Corporation gaming, marketing, scalability Key resource Everquest 2 dataset from Sony 250,000+ players 3+ years complete click-stream 14+ terabytes of data

Building a Web of Trust w/o Trust Ratings 1 A framework for deriving degree of trust The relationship between a review writer and a review rater 1-1: Calculating Quality of a Review and Reputation of a Review Rater 1-2: Calculating Reputation of a Review Writer 1-3: Constructing Users_Category Expertise Matrix E 1. Young Ae Kim, Hady W. Lauw, Ee-Peng Lim, Jaideep Srivastava, Building a Web of Trust without Explicit Trust Ratings, ICDE 2008 Workshop.

University of Minnesota Discovering Referral Networks from Medicare Data Pulmono logist Cardio logist Geriatrics Podiatrist Rheumato logist Medical Problems Patient Doug Wholey, Healthcare Policy & Management Dave Knutson, Minnesota Department of Health Jaideep Srivastava, Computer Science & Engineering

University of Minnesota Referral Networks and Cooperation  Problem  In many cases people visit multiple doctors and specialists for their medical needs  The patients would be served better if there were better coordination between these specialists  Classical approach  Offer incentives individually to specialists  Defects in this approach  Each specialist may want to “optimize” his/her own incentives  In such settings local optimization of services does not lead to global optimization of services  Proposed approach  Identify Referral Networks to encourage specialists to work together to offer better services  provide group incentives

Outline  Part A: Minnesota research program  Data mining @ Minnesota  SBP research @ Minnesota  Part B: Expert identification in social networks  Background  Possible Approaches  Ant Colony Optimization (ACO)  ACO for Expert Identification  Current Work  Results  Future Work University of Minnesota

Problem Background  Problem: Expert Identification in Social Networks  Setting:  A Dynamically Changing Social Network  A Dynamic Distribution of Topics.  No central management.  Reduce response time, Avoid flooding.  Problem Formulation:  Given a graph of E experts, a topic distribution T, devise an algorithm for expert identification that can be incrementally updated. University of Minnesota

Possible Approaches  Have a centralized repository of expertise and experts.  Assumes that one already knows what the 'topics' are and who the corresponding experts are.  Alternatively maintain a topic hierarchy over the network.  Also assumes that the topics and that the topics are stationary. University of Minnesota

ACO (Ant Colony Optimization) ‏  Initial Conditions:  A colony of ants foraging for food.  No central ‘brain’ controlling the ants.  Foraging for Food (Resource)  Initially ants set off in random directions to forage for food.  When an ant finds a food source it retraces its path.  Ants lay chemical trials called pheromones in their path which can evaporate if not reinforced. University of Minnesota

ACO (Ant Colony Optimization) ‏  Frequently used path become reinforced while the less frequently used paths become weak.  Ants follow the paths which have stronger trails. University of Minnesota

An ACO Model for Expert Identification  Queries are represented as ants.  Whenever a query ant finds an answer to a query it retraces its path and lays out a trail  Forward Ant and Backward Ant  Experts are the nodes with strong trails leading to them.  The first c ants are just allowed to traverse the network like a k-random walker.  Time to Live: maximum number of iterations that the ant should explore the network if the answer to the query is not found. University of Minnesota

ACO Approach  Queries are routed based on the scents.  Multiple keywords as different types of scents.  Different pheromones are combined for cases where one encounters an unfamiliar query.  The network as consisting of multiple types of pheromone trails. University of Minnesota

The ACO Approach  Amount of Pheromone Laid Q = The length of the path, j = edge, i = ant Lj = Distance from the origin to the node under consideration  Route Selection (multiple keywords) U = set of neighbors of the current node F Q = set of already visited nodes j = node to be selected University of Minnesota

Experiments and Results  ACO approach vs. K-random walker.  This is analogous to the situation where one does not know who the experts are.  Evaluation Metrics (Adopted from Michalmyr)  Resource Usage: Number of edges traversed for each query within a given period of time.  Hit rate: Number of queries satisfied within a given period of time.  Efficiency: Resource usage / Hit rate. University of Minnesota

Experiments University of Minnesota Size of Network = 10,000

Conclusion  A 'solution' that self-organizes.  'Solution' can be incrementally built.  Graceful degradation of performance.  Can account for changes in the network.  Topics for expertise do not have to be predefined.  An ant colony optimization approach for expert identification.  Topic based and key word based approach. University of Minnesota

Appendix: Related Work (ACO)  The Any Colony Optimization (ACO) Algorithm was developed by Margo Dorigo in 1992.  Main Applications: Assignment Problems, Scheduling Problems, Routing Problems.  ACO is ideal for problems where minimal cost has to be computed.  SemAnt, Query Routing in distributed environment with a predefined taxonomy. (ElkeMichlmayr) University of Minnesota

Appendix: Related Work  Expert Identification  Text on Message Boards. (ContactFinder)  E-mail and text analysis. (Schwartz et al.)  Graph Based Ranking Approaches. (Campell)  Query Routing  Broadcasting, Flooding.  History Based Query Routing. (Cohen et al.) REMINDIN (Tempich et al.)  Kleingberg’s Query Incentive Networks University of Minnesota

An Ant Colony Optimization Approach Expert Identification in Social Networks Muhammad Aurangzeb Ahmad, Jaideep Srivastava Department of Computer Science.

Similar presentations

Presentation on theme: "An Ant Colony Optimization Approach Expert Identification in Social Networks Muhammad Aurangzeb Ahmad, Jaideep Srivastava Department of Computer Science."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

An Ant Colony Optimization Approach Expert Identification in Social Networks Muhammad Aurangzeb Ahmad, Jaideep Srivastava Department of Computer Science.

Similar presentations

Presentation on theme: "An Ant Colony Optimization Approach Expert Identification in Social Networks Muhammad Aurangzeb Ahmad, Jaideep Srivastava Department of Computer Science."— Presentation transcript:

Similar presentations

About project

Feedback