Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Structural.

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Department of Computer Science Worcester Polytechnic Institute INTRODUCTION TO KNOWLEDGE DISCOVERY IN DATABASES AND DATA MINING.
Advertisements

Human Language Technologies. Issue Corporate data stores contain mostly natural language materials. Knowledge Management systems utilize rich semantic.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
On Community Outliers and their Efficient Detection in Information Networks Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei Han 1.
Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester, 2010
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
CSE 574: Artificial Intelligence II Statistical Relational Learning Instructor: Pedro Domingos.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Outline Chapter 1 Hardware, Software, Programming, Web surfing, … Chapter Goals –Describe the layers of a computer system –Describe the concept.
© 2001 Franz J. Kurfess Introduction 1 CPE/CSC 580: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly.
02 -1 Lecture 02 Agent Technology Topics –Introduction –Agent Reasoning –Agent Learning –Ontology Engineering –User Modeling –Mobile Agents –Multi-Agent.
Overview of Web Data Mining and Applications Part I
1 A Topic Modeling Approach and its Integration into the Random Walk Framework for Academic Search 1 Jie Tang, 2 Ruoming Jin, and 1 Jing Zhang 1 Knowledge.
Copyright R. Weber INFO 629 Concepts in Artificial Intelligence Fall 2004 Professor: Dr. Rosina Weber.
Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Structural.
An ontology of computing. What is an ontology? An ontology is a specification of a conceptualization. A specification of a representational vocabulary.
1 Using R for consumer psychological research Research Analytics | Strategy & Insight September 2014.
Some studies on Vietnamese multi-document summarization and semantic relation extraction Laboratory of Data Mining & Knowledge Science 9/4/20151 Laboratory.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Building and Analyzing Social Networks Case Studies of Semantic Social Network Analysis Dr. Bhavani Thuraisingham February 22, 2013.
Deciding Semantic Matching of Stateless Services Duncan Hull †, Evgeny Zolin †, Andrey Bovykin ‡, Ian Horrocks †, Ulrike Sattler † and Robert Stevens †
Adaptive Collaboration Support for the Web Amy Soller Institute for Defense Analyses, Alexandria, Virginia, U.S.A. Jonathan Grady October 12, 2005.
Artificial Intelligence Dr. Paul Wagner Department of Computer Science University of Wisconsin – Eau Claire.
revised CmpE 583 Fall 2006Discussion: OWL- 1 CmpE 583- Web Semantics: Theory and Practice DISCUSSION: OWL Atilla ELÇİ Computer Engineering.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Learning Analytics: A short introduction Learning Analytics & Machine Learning March 25, 2014 #LAK14.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.
Page 1 Ming Ji Department of Computer Science University of Illinois at Urbana-Champaign.
AIAI, The University of Edinburgh Jessica Chen-Burger A Few Highlights from IJCAI 2001, Seattle, USA.
Kansas State University Department of Computing and Information Sciences Kansas State University KDD Lab ( cDNA.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012.
Towards A Context-Based Dialog Management Layer for Expert Systems Victor Hung, Avelino Gonzalez & Ronald DeMara Intelligent Systems Laboratory University.
Computing & Information Sciences Kansas State University Paper Review Guidelines KDD Lab Course Supplement William H. Hsu Kansas State University Department.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, January 24, 2001.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 13 of 41 Monday, 20 September.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Computing & Information Sciences Kansas State University Data Sciences Summer Institute Multimodal Information Access and Synthesis Learning and Reasoning.
Computing & Information Sciences Kansas State University IJCAI HINA 2015: 3 rd Workshop on Heterogeneous Information Network Analysis KSU Laboratory for.
Practical Issues for Automated Categorization of Web Sites John M. Pierre Metacode Technologies, Inc. 139 Townsend Street San Francisco,
Mining the Biomedical Research Literature Ken Baclawski.
Computing & Information Sciences Kansas State University Paper Review Guidelines KDD Lab Course Supplement William H. Hsu Kansas State University Department.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Friday, 14 November 2003 William.
Computer Science & Informatics Digital Ethnography, & Digital Humanities 1 st Big Data Event Kansas State University Monday 28 April 2014 William H. Hsu.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying, Eric Hsueh-Chan Lu, Wen-Ning Kuo and Vincent S. Tseng Institute of Computer Science.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
1 Scalable Probabilistic Databases with Factor Graphs and MCMC Michael Wick, Andrew McCallum, and Gerome Miklau VLDB 2010.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Advisor-Advisee Relationships from Research Publication.
Chapter 6. Inference beyond the index 2007 년 1 월 30 일 부산대학교 인공지능연구실 김민호 Text : FINDING OUT ABOUT Page. 182 ~ 251.
Computing and Information Sciences Kansas State University ANNIE Conference November 10, 2008 Predicting Links and Link Change in Friends Networks: Supervised.
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
1 Intelligent Information System Lab., Department of Computer and Information Science, Korea University Semantic Social Network Analysis Kyunglag Kwon.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Warren Shen, Xin Li, AnHai Doan Database & AI Groups University of Illinois, Urbana Constraint-Based Entity Matching.
Introduction to Artificial Intelligence Heshaam Faili University of Tehran.
Learning Bayesian Networks for Complex Relational Data
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Data-Driven Educational Data Mining ---- the Progress of Project
School of Computer Science & Engineering
Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester,
Knowledge Management Systems
Data Warehousing and Data Mining
Presentation transcript:

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Structural Link Analysis from User Profiles and Friends Networks: A Feature Construction Approach William H. Hsu, Joseph Lancaster, Martin S. R. Paradesi, Tim Weninger Monday, 26 March 2007 Laboratory for Knowledge Discovery in Databases Kansas State University

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Link Analysis in Social Networks: The K-State Corpus

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Outline Background, Related Work and Rationale Technical Objective: Link Mining in Social Networks Methodology: Graph Feature Extraction Experimental Results: K-State LJMiner Corpus Continuing Work: Statistical Relational Models

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Problem Definition  Given: records of users of weblog or social network service  Discover  Features of entities: users, communities  Relationships: friendship, membership, moderatorship  Explanations and predictions for relationships Goals  Boost precision and recall of link existence prediction  Find relevant features Significance: Recommendations (Friendship, Membership) Problem Statement: Link Mining in Social Networks

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Related Work: Link Mining Getoor and Diehl (2005) - Graphical model representations of link structure Ketkar et al. (2005) - Data mining techniques vs graph-based representation Sarkar & Moore (2005) - Change in link structure across discrete time steps Popescul & Ungar (2003) - ER model to predict links Hill (2003), Bhattacharya & Getoor (2004) – Statistical Relational Learning to resolve identity uncertainty Resig et al. (2004) - Predicting IM online times using friends graph degree McCallum et al. (2005) - Inferring roles and topic categories based on link analysis

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Rationale Limitations of Current State of the Art  Do not take graph features into account  Limited ability to select, extract features Novel Contribution: Link Mining System  Extracts, computes features of network model  Towards dependent types for relational link mining Rationale  Desired functionality: infer new links from old  Evaluation: precision, recall for link existence

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Outline Background, Related Work and Rationale Technical Objective: Link Mining in Social Networks Methodology: Graph Feature Extraction Experimental Results: K-State LJMiner Corpus Continuing Work: Statistical Relational Models

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Technical Objectives: Link Mining in Social Networks TBD  TBD TBD

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) K-State Test Bed: LJMiner Corpus User Contact Info User Interest, Schools, Friends Community Membership Info

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) LiveJournal Topology [1]: Tools and Security Model LJMindMap.com © 2004 mcfnord © 2007 Denga, Inc.

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) LiveJournal Topology [2]: Definitions

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Outline Background, Related Work and Rationale Technical Objective: Link Mining in Social Networks Methodology: Graph Feature Extraction Experimental Results: K-State LJMiner Corpus Continuing Work: Statistical Relational Models

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Novel Contributions: Graph Feature Extraction TBD

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Graph Features [1]: Node, Pair, Link-Dependent uv u uv

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Graph Features [2]: Node and Pair Features in LJMiner Graph Features Interest-Related Features

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) LJCrawler TBD

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Outline Background, Related Work and Rationale Technical Objective: Link Mining in Social Networks Methodology: Graph Feature Extraction Experimental Results: K-State LJMiner Corpus Continuing Work: Statistical Relational Models

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Network Statistics: Graph Distance 1000 nodes 4000 nodes

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Interpretation of Results 941-node graph (Hsu et al., 2006): LJCrawler v1 output node graphs: LJCrawler v2 output

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Outline Background, Related Work and Rationale Technical Objective: Link Mining in Social Networks Methodology: Graph Feature Extraction Experimental Results: K-State LJMiner Corpus Continuing Work: Statistical Relational Models

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Results Establishing an Interdisciplinary Research Initiative  K-State / KU / UNL collaboration  Resources: Linguistic Data Consortium  NIST evaluations Involving End Users of Machine Translation  Document users  Machine learning, data mining, info extraction researchers Novel Applications  Social networks and collaborative recommendation  Gisting and beyond

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Information Extraction and Intelligent IR  Learning models for IE: ontologies  Latent semantic analysis Machine Learning  Natural language learning  Time series learning and understanding  Relational and first-order models Automated Reasoning  Probabilistic  Case-based and analogical Data Mining and Warehousing Grid Computing Continuing Work

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) References Knight, K. What’s New in Statistical Machine Translation. Invited Talk, International Joint Conference on Artificial Intelligence (IJCAI-2005), Edinburgh, UK, August, Knight, K. & Graehl, J. (2005). An Overview of Probabilistic Tree Transducers for Natural Language Processing. In Proceedings of CICLing 2005, p Chiang, D. A hierarchical phrase-based model for statistical machine translation. In Proceedings of the Conference of the Association for Computational Linguistics (ACL 2005), p. 263–270. Koehn, P., Och, F. J., & Marcu, D. (2003). Statistical Phrase-Based Translation. In Proceedings of HLT-NAACL 2003, the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, May 27 - June 1, 2003, Edmonton, CANADA.

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Acknowledgements K-State Lab for Knowledge Discovery in Databases  Vikas Bahirwani  Tejaswi Pydimarri  Andrew King Social Networks, Graph Theory, Graph Algorithms  Kirsten Hildrum (IBM T. J. Watson Labs)  Todd Easton (K-State, Industrial and Manufacturing Systems Engineering) Machine Learning  Dan Roth, Cinda Heeren, Jiawei Han (University of Illinois at Urbana-Champaign)  AnHai Doan (University of Wisconsin – Madison)

Computing & Information Sciences Kansas State University Boulder, Colorado First International Conference on Weblogs And Social Media (ICWSM-2007) Questions and Discussion