01/06/15Sergey Chernov 1 Extracting Semantic Relationships between Wikipedia Categories By Sergey Chernov, Tereza Iofciu, Wolfgang Nejdl, Xuan Zhou, Michal.

Slides:



Advertisements
Similar presentations
Historical time series Paris, October Quarterly Mutual Fund Sales &AUM Report | September 2008 | Page 2 Introduction (1) The research department.
Advertisements

What is the capital of the UK? London What is the capital of France? Paris.
Measurement, Evaluation, Assessment and Statistics
TU/e technische universiteit eindhoven Hera: Development of Semantic Web Information Systems Geert-Jan Houben Peter Barna Flavius Frasincar Richard Vdovjak.
Study Project The Countries and Capitals of the European Union.
Site Level Noise Removal for Search Engines André Luiz da Costa Carvalho Federal University of Amazonas, Brazil Paul-Alexandru Chirita L3S and University.
Social Research Methods
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
14 – day tour of Europe FROM UKRAINE TO … SEE 14 COUNTRIES OF EUROPE IN 14 DAYS! LONDON AND …
European Countries and their capital cities. Zagreb is the capital city of Croatia.
Golder and Huberman, 2006 Journal of Information Science Usage Patterns of Collaborative Tagging System.
Copyright © 2008 Intel Corporation. All rights reserved. Intel, the Intel logo, Intel Education Initiative, and the Intel Teach Program are trademarks.
Building Your Individualized Learning Plan (ILP).
+ The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1: Exploring Data Introduction Data Analysis: Making Sense of Data.
From Multi-Domain Statistical Data to Complex Decisions and Actions: A Linked Data Based Approach Marta Sabou, Irem Önder, Adrian M.P. Brasoveanu.
Intelligent Database Systems Lab Presenter : WU, MIN-CONG Authors : Jorge Villalon and Rafael A. Calvo 2011, EST Concept Maps as Cognitive Visualizations.
© Paul Buitelaar – November 2007, Busan, South-Korea Evaluating Ontology Search Towards Benchmarking in Ontology Search Paul Buitelaar, Thomas.
Copyright © 2009 Intel Corporation. All rights reserved. Intel, the Intel logo, Intel Education Initiative, and the Intel Teach Program are trademarks.
A Panoramic Approach to Integrated Evaluation of Ontologies in the Semantic Web S. Dasgupta, D. Dinakarpandian, Y. Lee School of Computing and Engineering.
Portfolio v1.0 Products. Benefits Scalable Fast Full interface via web services Fully integrated with Microsoft SharePoint Easy navigation Competence.
A Future for the Library Catalogue T. Hickey ACRL/DVC Bryn Mawr 3 November 2006.
Intelligent Database Systems Lab Presenter : YAN-SHOU SIE Authors Mohamed Ali Hadj Taieb *, Mohamed Ben Aouicha, Abdelmajid Ben Hamadou KBS Computing.
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Directions Directions:  Each group is required to create one PowerPoint presentation based on an assigned reading.  The ideas that must be covered in.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
A Knowledge-Based Search Engine Powered by Wikipedia David Milne, Ian H. Witten, David M. Nichols (CIKM 2007)
Europe – The Continent Learning Objectives:  To explore the continent of Europe  To develop your knowledge of Europe’s physical and Human Features Key.
+ The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1: Exploring Data Introduction Data Analysis: Making Sense of Data.
RCDL 2007, Pereslavl-Zalessky, Oct 2007 Converting Desktop into a Personal Activity Dataset Sergey Chernov, Enrico Minack, and Pavel Serdyukov.
Internal assessment, Results, Discussion, and Format By Mr Daniel Hansson.
2010 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT) Hierarchical Cost-sensitive Web Resource Acquisition.
Block-level Link Analysis Presented by Lan Nie 11/08/2005, Lehigh University.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 1 Exploring Data Introduction Data Analysis:
23. Juli deskWeb2.0: Combining Desktop and Social Search Sergej Zerr, Elena Demidova, Sergej Chernov L3S Research Center Hannover, Germany
 Second meeting of the scientific working group in Berlin on 19 March  Preliminary search results for 12 countries or regions were presented and discussed,
On Stability, Clarity, and Co-occurrence of Self-Tagging Aixin Sun and Anwitaman Datta Nanyang Technological University Singapore.
Chapter 1: Exploring Data
Analyzing, synthesizing & writing
Countries and Capitals of Western Europe
Shaun A. Bond University of Cincinnati
Chapter 1: Exploring Data
SEI 300 Education on your terms/tutorialrank.com.
SEI 300 Become Exceptional/ newtonhelp.com. SEI 300 Assignment Academic Vocabulary Assessments (14 Slides) For more course tutorials visit
Chapter 1: Exploring Data
DDI-RDF Discovery Vocabulary _ Use Cases and Vocabularies
Introduction to HTML Simple facts yet crucial to beginning of study in fundamentals of web page design!
DBpedia 2014 Liang Zheng 9.22.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
“The need for Semantic Desktop Dataset” L3S and University of Hannover, Germany Sergey Chernov, Tereza Iofciu, Wolfgang Nejdl, Xuan Zhou (chernov, iofciu,
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
WEST EUROPE MAP REVIEW.
GNP and per capita GNP Top of the world!?.
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

01/06/15Sergey Chernov 1 Extracting Semantic Relationships between Wikipedia Categories By Sergey Chernov, Tereza Iofciu, Wolfgang Nejdl, Xuan Zhou, Michal Kopycki, Przemyslaw Rys

01/06/15Sergey Chernov 2 Preliminaries  WIKIPEDIA: largest knowledge sharing system  Many pages assigned to CATEGORIES  All links are NAVIGATIONAL  Can we extract SEMANTIC links? MOTIVATION

01/06/15Sergey Chernov 3 Wikipedia Categories Example MOTIVATION

01/06/15Sergey Chernov 4 Possible benefits  Semi-structured queries “find Countries which had Democratic Non-Violent Revolutions” rephrased as “find page from category Countries which is connected to some page in Non-Violent Revolutions”  Hints for authors “you edit page from category Countries, do you want to add a link to page in category Capital?”  Raw data for manual semantic markup MOTIVATION

01/06/15Sergey Chernov 5 Countries Heuristics Experiments Denmark Austria Capitals Berlin Stockholm Vienna Germany France Paris  Number of links NL = 3  Connectivity Ratio CR = 3/4 = 0.75

01/06/15Sergey Chernov 6 Dataset  INEX 2006 collection  Sample category rankings Experiments

01/06/15Sergey Chernov 7 Manual assessment methodology  Semantic Connection Strength (SCS) Measure:  2 = strong semantic relationship,  1 = average semantic relationship,  0 = weak or no semantic relationship.  Instruction for Assessors  “category A is strongly related to category B (value 2) if you believe that every page in A should conceptually have at least one semantic link to B;”  “A and B are averagely related (value 1), if you believe 50% of pages in A should have semantic links to B;”  “otherwise, A and B are weakly related (value 0).”

01/06/15Sergey Chernov 8 Experiments with Number of Links Average semantic connections strength for 100 sample categories, extracted using Number of Links. Experiments

01/06/15Sergey Chernov 9 Experiments with Connectivity Ratio Average semantic connections strength for 100 sample categories, extracted using Connectivity Ratio. Experiments

01/06/15Sergey Chernov 10 General Results and Conclusions  Result is skewed toward Countries category  Connectivity Ratio is a better measure than Number of Links  We have observed that inlinks have better performance than outlinks. Summary

01/06/15Sergey Chernov 11 Future Steps More manual exploration, look for additional heuristics Consider more categories SCS composed of  Is this a “part of” relation? W1  Is this a “is a” relation? W2  Is this a “synonym” relation? W3  Is this a “antonym” relation? W4  It is related in a different way? Which one? W5 Summary

01/06/15Sergey Chernov 12 Thank You!