Hermes: News Personalization Using Semantic Web Technologies

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
Learning Semantic Information Extraction Rules from News The Dutch-Belgian Database Day 2013 (DBDBD 2013) Frederik Hogenboom Erasmus.
Semantic Access to Data from the Web Raquel Trillo *, Laura Po +, Sergio Ilarri *, Sonia Bergamaschi + and E. Mena * 1st International Workshop on Interoperability.
Semantic News Recommendation Using WordNet and Bing Similarities 28th Symposium On Applied Computing 2013 (SAC 2013) March 21, 2013 Michel Capelle
A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.
CHAITALI GUPTA, RAJDEEP BHOWMIK, MICHAEL R. HEAD, MADHUSUDHAN GOVINDARAJU, WEIYI MENG PRESENTED BY: SIDDHARTH PALANISWAMI A Query-based System for Automatic.
Information Retrieval in Practice
Xyleme A Dynamic Warehouse for XML Data of the Web.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Mobile Web Search Personalization Kapil Goenka. Outline Introduction & Background Methodology Evaluation Future Work Conclusion.
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
March 17, 2008SAC WT Hermes: a Semantic Web-Based News Decision Support System* Flavius Frasincar Erasmus University Rotterdam.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
How can Computer Science contribute to Research Publishing?
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Detecting Economic Events Using a Semantics-Based Pipeline 22nd International Conference on Database and Expert Systems Applications (DEXA 2011) September.
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
News Personalization using the CF-IDF Semantic Recommender International Conference on Web Intelligence, Mining, and Semantics (WIMS 2011) May 25, 2011.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Overview of Search Engines
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
Erasmus University Rotterdam Introduction Nowadays, emerging news on economic events such as acquisitions has a substantial impact on the financial markets.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
TOWL Time-determined ontology-based information system for real-time stock market analysis Econometric Institute Erasmus School of Economics Erasmus University.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Survey of Semantic Annotation Platforms
Copyright © 2013 Curt Hill The Zachman Framework What is it all about?
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.
1 A Semantic Web-Based Approach for Personalizing News Flavius Frasincar Erasmus University Rotterdam * Joint work with Kim Schouten,
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
21/11/2002 The Integration of Lexical Knowledge and External Resources for QA Hui YANG, Tat-Seng Chua Pris, School of Computing.
Alexey Kolosoff, Michael Bogatyrev 1 Tula State University Faculty of Cybernetics Laboratory of Information Systems.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
Semantics-Based News Recommendation with SF-IDF+ International Conference on Web Intelligence, Mining, and Semantics (WIMS 2013) June 13, 2013 Marnix Moerland.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
VLDB2005 CMS-ToPSS: Efficient Dissemination of RSS Documents Milenko Petrovic Haifeng Liu Hans-Arno Jacobsen University of Toronto.
Working with Ontologies Introduction to DOGMA and related research.
Lexico-semantic Patterns for Information Extraction from Text The International Conference on Operations Research 2013 (OR 2013) Frederik Hogenboom
© Copyright 2008 STI INNSBRUCK A Semantic Model of Selective Dissemination of Information for Digital Libraries.
Personalized Recommendation of Related Content Based on Automatic Metadata Extraction Andreas Nauerz 1, Fedor Bakalov 2, Birgitta.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Semantics-Based News Recommendation International Conference on Web Intelligence, Mining, and Semantics (WIMS 2012) June 14, 2012 Michel Capelle
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Of 24 lecture 11: ontology – mediation, merging & aligning.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Information Retrieval in Practice
Search Engine Architecture
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
THE TOWL ONTOLOGY LANGUAGE
Cross-language Information Retrieval
Associative Query Answering via Query Feature Similarity
Information Retrieval
COMPASS: A Geospatial Knowledge Infrastructure Managed with Ontologies
CS246: Information Retrieval
Chaitali Gupta, Madhusudhan Govindaraju
Presentation transcript:

Hermes: News Personalization Using Semantic Web Technologies Flavius Frasincar frasincar@ese.eur.nl Erasmus University Rotterdam

Contents Motivation Hermes Framework: Hermes News Portal: Conclusions News Classification News Querying Results Presentation Hermes News Portal: An example Conclusions Future Work

Motivation Large quantity of news on the Web: Difficult to find the ones of interest News messages have a strong impact on stock prices Limited annotation of RSS feeds: Broad categories (business, cars, entertainment, etc.) Google finance shows direct news which pertain to a certain portfolio: Indirect news (competitors of Google like Microsoft) are not presented Not possible to ask time-related queries about news

Hermes Framework Input: Output: Three phases: News items from RSS feeds Domain ontology linked to a semantic lexicon (e.g., WordNet) User query Output: News items as answers to the user query Three phases: 1. News Classification: Relate news items to ontology concepts 2. News Querying Allow the user to express his concepts of interest and the temporal constraints 3. Results Presentation Present the news items that match user’s query

Hermes Architecture

1. News Classification Concept defined in the ontology (class or individual) Multiple lexical representations for the same concept: Ontology synonyms (e.g., New York → New York, Big Apple) Semantic lexicon synonyms (e.g., buy → acquire) Concepts without subclasses or instances: Semantic lexicon hyponyms (e.g., company → dot-com) Lookup ontology concepts into news items A longer match supersedes a shorter match (European Central Bank supersedes European)

1. News Classification 1.1 Tokenization (words, punctuation signs) 1.2 Sentence splitting (sentences) 1.3 Part-of-speech tagging (e.g., noun, verb, adj., etc.) 1.4 Morphological analysis (e.g., lemma “read” for “reading” as a verb) 1.5 Word sense disambiguation (e.g., Structural Semantic Interconnection (SSI) based on word context) 1.6 Adding “hits” between news items and the domain ontology

2. News Querying 2.1 Query Formulation Present the domain knowledge as directed labeled multi-graph: with the additional constraint that arcs between two nodes are not allowed to share the same label (called conceptual graph) User selects the concepts of interest in the conceptual graph (e.g., Google) User is able to add to its selection concepts related to the concepts of interests using specified relations (e.g., hasCompetitors: Microsoft, eBay, and Yahoo) The selected concepts are presented in a separate graph (called search graph)

2. News Querying News are time stamped User is able to specify that only news in a certain time interval should be retrieved Time constraints: Last hour Last day Last year [2007-03-01T00:00:00.000+00:01, 2007-05-31T00:00:00.000+00:01 ] [Future: order constraints (e.g., order by time)]

2. News Querying 2.2 Query Execution Generate the query in a semantic query language: Map concepts of interest to query restrictions (current: disjunctive queries) Map temporal constraints to query restrictions Execute the semantic query The order of the relevant news items is not important here

3. Results Presentation 3.1 News Sorting Return news items that match a query Sort the news items based on their relevance degree to the query The relevance degree is determined empirically: based on a weighted sum of the number of hits in title (higher weight) and body (lower weight) of the news item News items that have the same relevance degree are sorted in descending timestamp order

3. Results Presentation 3.2 News Presentation Present the concepts involved in the query Per each news items show a summary: Title Source Date Few beginning lines from the news item ([Future: snippet]) Emphasize the hits (found concepts from the ontology) in the retrieved news items Show the icons of the most important query concept found in a news item: based on a weighted sum of the number of hits in title (higher weight) and body (lower weight) of a concept in a news item

Hermes News Portal Hermes News Portal (HNP) is an implementation of the Hermes framework Implementation language: Java Ontology represention language: OWL (e.g., cardinality restrictions, inverses, etc.) Semantic lexicon: WordNet Graph visualization: Prefuse (OWL2Prefuse) Query language: SPARQL SPARQL extended with custom time functions (e.g., currentDate(), currentTime(), etc.) Natural language processing: GATE

An Example Query: Which are the news items about Google or one of its competitors from the past six months?

1. News Classification – Import News

1. News Classification – Conceptual Graph

2. News Querying- Search Graph Individuals Classes Selected concepts Concepts related to the selected node Concepts from keyword search

2. News Querying - Search Graph

2. News Querying- SPARQL SPARQL query: PREFIX hermes: <http://hermes-news.org/news.owl#> SELECT ?title WHERE { ?news hermes:title ?title . ?news hermes:time ?date . ?news hermes:relation ?relation . ?news hermes:relatedTo ?concept . FILTER ( ?concept hermes:relatedTo hermes:Google || ?concept hermes:relatedTo hermes:Micosoft || ?concept hermes:relatedTo hermes:Ebay || ?concept hermes:relatedTo hermes:Yahoo ) ( ?date > "2009-02-01T00:00:00.000+00:01" && ?date < "2009-07-31T00:00:00.000+00:01" }

2. News Querying- tSPARQL Custom time functions: Function name Output type currentDate() xsd:date currentTime() xsd:time now() xsd:dateTime dateTime-add(xsd:dateTime A, xsd:duration B) dateTime-substract(xsd:dateTime A, xsd:duration B)

2. News Querying- tSPARQL tSPARQL query: PREFIX hermes: <http://hermes-news.org/news.owl#> SELECT ?title WHERE { ?news hermes:title ?title . ?news hermes:time ?date . ?news hermes:relation ?relation . ?news hermes:relatedTo ?concept . FILTER ( ?concept hermes:relatedTo hermes:Google || ?concept hermes:relatedTo hermes:Micosoft || ?concept hermes:relatedTo hermes:Ebay || ?concept hermes:relatedTo hermes:Yahoo ) ( ?date > hermes:dateTime-substract(hermes:now(), P0Y6M) && ?date < hermes:now() }

3. Results Presentation

Conclusions Hermes Framework: presents news items that match the user interests Hermes Framework: News Classification News Querying Results Presentation Hermes News Portal (HNP): an implementation of the Hermes framework HNP based on: WordNet semantic lexicon, OWL ontology, (extended) SPARQL queries, Prefuse visualization, GATE natural language processing

Future Work Word Sense Disambiguation: Ontology updates: GAMBL (supervised learning algorithm) Ontology updates: Learning from news items Check if the extracted information obeys the ontology axioms: Faulty extraction Ontology axioms update Simplify the query interface: Allow users to ask English queries based on a limited vocabulary Evaluate the tool outside the university lab