Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:

Slides:



Advertisements
Similar presentations
CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Advertisements

David W. Embley Brigham Young University Provo, Utah, USA WoK: A Web of Knowledge.
Ontologies for multilingual extraction Deryle W. Lonsdale David W. Embley Stephen W. Liddle Supported by the.
Semiautomatic Generation of Data-Extraction Ontologies Master’s Thesis Proposal Yihong Ding.
1 Ontology Based Extraction of RDF Data from the World Wide Web Tim Chartrand A Thesis Proposal Research Supported By NSF.
Ontology-Based Free-Form Query Processing for the Semantic Web by Mark Vickers Supported by:
Crosslingual Ontology-Based Document Retrieval (Search) in an eLearning Environment RANLP, Borovets, 2007 Eelco Mossel University of Hamburg.
Searching the Semantic Web. Introduction  Research Focuses: IE Ontologies (creating, languages, merging, storing, querying)  Next Sep: Using the Semantic.
Domain-Independent Data Extraction: Person Names Carl Christensen and Deryle Lonsdale Brigham Young University
HyKSS: A Multiple Ontology Approach to Hybrid Search Andrew Zitzelberger Brigham Young University MS Thesis Proposal.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
OWL-AA: Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation 2006 Spring Research Conference Yihong Ding.
Data Frames Version 3 Proposal. Data Frames Version 2 Year matches [2] constant { extract "\d{2}"; context "([^\$\d]|^)\d{2}[^,\dkK]"; } 0.5, { extract.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Ontology-Based Free-Form Query Processing for the Semantic Web Thesis proposal by Mark Vickers.
Automatic Extraction of Information Behind Web Forms Based on Application Ontologies Automatic Extraction of Information Behind Web Forms Based on Application.
Resolving Under Constrained and Over Constrained Systems of Conjunctive Constraints for Service Requests Muhammed J. Al-Muhammed David W. Embley Brigham.
Thesis Defense Mini-Ontology GeneratOr (MOGO) Mini-Ontology Generation from Canonicalized Tables Stephen Lynn Data Extraction Research Group Department.
From OSM-L to JAVA Cui Tao Yihong Ding. Overview of OSM.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
By ANDREW ZITZELBERGER A Framework for Extraction Ontology Based Information Management.
1 Extracting RDF Data from Unstructured Sources Based on an RDF Target Schema Tim Chartrand Research Supported By NSF.
Interfaces for Querying Collections. Information Retrieval Activities Selecting a collection –Lists, overviews, wizards, automatic selection Submitting.
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
Recognition and Satisfaction of Constraints in Free-Form Task Specification Muhammed Al-Muhammed.
Semantic Web Queries by Mark Vickers Funded by NSF.
1 Ontology Based Extraction of RDF Data from the World Wide Web Tim Chartrand Masters Thesis Research Supported By NSF.
Generating Data-Extraction Ontologies By Example Joe Zhou Data Extraction Group Brigham Young University.
Data Frame Augmentation of Free Form Queries for Constraint Based Document Filtering Andrew Zitzelberger.
1 Ontology Generation Based on a User-Specified Ontology Seed Cui Tao Data Extraction Research Group Department of Computer Science Brigham Young University.
1 Cui Tao PhD Dissertation Defense Ontology Generation, Information Harvesting and Semantic Annotation For Machine-Generated Web Pages.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Automatic Creation and Simplified Querying of Semantic Web Content An Approach Based on Information-Extraction Ontologies Yihong Ding, David W. Embley,
Thesis Proposal Mini-Ontology GeneratOr (MOGO) Mini-Ontology Generation from Canonicalized Tables Stephen Lynn Data Extraction Research Group Department.
Cross-Language Hybrid Keyword and Semantic Search David W. Embley, Stephen W. Liddle, Deryle W. Lonsdale, Joseph S. Park, Andrew Zitzelberger Brigham Young.
Personalized Information Retrieval in Context David Vallet Universidad Autónoma de Madrid, Escuela Politécnica Superior,Spain.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.
Chapter 13 Query Processing Melissa Jamili CS 157B November 11, 2004.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
An Aspect of the NSF CDI InitiativeNSF CDI: Cyber-Enabled Discovery and Innovation.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
OWL Representing Information Using the Web Ontology Language.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Information Retrieval
An Aspect of the NSF CDI Initiative CDI: Cyber-Enabled Discovery and Innovation.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Information Retrieval Quality of a Search Engine.
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
David W. Embley Brigham Young University Provo, Utah, USA.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Food and Agriculture Organization of the UN GILW Library and Documentation Systems Division Food, Nutrition and Agriculture Ontology Portal.
Object-Oriented Software Engineering Using UML, Patterns, and Java,
Cross-language Information Retrieval
David W. Embley Brigham Young University Provo, Utah, USA
Zachary Cleaver Semantic Web.
Introduction to Information Retrieval
An Electronic Borrowing System Using REST
CS246: Information Retrieval
Presentation transcript:

Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:

2 Presentation Overview Web Queries Web Queries Explanation of AskOntos Explanation of AskOntos Demo Demo Evaluation Evaluation Future Work and Conclusion Future Work and Conclusion

3 Web Queries: Challenges Example: Searching for a car Cannot specify constraints Cannot specify constraints Documents returned (usually too many) Documents returned (usually too many) Takes time to read through documents Takes time to read through documents Determine relevance Determine relevance Find information (price, year, etc.) Find information (price, year, etc.)

4 Web Queries: Opportunities Semantic web Semantic web Proposed ontology-based framework for making information machine-readable Proposed ontology-based framework for making information machine-readable Uses markup languages to identify information Uses markup languages to identify information “[A] search program can look for only those pages that refer to a precise concept…” “[A] search program can look for only those pages that refer to a precise concept…” -Tim Berners-Lee How should semantic web be searched? How should semantic web be searched?

5 Solution: AskOntos – a Query System for the Semantic Web Allows free-form queries over semantically annotated pages Allows free-form queries over semantically annotated pages Processes queries using information extraction Processes queries using information extraction Returns tables of extracted values Returns tables of extracted values

6 AskOntos Overview

7 Extraction Ontologies Object sets Relationship sets Participation constraints Lexical Non-lexical Primary object set Aggregation Generalization/Specialization

8 Extraction Ontologies Value Expression: [$]\d{1,2}\,\d{3} | … Key Word Phrase Left Context: $ Data Frame: Internal Representation: float Value Phrase Key Word Expression: ([Pp]rice)|([Cc]ost)| … Operation Phrase Operator: > Expression: (more\s*than)|(more\s*costly)|…

9 Annotating Web Pages

10 Annotating Web Pages

11 Step 1. Parse Query “Find me the and of all s – I want a ”pricemileageredNissan1996or newer >= Operator

12 Step 2. Find Related Ontology Similarity value: 5 Similarity value: 2 “Find me the price and mileage of all red Nissans – I want a 1996 or newer”

13 Conjunctive and aggregate queries run over selected ontology’s extracted values Conjunctive and aggregate queries run over selected ontology’s extracted values Value-phrase-matching words determine conditions Value-phrase-matching words determine conditions Conditions: Conditions: Color = “red” Color = “red” Make = “Nissan” Make = “Nissan” Year >= 1996 Year >= 1996 >= Operator Step 3. Formulate XQuery Expression

14 For Let Where Return Step 3. Formulate XQuery Expression

15 Step 4. Run XQuery Expression Over Ontology’s Extracted Data Uses Qexo 1.7, GNU’s XQuery engine for Java Uses Qexo 1.7, GNU’s XQuery engine for Java Orders results according to number of values Orders results according to number of values

16Demo

17 Evaluation of AskOntos Success Measure: ability to translate free- form queries into formal queries Success Measure: ability to translate free- form queries into formal queries Extraction ontologies : car ads, house ads, countries, movies, and diamond ads Extraction ontologies : car ads, house ads, countries, movies, and diamond ads 3 rounds of testing 3 rounds of testing 50 queries each (gathered from other CS students) 50 queries each (gathered from other CS students) 1 st round discarded due to queries 1 st round discarded due to queries Minor improvements on system between rounds Minor improvements on system between rounds

18 Query Translation Metrics “Find me the price and mileage of all red Nissans – I want a 1996 or newer.” Human conversion for $doc in document("file:///.../Car.OWL")/rdf:RDF for $Record in $doc/owl:Thing … where($Color="red" or empty($Color)) and ($Make="Nissan" or empty($Make)) and ($Year="1996" or empty($Year)) return {$Price} {$Color} {$Make} {$Year} Automated conversion PrecisionRecall Return-Clause Names 100%80% Conditions66%66% Return-Clause Names: {Price,Color, Make, Year} Conditions: {(Color,=,“red”), (Make,=,“Nissan”), (Year,=,“1996”)} Return-Clause Names: {Price, Mileage,Color, Make, Year} Conditions: {(Color,=,“red”), (Make,=,“Nissan”), (Year,>=,“1996”)}

19Results

20 Result Analysis Common reasons for errors: 1. Word not in lexicon: “5 Bedrooms, 3 Bath, study, game room, 2 car garage, and < $250,000”

21 Result Analysis “Which countries use the euro?” 2. Mistakes in regular expressions

22 Result Analysis 3. Not enough context: “What are the models from 2005”

23 Conclusion/Contributions AskOntos AskOntos Is a free-form query system for the semantic web Is a free-form query system for the semantic web Applies information extraction for query processing Applies information extraction for query processing Answers questions with extracted data values Answers questions with extracted data values Contributions Contributions Web queries that use semantic annotations Web queries that use semantic annotations Web queries returning answers from extracted data Web queries returning answers from extracted data Processing free-form queries using ontologies Processing free-form queries using ontologies

24 Future Work Disjunction and negation Disjunction and negation Fuzzy queries Fuzzy queries Spellchecker Spellchecker