Chaitali Gupta, Madhusudhan Govindaraju

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
BAH DAML Tools XML To DAML Query Relevance Assessor DAML XSLT Adapter.
OGSA Workshop, March 2002 MetaData Group. What metadata are needed? AIM: Systematic metadata framework for Grid Are these explicitly catered for in OGSA?
Natural Language Interfaces to Ontologies Danica Damljanović
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Hermes: News Personalization Using Semantic Web Technologies
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
CHAITALI GUPTA, RAJDEEP BHOWMIK, MICHAEL R. HEAD, MADHUSUDHAN GOVINDARAJU, WEIYI MENG PRESENTED BY: SIDDHARTH PALANISWAMI A Query-based System for Automatic.
© Copyright 2012 STI INNSBRUCK Apache Stanbol.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
March 17, 2008SAC WT Hermes: a Semantic Web-Based News Decision Support System* Flavius Frasincar Erasmus University Rotterdam.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Scientific Workflows Scientific workflows describe structured activities arising in scientific problem-solving. Conducting experiments involve complex.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Development of Front End Tools for Semantic Grid Services Dr.S.Thamarai Selvi, Professor & Head, Dept. of Information Technology, Madras Institute of Technology,
Agent Model for Interaction with Semantic Web Services Ivo Mihailovic.
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu & Gagan Agrawal Enabling.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
An OAI-Compliant Federated Physics Digital Library for the NSDL Department of Computer Science Old Dominion University, Norfolk, VA In Collaboration.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Digital libraries and web- based information systems Mohsen Kamyar.
Task 1.2 Context: definition and specification. Leuven, 14 oktober 2004 Outline Introduction Work method Context definition Context specification  Overview.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
OWL Representing Information Using the Web Ontology Language.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
1 Context-Aware Internet Sharma Chakravarthy UT Arlington December 19, 2008.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
INRIA - Progress report DBGlobe meeting - Athens November 29 th, 2002.
Natural Language Interfaces to Ontologies Danica Damljanović
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Matthew Farrellee Computer Sciences Department University of Wisconsin-Madison Condor and Web Services.
Ontologies for the Semantic Web Prepared By: Tseliso Molukanele Rapelang Rabana Supervisor: Associate Professor Sonia Burman 20 July 2005.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu and Gagan Agrawal Enabling.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Developing GRID Applications GRACE Project
Mathematical Service Matching Using Description Logic and OWL Kamelia Asadzadeh Manjili
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
OPM/S: Semantic Engineering of Web Services
knowledge organization for a food secure world
Web Ontology Language for Service (OWL-S)
Knowledge Based Workflow Building Architecture
Context-Aware Internet
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Ontological Framework for Enabling Free-Form Search in Scientific Discovery Chaitali Gupta, Madhusudhan Govindaraju Grid Computing Research Laboratory SUNY Binghamton 5/5/2019 E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session:

Motivation Most computer users today do not have to write programs most end users of Grid and scientific data sets should be shielded from low-level details Web Search engines search billions of web pages use Natural Language Processing (NLP) and Information Retrieval (IR) technologies return many links for any given search XML based technology and ontologies can be used to categorize and organize information machine-readable and understandable manner retrieve specific information from Grid/scientific services. E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Project Vision Our vision is that Web semantics can be leveraged to build search engine like interfaces even for Grid/Scientific Application Meta-Data. abstract away the fundamental complexity of XML based services specifications and toolkits Add a search box on portal dashboards Automatically convert queries to Job description specification formats All these related schemes work well for scientists who have a working knowledge of the query system. Our work extends the features provided by these systems with a free-form query based interface that provides ease-of-use for domain scientists without requiring them to learn any specific XML technology or query language details. E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Related Work MDS. WSRF compliant service to publish/retrieve resource information Condor ClassAds. Combines schema, data, and query in a simple but powerful query specification language. Condor Gangmatching. Overcomes bilateral matching limitations of the ClassAds. E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Comparing with SPARQL SPARQL Query Query in the Proposed Framework PREFIX dc:<http://example.org/dc/element/1.1/> PREFIX ns:<http:/example.org/ns#> SELECT ?machine-name    ?CPU WHERE { ?x  ns:cpu ?cpu. FILTER (?cpu   >   2.0). ?x dc:machine-name ?machine-name. } “All machine names with CPU speed greater than 2.0 GHz” E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Scope of Free-Form Queries The problem of processing and acting upon arbitrary English is an extremely challenging actively addressed in the AI community Use many techniques from NLP and semantic web Scope of our work is therefore limited cannot accept any free-form query designed to accept a limited form of English with a vocabulary taken from the ontology. E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Example queries for New York State Grid (NYSGrid) List all sites of NYSGrid All Sites of NYSGrid with Xeon processors Processor configuration of nodes at Binghamton site of NYSGrid All machine names in NYSGrid with CPU speed greater than 2.0GHz speed Status of job ID 117 running on NYSGrid Names of 16 free nodes on the NYSGrid with at least 4GB of memory List all nodes of NYSGrid having CPU speed greater than 1Ghz and less than 4 Ghz E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Example ontology model E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

System Components WSDL Processor User Query Interface Query Processor Match Processor Ontology Matcher Dictionary Matcher direct, stripped matching, hypernyms, hyponym Lexicon how people use words etc. Relevance Checker Glossary, input and output parameters of the Web service E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Example query that lights up the model The Ontology Matcher retrieves the ontologies from the ontology repository and matches them with the user query. Ontologies built in OWL for storing the vocabularies concepts include “CPU”, “memory”, “storage”, “job”, etc. use Jena to process OWL models/statements <subject, object, predicate> E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

System Components Queries that hit Ontology Matcher have an average of 95% - 96% better performance benefit than those requiring both Ontology and Dictionary Matcher. E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Performance of System Components Execution time taken by the major components E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

System Components Recall and Precision increases when domain dependent ontologies are considered. E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019

Research Challenges Design algorithms to automatically infer the context of user queries and map them to an appropriate set of Grid and scientific services. Automatically extend and update domain knowledge using Semantic Web techniques and WordNet. Build a feedback loop for cases that don’t work Enable construction of simple workflows multiple Grid services may be needed for a query merging results from different services E-science Microsoft Workshop 2008: Semantics Birds of a Feather Session: 5/5/2019