Semantics-Empowered Text Exploration for Knowledge Discovery Delroy Cameron, Pablo N. Mendes, Amit P. Sheth Knowledge Enabled Information and Services.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Kino : Making Semantic Annotations Easier Ajith Ranabahu #, Priti Parikh #, Maryam Panahiazar #, Amit Sheth # and Flora Logan- Klumpler* # Ohio Center.
CONTRIBUTIONS Ground-truth dataset Simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser) Implicit.
TU/e technische universiteit eindhoven Hera: Development of Semantic Web Information Systems Geert-Jan Houben Peter Barna Flavius Frasincar Richard Vdovjak.
RDB2RDF: Incorporating Domain Semantics in Structured Data Satya S. Sahoo Kno.e.sis CenterKno.e.sis Center, Computer Science and Engineering Department,
Knowledge Enabled Information and Services Science Schema-Driven Relationship Extraction from Unstructured Text Cartic Ramakrishnan Kno.e.sis Center, Wright.
1 Schema-Driven Relationship Extraction from Unstructured Text Cartic Ramakrishnan, Krys Kochut and Amit Sheth LSDIS Lab, University of Georgia, Athens,
Semantic Browser LSDIS (Large Scale Distributed Information Systems) Lab. Bilal Gonen M.Sc. in Computer Science University of Georgia
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
OntoBlog: Linking Ontology and Blogs Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of Informatics, Japan 2 Asian.
Linked Sensor Data Harshal Patni, Cory Henson, Amit P. Sheth Ohio Center of Excellence in Knowledge enabled Computing (Kno.e.sis) Wright State University,
A Taxonomy-based Model for Expertise Extrapolation Delroy Cameron, Amit P. Sheth Ohio Center for Excellence in Knowledge-enabled Computing (Kno.e.sis)
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
BTW (“By The Way…”) Information Annotation By Rudd Stevens, Jason Endo University of San Francisco.
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Research Problems in Semantic Web Search Varish Mulwad ____________________________ 1.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
A Statistical and Schema Independent Approach to Identify Equivalent Properties on Linked Data † Kno.e.sis Center Wright State University Dayton OH, USA.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Copyright © 2010 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are trademarks of Accenture. Multiple Ontologies in.
Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection Boanerges Aleman-Meza, Meenakshi Nagarajan,
Formalizing and Querying Heterogeneous Documents with Tables Krishnaprasad Thirunarayan and Trivikram Immaneni Department of Computer Science and Engineering.
Knowledge Enabled Information and Services Science Ontology supported Knowledge Discovery in the field of Human Performance and Cognition Kno.e.sis Center.
Information Seeking in Electronic Environments Marchionini, G. (1995). Information Seeking in Electronic Environments. Cambridge Press. Kathleen Padova.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Search Engine Architecture
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Semantic Enhancement: Key to Massive and Heterogeneous Data Pools Violeta Damjanovic, Thomas Kurz, Rupert Westenthaler, Wernher Behrendt, Andreas Gruber,
CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser)
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Unsupervised Discovery of Compound Entities for Relationship Extraction Cartic Ramakrishnan, Pablo N. Mendes Shaojun Wang, Amit P. Sheth
A Novel Visualization Model for Web Search Results Nguyen T, and Zhang J IEEE Transactions on Visualization and Computer Graphics PAWS Meeting Presented.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
An Ontological Approach to Financial Analysis and Monitoring.
Traversing Documents by Using Semantic Relationships Bilal Gonen Assistant Professor Computer Science, University of West Florida Ph.D. from University.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Discovering and Ranking Semantic Associations over a Large RDF Metabase Chris Halaschek, Boanerges Aleman- Meza, I. Budak Arpinar, Amit P. Sheth 30th International.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Implicit feedback, semi-explicit feedback (annotations), explicit.
Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Search Engine Architecture
Knowledge Discovery in the Semantic Web
Model-Driven Analysis Frameworks for Embedded Systems
Wikitology Wikipedia as an Ontology
Searching and browsing through fragments of TED Talks
Measuring Learning During Search: Differences in Interactions, Eye-Gaze, and Semantic Similarity to Expert Knowledge Florian Groß Mai
Presentation transcript:

Semantics-Empowered Text Exploration for Knowledge Discovery Delroy Cameron, Pablo N. Mendes, Amit P. Sheth Knowledge Enabled Information and Services Science Center (Kno.e.sis) Department of Computer Science and Engineering Wright State University Dayton, OH Victor Chan Division of Biosciences and Performance Human Effectiveness Directorate Air Force Research Lab (AFRL) Wright-Patterson Air Force Base Dayton, OH 48 th ACM Southeast Conference. ACMSE Oxford, Mississippi. April 15-17, 2010.

OUTLINE  Background  Paradigm Shift  Demo  Architecture  Experimental Results  Future Work  Conclusion 3

BACKGROUND  IR Systems - Interaction Paradigm Manually seek information Hyperlinked Documents Document-Centric Model  Basis - Interaction Paradigm Keyword Search Document Browsing 4

S BACKGROUND  Interaction Sequence 1. Assemble Keywords and Search 2. Document Selection 3. Document Inspection 4. Aggregation/Organization 5 Information Need What is the role of Magnesium in relation to Migraine? Magnesium migraine Search

LIMITATIONS  Query Reformulations Impatient users Recognition over Recall  Constrained navigation Hyperlink dependent - apriori  Fuzzy User Interests Haiti Earthquake – Recovery, Relief, Political Climate, Crime  Ineffective for Exploratory Search Search-and-Sift Query: Father of the Web Answer: Sir Tim Berners-Lee Amit P. Sheth, Cartic Ramakrishnan: Relationship Web: Blazing Semantic Trails between Web Resources. IEEE Internet Computing 11(4): (2007)

MOTIVATION  Users are A priori hyperlink dependent  Semantic Web Standards Entity Identification (Semantic Annotations) Relationship and Triple Identification Explore documents/information via relationships information seekers Informationdocumentsis embedded in 7

PARADIGM SHIFT Search Hit > Annotated Hit  Bag of annotated words/phrases  Annotated phrase is known entity  Entity is Subject/Object of Triple Navigation driven by relationships Entity[Document]RelationshipEntity[Document]  Entity[Document]  Relationship  Entity[Document] Contextual Navigation (relationships as context) 8

CONTRIBUTIONS 1. Novel Information Exploration Paradigm Data-Centric Model 2. Demonstrate use of background knowledge Named Entities, Relationships 3. Prototype Implementation Semantic annotations for navigation 4. Aggregation Utilities Saving, bookmarking, publishing etc 9

DEMO 10

Trie-based Spotter for Named Entity Identification used ultimately for document annotation Semantic Browser Controlled Vocabulary 992,281 DBpedia terms 15,742 HPCO terms 5,232 UMLS terms Controlled Vocabulary 992,281 DBpedia terms 15,742 HPCO terms 5,232 UMLS terms Medline (19 million Abstracts) Medline (19 million Abstracts) Spotter Module Document Corpus Linked Open Data SavePublishOrganize Utilities provided for promoting, bookmarking, and saving search results Search Workbench (SERP) Annotated entities provide anchors that serve as entry points to navigation Semantic Trail Log Sequential record of each triple navigated by a user Yahoo (indexed documents accessed as a Web Service using Yahoo Search Boss) Yahoo (indexed documents accessed as a Web Service using Yahoo Search Boss) Articles saved using Lucene. Indexed as of Aug Figure 1: System Components and Architecture ARCHITECTURE Background Knowledge HCPO Ontology UMLS

IMPLEMENTATION Spotter Module Dietary restriction with hypomagnesia is normally associated with diminished urinary excretion. magnesium UMLS Controlled Vocabulary Entity LabelPubMed ID Magnesium Deficiency C Dietary restriction with hypomagnesia C Magnesium EntityID: This process is called Spotting and uses a Trie data structure. 12 magnesium

ARCHITECTURE  Document Corpus Medline Lucene Index - 19 million abstracts Aug REST Endpoint: XML Response (or JSON) Keyword queries, Document IDs  Background Knowledge UMLS (Unified Medical Language System)  5,232 entities and 16,540 triples HPCO (Human Performance & Cognition Ontology)  15,742 entities and 22,298 triples 13

Rank Feature on [1-5] scale Normalized Relative Aggregated Scores EVALUATION Evaluation Metrics Search User Interfaces Semantic Browser (Medline + UMLS) PubMedYahoo Interface Design Useful Features Motivation to Explore Information Novelty Effectiveness of Task outcome Required Cognitive Load Overall Satisfaction

CONCLUSION Novel Information Exploration Paradigm Semantic Browser support Contextual Navigation Identify Named Entities and Relationships Provide Semantic Annotations Utilities for Aggregation Semantic Trails to Knowledge Discovery 15

x Formal Model for Paradigm Shift Improved Spotter – Additional Vocabularies, Context, Rule Based Relationship Ranking Document Re-ranking Trail Logs Analysis FUTURE WORK 16

ACKNOWLEDGEMENTS People Cartic Ramakrishnan Bilal Gonen, Aditya Dhoke Wesley Workman, Rodrigo Gama, Guilherme de Napoli Air Force Research Lab Human Effectiveness Directorate Wright-Patterson Air Force Base National Science Foundation Award SemDis: Discovering Complex Relationships in the Semantic Web. No Wright State University No. IIS to University of Georgia 17

QUESTIONS 18

Semantic Web extension of the current web common vocabulary machine processable Semantic Web – is an extension of the current web in which data is expressed in a common vocabulary making such that the data becomes machine processable. Ontology conceptsrelationships Ontology – is a specification of concepts and relationships between them. Triple subject-predicate-object Triple - a ternary relation containing an entity pair and a relationship that expresses the link between them i.e. subject-predicate-object Entity/Concept thing Entity/Concept – an instance of a thing URI URI – a unique identifier for any resource/entity/thing on the web LOD LOD - a semantic web initiative to provide a repository of semantically connected datasets TERMINOLOGY 19