SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
SemTag and Seeker: Bootstrapping the Semantic Web via Automated Semantic Annotation Presented by: Hussain Sattuwala Stephen Dill, Nadav Eiron, David Gibson,
An Ontological Approach to the Document Access Problem of Insider Threat ISI 2005, (May 20) Boanerges Aleman-Meza 1 Phillip Burns 2 Matthew Eavenson 1.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
The Semantic Web Week 1 Module Content + Assessment Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22,
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection Boanerges Aleman-Meza, Meenakshi Nagarajan,
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
An Ontological Approach to Assessing IC Need to Know Phillip BurnsCTA Inc. Prof. Amit ShethLSDIS Lab, University of Georgia Presented to ARDA PI Meeting,
Formalizing and Querying Heterogeneous Documents with Tables Krishnaprasad Thirunarayan and Trivikram Immaneni Department of Computer Science and Engineering.
Ranking Relationships on the Semantic Web Budak Arpinar This work is funded by NSF-ITR-IDM Award# titled '‘SemDIS: Discovering Complex Relationships.
An Introduction to the Resource Description Framework Eric Miller Online Computer Library Center, Inc. Office of Research Dublin, Ohio 元智資工所 系統實驗室 楊錫謦.
Deploying Trust Policies on the Semantic Web Brian Matthews and Theo Dimitrakos.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Logics for Data and Knowledge Representation
1 Virtualisation and Validation of Smart City Data Dr Sefki Kolozali Institute for Communication Systems Electronic Engineering Department University of.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Semantic (Web) Technology in Action - today The Semantic Web – Scientific American article considered harmful? WWW2003 Panel (PN2), Budapest, May 21, 2003.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Data Mining for Web Intelligence Presentation by Julia Erdman.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Semantic Web in Action Ontology-driven information search, integration and analysis NASA Virtual Iron Bird Workshop, NASA Ames, March 31, 2004 Amit Sheth.
Semantic Web: The Future Starts Today “Industrial Ontologies” Group InBCT Project, Agora Center, University of Jyväskylä, 29 April 2003.
Searching and Ranking Documents based on Semantic Relationships PaperPaper presentation ICDE Ph.D. Workshop 2006 April 3rd, 2006, Atlanta, GA, USA This.
The Semantic Logger: Supporting Service Building from Personal Context Mischa M Tuffield et al. Intelligence, Agents, Multimedia Group University of Southampton.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
OWL-based Semantic Conflicts Detection and Resolution for Data Interoperability Changqing Li,Tok Wang Ling Department of Computer Science School of Computing.
CREAM: Semantic annotation system May 24, 2013 Hee-gook Jun.
1 Context-Aware Internet Sharma Chakravarthy UT Arlington December 19, 2008.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Context Aware Semantic Association Ranking SWDB Workshop Berlin, September 7, 2003 Boanerges Aleman-MezaBoanerges Aleman-Meza, Chris Halaschek, I. Budak.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
1 MedAT: Medical Resources Annotation Tool Monika Žáková *, Olga Štěpánková *, Taťána Maříková * Department of Cybernetics, CTU Prague Institute of Biology.
Aim Ability to automate the detection of financial inconsistency and irregularity Problem Need to create a unified and logically rigorous terminology.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Ontology Quality by Detection of Conflicts in Metadata Budak I. Arpinar Karthikeyan Giriloganathan Boanerges Aleman-Meza LSDIS lab Computer Science University.
Information Architecture The Open Group UDEF Project
References [1] D:\My Documents\SemanticWebWorkshop\kaynak\Ian Horrocks - CS646\intro-2004.pptD:\My Documents\SemanticWebWorkshop\kaynak\Ian Horrocks -
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
An Ontology-based Approach to Context Modeling and Reasoning in Pervasive Computing Dejene Ejigu, Marian Scuturici, Lionel Brunie Laboratoire INSA de Lyon,
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
An Ontological Approach to Financial Analysis and Monitoring.
Introduction to the Semantic Web Jeff Heflin Lehigh University.
University of Maryland Scaling Heterogeneous Information Access for Wide area Environments Michael Franklin and Louiqa Raschid.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Discovering and Ranking Semantic Associations over a Large RDF Metabase Chris Halaschek, Boanerges Aleman- Meza, I. Budak Arpinar, Amit P. Sheth 30th International.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
By: Chris Halaschek Advisors: Dr. I. Budak Arpinar Dr. Amit P. Sheth
Towards a framework for architectural design decision support
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Social Knowledge Mining
Metadata Construction in Collaborative Research Networks
Amit Sheth, CTO, Semagix Inc
Context-Aware Internet
Presentation transcript:

SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza, Chris Halaschek, Amit Sheth, I.Budak Arpinar, Gowtham Sannapareddy Amit ShethI.Budak Arpinar

6/21/2004 Outline Motivation Goals Development Framework Current Status Related Work Conclusion & Future Work

6/21/2004 Motivation for SWETO Many new techniques and software tools from emerging Semantic Web community  Need a common infrastructure for testing Ontologies are a centerpiece of most approaches  Need an open and freely available ontology with a very large knowledge base

6/21/2004 Motivation for SWETO Current ontologies (i.e. TAP KB [3]) have breadth but lack depth Need for a large scale dataset for testing algorithms for knowledge discovery (i.e. Semantic Associations [1])

6/21/2004 The Big Picture

6/21/2004 SWETO Goals Develop a broad and deep ontology populated with real facts/data from real world heterogeneous sources  the instances in the knowledge base should be highly interconnected Serve as a test-bed for advanced semantic applications (i.e. business intelligence, national security, etc.) Address the requirements of a research benchmark for semantic analytics, and the issues of:  ontology creation  semi-automatic extraction  entity disambiguation

6/21/2004 Development Framework Utilized Semagix Freedom [4] for ontology creation and metadata extraction With Freedom, knowledge extractors were created by specifying regular expressions to extract entities from various data sources:  Open and trusted Web sources  (semi-) structured sources allow high scalability in extraction and crawling

6/21/2004 Development Framework Data sources:  Selected sources which were highly reliable Web sites that provide entities in a semi–structured format unstructured data with parse-able structures (e.g.,html pages with tables) dynamic web sites with database back-ends  Considered the types and quantity of implicit/explicit relationships preferred sources in which instances were interconnected  Considered sources whose entities would have rich metadata  Public and open sources were preferred due to the desire to make SWETO openly available

6/21/2004 Development Framework As the sources were ‘scraped’ by the extractors, entities are extracted and stored in appropriate classes in an ontology Due to heterogeneous data sources, entity disambiguation is a crucial step  Freedom’s disambiguation techniques automatically resolved entity ambiguities in 99% of the cases, leaving less than 1% for human disambiguation (about 200 cases)

6/21/2004 Development Framework Utilize Freedom’s API for exporting both the ontology and its instances in either RDF [5] or OWL [2] syntax Extractors are scheduled to rerun for keeping the ontology updated

6/21/2004 (Semagix) Application Architecture

6/21/2004 Current Status Current population includes over 800,000 entities and over 1,500,000 explicit relationships among them Continue to populate the ontology with diverse sources thereby extending it in multiple domains

6/21/2004 Current Status – Classes Subset of classes in the ontology # Instances Cities, countries, and states 2,902 Airports 1,515 Companies, and banks 30,948 Terrorist attacks, and organizations 1,511 Persons and researchers 307,417 Scientific publications 463,270 Journals, conferences, and books 4,256 TOTAL (as of May 2004) 811,819

6/21/2004 Current Status – Relationships Subset of relationships# Explicit relations located in30,809 responsible for (event)1,425 Listed author in1,045,719 (paper) published in467,367

6/21/2004 Current Status – Disambiguation Disambiguation type# Times used Automatic (Freedom)248,151 Manual210 Unresolved (Removed)591

6/21/2004 Browsing of the Schema

6/21/2004 Related Work TAP KB [3] is fairly broad but not very deep knowledge base annotated in RDF

6/21/2004 Conclusions & Future Work Using Semagix Freedom, we have created a very broad and deep Semantic Web Evaluation Ontology (SWETO)  Contains over 800,000 entities and over 1,500,000 explicit relationships among them Aim to continue the population of SWETO by further extraction of data Also plan to further investigate the use of semantic similarity for entity disambiguation

6/21/2004 SWETO Project Homepage  Project description, papers, presentations

6/21/2004 References [1] K. Anyanwu, and A. Sheth. “r-Queries: Enabling Querying for Semantic Associations on the Semantic Web”. Twelfth International World Wide Web Conference, Budapest, Hungary. May 20-24, 2003; pp [2] S. Bechhofer, F. Harmelen, J. Hendler, I. Horrocks, D. McGuinness, P. Patel-Schneider, et al. (2003). “OWL Web Ontology Language Reference”. W3C Proposed Recommendation, from [3] R. Guha and R. McCool. “Tap: A Semantic Web Test-Bed”. Journal of Web Semantics, 1(1), Dec. 2003, pp [4] B. Hammond, A. Sheth, K. Kochut. “Semantic Enhancement Engine: A Modular Docu-ment Enhancement Platform for Semantic Applications over Heterogeneous Content in Real World Semantic Web Applications”. V. Kashyap & L. Shklar, Eds., IOS Press, 2002 [5] O. Lassila, & R. Swick. “Resource Description Framework (RDF) Model and Syntax Specification”. W3C Recommendation, from