Revelytix SICoP Presentation DRM 3.0 with WordNet Senses in a Semantic Wiki Michael Lang February 6, 2007.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Federal Data Architecture Subcommittee Co-chairs: What Does the DRM Mean to Me? – The FEA DRM Management Strategy 19 July 2006 Bryan Aucoin, DNI Suzanne.
FEADRM Person Person Harmonization Workgroup Data Architecture Subcommittee Meeting January 11, 2007.
Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
OMG Architecture Ecosystem SIG Federal CIO Council Data Architecture Subcommittee May 2011 Cory Casanave.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Information and Business Work
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
224 Schilling Circle Suite 240 Hunt Valley, MD (410) Ontology-Driven Information Management Standards-Based Collaborative.
Environmental Terminology System and Services (ETSS) June 2007.
A Methodology for Developing a Taxonomy – A Subject Oriented Approach
Chapter 10: Analyzing Systems Using Data Dictionaries Instructor: Paul K Chen.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Semantic Interoperability Community of Practice (SICoP) Semantic Web Applications for National Security Conference Hyatt Regency Crystal City, Regency.
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
FEA DRM Management Strategy 11 October 2006 “Build to Share”
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
SICoP Presentation A story about communication Michael Lang BEARevelytix May 2, 2007.
XBRL Seminar: The New Data Reference Model
1 Data Architecture, Modeling, and Networks Brand L. Niemann January 5, 2007.
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
Metadata Management Case Study Date: 10/21/2008 Dan McCreary President Dan McCreary & Associates (952) M D Metadata Solutions.
1 Building DRM 3.0 and Web 3.0 for Managing Context Across Multiple Documents and Organizations Mills Davis and Brand Niemann, SICoP Co-Chairs, and Lucian.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Cairo Corporation An Inc 500 Company ISO9001:2000 CERTIFIED 8(a) ۰ SDB ۰ WOB ۰ GSA ۰ GSA STARS The Dream of a Common Language: Extending the Role of the.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Aude Dufresne and Mohamed Rouatbi University of Montreal LICEF – CIRTA – MATI CANADA Learning Object Repositories Network (CRSNG) Ontologies, Applications.
EPA’s Environmental Terminology System and Services (ETSS) Michael Pendleton Data Standards Branch, EPA/OEI Ecoiformatics Technical Collaborative Indicators.
Semantic Web - an introduction By Daniel Wu (danielwujr)
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
W HAT IS I NTEROPERABILITY ? ( AND HOW DO WE MEASURE IT ?) INSPIRE Conference 2011 Edinburgh, UK.
1 Data and Information Architecture: Not Just for Enterprise Architects! Gartner Enterprise Architecture Conference June 2007, Nashville, TN Gaylord.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
1 © Copyright 2006 Data Foundations, Inc. CONFIDENTIAL & PROPRIETARY OneData and the FEA DRM Presented at SICOP 2006 February 10,
1 DAS Annual Review June 2008 “Build to Share” Suzanne Acar, US DOIAdrian Gardner, US National Weather ServiceCo-Chair, Federal DAS
Overview of FEA Geospatial Profile, Version 0.3 Doug Nebert FGDC Secretariat.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Information Architecture The Open Group UDEF Project
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
SICoP Presentation A story about communication Michael Lang BEARevelytix April 25, 2007.
Implementing the FEA DRM Michael C. Daconta Metadata Program Manager March 15, 2004.
EbXML Semantic Content Management Mark Crawford Logistics Management Institute
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
OMG Architecture Ecosystem SIG Enterprise Data World 2011.
Agenda Federated Enterprise Architecture Vision
Data Reference Model Implementation Through Iteration & Testing
Collaborative Vocabulary Management
A New Enterprise Data Management Strategy for the US EPA
Brand Niemann, US EPA and
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
OneData and the FEA DRM Presented at SICOP 2006 February 10, 2006 Mathew Manathara Data Foundations, Inc.
The Re3gistry software and the INSPIRE Registry
One Language. One Enterprise.™
Business Process Management and Semantic Technologies
Presentation transcript:

Revelytix SICoP Presentation DRM 3.0 with WordNet Senses in a Semantic Wiki Michael Lang February 6, 2007

Agenda ► Semantic Matching using Wordnet ► Bootstrapping COI based vocabularies  With WordNet ► DRM 3.0 in a semantic Wiki  With WordNet integration  DRM implementation tool for the agencies ► Demonstration

DRM Mission ► Facilitate information sharing ► How can I know that a data element or service I have discovered is the one I really want?  Description  Context ► How can I describe and provide sufficient context for anyone to know they have found what they want ► Knowledge model  Excel, ISO 11179, DRM 2.0 will not be sufficient  OWL

Semantic Matching

► MatchIT  Extracts terms from data bases  Creates a MatchIT vocabulary based on the collection of terms  Uses WordNet to match terms in disparate systems  Uses WordNet to match terms to domain vocabularies (NIEM)  Attaches WordNet “senses” to the vocabulary terms ► MatchIT vocabularies can be exported as OWL models  With the WordNet senses and synsets ► MatchIT can use other knowledgebases to facilitate matching

Bootstrapping COI Vocabularies ► MatchIT vocabularies are imported  Either as OWL classes for vocabulary development  Or as OWL individuals for DRM development ► These vocabularies can enriched using Knoodl.com for community based development  Data dictionary  Vocabulary  Knowledge base

Conceptual / Logical / Physical Data Models Relational XMLXML XMLXML XMLXML XML Ontologies [OWL/RDF] Domain [UML/ER] Data Harmonization Complete Metadata Access Data/Content Access Ontological Semantics Access OWL / RDF Model Complete Import Export Representations Find Matches Ontological Semantics Access Enterprise Information Sources Custom Any Source XML File System JDBC RDMS Semantic Ontology Platform Fact Repositories Onomasticons Lexicons Domain Ontology Models & Files [versioned] Search Index Web Reporting Instance- level Match Schema- level Match Build Knoodl.com Third-Party Modeling Tool MatchIT Vocabulary Manager

Collaborative OWL editor

Information Management ► Knoodl is a new kind of modeling tool for modeling the structure, semantics and knowledge of any domain  The modeling process is necessarily collaborative  The process is necessarily extensible and additive  Community of Interest (COI) based tool  OWL based

Knoodl.com is … ► An internet application where people can collaborate with others in their communities of interest to  Create, edit, share and find  Vocabularies / ontologies ► OWL Repository  Free, but licensing controlled by COI’s ► Social Computing Paradigm  Users contribute content and benefit from the content  Vocabularies capture much of the institutional knowledge of an enterprise or community  Gain value over time  Used by people and machines

Knoodl.com ► Knoodl is a collaborative framework ► Interoperability depends on three groups of stakeholders contributing to the description and context of the services ► Businesspeople ► Technical people ► Data people  Knoodl provides the features for the business people to participate

FEADRM Person Person Harmonization Workgroup Data Architecture Subcommittee Meeting January 11, 2007

Gathering Information We asked those on the workgroup to share their models of PERSON with us. We received documents from the Department of the Interior (DOI), the Veterans’ Administration (VA), the Federal Aviation Administration (FAA), and the Environmental Protection Agency (EPA). You can view them on CORE.gov at

Analyzing the Data We compared the entities and attributes from all the documentation. We created an Excel Workbook. – The first sheet contains all the entities and attributes from each model. – The second sheet contains a mapping of the entities from the other agencies to those of the Social Security Administration (SSA) – The third sheets contains the entities, attributes, and their definitions from the SSA FEADRM Model The Excel document is named ‘Person Entities and Attributes from Various Feds’ and you can view it on CORE.gov at

Observations A data model should have a point of view, we should have a common one at the Federal level. Everyone should be modeling business data rather than creating logical data base models. PERSON is probably the area in which resides most of the non-administrative sharable data. This is what we at SSA call “common shared.” The definition of business concepts represented by entities at the “top” of the data model should not be in terms so rigorously tied to the business of any one agency. Data that are “regulated” require formal agreement to be sharable. PERSON cannot be addressed in a vacuum. The concepts of organization, party, and role should be addressed at the same time.

DRM 3.0

Communities of Interest (COI) Vision Each COI will implement the 3 pillar framework strategy. Business & Data Goals drive Information Sharing/Exchange (Services) Governance Data Strategy Data Architecture (Structure)

The FEA Data Reference Model 2.0 Source: Expanding E-Government, Improved Service Delivery for the American People Using Information Technology, December 2005, pages NIEM 1.0 NIEM Roadmap Pilot

DRM 2.0 Implementation Metamodel ► Definitions:  Metamodel: Precise definitions of constructs and rules needed for abstraction, generalization, and semantic models.  Model: Relationships between the data and its metadata - W3C.  Metadata: Data about the data for: Discovery, Integration, and Execution.  Data: Structured e.g. Table, Semi-Structured e.g. , and Unstructured e.g. Paragraph. Source: Professor Andreas Tolk, 2005.

The Revelytix Solution ► OWL MetaModel:  owl:Class  owl:Property ► DRM Model:  Topic (owl class)  Entity (owl class)  Relationship (owl object property) Use existing MetaModel languages to model the FEA DRM – OWL Model the DRM in a collaborative environment - Knoodl Extend the DRM to model the type of information that will be created – JDBC metadata, Wordnet synset and word data Use existing MetaModel languages to model the FEA DRM – OWL Model the DRM in a collaborative environment - Knoodl Extend the DRM to model the type of information that will be created – JDBC metadata, Wordnet synset and word data

DRM Implementation: Data Description Area ► Model JDBC Metadata to Data Description Area Entity Attribute DRM v2.0 Vocabulary View Column Table MatchIT Data Dictionary Vocabulary Relationship ForeignKey

DRM Implementation: Data Context Area ► Model Wordnet data to Data Context Area Relationship Topic DRM v2.0 Vocabulary Hyponym Synset Hypernym MatchIT Data Dictionary Vocabulary Taxonomy Wordnet

SICoP Knowledge Reference Model The point of this graph is that Increasing Metadata (from glossaries to ontologies) is highly correlated with Increasing Search Capability (from discovery to reasoning).

Demonstration

Contextualize (Interpret) Automated term tokenization Automated semantic linking using the default knowledge-base contained within MatchIT ArticleAmount AmountArticle Sum Assets Creation Synonym Type-of

Semantic Matching (Mediate) ► Relationships pre-established within the knowledge-base… Identify the Target and the Source(s) and run the match. ArticleAmount ProductShares Automatically linked by a specific % distance

Semantic Matching (Mediate) Not all direct matches are the most relevant… In many cases the most valuable match are the distant matches. By adding a domain knowledge-base these relationships become more obvious. Abstraction Evidence