Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.

Slides:



Advertisements
Similar presentations
Università di Modena e Reggio Emilia ;-)WINK Maurizio Vincini UniMORE Researcher Università di Modena e Reggio Emilia WINK System: Intelligent Integration.
Advertisements

DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
XML: Extensible Markup Language
D2I Project, Rome, October ARTEMIS The ARTEMIS prototype for the construction of reconciled views based on affinity evaluation and interactive.
XML & Data Structures for the Internet Yingcai Xiao.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
XML for Information Management – Day 2 Airi Salminen University of Erlangen-Nuremberg Computational Linguistics Instructor: Professor Airi Salminen
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Introduction XML Technologies Mark Graves. This presentation is Copyright 2001, 2002 by Mark Graves and contains material Copyright 2002 by Prentice Hall.
ModelicaXML A Modelica XML representation with Applications Adrian Pop, Peter Fritzson Programming Environments Laboratory Linköping University.
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C Activities HTML: is the lingua franca for publishing on the Web XHTML: an XML application.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
1 COS 425: Database and Information Management Systems XML and information exchange.
XML A brief introduction ---by Yongzhu Li. XML --- a brief introduction 2 CSI668 Topics in System Architecture SUNY Albany Computer Science Department.
COS 381 Day 16. Agenda Assignment 4 posted Due April 1 There was no resubmits of Assignment Capstone Progress report Due March 24 Today we will discuss.
W3C Activities HTML: is the lingua franca for publishing on the Web XHTML: an XML application with a clean migration path from HTML 4.01 CSS: Style sheets.
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
Digital Encoding What’s behind E-text Resources?.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Technical University of Valencia Computer Science Department SOFSEM’07 (22/01/2007) A Program Slicing Based Method to Filter XML/DTD documents.
Z39.50, XML & RDF Applications ZIG Tutorial January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
XML & Library Applications ELAG 2001 Poul Henrik Jørgensen, Danish Bibliographic Centre,
Practical RDF Chapter 1. RDF: An Introduction
Introduction to XML Eugenia Fernandez IUPUI. What is XML? From the World Wide Web Consortium (W3C) The Extensible Markup Language (XML) is the universal.
MSc IT Multimedia XML & XSLT P. Muneesawang. 2 Outline Why XML XSL.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
The Semantic Web William M Baker
XML 1 Enterprise Applications CE00465-M XML. 2 Enterprise Applications CE00465-M XML Overview Extensible Mark-up Language (XML) is a meta-language that.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
1 Chapter 10: XML What is XML What is XML Basic Components of XML Basic Components of XML XPath XPath XQuery XQuery.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
CP3024 Lecture 9 XML: Extensible Markup Language.
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
CEAL 2003 XML for CJK Wooseob Jeong School of Information Studies University of Wisconsin - Milwaukee.
1 Credits Prepared by: Rajendra P. Srivastava Ernst & Young Professor University of Kansas Sponsored by: Ernst & Young, LLP (August 2005) XBRL Module Part.
XML Extras Outline 1 - XML in 10 Points 2 - XML Family of Technologies 3 - XML is Modular 4 - RDF and Semantic Web 5- XML Example: UK GovTalk Group’s Schema.
XML Engr. Faisal ur Rehman CE-105T Spring Definition XML-EXTENSIBLE MARKUP LANGUAGE: provides a format for describing data. Facilitates the Precise.
Jennifer Widom XML Data Introduction, Well-formed XML.
XML, XSL, and SOAP Building Object Systems from Documents CSC/ECE 591o Summer 2000.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Web Technologies Lecture 4 XML and XHTML. XML Extensible Markup Language Set of rules for encoding a document in a format readable – By humans, and –
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
SCHOOL OF LIBRARY, ARCHIVE AND INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues XML and extensible systems Andy Dawson School.
PRACTICAL KNOWLEDGE REPRESENTATION FOR THE WEB Frank van Harmelen Dieter Fensel AIFB Kim Kangil Structural Complexity Laboratory.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C - The World Wide Web Consortium W3C - The World Wide Web Consortium.
Martin Kruliš by Martin Kruliš (v1.1)1.
INFSY 547: WEB-Based Technologies Gayle J Yaverbaum, PhD Professor of Information Systems Penn State Harrisburg.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
XML 1.Introduction to XML 2.Document Type Definition (DTD) 3.XML Parser 4.Example: CGI Gateway to XML Middleware.
Beyond HTML: Extensible Markup Language (XML)
Kynn Bartlett 11 April 2001 STC San Diego The HTML Writers Guild Copyright © 2001 XML, XHTML, XSLT, and other X-named specifications.
Updated :02 Hong Kong University of Science & Technology Library Workshop on XML-Based Library Applications 1. What is XML?
XML BASICS and more…. What is XML? In common:  XML is a standard, simple, self-describing way of encoding both text and data so that content can be processed.
XML QUESTIONS AND ANSWERS
XML in Web Technologies
Session I - Introduction
Session I - Introduction
Database Processing with XML
XML Data Introduction, Well-formed XML.
CSE591: Data Mining by H. Liu
Presentation transcript:

Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti ISTI - Institute of Information Science and Technology CNR, Pisa Italy

Outline q Digital Libraries on Internet q Need of integrated access (Open Archive Initiative) q Metadata in Digital Libraries q Impact of XML on Digital Libraries q Controlling semantics in XML (data and metadata interchange in Digital Libraries) q Ontology-based approach

Digital Libraries on Internet q The Internet is making accessible a large, and increasing, number of Digital Libraries, originally intended for specific and specialised groups of users, to a wide range of potential users q The problem of controlling, exchanging and integrating the semantics associated to Digital Libraries (i.e., the associated metadata) is becoming more and more important.

Open Archive Initiative q Need of integrated access to Digital Libraries. q The Open Archives initiative (OAi), in US, aims at guaranteeing interoperability among Digital Libraries (e-print archives). q It has established a set of relatively simple but potentially quite powerful interoperability specifications that facilitate the development of services implemented by third parties.

Metadata in Digital Libraries q Metadata in Digital Libraries, for bibliographic data, are usually expressed according to models like Dublin Core or MARC. q However, there is the need to generalise the description of data and metadata made available in a large variety of Digital Libraries. q The wide acceptance on the Web of XML can be a decisive factor in this direction.

What is XML q XML: eXtensible Markup Language 3 XML is a simple, standard way to delimit text data 3 the ASCII of the Web: q use your favorite programming language to create an arbitrary data structure q share it with anyone using any other language on any other computing platform q Proposed by the World Wide Web Consortium (W3C) q XML is a subset of SGML 3 SGML - Standard Generalized Markup Language

Why XML q HTML, the actual standard on the Web, is mainly concerned with the presentation style 3 HTML fuses data and presentation q XML is not only concerned with the presentation style of the document, but also with formal description of data content 3 XML separates data and presentation q XML intends to combine the flexibility and power of SGML with the widespread acceptance of HTML

W3C XML Technology q Data description and modeling 3 XML structure 3 DTD - Document Type Definition 3 XML Schema q Data presentation and styling 3 CSS - Cascading Style Sheets 3 XSL - Extensible Style-sheet Language q Data processing 3 API for XML: q DOM - Document Object Model q SAX - Simple API for XML 3 Transforming XML: q XSLT and XPath

Controlling Semantics in XML q XML is a powerful and flexible way to convey the semantics of data through a syntax: 3 it does not ensures the correctness of the process: q two applications may interoperate via XML and still give different meaning to the same data objects q XML document tags can be use to describe the meaning of the document components. Controlling the semantics associated to XML tags will be a decisive task. q W3C activity on metadata: 3 PICS: Platform for Internet Content Selection 3 RDF: Resource Description Framework

Impact of XML on Digital Libraries q Controlling the semantics in XML will open new perspective in accessing Digital Libraries, since XML is going to become the new interoperability standard for distributed Digital Libraries. q We foresee a situation where XML will be used in Digital Libraries: 3 for exchanging digital documents (often multimedia) and their multi- modal presentations (via XSL) 3 for defining metadata, using XML DTD or Schema descriptions, with associated RDF (Resource Description Framework) schema descriptions.

Ontology-based approach q aims to build a Digital Library Ontology representing a global virtual view of distributed Digital Libraries q Mapping rules between local and global views based on a “Common Thesaurus” of terminological relationships able to reconcile different representation of similar concepts. q The starting point is the MOMIS system

Mediator envirOnment for Multiple Information Sources (MOMIS) Project q Information sharing from multiple heterogeneous sources q Proposal :Information Integration to provide a global conceptual schema, allowing a user to pose a query and to receive a single unified answer. Internet:

MOMIS Architecture

MOMIS wrapper q The wrappers are the access point for the data sources. q The wrappers present each data source (XML, relational, object,...) in a common data model (derived from ODMG and I 3 /POB proposal) q An XML wrapper wraps data sources that contains valid XML data: 3 Translation phase: from XML-DTD data structures to ODMG data structures 3 Querying phase: query translation from a ODMG-standard query language to XML query language.

Common Thesaurus q Intensional and extensional intra and inter-schema relationships between name concepts 3 SYN (Synonym-of), 3 BT (Broader Terms), or hypernymy, NT (Narrower Terms), or hyponymy. 3 RT (Related Terms), or positive association, q The relationships added to the Common Thesaurus are: 3 schema-derived 3 lexical-derived 3 designer-supplied 3 inferred

Lexical-derived relationships q Lexical relationships holding between names, deriving from the mining of used words. q Use of WordNet lexical system to extract relationships and propose them to the designer. 3 The designer can confirm these relationships or not and can provide further information

Lexical-derived relationships :an example NT hyponymy

Lexical-derived relationships :an example

SI-Designer

Example of XML Source <!ATTLIST Student StudentId ID #REQUIRED tutor CDATA #REQUIRED> <!ATTLIST Professor Prof_code ID #REQUIRED Office_phone CDATA #IMPLIED> <!ATTLIST Division description CDATA #REQUIRED sector CDATA #REQUIRED> …..