An Extension to XML Schema for Structured Data Processing Presented by: Jacky Ma Date: 10 April 2002.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Forest Markup / Metadata Language FML
XML DOCUMENTS AND DATABASES
Introduction to Databases
SRDC Ltd. 1. Problem  Solutions  Various standardization efforts ◦ Document models addressing a broad range of requirements vs Industry Specific Document.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
The Structured-Element Object Model for XML Committee Members Prof. Y.S. Moon(Chairman) Prof. Irwin King Prof. Michael Lyu(Supervisor) Oral Defense for.
1 COS 425: Database and Information Management Systems XML and information exchange.
1 Indexing and Querying XML Data for Regular Path Expressions A Paper by Quanzhong Li and Bongki Moon Presented by Amnon Shochot.
Quick Review of Apr 15 material Overflow –definition, why it happens –solutions: chaining, double hashing Hash file performance –loading factor –search.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
© Copyright 2000 M. Rodriguez-Martinez, All Rights Reserved Automatic Deployment of Application-Specific Metadata and Code in MOCHA Manuel Rodriguez-Martinez.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Sheet 1XML Technology in E-Commerce 2001Lecture 6 XML Technology in E-Commerce Lecture 6 XPointer, XSLT.
XML and its applications: 4. Processing XML using PHP.
School of Computing and Management Sciences © Sheffield Hallam University To understand the Oracle XML notes you need to have an understanding of all these.
XML eXtensible Markup Language w3c standard Why? Store and transport data Easy data exchange Create more languages WSDL (Web Service Description Language)
CST203-2 Database Management Systems Lecture 2. One Tier Architecture Eg: In this scenario, a workgroup database is stored in a shared location on a single.
Introduction to Databases A line manager asks, “If data unorganized is like matter unorganized and God created the heavens and earth in six days, how come.
Graph Data Management Lab, School of Computer Science gdm.fudan.edu.cn XMLSnippet: A Coding Assistant for XML Configuration Snippet.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Interoperability in Information Schemas Ruben Mendes Orientador: Prof. José Borbinha MEIC-Tagus Instituto Superior Técnico.
The main mathematical concepts that are used in this research are presented in this section. Definition 1: XML tree is composed of many subtrees of different.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Processing of structured documents Spring 2002, Part 2 Helena Ahonen-Myka.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
XML and Digital Libraries M. Zubair Department of Computer Science Old Dominion University.
Date : 3/3/2010 Web Technology Solutions Class: Application Syndication: Parse and Publish RSS & XML Data.
© GMV S.A., 2004 Property of GMV S.A. All rights reserved 2004/05/13 XML in CCSDS CCSDS Spring Meeting - Montreal Fran Martínez GMVSA 4081/04.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
EXist Indexing Using the right index for you data Date: 9/29/2008 Dan McCreary President Dan McCreary & Associates (952) M.
Andrew S. Budarevsky Adaptive Application Data Management Overview.
Declaratively Producing Data Mash-ups Sudarshan Murthy 1, David Maier 2 1 Applied Research, Wipro Technologies 2 Department of Computer Science, Portland.
Tutorial 13 Validating Documents with Schemas
Gaining and keeping competitive advantage through J2EE and XML interoperability Dimitri Maesfranckx Product Manager.
XML Refresher Course Bálint Joó School of Physics University of Edinburgh May 02, 2003.
XML Grammar and Parser for WSOL Kruti Patel, Vladimir Tosic, Bernard Pagurek Network Management & Artificial Intelligence Lab Department of Systems & Computer.
Scaling Heterogeneous Databases and Design of DISCO Anthony Tomasic Louiqa Raschid Patrick Valduriez Presented by: Nazia Khatir Texas A&M University.
XML eXtensible Markup Language. XML A method of defining a format for exchanging documents and data. –Allows one to define a dialect of XML –A library.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Ontology Resource Discussion
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
Object storage and object interoperability
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Lecture 23 XQuery 1.0 and XPath 2.0 Data Model. 2 Example 31.7 – User-Defined Function Function to return staff at a given branch. DEFINE FUNCTION staffAtBranch($bNo)
Using DSDL plus annotations for Netconf (+) data modeling Rohan Mahy draft-mahy-canmod-dsdl-01.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Metadata Driven Aspect Specification Ricardo Ferreira, Ricardo Raminhos Uninova, Portugal Ana Moreira Universidade Nova de Lisboa, Portugal 7th International.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
XML QUESTIONS AND ANSWERS
Grid Metadata Management
The Structured-Element Object Model for XML
The Re3gistry software and the INSPIRE Registry
RichAnnotator: Annotating rich (XML-like) documents
Semi-Structured data (XML Data MODEL)
Data Model.
Database Systems Instructor Name: Lecture-3.
Presented by: Jacky Ma Date: 11 Dec 2001
Magnet & /facet Zheng Liang
Semi-Structured data (XML)
Geographic Information Systems
Presentation transcript:

An Extension to XML Schema for Structured Data Processing Presented by: Jacky Ma Date: 10 April 2002

Presentation Outline The Problems Research Objectives The Schema Extension: MMX MMX Query System Discussion Conclusion

The Problems Mapping XML data into relational tables Not natural to XML structure Efficient, but may not be a effective method Legacy application-specific structured data Similar modeling but proprietary implementation Not interoperable, and difficult to maintain Lack of modular design and thus difficult to combine to form more complex data structure Meta-data can facilitate wide range of needs, while XML Schema is solely used for physical data validation nowadays

Research Objectives To facilitate more effective searching and storing of XML contents by making use of meta-data (XML Schema) Propose a data-oriented model to allow different storage mechanism, processing model, and query model on XML contents

Our Approach – MMX Use meta-data to map XML data into structured data objects Define the structured data models “conceptually” and link the models to XML document structure “syntactically” Meta-data is defined as an extension of XML Schema The extension is called MMX (Multi Model XML)

Program Driven vs. Data Driven Raw Data Data with Program Codes Data with Modeling Information Structured Data (XML) Program Driven Data Driven Information for processing is hard-coded in program Processing instruction is hard-coded in data?! MMX!

A Glance of XML Data

A Glance of The Linked Schema

Schema Extension The extended schema is associated with a namespace The extended schema goes within a schema element, like in the example specify a single structure object instance Name association for elements and attributes Class hierarchies: -> -> finally to the structure specified in Additional properties in, and Schema writer has to know the structure model specification, while the XML writer only needs to know the given schema

Modeling For an instance of “MMX data object” As an encapsulated information object only accessible from the root, thus as a “single tree node” As a mapping from root node, query method and query parameters to the value at leaf nodes Leaf nodes may contain any valid XML content, as long as defined in the Schema I.e. may contain another “MMX data object” A query is modeled as a 3-dimension tuple: [accessing-node, query-method, query-parameters] Accessing-node is specified by XPath Query-method is specified in String Value Query-parameters is multi-dimension depends on the current model

Modeling (2) Tree (1) Tree(2) XML Elements.. A B Tree(1) is accessible from point A, occasionally, a query (e.g. [A, “spatial-search”,(3, 5)], assuming Tree(1) will accept spatial-search with two coordinates) may return point B as answer, either by XPath of B or the XML subtree of B. From this point B, user may drill down the tree by issue another query on Tree(2).

Query with and without MMX From the original XML data, we could not assume the semantics of the data: We can ONLY do XML-based query such as XPath We can do the spatial query ONLY IF we can map the data into a R-Tree After mapping the data into R-Tree Spatial Queries Give me the point at (2,7) Give me the point nearest to (4,4) Nearest Neighbor Search Give me the point nearest to “Franklin”

Processing Users might not know the “type” of the node (and not necessary to know). They are interested in what they can do Users retrieved the list of possible operation by issuing a LIST-OPERATION method to the root element of a MMX object Possible operations may include queries, updates, and other model-specific operations

MMX Query System To show that the schema, modeling, and processing of MMX extension is workable To illustrate how it assists in querying XML data To facilitate as the platform for testing the implementation of arbitrary structured models Implement with JDK1.4

VP-Tree X-Tree System Design XML Schema R-Tree … DOM MMX Document Parse Schema Fetch Classes MMX Element Node Data Abstract MMX Element Extends class The Abstract Class defines common interface that have to be implement in each MMX Element such as LIST-OPERATION, QUERY, BUILD, etc. R-Tree Schema Maps (Partly) Defines Clients

Discussions - Pros Compatible with the relational approach, and supersedes that. Modular design promotes reusability and maintainability XML “flatten” the legacy structured data to make them text-editable, easy to transport and process by different systems

Discussion - Cons There is no generic syntax to precisely describe all kinds of structures models The size of XML file is often larger than legacy data file Each structure model needs additional implementation effort Schema specification become longer and longer quickly as number of supported model increases

Conclusion Propose a representation to encapsulate data structures Describe XML data with the Schema conceptually as well as syntactically Map legacy structure models into Schema, and map XML data to the structure models by the Schema Structured data repository with increased interoperability, reusability, and transportability

Q&A