Deriving Relation Keys from XML Keys by Qing Wang, Hongwei Wu, Jianchang Xiao, Aoying Zhou, Junmei Zhou Reviewed by Chris Ying Zhu, Cong Wang, Max Wang,

Slides:



Advertisements
Similar presentations
XML-XSL Introduction SHIJU RAJAN SHIJU RAJAN Outline Brief Overview Brief Overview What is XML? What is XML? Well Formed XML Well Formed XML Tag Name.
Advertisements

Chungnam National University DataBase System Lab
XML: Extensible Markup Language
Bottom-up Evaluation of XPath Queries Stephanie H. Li Zhiping Zou.
Semantics Static semantics Dynamic semantics attribute grammars
Chapter 9 Code optimization Section 0 overview 1.Position of code optimizer 2.Purpose of code optimizer to get better efficiency –Run faster –Take less.
Internet Technologies1 1 Lecture 4: Programming with XSLT.
XML, XML Schema, Xpath and XQuery Slides collated from various sources, many from Dan Suciu at Univ. of Washington.
Tree Data Structures &Binary Search Tree 1. Trees Data Structures Tree  Nodes  Each node can have 0 or more children  A node can have at most one parent.
May 2006CLINT-LN Parsing1 Computational Linguistics Introduction Approaches to Parsing.
From Semistructured Data to XML: Migrating The Lore Data Model and Query Language Roy Goldman, Jason McHugh, Jennifer Widom Stanford University
Storing and Querying XML Data in Databases Anupama Soli
XSL Concepts Lecture 7. XML Display Options What can XSL Transformations do? generation of constant text suppression of content moving text (e.g., exchanging.
1 SCHEMALESS APPROACH OF MAPPING XML DOCUMENTS INTO RELATIONAL DATABASE Ibrahim Dweib, Ayman Awadi, Seif Elduola Fath Elrhman, Joan Lu CIT 2008 Sydney,
XSL Transformations Lecture 8, 07/08/02. Templates The whole element is a template The match pattern determines where this template applies Result element(s)
1 COS 425: Database and Information Management Systems XML and information exchange.
Keys For XML Peter Buneman Susan Davidson Wenfei Fan Carmem Hara Wang Chiew Tan.
Containment and Equivalence for an XPath Fragment By Gerom e Mikla Dan Suciu Presented By Roy Ionas.
1 Indexing and Querying XML Data for Regular Path Expressions A Paper by Quanzhong Li and Bongki Moon Presented by Amnon Shochot.
Semi-structured Data. Facts about the Web Growing fast Popular Semi-structured data –Data is presented for ‘human’-processing –Data is often ‘self-describing’
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
Sangam: A Transformation Modeling Framework Kajal T. Claypool (U Mass Lowell) and Elke A. Rundensteiner (WPI)
Algorithmics and Applications of Tree and Graph Searching Dennis Shasha, Jason T. L. Wang, and Rosalba Giugno Presenters: Jerod Watson & Christan Grant.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Introduction to XPath Bun Yue Professor, CS/CIS UHCL.
4/20/2017.
1 XDO2: A Deductive Object-Oriented Query Language for XML Wei Zhang 1, Tok Wang Ling 1, Zhuo Chen 1, and Gillian Dobbie 2 School of Computing National.
IS432: Semi-Structured Data Dr. Azeddine Chikh. 1. Semi Structured Data Object Exchange Model.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
XML Anisha K J Jerrin Thomas. Outline  Introduction  Structure of an XML Page  Well-formed & Valid XML Documents  DTD – Elements, Attributes, Entities.
Lecture 6 of Advanced Databases XML Schema, Querying & Transformation Instructor: Mr.Ahmed Al Astal.
Graph Data Management Lab, School of Computer Science gdm.fudan.edu.cn XMLSnippet: A Coding Assistant for XML Configuration Snippet.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 6 XSLT (Based on Møller and Schwartzbach,
XML as a Boxwood Data Structure Feng Zhou, John MacCormick, Lidong Zhou, Nick Murphy, Chandu Thekkath 8/20/04.
DASWIS NF-SS: A Normal Form for Semistructured Schemata Xiaoying Wu, Tok Wang Ling, Sin Yeung Lee, Mong Li Lee National University of Singapore.
Session IV Chapter 9 – XML Schemas
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Processing of structured documents Spring 2003, Part 7 Helena Ahonen-Myka.
Validated Model Transformation Tihamér Levendovszky Budapest University of Technology and Economics Department of Automation and Applied Informatics Applied.
1 XSLT An Introduction. 2 XSLT XSLT (extensible Stylesheet Language:Transformations) is a language primarily designed for transforming the structure of.
XML Data Management 10. Deterministic DTDs and Schemas Werner Nutt.
Managing XML and Semistructured Data Lecture 13: XDuce and Regular Tree Languages Prof. Dan Suciu Spring 2001.
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
Lecture 6: XML Query Languages Thursday, January 18, 2001.
Database Systems Part VII: XML Querying Software School of Hunan University
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Management of XML and Semistructured Data Lecture 11: Schemas Wednesday, May 2nd, 2001.
Early Profile Pruning on XML-aware Publish- Subscribe Systems Mirella M. Moro, Petko Bakalov, Vassilis J. Tsotras University of California VLDB 2007 Presented.
Management of XML and Semistructured Data Lecture 10: Schemas Monday, April 30, 2001.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
XML and Database.
Sept. 27, 2002 ISDB’02 Transforming XPath Queries for Bottom-Up Query Processing Yoshiharu Ishikawa Takaaki Nagai Hiroyuki Kitagawa University of Tsukuba.
Unit 3 — Advanced Internet Technologies Lesson 11 — Introduction to XSL.
More XML XPATH, XSLT CS 431 – February 23, 2005 Carl Lagoze – Cornell University.
Computing & Information Sciences Kansas State University Friday, 20 Oct 2006CIS 560: Database System Concepts Lecture 24 of 42 Friday, 20 October 2006.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Efficient Discovery of XML Data Redundancies Cong Yu and H. V. Jagadish University of Michigan, Ann Arbor - VLDB 2006, Seoul, Korea September 12 th, 2006.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Page 1 Renzo Angles and Claudio Gutierrez University of Chile ACM Computing Surveys, 2008 Survey of Graph Database Models.
1 The XPath Language. 2 XPath Expressions Flexible notation for navigating around trees A basic technology that is widely used uniqueness and scope in.
Processing XML Streams with Deterministic Automata Denis Mindolin Gaurav Chandalia.
XML Extensible Markup Language
Modified Slides from Dr.Peter Buneman 1 XML Constraints Constraints are a fundamental part of the semantics of the data; XML may not come with a DTD/type.
Trees. Trees: – A trunk from the roots – Divides into branches – Ends in leaves.
Recursive Objects (Part 4)
Document Object Model (DOM): Objects and Collections
Alin Deutsch, University of Pennsylvania Mary Mernandez, AT&T Labs
XML Constraints Constraints are a fundamental part of the semantics of the data; XML may not come with a DTD/type – thus constraints are often the only.
Data Structures Using C++ 2E
Presentation transcript:

Deriving Relation Keys from XML Keys by Qing Wang, Hongwei Wu, Jianchang Xiao, Aoying Zhou, Junmei Zhou Reviewed by Chris Ying Zhu, Cong Wang, Max Wang, Blazej Kot

Introduction Motivation –XML data has been widely used on the internet for storing semi-structured data. Aim of work –Mapping XML constraint keys to relation keys. Results –A heuristic algorithm is introduced to achieve the aim.

Background What a XML database or document look like Networks Larry Peterson Foundation TCPIP ATM Applications Networks Shary Zhou Introduction Applications

Background XML Tree –DOM specification models XML as a tree

Background XML Tree defined T=(V, lab, ele, att, val, id, root), where –V is a set of nodes; –lab maps each node to its type of label. One node can be Element Attribute Text –ele defines element and text children of element node –att maps element node to its attribute children –val maps attribute node or text node to text –id a unique identifier to each node –root is the root of the tree

Background Simple Expression PLs p::= e| l.p – e an empty path –. a concatenation operator –Cannot contain wild card –E.g.. book.author –[[book.author]] set of node reach by book.author Path Expression –PL P::= e| l | P.P | _* –l stands for any label or node name; –_ wild card match one single node name or label –_* matches any arbitrary path –E.g. book._* – [[book._*]] set of node reach by book._*

Background XML Keys – (Q’, (Q, {P1,..., Pn})) –Q, Q’ are PL –Pi are PLs such that for all i ∈ {1,..., n} –Q’.Q.Pi is a valid path expression –Q’ context path, Q target path, –P1,..., Pn key paths. –E.g. (book._*,(chapter, {number}))

Background Transformation language from XML to relational schema –transformation from XML data to a relational database of schema R is composed of a collection of rules from XML data to relations in the relational database –Rule Path (1) X  Y.P, –Y a rule path, P a path expression –Y.P is a valid path expression. (2) xr, the root of XML tree. –A rule on a relation R is Rule(R) = {rule(R,l)| l ∈ att(R)}. –Given a relational schema R, a transformation from XML data to a relational database of schema R is = (Rule(R1),..., Rule(Rn), where R1,...,Rn are the relations in the relational database.

Background Relation Tree RT = (V’, xr, xb, label, parent, children, leaf, attribute) –V’ set of nodes –Xr root of the relation tree –Xb unique child of Xr –label to a path expression, and label(Xr) = r. If xb, label maps it to P of PL. Otherwise, label maps to simple path expression PLs –parent maps its parent node. For Xr, parent undefined –children maps each interior node to a set of its children. children(Xr) = {Xb}. For leaves, children is undefined –leaf returns true if v is a leaf. Otherwise, it returns false –attribute maps each leaf v to a string l which denotes one of attributes of the relation R. For interior nodes, it is undefined Any node in relational tree has corresponding node in the tree model

The Derivation Algorithm Simple Expression: Algorithm derivation( , Rule(R))   (Relation Keys )  The XML Key Set Rule(R) A rule in transformation schema Construct The Relation Tree Process Each Node from RT in post-order Collect the key paths in a bottom up approach Compute the relation keys while it reach the root node. Return the relation keys

The Derivation Algorithm Example: Rule (book-chapter) (Diagram From “Driving Relation Keys From XML Keys”)

The Derivation Algorithm The Result Relation Key Set: cid  isbn, cid  title, cid  author, cid  cnumber, cid  cname, (cnumber, isbn)  title, (cnumber, isbn)  author, (cnumber, isbn)  cid, (cnumber, isbn)  cname. These keys define the constraints of relation (book-chapter) in the relational datbase.

Summary xml data RDB xml keys RDB keys What this paper was about Transformation

Conclusion Fixed relational schema -> the presented algorithm provides a way of testing if the incoming XML data is valid Variable relational schema -> it may be updated by taking into account the relational keys generated by the algorithm from the incoming data and XML keys. Future work: given some XML keys for data, automatically create a good transformation between the XML data and relational database.