04-19-20011 Rainbow: Bridging XML and Relational Databases Design, Implementation, and Evaluation MQP Advisor: Prof. Elke A. Rundensteiner Sponsor: Verizon.

Slides:



Advertisements
Similar presentations
Native XML Database or RDBMS. Data or Document orientation If you are primarily storing documents, then a Native XML Database may be the best option.
Advertisements

By Daniela Floresu Donald Kossmann
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
1 The Database Application Development Process The Database Application Development Process.
Visibility Information Exchange Web System. Source Data Import Source Data Validation Database Rules Program Logic Storage RetrievalPresentation AnalysisInterpretation.
Collaborative Management Environment CIO Technical Presentation Thomas E. Potok and Mark Elmore Collaborative Technologies Research Center Computer Science.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
1 Murali Mani Topics projects in databases and web applications and XML Database Systems Research Lab @cs.wpi.eduWebpages:
Rainbow: XML and Relational Database Design, Implementation, Test, and Evaluation Project Members: Tien Vu, Mirek Cymer, John Lee Advisor:
XML Views El Hazoui Ilias Supervised by: Dr. Haddouti Advanced XML data management.
ISMT221 Information Systems Analysis and Design Project Management Tools Lab 1 Tony Tam.
Implementing P3P Using Database Technology Rakesh Agrawal Jerry Kiernan Ramakrishnan Srikant Yirong Xu Presented by Yajie Zhu 03/24/2005.
1 Murali Mani Topics projects in databases and web applications and XML Database Systems Research Lab @cs.wpi.eduWebpages:
WIDM 2002 DSRG, Worcester Polytechnic Institute1 Honey, I Shrunk the XQuery! —— An XML Algebra Optimization Approach Xin Zhang, Bradford Pielech and Elke.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
Chapter 14 Database Connectivity and Web Technologies
Automatic Data Ramon Lawrence University of Manitoba
Chapter 1 The Systems Development Environment Modern Systems Analysis and Design Sixth Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich.
RIZWAN REHMAN, CCS, DU. Advantages of ORDBMSs  The main advantages of extending the relational data model come from reuse and sharing.  Reuse comes.
© Copyright 2000 M. Rodriguez-Martinez, All Rights Reserved Automatic Deployment of Application-Specific Metadata and Code in MOCHA Manuel Rodriguez-Martinez.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
The Design Discipline.
UML Tools ● UML is a language, not a tool ● UML tools make use of UML possible ● Choice of tools, for individual or group use, has a large affect on acceptance.
Technical Overview.
Lesson 7 Guide for Software Design Description (SDD)
INFO425: Systems Design INFORMATION X Finalizing Scope (functions/level of automation)  Finalizing scope in terms of functions and level of.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
Quality Assurance for Component- Based Software Development Cai Xia (Mphil Term1) Supervisor: Prof. Michael R. Lyu 5 May, 2000.
Database Design for DNN Developers Sebastian Leupold.
ISYS 512 Business Application Design and Development with.Net David Chao.
©2003 Altova GmbH. All Rights Reserved. Architecting XML Schemas for Oracle XML DB September 10th, 2003 Alexander Falk President, CEO – Altova, Inc. Member.
Fundamentals of Database Chapter 7 Database Technologies.
Best of Both Worlds: Information Management Solutions SmartCore Management Dashboards.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 Quick Tutorial – Part 2 Open Data Web Services for Oracle BPM August, 2013 Forms.
David Webber, NIEM Team, Oracle Public Sector Rapid NIEM XML Exchange Design, Semantics and UML Models NIEM Test Model Data Deploy Requirements Build Exchange.
Service Computation 2010November 21-26, Lisbon.
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
PHP Features. Features Clean syntax. Object-oriented fundamentals. An extensible architecture that encourages innovation. Support for both current and.
Rainbow - Bridging XML and Relational Databases: Design, Implementation, and Evaluation MQP Advisor: Prof. Elke A. Rundensteiner, PhD Sponsor:
1 Systems Analysis and Design in a Changing World, Thursday, January 18, 2007.
Introduction to the new mainframe © Copyright IBM Corp., All rights reserved. Chapter 12 Understanding database managers on z/OS.
Hibernate 3.0. What is Hibernate Hibernate is a free, open source Java package that makes it easy to work with relational databases. Hibernate makes it.
Session 9 Component and Deployment. OOAD with UML / Session 9 / 2 of 17 Review State Diagrams represent the software entities in terms of their states.
1 © 1999 Microsoft Corp.. Microsoft Repository Phil Bernstein Microsoft Corp.
Rainbow - Bridging XML and Relational Databases: Design, Implementation, and Evaluation MQP Advisor: Prof. Elke A. Rundensteiner, PhD Sponsor:
Rainbow: XML and Relational Database Design, Implementation, Test, and Evaluation Project Members: Tien Vu, Mirek Cymer, John Lee Advisor:
Design and Implementation of a Rationale-Based Analysis Tool (RAT) Diploma thesis from Timo Wolf Design and Realization of a Tool for Linking Source Code.
© 2006 Altova GmbH. All Rights Reserved. Altova ® Product Line Overview.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
ThinStructure: An Overview Support for ThinStructure demonstration. Jean Georges Perrin – Annandale, 21 st April 2004.
ISYS 512 Business Application Design and Development with.Net David Chao.
©2007 Really Strategies, Inc. CONFIDENTIAL 1 Native XML Content Management Philadelphia XML Users’ Group.
Technical Overview. What Is MagicDraw?  Business Analysts  Software Analysts  Programmers  QA Engineers  Documentation Writers MagicDraw is a powerful.
Executive Overview. Software modeling is essential, because it is the map that guides your developers. Additionally: Modeling Software  Visual information.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Chapter 1 Assuming the Role of the Systems Analyst.
Feature Overview Oracle Explorer – browse and alter schema Wizards and Designers Automatic code generation PL/SQL Editor with IntelliSense Oracle Data.
Prof. HeleMai Haav: CSC 230 Spring *03 Overview: Databases.
Building Enterprise Applications Using Visual Studio®
Component and Deployment
Phil Bernstein Microsoft Corp.
Lecture 1: Multi-tier Architecture Overview
Evaluating Compuware OptimalJ as an MDA tool
Project Redband StatMonitor Application
MQP Project Members: Tien Vu, Mirek Cymer, John Lee MQP Advisor:
Oracle SQL Developer Data Modeler
Mark Quirk Head of Technology Developer & Platform Group
Presentation transcript:

Rainbow: Bridging XML and Relational Databases Design, Implementation, and Evaluation MQP Advisor: Prof. Elke A. Rundensteiner Sponsor: Verizon Laboratories Incorporated MQP Project Members: Tien Vu, Mirek Cymer, John Lee

HTML vs. XML  Microsoft, IBM, Informix, Oracle, Sun,...

XML Data Management by RDBMS Advantages: Advantages:  Efficient query and analysis tools.  Matured database tools available.  Easy integration with existing business databases. Issues: Issues:  Map between XML and Relational Model.  Update Propagation.  Query Translation and Optimization.

Motivation for Mapping Query Performance vary with respect to how data is mapped. Query Performance vary with respect to how data is mapped. Flexible mapping: fixed translation and restructure Flexible mapping: fixed translation and restructure Mustang 2001 Ford car make model year car makemodel year FordMustang2001 Alternate Mapping

XMLData Sub system Legend Rainbow Architecture DTDXML XML Query XML User XML Query Engine DTDM Manager XML Manager Restructuring Subsystem RDBMS

Goals of our MPQ What: What:  Implement and evaluate restructuring subsystems within the large-scale Rainbow system. How: How:  Learn about the database technologies and web tools.  Translate research ideas to software system design.  Practice software engineering techniques:  UML, engineer and reuse code.  Design an experimental test plan and test bed.  Conduct performance study and analysis.

Restructuring Subsystem DTDXML XML Query XML User XML Query Engine DTDM Manager XML Manager Restructuring Mapping RestructureOperatorLibrary Restructurer Query Storage XMLModel Sub system RelationalModel InternalProcess Legend

Restructuring Operators 11 Restructuring Operators: 11 Restructuring Operators:  Rename Item/Attribute  Switch Nesting  Pushup/Pushdown Attribute  Pushup/Pushdown Nesting  Split/Merge Nesting  Reference/Dereference

Mapping: Sequence of Restructure Operators Mapping is modeled as a sequence of reversable restructuring operators, Operator Name + Parameters. Mapping is modeled as a sequence of reversable restructuring operators, Operator Name + Parameters. For Example: For Example: pushUpAttribute( ‘account_number’, ‘value’, ‘invoice’, ‘account_number’ ); pushUpAttribute( ‘bill_period’, ‘value’, ‘invoice’, ‘bill_peroid’ ); renameItem( ‘invoice’, ‘summary’ ); invoice value account_numbill_period summary account_numbill_period

SQLs for Push-Up Attributes CREATE VIEW new.A (, a) AS CREATE VIEW new.A (, a) AS SELECT A., B.b SELECT A., B.b FROM old.A, old.B WHERE B.pid = A.iid CREATE VIEW new.B () AS CREATE VIEW new.B ( ) AS SELECT B. SELECT B. FROM old.B A B A B Push-up b a

Example SQLs Inline: make.value into car as Attribute make. Inline: make.value into car as Attribute make. Mapping: Mapping:  pushUpAttribute( ‘account_number’, ‘value’, ‘invoice’, ‘account_number’ ); SQL statements: SQL statements: CREATE VIEW new.invoice (iid, pid, account_number) AS SELECT SELECT invoice.iid, invoice.pid, account_number.value FROM old.invoice, old.account_number WHERE account_number.pid = invoice.iid CREATE VIEW new.account_number (iid, pid) AS SELECT SELECT account_number.iid, account_number.pid FROM old.account_number

Rainbow Implementation Development Tools Development Tools  Java: Visual Café2, Javadocs, JAVA2  Oracle 8i, XML 4J, JDBC1.2, SQL Queries Code Facts Code Facts  44 total system classes  17 classes of Rainbow  27 classes reused  ? lines of system code  ? lines of Rainbow code  ? lines of code reused

Screen Shot

Screen Shot

Rainbow Test & Experimental Evaluation Experimental Setup Experimental Setup  Oracle 8i  Windows NT Data Data  Created a DTD  Randomly generated XML  Hand translated queries Factors Factors  Type of query  Number of operations

Query Performance Evaluation

Rainbow Conclusions Technical accomplishments Technical accomplishments  Functional prototype system  Feasibility of Rainbow concepts  Automated test bed designed  Performance evaluations show that:  (Ideal) Moving up data on the embedded-relational-level yields better query performance for Join queries. Knowledge gained Knowledge gained  OO, Java, JDBC, SQL, RDBMS, XML, DTD  Teamwork & S/W Engineering & Software Reuse  Logistics of setting up an experiment Future work Future work  Experiment test plans and test beds to realize the full potential of the restructuring component.

Rainbow: XML and Relational Database Design, Implementation, and Evaluation Project Members: Tien Vu, Mirek Cymer, John Lee Advisor: Elke A. Rundensteiner Ph. D Student: Xin Zhang Sponsor By: Verizon Laboratories Incorporated Visit Rainbow at

Recycled!!!

XML: The Future of the Web Benefits: Benefits:  Efficient query and analysis tools.  Matured Data Warehousing support.  Easy Integration with existing business database. Applications: Applications:  E-commerce  Web-based industries Jun 9 - Jul 8, 2000 Sprint $0.25

XML and Relational Database Problem Problem  Many Application usually change its data very frequently.  e.g., flight reservation, online billing, inventory. Current Solution Current Solution  Reloading the complete XML document when changed which is very expensive. Rainbow Solution Rainbow Solution  Incrementally propagate XML Document Updates to Stored XML Data.  Goal: XML Repository Implemented using RDBMS  Approach: Flexible Mapping  Features: DTD Metadata Management in RDBDTD Metadata Management in RDB Automatic Schema CreationAutomatic Schema Creation Incremental Update PropagationIncremental Update Propagation XML Query OptimizationXML Query Optimization

Rainbow Analysis

Rainbow Analysis Cont..

HTML vs. XML HTML HTML<h1>Car</h1><h2>Make</h2> Ford Mustang Ford Mustang<h2>Seats</h2><p>5 Top Speed Top Speed 70 m.p.h 70 m.p.h XML Car Ford Mustang 5 70