1 Mariposa system Witold Litwin. 2 Basic goals WAN oriented DDBS Multiple sites –e.g., 1000 Scalable Locally autonomous Easy to evolve.

Slides:



Advertisements
Similar presentations
Database Architectures and the Web
Advertisements

Distributed DBMS© M. T. Özsu & P. Valduriez Ch.6/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Distributed Systems Topics What is a Distributed System?
1 Interoperability of a Scalable Distributed Data Manager with an Object-relational DBMS Thesis presentation Yakham NDIAYE November, 13 the 2001 November,
Distributed Systems 1 Topics  What is a Distributed System?  Why Distributed Systems?  Examples of Distributed Systems  Distributed System Requirements.
Distributed databases
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Peer-to-peer archival data trading Brian Cooper Joint work with Hector Garcia-Molina (and others) Stanford University.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
CHAIMS: Compiling High-level Access Interfaces for Multisite Software Neal Sample Stanford University.
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
Database Software File Management Systems Database Management Systems.
June 1998 CHAIMS1 Challenge 1: Composition of Processes... versus composition and integration of Data »data-warehouses »wrapping data available on web.
Query Optimization. General Overview Relational model - SQL  Formal & commercial query languages Functional Dependencies Normalization Physical Design.
Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.
TECHNIQUES FOR OPTIMIZING THE QUERY PERFORMANCE OF DISTRIBUTED XML DATABASE - NAHID NEGAR.
Chapter 3 Database Architectures and the Web Pearson Education © 2009.
1 Ömer Korçak Cmpe 521 Mariposa: A Wide Area Distributed Database System.
Database Architectures and the Web
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Database Architectures and the Web Session 5
CST203-2 Database Management Systems Lecture 2. One Tier Architecture Eg: In this scenario, a workgroup database is stored in a shared location on a single.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Computer Measurement Group, India Optimal Design Principles for better Performance of Next generation Systems Balachandar Gurusamy,
Master Thesis Defense Jan Fiedler 04/17/98
1 High-Availability LH* Schemes with Mirroring W. Litwin, M.-A. Neimat U. Paris 9 & HPL Palo-Alto
Mariposa: a wide-area distributed database system Kumar Ramdurgkar. CIS 661.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Session-8 Data Management for Decision Support
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Distributed Database Systems Overview
Announcements. Data Management Chapter 12 Traditional File Approach  Structure Field  Record  File  Fixed All records have common fields, and a field.
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Database Architectures Database System Architectures Considerations – Data storage: Where do the data and DBMS reside? – Processing: Where.
Chapter 1 Introduction Yonsei University 1 st Semester, 2015 Sanghyun Park.
MongoDB is a database management system designed for web applications and internet infrastructure. The data model and persistence strategies are built.
Distributed DBMSs- Concept and Design Jing Luo CS 157B Dr. Lee Fall, 2003.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Databases Illuminated
Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.
Distributed database system
1 Distributed Databases Chapter 21, Part B. 2 Introduction v Data is stored at several sites, each managed by a DBMS that can run independently. v Distributed.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Distributed Information Systems (CSCI 5533) Presentation ID: 19 Query Processing In Distributed Multi - DBMS Submitted to: Dr. Liaw, Morris Submitted by:
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
Peer-to-Peer Systems: An Overview Hongyu Li. Outline  Introduction  Characteristics of P2P  Algorithms  P2P Applications  Conclusion.
Distributed DBMS, Query Processing and Optimization
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
1 Overview of Query Evaluation Chapter Outline  Query Optimization Overview  Algorithm for Relational Operations.
Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Igor EPIMAKHOV Abdelkader HAMEURLAIN Franck MORVAN
MIT – Laboratory for Computer Science
Database Architectures and the Web
Grid Computing.
Distributed DBMS Concepts of Distributed DBMS
Database Architectures and the Web
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Ch 4. The Evolution of Analytic Scalability
Query Processing CSD305 Advanced Databases.
Distributed Database Management Systems
Distributed Databases
Database System Concepts and Architecture
Course Instructor: Supriya Gupta Asstt. Prof
Presentation transcript:

1 Mariposa system Witold Litwin

2 Basic goals WAN oriented DDBS Multiple sites –e.g., 1000 Scalable Locally autonomous Easy to evolve

3 Solution Traditional DBMS techniques –a non-solution Economical paradigm rooted in –Computer Life Litwin & Wiederhold, 1990 Contract nets –Smith & ? –Implemented at Xerox Parc

4 DBMS architecture Object-relational data model Tables are fragmented –hash, range, or user defined fragments –replicas for better throughput –fragments can split or coalesce (group) –fragments can move among sites –SDDSs could be used for the fragmentation

5 Manipulations SQL-3 Queries are translated to operations on fragmentes An acceptable execution plan is produced –perhaps with strides The plan is optimized through an economical tuning –choose cheapest servers of subqueries

6Architecture Server query proc. Broker Bidder Client query proc. Server query proc. Bidder Server query proc. Bidder Name Server Bidder Name Server Bidder

7 Query Processing Overview Rule based language for bidders, brokers and storage managers on the servers –RUSH system (A.Sah & al) brokers start query auctions –sendout price / time curves in $$ provided to sites by the network bank –choose best bidders –adjust query plans decreases the price as long as the total time remains acceptable

8 Bidders Propose prices & timesPropose prices & times –for CPU, IO, Memory for fragments Advertise their capabilities –with name servers Select a, b from T -> projections of T Select a, b from T where c = '123' -> selections and projections OK Advertise multioperational pricing plans –volume reduction, coupons...

9 Pricing policy CPU & IO c = cost per time unit –e (c, q) estimate of the subquery q cost –l system current load l = 1,2... –final price p p = e * l Idle servers become work hungry Successful servers become expensive Work becomes naturally distributed

10 Memory services Similar rules as for queries, but Load l is the storage load factor The price is continuous per time unit Fragments can get sold among sites –moves or copies –The bidder bids only for queries to fragments it has permanently to temporary results it has from previous subquery

11 BiddingBidding Expensive protocol –auction out –collect replies –compute cheapest plan –sendout queries Cheap protocol –use only advertised capabilities –issue purchase orders to selected bidders –pay the bill (or go to court)

12 Name service Pretty classical naming schema thought : –name location discovery is dynamical –is payable –fragement moves are not synchronously posted to name servers as there are many name servers as in SDDSs for clients

13 Semantic Heterogeneity Name heterogeneity among sites resolved –locally –by name servers Data type and representation heterogeneity –resolved using canonical representation Integer CR Integer

14 Conclusion Mariposa is an attempt to build large MDBSs –based on human society management tools –claimed more efficient at large scale than the traditional DBMS tools –local autonomy –no free lunch It is on-going, and 1st version implemented First results are successful One searches to further improve the economical tools

15 End