HyperKVS Group Meeting Oracle Streams Dr. Volker Kuhr.

Slides:



Advertisements
Similar presentations
Yukon – What is New Rajesh Gala. Yukon – What is new.NET Framework Programming Data Types Exception Handling Batches Databases Database Engine Administration.
Advertisements

Database Architectures and the Web
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
SQL Server Replication
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco.
Oracle Advanced Queuing Features Overview
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
DISTRIBUTED DATABASE. Centralized & Distributed Database  Single site database – centralized database –A database is located at a single site or distributed.
Data Replication with Materialized Views ISYS 650.
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Chapter 12 Distributed Database Management Systems
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
Oracle Streams--Simplifying Information Sharing in Oracle10 g Patricia McElroy Product Manager Oracle Corporation Session id:
Data Replication with Advanced Replication & Oracle Streams John Abrahams Technology Sales Consultant Oracle Nederland.
Database Design – Lecture 16
Databases and Statistical Databases Session 4 Mark Viney Australian Bureau of Statistics 5 June 2007.
Oracle Advanced Compression – Reduce Storage, Reduce Costs, Increase Performance Session: S Gregg Christman -- Senior Product Manager Vineet Marwah.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
Lecture 5: Sun: 1/5/ Distributed Algorithms - Distributed Databases Lecturer/ Kawther Abas CS- 492 : Distributed system &
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Right In Time Presented By: Maria Baron Written By: Rajesh Gadodia
Heterogeneous Database Replication Gianni Pucciani LCG Database Deployment and Persistency Workshop CERN October 2005 A.Domenici
Module 11: Introducing Replication. Overview Introduction to Distributed Data Introduction to SQL Server Replication SQL Server Replication Agents SQL.
Lecture # 3 & 4 Chapter # 2 Database System Concepts and Architecture Muhammad Emran Database Systems 1.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Oracle's Distributed Database Bora Yasa. Definition A Distributed Database is a set of databases stored on multiple computers at different locations and.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Distributed Databases
Page 1. Data Integration Using Oracle Streams A Case Study Session #:
Oracle® Streams for Near Real Time Asynchronous Replication Nimar S. Arora Oracle USA.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
Ing. Erick López Ch. M.R.I. Replicación Oracle. What is Replication  Replication is the process of copying and maintaining schema objects in multiple.
MBA 664 Database Management Systems Dave Salisbury ( )
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
7 Strategies for Extracting, Transforming, and Loading.
Introduction to Databases
REST By: Vishwanath Vineet.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
DATABASE REPLICATION DISTRIBUTED DATABASE. O VERVIEW Replication : process of copying and maintaining database object, in multiple database that make.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Chapter 1 Database Access from Client Applications.
Steams implementation with oracle e-business suite and discoverer
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
C Copyright © 2006, Oracle. All rights reserved. Integrating with Oracle Streams.
Making Sense of Service Broker Inside the Black Box.
20 Copyright © 2006, Oracle. All rights reserved. Best Practices and Operational Considerations.
Distributed Databases
WLCG Collaboration Workshop CMS online/offline replication
Chapter Name Replication and Mobile Databases Transparencies
Software Design and Architecture
MANAGING DATA RESOURCES
Database management concepts
Making Sense of Service Broker
MANAGING DATA RESOURCES
Cloud Data Replication with SQL Data Sync
Data Model.
Database management concepts
Introduction of Week 13 Return assignment 11-1 and 3-1-5
Database Management Systems
Introduction of Week 14 Return assignment 12-1
Presentation transcript:

HyperKVS Group Meeting Oracle Streams Dr. Volker Kuhr

Agenda Description of the environment of Distributed Databases Streams vs. Adv. Replication Oracle Streams Technology –Capture –Propagation –Apply Architecture Benefits of Streams Technology –High Availibility –Managebility –Performance –Guaranteed Future Conclusion

Distributed Databases AFR MEXSAOESPCZ VPM IBM-HOST (GATEWAYS) SHANGHAI CHANGCHUN KVSPF6 P97 PROD KVS is one of the worldwide biggest distributed DB systems using Oracle Adv. Replication

Streams vs. Adv. Replication – Replication: Adv. Replication since 7.3.x available Enhancements until Oracle Version 8i/9i Unique Position until Oracle Version 8i Replication based on Trigger functionality only for DML Static solution (Schema, Tables, Attributes must be the same) – Streams: Streams usable since for productive environments Key Companies are working with Streams Technology –Datawarehouse –Distributed Systems High focus on Enhancements (10gR1, 10gR2, 11g ) very dynamic Also useful for heterogeneous systems.

Streams Basic Elements –Three basic tasks of a stream: Capture Staging Apply (consumption) –A stream can perform multiple tasks across multiple databases. ApplyStagingCapture

You can place events in Streams: –Implicitly: Log-based capture of DML and DDL changes –Explicitly: Direct enqueue of user messages Capture

Propagation (Staging) Streams uses a staging area, which: – Is implemented as a queue in a queue table – Supports the self-describing data type, SYS.AnyData – Stages captured events and user-created events in the same queue – Retains events until they are consumed by all applicable tasks, processes, or applications. Propagation Staging

Apply – Events in the staging queue can be consumed: Implicitly by an apply process Explicitly by an application performing dequeue via open interfaces such as JMS, C, or PL/SQL – The apply process can: Apply data changes directly to the database objects Perform a series of operations based on the event by means of an apply handler Apply

Architecture Goal: – Reduction on simple Implementation Typ: Primary Database with Secondary Databases – Complex Implementation follows later: Primary Database with Extented Secondary Databases

Availibility – With Replication, there might be locking problems at high productive Slave sites Problems only on Slave Sites Tables with frequently changes Heart of the application: tree of documents Business Trend is increasing –Catia V5 –Amount of users / Application with very good consumer acceptance –Huge tables and high frequent changes on tables – Temporarily “lost” data in replication Environment Conflict resolution is handled at Master Site Consolidated information will be written back to slave – In Streams, all DB Sites have more equality & autonomy

Managebility – Replication Changes on Table structure in short Release cycles Lack of Support During DDL Changes on Tables High aggravation on guarantee of global consistence –Conflicts during the time of worldwide changes –“normal” DML Changes – bulk operations – Efficient Backup & Recovery Concept for Streams Well defined strategy for Recovery Issues Cloning of Streams Enhancements in Exp/Imp and data pump functionality – Overhead in Replication Reorganization of MLOG$-, USLOG$ Tables

Performance – Less Network workload Less network overhead than Adv. Replication Information is not written back to slave sites Support of DDL Statements –DDL Statements are (re)executed on slaves with streams – Data Integrity, Constraining Unique Indexes/Constraints, Foreign Keys  Optimizer – Faster User Interaktion Capture & Apply mechanism have its own background processes no Snapshot Logs / no Trigger interaction – Advanced Queuing (AQ) & usages of multiple queues Parallelisation vs. single queue concept of Replication mechanism

Guaranteed Future (1/2) – Support of DDL Operations Reduction of massive DML Operations Use of Rolling Partitions –Streams Support for Tablespace & Schema Replication About 130 Tables are in Replication environment –Easy Use of Instantiation The only way to implement huge tables (> 20 Mio Rows) –Journaling Tables –Security Secure Queues in STRMADMIN Schema –Data integrity & Transaction order guaranty Constraints, Foreign Keys

Guaranteed Future (2/2) – Test system Build Up with Streams Semi productive Environment:Coupling with Production System Downstream Capture: Ni impact on production Environment No double data volume ( comparable to Snapshots) No divergence between production and test by time No Loss of Test Data – Directed Networking & Multi-Client Capability Apply Forwarding & Queue Forwarding Table Subsets for different Secondary Sites (clients) – Support of transformations Different Columns of Tables can be replicated with streams Table-, Column- or Schema name can be different on systems –Support for (almost) all Data Types Abstract Data Types (XML as ADT) in progress of new Releases

Conclusion KVS is one of the worldwide biggest distributed DB systems using Oracle Adv. Replication Replication has been the only way of working in this distributed environment. There are several misbehaviours with Adv. Replication Streams is a new technology, based on Advanced Queuing and logminer functionality Streams is usable since Oracle 9i and fully developed in 10g(R2) With Streams, there will be less workload regarding user interaction, but there are background processes using resources There will be lots of benefits using Streams instead of Replication Argumentation is based Availibility, Managebility, Performance and guaranteed future Beside distributed database environments, Streams is also usable in the business of Test&Quality systems, heterogeneous systems and datawarehouse applications.