Data Pump in Oracle Database 10 g : Foundation for Ultra-High Speed Data Movement Utilities Session id: 40043 George H. Claborn Data Pump Technical Project.

Slides:



Advertisements
Similar presentations
new database engine component fully integrated into SQL Server 2014 optimized for OLTP workloads accessing memory resident data achive improvements.
Advertisements

18 Copyright © Oracle Corporation, All rights reserved. Transporting Data Between Databases.
CHAPTER 13 Data Pump. Tool that replaces legacy exp/imp utilities Data Pump is a scalable, feature-rich utility that allows you to extract objects and.
INTRODUCTION TO ORACLE Lynnwood Brown System Managers LLC Backup and Recovery Copyright System Managers LLC 2008 all rights reserved.
18 Copyright © 2005, Oracle. All rights reserved. Moving Data.
Dale Roberts 1 Department of Computer and Information Science, School of Science, IUPUI Dale Roberts, Lecturer Computer Science, IUPUI
18 Copyright © 2005, Oracle. All rights reserved. Moving Data.
17 Copyright © 2007, Oracle. All rights reserved. Moving Data.
Oracle Architecture. Instances and Databases (1/2)
The New Data Pump Caleb Small Next generation Import / Export New features Better performance Improved security Versatile interfaces.
INTRODUCTION TO ORACLE DATABASE ADMINISTRATION Lynnwood Brown System Managers LLC Introduction – Lecture 1 Copyright System Managers LLC 2007 all rights.
Self-Managing Database: Easy Upgrade Tom Graves Consulting Member of Technical Staff Oracle Corporation Session id:
Advanced Databases Basic Database Administration Guide to Oracle 10g 1.
A Guide to Oracle9i1 Introduction to Oracle9i Database Administration Chapter 11.
Check Disk. Disk Defragmenter Using Disk Defragmenter Effectively Run Disk Defragmenter when the computer will receive the least usage. Educate users.
9 Copyright © Oracle Corporation, All rights reserved. Oracle Recovery Manager Overview and Configuration.
Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.
Week 5 – Chap. 5 Data Transfer DBAs often must transfer data to and from text files, Excel spreadsheets, Access, Oracle or other SQL Server databases This.
Introduction to Oracle Backup and Recovery
IS 4510 – Database Administration Module – 2 Database Backup 10/24/20141Compiled by: Zafar Iqbal Khan.
Backup & Recovery 1.
Module 8: Server Management. Overview Server-level and instance-level resources such as memory and processes Database-level resources such as logical.
Introduction to the Enterprise Library. Sounds familiar? Writing a component to encapsulate data access Building a component that allows you to log errors.
CHAPTER 21 Automating Jobs. Introduction to Automating Jobs DBAs rely heavily on automating jobs. DBAs cannot be effective without automation. Listed.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
Oracle 10g Data Pump Say good bye to exp and imp (or not)! 2006/02/23 Simon Pane.
11 Copyright © Oracle Corporation, All rights reserved. RMAN Backups.
By Lecturer / Aisha Dawood 1.  You can control the number of dispatcher processes in the instance. Unlike the number of shared servers, the number of.
CSE 781 – DATABASE MANAGEMENT SYSTEMS Introduction To Oracle 10g Rajika Tandon.
ORACLE
Extents, segments and blocks in detail. Database structure Database Table spaces Segment Extent Oracle block O/S block Data file logical physical.
Informix IDS Administration with the New Server Studio 4.0 By Lester Knutsen My experience with the beta of Server Studio and the new Informix database.
9 Copyright © 2004, Oracle. All rights reserved. Flashback Database.
The protection of the DB against intentional or unintentional threats using computer-based or non- computer-based controls. Database Security – Part 2.
Oracle Tuning Ashok Kapur Hawkeye Technology, Inc.
Scripting Pertemuan 4 Matakuliah: T0413 Tahun: 2009.
IT 456 Seminar 5 Dr Jeffrey A Robinson. Overview of Course Week 1 – Introduction Week 2 – Installation of SQL and management Tools Week 3 - Creating and.
Oracle 10g Database Administrator: Implementation and Administration Chapter 2 Tools and Architecture.
Oracle Data Integrator Procedures, Advanced Workflows.
Oracle9i Performance Tuning Chapter 12 Tuning Tools.
Module 10 Administering and Configuring SharePoint Search.
A Brief Documentation.  Provides basic information about connection, server, and client.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
1. When things go wrong: how to find SQL error Sveta Smirnova Principle Technical Support Engineer, Oracle.
Process Architecture Process Architecture - A portion of a program that can run independently of and concurrently with other portions of the program. Some.
Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.
INTRODUCTION TO ORACLE DATABASE ADMINISTRATION Lynnwood Brown President System Managers LLC Introduction – Lecture 1 Copyright System Managers LLC 2003.
The Self-managing Database: Proactive Space and Schema Object Management Amit Ganesh Director, Data, Space and Transaction Processing Oracle Corporation.
12 Copyright © 2009, Oracle. All rights reserved. Managing Backups, Development Changes, and Security.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
Task #1 Create a relational database on computers in computer classroom 308, using MySQL server and any client. Create the same database, using MS Access.
Getting the Most from Oracle Data Pump Roy Swonger Director, Oracle Database Utilities January, 2007.
Batch Jobs Using the batch job functions. Use [Bulk Changes][Batch Job Utility] to start. Read the information panel. Check with TAMS Technical Support.
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
Oracle Architecture - Structure. Oracle Architecture - Structure The Oracle Server architecture 1. Structures are well-defined objects that store the.
3 Copyright © 2007, Oracle. All rights reserved. Using the RMAN Recovery Catalog.
​ TdBench 7.2 – tdb.sh Utility Script. 2 Created for TdBench 7.x release to consolidate tools Open architecture – looks for scripts in the./tools directory.
 CONACT UC:  Magnific training   
13 Copyright © 2007, Oracle. All rights reserved. Using the Data Recovery Advisor.
11 Copyright © 2004, Oracle. All rights reserved. Performing a Migration Using Oracle Migration Workbench (Part II)
23 Copyright © 2004, Oracle. All rights reserved. Preferring the Past: Flashback.
11 Copyright © 2009, Oracle. All rights reserved. Enhancing ETL Performance.
Databases and DBMSs Todd S. Bacastow January 2005.
Data, Space and Transaction Processing
Oracle Database for APM 9.1
Getting the Most from Oracle Data Pump
Oracle Architecture Overview
How to Thrive as a DBA in an Oracle10g World
UTILITY TO EXPORT/IMPORT DATA Gary M. Noble LDS Church
Presentation transcript:

Data Pump in Oracle Database 10 g : Foundation for Ultra-High Speed Data Movement Utilities Session id: George H. Claborn Data Pump Technical Project Leader Oracle Corporation New England Development Center

Data Pump: Overview  What is it?  Main Features  Architecture  Performance  Things to keep in mind  Some thoughts on original exp / imp

Data Pump: What is it?  Server-based facility for high performance loading and unloading of data and metadata  Callable: DBMS_DATAPUMP. Internally uses DBMS_METADATA  Data written in Direct Path stream format. Metadata written as XML  New clients expdp and impdp: Supersets of original exp / imp.  Foundation for Streams, Logical Standby, Grid, Transportable Tablespaces and Data Mining initial instantiation.

Features: Performance!!  Automatic, two-level parallelism – Direct Path for inter-partition parallelism – External Tables for intra-partition parallelism – Simple: parallel= – Dynamic: Workers can be added and removed from a running job in Enterprise Edition – Index builds automatically “parallelized” up to degree of job  Simultaneous data and metadata unload  Single thread of data unload: 1.5-2X exp  Single thread of data load: 15X-40X imp  With index builds: 4-10X imp

Features: Checkpoint / Restart  Job progress recorded in a “Master Table”  May be explicitly stopped and restarted later: – Stop after current item finishes or stop immediate  Abnormally terminated job is also restartable  Current objects can be skipped on restart if problematic

Features: Network Mode  Network import: Load one database directly from another  Network export: Unload a remote database to a local dumpfile set – Allows export of read-only databases  Data Pump runs locally, Metadata API runs remotely.  Uses DB links / listener service names, not pipes. Data is moved as ‘insert into select  Direct path engine is used on both ends  It’s easy to swamp network bandwidth: Be careful!

Features: Fine-Grained Object Selection  All object types are supported for both operations: export and import  Exclude: Specified object types are excluded from the operation  Include: Only the specified object types are included. E.g, just retrieve packages, functions and procedures  More than one of each can be specified, but use of both is prohibited by new clients  Both take an optional name filter for even finer granularity: – INCLUDE PACKAGE: “LIKE ‘PAYROLL%’ “ – EXCLUDE TABLE: “IN (‘FOO’,’BAR’, … )’ “

Features: Monitoring  Flexible GET_STATUS call  Per-worker status showing current object and percent done  Initial job space estimate and overall percent done  Job state and description  Work-in-progress and errors

Features: Dump File Set Management  Directory based: E.g, DMPDIR :export01.dmp where DMPDIR created as: SQL> create directory dmpdir as ‘/data/dumps’  Multiple, wildcarded file specifications supported: dumpfile=dmp1dir:full1%u.dmp, dmp2dir:full2%u.dmp – Files are created as needed on a round-robin basis from available file specifications  File size can be limited for manageability  Dump file set coherency automatically maintained

New Clients – expdp / impdp  Similar (but not identical) look and feel to exp / imp  All modes supported: full, schema, table, tablespace, transportable. Superset of exp / imp  Flashback is supported  Query supported by both expdp and impdp… and on a per-table basis!  Detach from and attach to running jobs  Multiple clients per job allowed; but a single client can attach to only one job at a time  If privileged, attach to and control other users’ jobs

New Clients – expdp / impdp  Interactive mode entered via Ctl-C: – ADD_FILE: Add dump files and wildcard specs. to job – PARALLEL: Dynamically add or remove workers – STATUS: Get detailed per-worker status and change reporting interval – STOP_JOB{=IMMEDIATE}: Stop job, leaving it restartable. Immediate doesn’t wait for workers to finish current work items… they’ll be re-done at restart – START_JOB: Restart a previously stopped job – KILL_JOB: Stop job and delete all its resources (master table, dump files) leaving it unrestartable – CONTINUE: Leave interactive mode, continue logging – EXIT: Exit client, leave job running

Features: Other Cool Stuff …  DDL transformations are easy with XML: – REMAP_SCHEMA – REMAP_TABLESPACE – REMAP_DATAFILE – Segment and storage attributes can be suppressed  Can extract and load just data, just metadata or both  SQLFILE operation generates executable DDL script  If a table pre-exists at load time, you can: skip it (default), replace it, truncate then load or append to it.  Space estimates based on allocated blocks (default) or statistics if available  Enterprise Manager interface integrates 9 i and 10g  Callable!

Architecture: Block Diagram Direct Path API External Table API Oracle_ Loader Metadata API: DBMS_METADATA Oracle_ DataPump expdpimpdp Enterprise Manager Data Pump DBMS_DATAPUMP Data/Metadata movement engine Other Clients: Data Mining, etc

Architecture: Flow Diagram Master Control Process User A’s shadow process Dump File Set: Data, metadata & master table User A: expdp User B’s shadow process Dynamic commands (stop, start, parallel etc) Master table Command and control queue Status queue: Work-in-progress and errors Worker B: Direct path Worker C: Ext. Table Worker A: metadata User B: OEM Parallel Proc. 01 Parallel Proc. 02 Log File

No Clients Required ! Master Control Process Dump File Set: Data, metadata & master table Master table Worker B: Direct path Worker C: Ext. Table Worker A: metadata Parallel Proc. 01 Parallel Proc. 02 Log File

Data Pump: Performance Tuning  Default initialization parameters are fine! – Make sure disk_asynch_io remains TRUE  Spread the I/O!  Parallel= no more than 2X number of CPUs: Do not exceed disk spindle capacity. – Corollary: SPREAD THE I/O !!!  Sufficient SGA for AQ messaging and metadata API queries  Sufficient rollback for long running queries That’s it!

Large Internet Company 2 Fact Tables: 16.2M rows, 2 Gb ProgramElapsed exp out of the box: direct=y 0 hr 10 min 40 sec exp tuned: direct=y buffer=2M recordlength=64K 0 hr 04 min 08 sec expdp out of the box: Parallel=1 0 hr 03 min 12 sec imp out of the box 2 hr 26 min 10 sec imp tuned: buffer=2M recordlength=64K 2 hr 18 min 37 sec impdp out of the box: Parallel=1 0 hr 03 min 05 sec With one index per table imp tuned: buffer=2M recordlength=64K 2 hr 40 min 17 sec impdp: Parallel=1 0 hr 25 min 10 sec

Oracle Applications Seed Database:  Metadata intensive: 392K objects, 200 schemas, 10K tables, 2.1 Gb of data total  Original exp / imp total: 32 hrs 50 min – exp: 2 hr 13 min imp: 30 hrs 37 min.  Data Pump expdp / impdp total: 15 hrs 40 min – expdp: 1 hr 55 min impdp: 13 hrs 45 min. – Parallel=2 for both expdp and impdp

Keep in Mind:  Designed for *big* jobs with lots of data. – Metadata performance is about the same – More complex infrastructure, longer startup  XML is bigger than DDL, but much more flexible  Data format in dump files is ~15% more compact than exp  Import subsetting is accomplished by pruning the Master Table

Original exp and imp  Original imp will be supported forever to allow loading of V5 – V9i dump files  Original exp will ship at least in 10 g, but may not support all new functionality.  9i exp may be used for downgrades from 10 g  Original and Data Pump dump file formats are not compatible

10g Beta Feedback  British Telecom: Ian Crocker, Performance & Storage Consultant “We have tested Oracle Data Pump, the new Oracle10 g Export and Import Utilities. Data Pump Export performed twice as fast as Original Export, and Data Pump Import performed ten times faster than Original Import. The new manageability features should give us much greater flexibility in monitoring job status.”  Airbus Deutschland: Werner Kawollek, Operation Application Management “We have tested the Oracle Data Pump Export and Import utilities and are impressed by their rich functionality. First tests have shown tremendous performance gains in comparison to the original export and import utilities"

Please visit our Demo! Oracle Database 10g Data Pump: Faster and Better Export / Import Try out Data Pump’s Tutorial in the Oracle By Example (OBE) area: “Unloading and Loading Data Base Contents”

Q & Q U E S T I O N S A N S W E R S