Tales of the Secret Bunker 2016 (231) Dump and Load Edition Mike Furgal – Director MDBA and Pro2 Services Gus Bjorklund - Lackey.

Slides:



Advertisements
Similar presentations
B3: Putting OpenEdge Auditing to Work: Dump and Load with (Almost) No Downtime David EDDY Senior Solution Consultant.
Advertisements

DB-7: OpenEdge® Database Performance Tuning
DB-03: A Tour of the OpenEdge™ RDBMS Storage Architecture Richard Banville Technical Fellow.
1 PUG Challenge Americas 2014 Click to edit Master title style PUG Challenge EMEA 2014 – Dusseldorf, Germany Tales from the Audit Trails Presented by:
T OP N P ERFORMANCE T IPS Adam Backman Partner, White Star Software.
OPS-10: Moving V8/V9 RDBMS to OpenEdge® 10
Database Management Systems, R. Ramakrishnan and J. Gehrke1 External Sorting Chapter 11.
FlareCo Ltd ALTER DATABASE AdventureWorks SET PARTNER FORCE_SERVICE_ALLOW_DATA_LOSS Slide 1.
1 PUG Challenge EU 2014 Click to edit Master title style PUG Challenge EMEA 2014 – Dusseldorf, Germany Common Database Problems Common Database Solutions.
1 External Sorting Chapter Why Sort?  A classic problem in computer science!  Data requested in sorted order  e.g., find students in increasing.
External Sorting CS634 Lecture 10, Mar 5, 2014 Slides based on “Database Management Systems” 3 rd ed, Ramakrishnan and Gehrke.
DB-14: Tales of the Bunker Gus Björklund, Progress Software Corporation John Harlow, Bravepoint, Inc. Dan Foreman, Bravepoint, Inc. Rich Banville,
1 External Sorting Chapter Why Sort?  A classic problem in computer science!  Data requested in sorted order  e.g., find students in increasing.
OpenEdge Replication Made Easy Adam Backman White Star Software
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
New Generation of OpenEdge ® RDBMS Advanced Storage Architecture II Tomáš Kučera Principal Solution Engineer / EMEA Power Team.
MOVE-4: Upgrading Your Database to OpenEdge® 10 Gus Björklund Wizard, Vice President Technology.
Backup & Recovery 1.
Database Storage Considerations Adam Backman White Star Software DB-05:
Objectives Learn what a file system does
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
Top Performance Enhancers Top Performance Killers in Progress Dan Foreman Progress Expert
© 2015 Progress Software Corporation. 1 abstract To paraphrase Benjamin Disraeli, there are lies, damned lies and benchmarks. Your intrepid band of benchmarkers.
Strength. Strategy. Stability.. Progress Performance Monitoring and Tuning Dan Foreman Progress Expert BravePoint BravePoint
DB-01 Upgrading to OpenEdge ® Practices & Initial Tuning Tom Harris, Managing Director, RDBMS Technology.
Component 4: Introduction to Information and Computer Science Unit 4: Application and System Software Lecture 3 This material was developed by Oregon Health.
9 Copyright © 2004, Oracle. All rights reserved. Flashback Database.
DONE-08 Sizing and Performance Tuning N-Tier Applications Mike Furgal Performance Manager Progress Software
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 External Sorting Chapter 13.
© 2008 Quest Software, Inc. ALL RIGHTS RESERVED. Perfmon and Profiler 101.
Database Management Systems, R. Ramakrishnan and J. Gehrke 1 External Sorting Chapter 13.
Common Database Problems Common Database Solutions Mike Furgal Managed Database Service EMEA PUG Challenge 2015, Copenhagen, Denmark 4 – 6 November, 2015.
© 2015 Progress Software Corporation. 1 abstract Your intrepid band of benchmarkers returns once more, with the results of testing a recent release of.
Introduction to Database Systems1 External Sorting Query Processing: Topic 0.
CPSC Why do we need Sorting? 2.Complexities of few sorting algorithms ? 3.2-Way Sort 1.2-way external merge sort 2.Cost associated with external.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 External Sorting Chapter 11.
External Sorting. Why Sort? A classic problem in computer science! Data requested in sorted order –e.g., find students in increasing gpa order Sorting.
Common Database Problems Common Database Solutions Mike Furgal PROGRESS Bravepoint – Database Services.
Log Shipping, Mirroring, Replication and Clustering Which should I use? That depends on a few questions we must ask the user. We will go over these questions.
AA202: Performance Enhancers for Laserfiche Connie Anderson, Technical Writer.
Same Plan Different Performance Mauro Pagano. Consultant/Developer/Analyst Oracle  Enkitec  Accenture DBPerf and SQL Tuning Training Tools (SQLT, SQLd360,
Platform and Data Migration With Little Downtime

Who, What, Where, When, Why and How
Behind The Scenes: Updating A Record
How Social is your data Fundamentals on database storage
Antonio Abalos Castillo
We Have Found Nirvana with Online Dump and Load (224)
Tree-Structured Indexes
Physical Database Design and Performance
External Sorting Chapter 13
Mike Furgal Director – DB and Pro2 Services March 20th, 2017
Chapter Overview Understanding the Database Architecture
Database Performance Tuning and Query Optimization
IB Computer Science Topic 2.1.1
Walking Through A Database Health Check
Physical Database Design
OPS-8: Effective OpenEdge® Database Configuration
So far… Text RO …. printf() RW link printf Linking, loading
External Sorting Chapter 13
Database Internals: How Indexes Work
Minimize Unplanned Downtime and Data Loss with OpenEdge
Chapter 11 Database Performance Tuning and Query Optimization
OPS-14: Effective OpenEdge® Database Configuration
External Sorting.
External Sorting Chapter 13
Unit 12 Index in Database 大量資料存取方法之研究 Approaches to Access/Store Large Data 楊維邦 博士 國立東華大學 資訊管理系教授.
Presentation transcript:

Tales of the Secret Bunker 2016 (231) Dump and Load Edition Mike Furgal – Director MDBA and Pro2 Services Gus Bjorklund - Lackey

2 find directions where location-name = "secret bunker"

3 Type II storage areas have lessened the need to do regular dump and load processing, however you still need to dump and load to maintain good performance. The frequency of the dump and load tends to be every 3 to 5 years.

4 The Bunkerteers took it upon themselves to test various dump and load technologies

5 What needs to be dumped?  Data Definitions  Data  Sequence Current Values  Security _user SQL Permissions  Audit Rules  Proutil Describe  Etc

6 Before you dump  Backup the database  Verify the backup Best approach would be to restore the database  Run a tabanlys Used to compare after the load to make sure we got all the data  Make a note of: db block size, ai block size, bi block size, code page, etc who has DBA rights  DO YOU HAVE ENOUGH DISK SPACE ?

7 How to Load  Data Definitions  Data  Sequence Current Values  Security _user SQL Permissions  Audit Rules  May be more

8 After You Load  Backup the database  Run a tabanlys Compare the row count to the original  Proutil Describe Compare it to the original Bi Blocksize, BI Cluster Size, AI Blocksize, etc  Enable After Imaging  Rebaseline OE Replication

what could go wrong?

10 mike

11 Data Dumping and Loading Options ASCII Options  Ascii Dump through the dictionary  Ascii Load through the dictionay With and without active indexes  Bulkload Buffer Copy  Buffer Copy from one database to another Binary Options  Binary Dump  Binary Load

12 On to the Tests

13

bunker

15 Bunker Machine  4 quad-core 2.4 GHz Intel processors bogomips  64 GB memory  8 x 146 GB 10,000 rpm sas drives 2 RAID 10 6 RAID 0 for /opt/tmp  16 x 300 GB 10,000 rpm drives RAID 10 for /opt/db  8 x 300 GB 10,000 rpm drives RAID 10 for /opt/db1  Centos Linux 6.7  OpenEdge New this machine costs $35,000 USD. Used we found it for $3,500 USD

BIGROW test runs in 4 seconds 24 MB/Second

17 Database Statistics  Size: 36 GB  Tables: 835  Indexes: 1,736  Areas 49 Data Areas 49 Index Areas TableRowsSize Table 1 13,495,7174.6G Table 2 52,307,5523.5G Table 3 1,873,6012.1G Table 4 2,432,8841.3G Table 5 9,430, M

18 gus

19 ASCII^H^H^H^H^HText Dump  Using the data dictionary to dump the data programatically by running prodict/dump_d.p  Where to put dump files? Same filesystem86:12 Different filesystems85:33  Making the filesystem cache smaller made a slight difference 86:12 vs 87:46

20 key order does not match record storage order Index Leaf Blocks K1K2K3K4K5K6K7K8 Index Keys Record Blocks

21 Scatter Matters  The database used was freshly dumped and loaded. To get past this, we ran a scatter analysis program which looks at the logical scatter of the database  Changed the primary index on the top 43 largest tables Row counts were greater than 500,000 % of rows logically scattered > 10  Comparison Non scattered dump: 87:46 Scattered dump: 133: % slower!! This is important!

22 Text Load  Using the dictionary load programmatically with prodict/load_d.p Loading into preallocated space:376:20 Loading into variable extents:365:06 Loading from a different filesystem:354:15  Loading no active indexes: 276:04 Loading the data:231:34 Index rebuild:44:33  Bulkload: 110:05 Loading the data:65:32 Index rebuild:44:33

Best Text Dump and Load Result 233:09 3 hours, 53 minutes, 9 seconds By: Dictionary Text dump, Bulkload, and Index Rebuild

25 mike

26 Buffer Copy  The approach here is to connect to thre source database and buffer-copy all the data to the target database.  Tests performed Single User Multi-User With active Indexes No active indexes Parallel FOR EACH s. NO-LOCK: BUFFER-COPY s. to t.. END.

27 Buffer Copy Results The parallel processing used 8 processors running at the same time

28 How to Parallelize the process  A bunch of sophisticated analysis was done to optimize this process.

30 How to Parallelize the process  A bunch of sophisticated analysis was done to optimize this process.  Take the size of the largest table and use that as a guide  Combing all the table into their respecive areas  Break them up into N sized units Where N is the size of the largest table  This process is very complex and borders on rocket science

The longest thread took 130 minutes The shortest thread took 68 minutes

32 gus

33 Binary Dump  Build scripts to dump out all the data.  Tests Single Users Read Only Multi user Parallel

34 It’s 4x slower to dump the scattered data Notice the similar times for the non-scattered and scattered table scan dumps. Even the (expert?) bunkerteers do stupid experiments Using -index 0 does a table scan. While this is the fasted dump method, the order of the rows will not be useful for most applications

35 Binary Dump Results

36 Binary Dump Results

Much special equipment was needed

38 Binary Dump Results

39 mike

40 A Simple Parallel Processor # Thread Scheduler for i in `cat tables` do currentThread=`ls -1 *.working 2> /dev/null | wc –l` if [ $currentThread -le $threads ] then./dump_table.sh $i > $i.out & else while [ 1 ] do currentThread=`ls -1 *.working 2> /dev/null | wc –l` if [ $currentThread -le $threads ] then break fi sleep 0.1 done./dump_table.sh $i > $i.out & fi sleep 0.2 done wait # Worker Thread echo $$ > $$.working proutil scattered -C dump $1./bdump rm -f $$.working

41

42 gus

43 Binary Load  Binary load tests include Single User Multi User Parallel loads by area Single user with build indexes Multi user with build indexes Parallel load by area with build indexes

44 Binary Load Results The best dump and load result is: Parallel dump by table 20:22 Parallel load by area and build indexes46:49 Total time excluding backups, etc67:06

45 Results Summary

46