Recovery Planning A Holistic View Adam Backman, President White Star Software

Slides:



Advertisements
Similar presentations
Which server is right for you? Get in Contact with us
Advertisements

Skyward Disaster Recovery Options
SQL Server Disaster Recovery Chris Shaw Sr. SQL Server DBA, Xtivia Inc.
Protect Your Business and Simplify IT with Symantec and VMware Presenter, Title, Company Date.
Business Continuity Section 3(chapter 8) BC:ISMDR:BEIT:VIII:chap8:Madhu N PIIT1.
© 2009 EMC Corporation. All rights reserved. Introduction to Business Continuity Module 3.1.
© 2007 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Data protection and disaster recovery.
1 Disk Based Disaster Recovery & Data Replication Solutions Gavin Cole Storage Consultant SEE.
FlareCo Ltd ALTER DATABASE AdventureWorks SET PARTNER FORCE_SERVICE_ALLOW_DATA_LOSS Slide 1.
Princeton PC Users Group Hard Drive Disaster! By Paul Kurivchack March 14, 2005.
June 23rd, 2009Inflectra Proprietary InformationPage: 1 SpiraTest/Plan/Team Deployment Considerations How to deploy for high-availability and strategies.
1 Pertemuan 23 Contingency Planning Matakuliah:A0334/Pengendalian Lingkungan Online Tahun: 2005 Versi: 1/1.
Reliability Week 11 - Lecture 2. What do we mean by reliability? Correctness – system/application does what it has to do correctly. Availability – Be.
MSIS 110: Introduction to Computers; Instructor: S. Mathiyalakan1 Systems Design, Implementation, Maintenance, and Review Chapter 13.
Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin CHAPTER FIVE INFRASTRUCTURES: SUSTAINABLE TECHNOLOGIES CHAPTER.
Preservasi Informasi Digital.  It will never happen here!  Common Causes of Loss of Data  Accidental Erasure (delete, power, backup)  Viruses and.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
VIRTUALIZATION AND YOUR BUSINESS November 18, 2010 | Worksighted.
Back Up and Recovery Sue Kayton February 2013.
1 Disaster Recovery Planning & Cross-Border Backup of Data among AMEDA Members Vipin Mahabirsingh Managing Director, CDS Mauritius For Workgroup on Cross-Border.
Installing software on personal computer
National Manager Database Services
OpenEdge High Availabilty Adam Backman Grand Poobah – White Star Software.
CHAPTER OVERVIEW SECTION 5.1 – MIS INFRASTRUCTURE
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 14: Problem Recovery.
November 2009 Network Disaster Recovery October 2014.
Building Highly Available Systems with SQL Server™ 2005 Vineet Gupta Evangelist – Data and Integration Microsoft Corp.
Server Types Different servers do different jobs. Proxy Servers Mail Servers Web Servers Applications Servers FTP Servers Telnet Servers List Servers Video/Image.
INFO 355Week #61 Systems Analysis II Essentials of design INFO 355 Glenn Booker.
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Maintaining a Microsoft SQL Server 2008 Database SQLServer-Training.com.
Business Continuity and Disaster Recovery Chapter 8 Part 2 Pages 914 to 945.
CHAPTER FIVE INFRASTRUCTURES: SUSTAINABLE TECHNOLOGIES
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Turning Practice into Perfect Implementing Fathom 2.0 Adam Backman White Star Software
Local Area Networks (LAN) are small networks, with a short distance for the cables to run, typically a room, a floor, or a building. - LANs are limited.
Chapter Fourteen Windows XP Professional Fault Tolerance.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Chapter 16 Designing Effective Output. E – 2 Before H000 Produce Hardware Investment Report HI000 Produce Hardware Investment Lines H100 Read Hardware.
Introduction to Cloud Computing
Principles of Information Systems, Sixth Edition Systems Design, Implementation, Maintenance, and Review Chapter 13.
Cloud Computing Characteristics A service provided by large internet-based specialised data centres that offers storage, processing and computer resources.
©2006 Merge eMed. All Rights Reserved. Energize Your Workflow 2006 User Group Meeting May 7-9, 2006 Disaster Recovery Michael Leonard.
Chapter © 2006 The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/ Irwin Chapter 7 IT INFRASTRUCTURES Business-Driven Technologies 7.
Preventing Common Causes of loss. Common Causes of Loss of Data Accidental Erasure – close a file and don’t save it, – write over the original file when.
Module 9 Planning a Disaster Recovery Solution. Module Overview Planning for Disaster Mitigation Planning Exchange Server Backup Planning Exchange Server.
Creating a complete recovery solution Adam Backman Partner, White Star Software.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
Principles of Information Systems, Sixth Edition Systems Design, Implementation, Maintenance, and Review Chapter 13.
Project Cost Estimate Form PLAN IT! Todd Crosby. The Project Cost Estimate Form These are your best “educated guesses” – it is expected that data will.
ALMA Archive Operations Impact on the ARC Facilities.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
VMware vSphere Configuration and Management v6
High Availability in DB2 Nishant Sinha
Principles of Information Systems, Sixth Edition 1 Systems Design, Implementation, Maintenance, and Review Chapter 13.
Component 8/Unit 9aHealth IT Workforce Curriculum Version 1.0 Fall Installation and Maintenance of Health IT Systems Unit 9a Creating Fault Tolerant.
CDP Technology Comparison CONFIDENTIAL DO NOT REDISTRIBUTE.
Oracle Standby Implementation Tantra Invedy. Standby Database Introduction Fail over Solution Disaster Recovery Solution if remote Ease of implementation.
WHAT ARE BACKUPS? Backups are the last line of defense against hardware failure, floods or fires the damage caused by a security breach or just accidental.
Networking Objectives Understand what the following policies will contain – Disaster recovery – Backup – Archiving – Acceptable use – failover.
Planning for Application Recovery
Integrating Disk into Backup for Faster Restores
Adam Backman Chief Cat Wrangler – White Star Software
Back Up and Recovery Sue Kayton October 2015.
SpiraTest/Plan/Team Deployment Considerations
Disaster Recovery at UNC
IBM Tivoli Storage Manager
Presentation transcript:

Recovery Planning A Holistic View Adam Backman, President White Star Software

What We Will Cover Where to Start? Creating a plan – Who is involved? – What are you going to protect? – Where is it going to go? – When (how often) are you going to backup? Implementing the plan Automation Testing

Before we start Before starting any recovery of your system, backup what you have now as it may be your route of last resort if some part of your recovery plan fails. It is generally better to leave the “damaged” things alone and recover to a new piece of hardware or different disks.

What is recovery planning? Known by many names – Disaster recovery plan – Business process contingency plan A description of how an organization is to deal with events that make the continuation of business impossible Describes precautions taken to minimize or eliminate the effects of a disaster

Where to start? Determine who owns the data Determine the value of the data Determine the value of lost productivity – Time to rekey – Inventory worth less (no audit trail) – Cannot process as much or any business Determine stake holders (users of the data)

Creating a plan Goals (Event-based goals) – If we lose or corrupt data (Human error) – If we lose a disk (DB gone) – If we have a fire (Machine gone) – If we have a natural disaster (Facility gone) Hardware Software Data Other stuff

Where to start? Use your current plan – It is there, you do have a tested plan don’t you? – We have been using it for years and has always “worked” – If it is not broken why change, we might even test it Start from scratch – Your current plan was written by dummies (unless it was written by you, of course) – Archiving is more than throwing the tape in a drawer in the computer room. – You mean we have a plan now? – When is the last time you tested your backup?

Creating a plan - Goals Acceptable downtime (Generally cost based) Everyone wants zero but it is generally cost prohibitive Planned outages – Hardware install and maintenance – Software upgrade – O/S upgrade or patch Notifications (Both before and during outage) – Who – When – What do they do?

Goals Minimize the impact to the customers Lose a minimal amount of data Don’t build a plan that costs more than the data is worth Don’t build a process you cannot support – Too complex – Hard coded so maintenance is a problem – Build in the ability to change with the environment – Support multiple “exceptions”

Creating a plan - Hardware What to include – Computer hardware – Network – Phone, handheld devices, … Options – Duplication – Replication (Same storage capacity but less resources) – Co-location – External service

Hardware – What users need to access your application Database engine (Where your database resides) Application server(s) for n-tier application Web server(s) Client PC’s Network to connect it all together Internet Phone, FAX, External Interfaces, …

Creating a plan – Apps and software Applications Supporting applications Operating system Production data Transient data

Software – Keeping applications current Application – Remote mirroring – Automated via formal application deployment – Formal process-based application deployment Supporting application – Remote mirroring – Vendor supported deployment process to deal with applications that are licensed to a specific machine

Software – Keeping data current Replication – Real-Time with OpenEdge replication – Quasi Real-Time with Log-based replication – Disaster only recovery via restore and application of after image files. Transient Data (Example: EDI drops, ftp transfers, …) – Remote mirroring – Automated replication

Software – Keeping OS files current Operating system – Running virtualization to allow for quick cloning of your environment – Automated via customized scripts – Keeping two systems in sync via a formalized process – Use network definitions for users, printers, and other operating system resources

Creating a plan – Other stuff What makes your business run? – Phones – Faxes – Business to Business (EDI, XML Feed, …) Can people work from home? Do you have/need another location? Contact lists in case of major catastrophe – Kept up-to-date – Kept online and printed in an accessible location

Remember: Keep your plan simple

Implementing your plan First implementation should be a totally manual process to insure the steps work and allow for documentation Document the process as you go – Who are you logged in as? – Exactly what you typed – Where you were (console, remote, …) – Can things be done in parallel or sequentially – Where are the logs and what to look for in the logs

Documentation All recovery documentation should be VERY specific Create documents for normal maintenance – Backups – Database growth – Modification of OS, Application, printers, … Create scenario based recovery plans – Lose a disk (or disk pair) – Fire – Flood

Automation: Why automate your plan? When it is needed it will be a stressful time The person who best knows the plan will be on vacation Reduces the chance of human error You can duplicate the process for multiple databases The process can be audited provided logging is adequate

Automation: General rules Make sure you back things up before proceeding Automate as much as possible Have the process broken up logically to enable easier easier implementation and testing Make sure you create log(s) Checking the log(s) is part of implementation and testing

Single System – Testing decisions Questions – Do you have enough space? If not, you really do not care about recovery – Do you have enough throughput potential if you do have enough space? – Can you take an outage? If so, how long? May still need to test while running.

Dual Systems – Testing decisions Are the two systems sharing disks? – If yes Do you have enough space Do you have enough throughput potential to test recovery while production is running – If no Is there enough space to duplicate the whole system? Will throughput capacity allow you to give reliable time estimates? Are the two systems evenly configured for other resources beyond disks

Testing your plan Recovery plan testing is an ongoing process not test once then pray Test various different types of recoveries including a tape failure (Rolling forward multiple days of transactions) Make recovery plan testing part of someone’s job responsibilities and evaluation criteria or it is less likely to get done

Testing your plan Who does the test? – Not the person who wrote it – The backup person for the implementation – Someone who is “always” there regardless of technical ability How often to test? – Material data change (10% increase is a good target) – Any change in database configuration – Do you have a second site or redundant hardware? – Do you have enough disk capacity (space and throughput)

How to test your plan Fail over to your backup system Fail back to your primary system Contingency planning for personnel, physical plant and equipment (Lead time for resources)

Summary: Recovery planning Be inclusive when building your team Always backup what you have now, however little, before starting to recover Create and maintain a comprehensive plan – Include everything needed to use the application: Hardware, applications, and data Create and maintain a contact list both online and physical Test your plan periodically (At least annually)

Questions? THANK YOU