Chapter 14 Handling Online Redo Log Failures. Background RMAN doesn’t back up online redo logs You don’t use RMAN to recover from online redo log failures.

Slides:



Advertisements
Similar presentations
Oracle9i Database Administrator: Implementation and Administration 1 Chapter 2 Overview of Database Administrator (DBA) Tools.
Advertisements

ITEC474 Control File Maintenance
1 Chapter 17 Troubleshooting RMAN. 2 Background Authors thought this topic was often glazed over or not covered well Knew that every topic can’t be covered,
FlareCo Ltd ALTER DATABASE AdventureWorks SET PARTNER FORCE_SERVICE_ALLOW_DATA_LOSS Slide 1.
5 Copyright © 2006, Oracle. All rights reserved. Database Recovery.
5. SQL Restore tasks Objectives –Learn howto recover databases from disaster Contents –View database and transaction log backup files –Perform a complete.
RMAN Restore and Recovery
Backup and Recovery (2) Oracle 10g CAP364 1 Hebah ElGibreen.
Backup and Recovery Part 1.
Chapter 12 Performing Incomplete Recovery. Background Viewed as one of the more difficult chapters to write Thought it was important to put in material.
Harvard University Oracle Database Administration Session 11 Database Recovery.
ITEC474 Redo Log Files.
Database Backup & Recovery David Konopnicki. Introduction A major responsibility of the database administrator is to prepare for the possibility of hardware,
CHAPTER 16 User-Managed Backup and Recovery. Introduction to User Managed Backup and Recovery Backup and recover is one of the most critical skills a.
Chapter 5 Configuring the RMAN Environment. Objectives Show command to see existing settings Configure command to change settings Backing up the controlfile.
CHAPTER 5 Managing Control Files, Online Redo Logs, and Archiving.
Oracle9i Database Administrator: Implementation and Administration
7 Maintaining Redo Log Files. 7-2 Objectives Explaining the use of online redo log files Obtaining log and archive information Controlling log switches.
Backup Concepts. Introduction Backup and recovery procedures protect your database against data loss and reconstruct the data, should loss occur. The.
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
Introduction to Oracle Backup and Recovery
Using RMAN to Perform Recovery
CHAPTER 16 User-Managed Backup and Recovery. Introduction to User Managed Backup and Recovery Backup and recover is one of the most critical skills a.
Oracle Database High Availability Brandon Kuschel Jian Liu Source: Oracle Database 11g Release 2 High Availability An Oracle White Paper November 2010.
13 Copyright © Oracle Corporation, All rights reserved. RMAN Complete Recovery.
PPOUG, 05-OCT-01 Agenda RMAN Architecture Why Use RMAN? Implementation Decisions RMAN Oracle9i New Features.
7 Copyright © 2006, Oracle. All rights reserved. Dealing with Database Corruption.
SRUTHI NAGULAVANCHA CIS 764, FALL 2008 Department of Computing and Information Sciences (CIS) Kansas State University -1- Back up & Recovery Strategies.
11 Copyright © Oracle Corporation, All rights reserved. RMAN Backups.
Chapter 9 Scripting RMAN. Background Authors felt that scripting was a topic not covered well Authors wanted to cover both Unix/Linux and Windows environments.
15 Copyright © 2005, Oracle. All rights reserved. Performing Database Backups.
16 Copyright © 2007, Oracle. All rights reserved. Performing Database Recovery.
By Lecturer / Aisha Dawood 1.  You can control the number of dispatcher processes in the instance. Unlike the number of shared servers, the number of.
4 Copyright © 2006, Oracle. All rights reserved. Recovering from Noncritical Losses.
15 Copyright © 2007, Oracle. All rights reserved. Performing Database Backups.
9 Copyright © 2004, Oracle. All rights reserved. Flashback Database.
7202ICT – Database Administration
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
CHAPTER 5 Managing Control Files and Online Redo Logs.
15 Copyright © Oracle Corporation, All rights reserved. RMAN Incomplete Recovery.
Week 3 Lecture 1 The Redo Log Files and Diagnostic Files.
11 Copyright © 2004, Oracle. All rights reserved. Dealing with Database Corruption.
Process Architecture Process Architecture - A portion of a program that can run independently of and concurrently with other portions of the program. Some.
14 Copyright © 2005, Oracle. All rights reserved. Backup and Recovery Concepts.
Backup Concepts. Introduction Backup and recovery procedures protect your database against data loss and reconstruct the data, should loss occur. The.
10 Copyright © Oracle Corporation, All rights reserved. User-Managed Backups.
© 2006 Northgate Information Solutions plc and its associated companies. All rights reserved. Slide 1.
Backup and Recovery Part 2. Online backup Online backup requirements: Backup all data files (file system copy) Backup all redo information produced during.
Overview of Oracle Backup and Recovery Darl Kuhn, Regis University.
12 Copyright © Oracle Corporation, All rights reserved. User-Managed Complete Recovery.
18 Copyright © 2004, Oracle. All rights reserved. Backup and Recovery Concepts.
6 Copyright © 2007, Oracle. All rights reserved. Performing User-Managed Backup and Recovery.
2 Copyright © 2007, Oracle. All rights reserved. Configuring for Recoverability.
8 Copyright © 2007, Oracle. All rights reserved. Using RMAN to Duplicate a Database.
18 Copyright © 2004, Oracle. All rights reserved. Recovery Concepts.
14 Copyright © 2005, Oracle. All rights reserved. Backup and Recovery Concepts.
9 Copyright © 2004, Oracle. All rights reserved. Incomplete Recovery.
14 Copyright © 2007, Oracle. All rights reserved. Backup and Recovery Concepts.
What is Flashback? Technology that allows you to revert the database to a point in time in the past Several versions of flashback available Different types.
Oracle Database High Availability
RMAN-Backup and Recovery The Most Essential But The Most Ignored
Database Administration
Maintaining Online Redo Log Files
Maximum Availability Architecture Enterprise Technology Centre.
Oracle Database High Availability
Oracle9i Database Administrator: Implementation and Administration
Recovering from Noncritical Losses
Performing Tablespace Point-in-Time Recovery
Performing Database Recovery
Chapter 5 The Redo Log Files.
Presentation transcript:

Chapter 14 Handling Online Redo Log Failures

Background RMAN doesn’t back up online redo logs You don’t use RMAN to recover from online redo log failures Authors and editor debated on whether this topic should be included in an RMAN book Authors felt strongly that some material should be added to show how to deal with online redo log failures This topic isn’t covered well in many other Oracle B&R books One of the first chapters written in the book

Online Redo Logs Store the transaction redo stream Used for recovery Considered a weak link in the Oracle architecture If not using a tool like Data Guard in maximum protection mode, you could lose transactions if you lose your current online redo log

Mitigating Online Redo Failure Multiplex groups to have multiple members Never allow two members of the same group to share the same controller Never put two members of the same group on the same physical disk Ensure operating system file permissions are set appropriately Use physical storage devices that are redundant (that is, RAID) Appropriately size the log files so that they switch and are archived at regular intervals Set the archive_lag_target initialization parameter to ensure that the online redo logs are switched at regular intervals

Determining How to Restore 1. Inspect the alert.log file to determine which online redo log files have experienced a media failure 2. Query V$LOG and V$LOGFILE to determine the status of the log group and degree of multiplexing 3. If there is still one functioning member of a multiplexed group, then see recipe 14-2 for details on how to fix the remaining failed member(s) 4. Depending on the status of the log group, use Table 14-1 to determine what action to take

Determining Status SQL> select a.group#, a.thread#, a.status grp_status, b.member member, b.status mem_status from v$log a, v$logfile b where a.group# = b.group# order by a.group#, b.member;

Status Column of V$LOG Inactive, not required for crash recovery Active, required for crash recovery Current, required for crash recovery, log writer is currently writing

Restoring After Losing One Member of Multiplexed Group Oracle continues to operate if it can write to one member of an online redo log group If one member is bad, an error is written to the alert.log file Steps to take: 1. Identify the online redo log file experiencing media failure 2. Ensure that the online redo log file is not part of the current online log group 3. Drop the damaged member 4. Add a new member to the group

Recovering After Loss of All Members of the INACTIVE Redo Log Group 1. Verify that all members of a group have been damaged 2. Verify that the log group status is INACTIVE 3. Re-create the log group with the clear logfile command 4. If the re-created log group has not been archived, then immediately back up your database

Dropping and Adding a Redo Log Group A log group has to have an inactive status before you can drop it. You can check the status of the log group as shown here: SQL> select group#, status, archived, thread#, sequence# from v$log; SQL> alter database drop logfile group ; SQL> alter database add logfile group ('\directory\file') SIZE K|M;

Recovering After Loss of All Members of the ACTIVE Redo Log Group 1. Verify the damage to the members 2. Verify that the status is ACTIVE 3. Attempt to issue a checkpoint 4. If the checkpoint is successful, the status should now be INACTIVE, and you can clear the log group 5. If the log group that was cleared was unarchived, back up your database immediately 6. If the checkpoint is unsuccessful, then you will have to perform incomplete recovery (see recipe 14-5 for options)

Recovering After Loss of All Members of the CURRENT Redo Log Group This is where things get a little scary Oracle needs the current online redo log to perform crash recovery You might have to perform an incomplete recovery

Options When Losing Current Online Redo Log Perform an incomplete recovery up to the last good SCN If flashback is enabled, flash your database back to the last good SCN If you’re using Oracle Data Guard, fail over to your physical or logical standby database Contact Oracle Support for suggestions

Background Authors, editor, and technical editor (who is an RMAN expert working for Oracle Support) debated on exactly which options they could include in the book when losing current online redo log group There are other options, but potentially not supported unless you enlist the help of Oracle Support Decided to go with the line “contact Oracle Support to discuss all options...”