Chapter 8 Database Redesign

Slides:



Advertisements
Similar presentations
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 8-1 David M. Kroenke’s Chapter Eight: Database Redesign Database Processing:
Advertisements

Fundamentals, Design, and Implementation, 9/e Chapter 8 Database Redesign.
Database Administration Chapter Six DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4 th Edition.
Database Management System MIS 520 – Database Theory Fall 2001 (Day) Lecture 13.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 8-1 David M. Kroenke’s Chapter Eight: Database Redesign Database Processing:
Database Administration Chapter Six DAVID M. KROENKE’S DATABASE CONCEPTS, 2 nd Edition.
Transaction Management and Concurrency Control
Prentice Hall © COS 346 Day Agenda Questions? Assignment 8 Due Assignment 9 posted –Due April 2:05 PM Quiz 2 Today –SQL Chaps 2-19.
Transaction Management and Concurrency Control
Fundamentals, Design, and Implementation, 9/e Chapter 11 Managing Databases with SQL Server 2000.
Transaction Management and Concurrency Control
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 4-1 David M. Kroenke Database Processing Chapter 9 Managing Multi- User.
Prentice Hall © COS 346 Day Agenda Questions? Assignment 7 Corrected –4 A’s and 4 B’s Assignment 8 posted –Due April 6 Quiz 2 next class.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Transaction Management and Concurrency Control.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 9-1 COS 346 Day 19.
COS 346 DAY 17.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 8-1 COS 346 Day 18.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 9-1 COS 346 Day 19.
Database Administration
Database Administration Chapter Six DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
David M. Kroenke and David J
Database Administration Part 1 Chapter Six CSCI260 Database Applications.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Transaction Management and Concurrency Control.
Transaction Management and Concurrency Control
1 IT420: Database Management and Organization Transactions 31 March 2006 Adina Crăiniceanu
Database Administration
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 9-1 Managing Multiuser Databases.
BIS Database Systems School of Management, Business Information Systems, Assumption University A.Thanop Somprasong Chapter # 10 Transaction Management.
1 IT420: Database Management and Organization Database Security 5 April 2006 Adina Crăiniceanu
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Nine: Managing Multiuser Databases 9-1 KROENKE.
Ch 10: Transaction Management and Concurrent Control.
1 IT420: Database Management and Organization Session Control Managing Multi-user Databases 24 March 2006 Adina Crăiniceanu
Prentice Hall © COS 346 Day Agenda Questions? Assignment 8 not Corrected yet Assignment 9 posted –Due April 2:05 PM Quiz 2 Corrected.
Database Processing: Fundamentals, Design, and Implementation, 9/e by David M. KroenkeChapter 9/1 Copyright © 2004 Please……. No Food Or Drink in the class.
© 2002 by Prentice Hall 1 Database Administration David M. Kroenke Database Concepts 1e Chapter 6 6.
Fundamentals, Design, and Implementation, 9/e Chapter 9 Managing Multi-User Databases.
Database Processing: Fundamentals, Design, and Implementation, 9/e by David M. KroenkeChapter 8/1 Copyright © 2004 Please……. No Food Or Drink in the class.
Fundamentals, Design, and Implementation, 9/e Chapter 8 Database Redesign.
1 IT420: Database Management and Organization Managing Multi-user Databases 29 March 2006 Adina Crăiniceanu
© 2006 ITT Educational Services Inc. Course Name: IT390 Business Database Administration Unit 8 Slide 1 IT 390 Business Database Administration Unit 8:
KROENKE and AUER - DATABASE CONCEPTS (3 rd Edition) © 2008 Pearson Prentice Hall 6-1 Chapter Objectives Understand the need for and importance of database.
David M. Kroenke and David J. Auer Database Processing: F undamentals, Design, and Implementation Chapter Nine: Managing Multiuser Databases 9-1 KROENKE.
10 1 Chapter 10_B Concurrency Control Database Systems: Design, Implementation, and Management, Rob and Coronel.
3 Database Systems: Design, Implementation, and Management CHAPTER 9 Transaction Management and Concurrency Control.
Fundamentals, Design, and Implementation, 9/e COS 346 DAY 17.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Eight: Database Redesign.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Nine: Managing Multiuser Databases.
Chapter 5 Managing Multi-user Databases 1. Multi-User Issues Database Administration Concurrency Control Database Security Database Recovery Page 307.
Chapter 13 Managing Transactions and Concurrency Database Principles: Fundamentals of Design, Implementation, and Management Tenth Edition.
SYSTEMS IMPLEMENTATION TECHNIQUES TRANSACTION PROCESSING DATABASE RECOVERY DATABASE SECURITY CONCURRENCY CONTROL.
Chapter 5 Database Design
Managing Multi-User Databases
David M. Kroenke and David J
David M. Kroenke and David J
Transaction Management and Concurrency Control
Functions of a Database Management System
Database Administration
Managing Multi-user Databases
Chapter 9 Managing Multi-User Databases
Multi-User Databases Chapter 9.
Transaction Management
Database Processing: David M. Kroenke’s Chapter Nine: Part One
Database Processing: David M. Kroenke’s Chapter Nine: Part Two
Transactions, Locking and Query Optimisation
Chapter 10 Transaction Management and Concurrency Control
Introduction of Week 13 Return assignment 11-1 and 3-1-5
Chapter 8 Database Redesign
Database Administration
Chapter 11 Managing Databases with SQL Server 2000
Database Processing: David M. Kroenke’s Chapter Nine: Part Two
Presentation transcript:

Chapter 8 Database Redesign Instructor: Dragomir R. Radev Fall 2005

Need For Database Redesign Database redesign is necessary To fix mistakes made during the initial database design To adapt the database to changes in system requirements New information systems cause changes in systems requirements because information systems and organizations create each other When a new system is installed, users can behave in new ways As the users behave in the new ways, they will want changes to the system to accommodate their new behaviors Copyright © 2004

Database Redesign Three principles for database redesign: Measure twice and cut once: understand the current structure and contents of the database before making any structure changes Test the new changes on a test database before making real changes Create a complete backup of the operational database before making any structure changes Technique: reverse engineering (RE) Copyright © 2004

Reverse Engineering Reverse engineering is the process of reading and producing a data model from a database schema A reverse engineered (RE) data model Provides a basis to begin the database redesign project Is neither truly a conceptual nor an internal schema as it has characteristics of both Should be carefully reviewed because it almost always has missing information Copyright © 2004

Example: RE Data Model Copyright © 2004

Database Backup and Test Databases Before making any changes to an operational database A complete backup of the operational database should be made Any proposed changes should be thoroughly tested Three different copies of the database schema used in the redesign process A small test database for initial testing A large test database for secondary testing The operational database Copyright © 2004

Database Redesign Changes Changing tables and columns Changing table names Adding and dropping table columns Changing data type or constraints Adding and dropping constraints Changing relationships Changing cardinalities Adding and deleting relationships Adding and removing relationship for de-normalization Copyright © 2004

Changing Table Names There is no SQL-92 command to change table name The table needs to be re-created under the new name, tested, and the old table is dropped Changing a table name has a surprising number of potential consequences Therefore, using views defined as table aliases is more appropriate Only views that define the aliases would need to be changed when the source table name is changed Copyright © 2004

Adding Columns To add null columns to a table ALTER TABLE WORK ADD COLUMN DateCreated Date NULL; Other column constraints, e.g., DEFAULT or UNIQUE, may be included with the column definition Newly added DEFAULT constraint will be applied to only new rows, existing rows will have null values Three steps to add a NOT NULL column: Add the column as NULL Add data to every row Alter the column constraint to NOT NULL Copyright © 2004

Dropping Columns To drop non-key columns ALTER TABLE WORK DROP COLUMN DateCreated; To drop a foreign key column, the foreign key constraint must first be dropped To drop the primary key, all foreign keys using the primary key must first be dropped; follow by dropping the primary key constraint Copyright © 2004

Changing Data Type or Constraints Use the ALTER TABLE ALTER COLUMN to change data types and constraints For some changes, data will be lost or the DBMS may refuse the change To change a constraint from NULL to NOT NULL, all rows must have a value first Copyright © 2004

Changing Data Type or Constraints Converting more specific data type, e.g., date, money, and numeric, to char or varchar will usually succeed Changing a data type from char or varchar to a more specific type can be a problem Example ALTER TABLE ARTIST ALTER COLUMN Birthdate Numeric (4,0) NULL; Copyright © 2004

Adding and Dropping Constraints Use the ALTER TABLE ADD (DROP) CONSTRAINT to add (remove) constraints Example ALTER TABLE ARTIST ADD CONSTRAINT NumericBirthYearCheck CHECK (Birthdate > 1900 and Birthdate < 2100); Copyright © 2004

Changing Minimum Cardinalities On the parent side: To change from zero to one, change the foreign key constraint from NULL to NOT NULL Can only be done if all the rows in the table have a value To change from one to zero, change the foreign key constraint from NOT NULL to NULL On the child side: Add (to change from zero to one) or drop (to change from one to zero) triggers that enforce the constraint Copyright © 2004

Changing Maximum Cardinalities Changing from 1:1 to 1:N If the foreign key is in the correct table, remove the unique constraint on the foreign key column If the foreign key is in the wrong table, move the foreign key to the correct table and do not place a unique constraint on that table Changing from 1:N to N:M Build a new intersection table and move the key and foreign key values to the intersection table Copyright © 2004

Reducing Cardinalities Reducing cardinalities may result in data loss Reducing N:M to 1:N Create a foreign key in the parent table and move one value from the intersection table into that foreign key Reducing 1:N to 1:1 Remove any duplicates in the foreign key and then set a uniqueness constraint on that key Copyright © 2004

Adding and Deleting Relationships Adding new tables and relationships Add the tables and relationships using CREATE TABLE statements with FOREIGN KEY constraints If an existing table has a child relationship to the new table, add a FOREIGN KEY constraint using the existing table Deleting relationships and tables Drop the foreign key constraints and then drop the tables Copyright © 2004

Adding Tables and Relationships for Normalization Steps: Use correlated subqueries to determine whether the normalization assumption is justified If not, fix the data before proceeding Create a new table and move the normalized data into the new table Define the appropriate foreign key Copyright © 2004

Removing Relationships for Denormalization Steps: Define the new columns in the table to be denormalized Fill the table with existing data Drop the child table and relationship Copyright © 2004

Forward Engineering Forward engineering is the process of applying data model changes to an existing database Results of forward engineering should be tested before using it on an operational database Some tools will show the SQL that will execute during the forward engineering process If so, that SQL should be carefully reviewed Copyright © 2004

Chapter 9 Managing Multi-User Databases Instructor: Dragomir R. Radev Winter 2005

Database Administration All large and small databases need database administration Data administration refers to a function concerning all of an organization’s data assets Database administration (DBA) refers to a person or office specific to a single database and its applications Copyright © 2004

DBA Tasks Managing database structure Controlling concurrent processing Managing processing rights and responsibilities Developing database security Providing for database recovery Managing the DBMS Maintaining the data repository Copyright © 2004

Managing Database Structure DBA’s tasks: Participate in database and application development Assist in requirements stage and data model creation Play an active role in database design and creation Facilitate changes to database structure Seek community-wide solutions Assess impact on all users Provide configuration control forum Be prepared for problems after changes are made Maintain documentation Copyright © 2004

Concurrency Control Concurrency control ensures that one user’s work does not inappropriately influence another user’s work No single concurrency control technique is ideal for all circumstances Trade-offs need to be made between level of protection and throughput Copyright © 2004

Atomic Transactions A transaction, or logical unit of work (LUW), is a series of actions taken against the database that occurs as an atomic unit Either all actions in a transaction occur or none of them do Copyright © 2004

Example: Atomic Transaction Copyright © 2004

Example: Atomic Transaction Copyright © 2004

Concurrent Transaction Concurrent transactions refer to two or more transactions that appear to users as they are being processed against a database at the same time In reality, CPU can execute only one instruction at a time Transactions are interleaved meaning that the operating system quickly switches CPU services among tasks so that some portion of each of them is carried out in a given interval Concurrency problems: lost update and inconsistent reads Copyright © 2004

Example: Concurrent Transactions Copyright © 2004

Example: Lost Update Problem Copyright © 2004

Resource Locking Resource locking prevents multiple applications from obtaining copies of the same record when the record is about to be changed Copyright © 2004

Lock Terminology Implicit locks are locks placed by the DBMS Explicit locks are issued by the application program Lock granularity refers to size of a locked resource Rows, page, table, and database level Large granularity is easy to manage but frequently causes conflicts Types of lock An exclusive lock prohibits other users from reading the locked resource A shared lock allows other users to read the locked resource, but they cannot update it Copyright © 2004

Example: Explicit Locks Copyright © 2004

Serializable Transactions Serializable transactions refer to two transactions that run concurrently and generate results that are consistent with the results that would have occurred if they had run separately Two-phased locking is one of the techniques used to achieve serializability Copyright © 2004

Two-phased Locking Two-phased locking Transactions are allowed to obtain locks as necessary (growing phase) Once the first lock is released (shrinking phase), no other lock can be obtained A special case of two-phased locking Locks are obtained throughout the transaction No lock is released until the COMMIT or ROLLBACK command is issued This strategy is more restrictive but easier to implement than two-phase locking Copyright © 2004

Deadlock Deadlock, or the deadly embrace, occurs when two transactions are each waiting on a resource that the other transaction holds Preventing deadlock Allow users to issue all lock requests at one time Require all application programs to lock resources in the same order Breaking deadlock Almost every DBMS has algorithms for detecting deadlock When deadlock occurs, DBMS aborts one of the transactions and rollbacks partially completed work Copyright © 2004

Example: Deadlock Copyright © 2004

Optimistic/Pessimistic Locking Optimistic locking assumes that no transaction conflict will occur DBMS processes a transaction; checks whether conflict occurred If not, the transaction is finished If so, the transaction is repeated until there is no conflict Pessimistic locking assumes that conflict will occur Locks are issued before transaction is processed, and then the locks are released Optimistic locking is preferred for the Internet and for many intranet applications Copyright © 2004

Example: Optimistic Locking Copyright © 2004

Example: Pessimistic Locking Copyright © 2004

Declaring Lock Characteristics Most application programs do not explicitly declare locks due to its complication Instead, they mark transaction boundaries and declare locking behavior they want the DBMS to use Transaction boundary markers: BEGIN, COMMIT, and ROLLBACK TRANSACTION Advantage If the locking behavior needs to be changed, only the lock declaration need be changed, not the application program Copyright © 2004

Example: Marking Transaction Boundaries Copyright © 2004

ACID Transactions Acronym ACID transaction is one that is Atomic, Consistent, Isolated, and Durable Atomic means either all or none of the database actions occur Durable means database committed changes are permanent Copyright © 2004

ACID Transactions (cont.) Consistency means either statement level or transaction level consistency Statement level consistency: each statement independently processes rows consistently Transaction level consistency: all rows impacted by either of the SQL statements are protected from changes during the entire transaction With transaction level consistency, a transaction may not see its own changes Copyright © 2004

ACID Transactions (cont.) Isolation means application programmers are able to declare the type of isolation level and to have the DBMS manage locks so as to achieve that level of isolation SQL-92 defines four transaction isolation levels: Read uncommitted Read committed Repeatable read Serializable Copyright © 2004

Transaction Isolation Level Copyright © 2004

Cursor Type A cursor is a pointer into a set of records It can be defined using SELECT statements Four cursor types Forward only: the application can only move forward through the recordset Scrollable cursors can be scrolled forward and backward through the recordset Static: processes a snapshot of the relation that was taken when the cursor was opened Keyset: combines some features of static cursors with some features of dynamic cursors Dynamic: a fully featured cursor Choosing appropriate isolation levels and cursor types is critical to database design Copyright © 2004

Database Security Database security ensures that only authorized users can perform authorized activities at authorized times Developing database security Determine users’ processing rights and responsibilities Enforce security requirements using security features from both DBMS and application programs Copyright © 2004

DBMS Security DBMS products provide security facilities They limit certain actions on certain objects to certain users or groups Almost all DBMS products use some form of user name and password security Copyright © 2004

DBMS Security Model Copyright © 2004

DBMS Security Guidelines Run DBMS behind a firewall, but plan as though the firewall has been breached Apply the latest operating system and DBMS service packs and fixes Use the least functionality possible Support the fewest network protocols possible Delete unnecessary or unused system stored procedures Disable default logins and guest users, if possible Unless required, never allow all users to log on to the DBMS interactively Protect the computer that runs the DBMS No user allowed to work at the computer that runs the DBMS DBMS computer physically secured behind locked doors Access to the room containing the DBMS computer should be recorded in a log Copyright © 2004

DBMS Security Guidelines (cont.) Manage accounts and passwords Use a low privilege user account for the DBMS service Protect database accounts with strong passwords Monitor failed login attempts Frequently check group and role memberships Audit accounts with null passwords Assign accounts the lowest privileges possible Limit DBA account privileges Planning Develop a security plan for preventing and detecting security problems Create procedures for security emergencies and practice them Copyright © 2004

Application Security If DBMS security features are inadequate, additional security code could be written in application program Application security in Internet applications is often provided on the Web server computer However, you should use the DBMS security features first The closer the security enforcement is to the data, the less chance there is for infiltration DBMS security features are faster, cheaper, and probably result in higher quality results than developing your own Copyright © 2004

SQL Injection Attack SQL injection attack occurs when data from the user is used to modify a SQL statement User input that can modify a SQL statment must be carefully edited to ensure that only valid input has been received and that no additional SQL syntax has been entered Example: users are asked to enter their names into a Web form textbox User input: Benjamin Franklin ' OR TRUE ' SELECT * FROM EMPLOYEE WHERE EMPLOYEE.Name = 'Benjamin Franklin' OR TRUE; Result: every row of the EMPLOYEE table will be returned Copyright © 2004

Database Recovery In the event of system failure, that database must be restored to a usable state as soon as possible Two recovery techniques: Recovery via reprocessing Recovery via rollback/rollforward Copyright © 2004

Recovery via Reprocessing Recovery via reprocessing: the database goes back to a known point (database save) and reprocesses the workload from there Unfeasible strategy because The recovered system may never catch up if the computer is heavily scheduled Asynchronous events, although concurrent transactions, may cause different results Copyright © 2004

Rollback/Rollforward Recovery via rollback/rollforward: Periodically save the database and keep a database change log since the save Database log contains records of the data changes in chronological order When there is a failure, either rollback or rollforward is applied Rollback: undo the erroneous changes made to the database and reprocess valid transactions Rollforward: restored database using saved data and valid transactions since the last save Copyright © 2004

Example: Rollback Before-images: a copy of every database record (or page) before it was changed Copyright © 2004

Example: Rollforward After-images: a copy of every database record (or page) after it was changed Copyright © 2004

Example: Transaction Log Copyright © 2004

Example: Database Recovery Copyright © 2004

Example: Database Recovery Copyright © 2004

Checkpoint A checkpoint is a point of synchronization between the database and the transaction log DBMS refuses new requests, finishes processing outstanding requests, and writes its buffers to disk The DBMS waits until the writing is successfully completed  the log and the database are synchronized Checkpoints speed up database recovery process Database can be recovered using after-images since the last checkpoint Checkpoint can be done several times per hour Most DBMS products automatically checkpoint themselves Copyright © 2004

Managing the DBMS DBA’s Responsibilities Generate database application performance reports Investigate user performance complaints Assess need for changes in database structure or application design Modify database structure Evaluate and implement new DBMS features Tune the DBMS Copyright © 2004

Maintaining the Data Repository DBA is responsible for maintaining the data repository Data repositories are collections of metadata about users, databases, and its applications The repository may be Virtual as it is composed of metadata from many different sources: DBMS, code libraries, Web page generation and editing tools, etc. An integrated product from a CASE tool vendor or from other companies The best repositories are active and they are part of the system development process Copyright © 2004

Chapter 9 Managing Multi-User Databases Instructor: Dragomir R. Radev Winter 2005