Midterm 2: April 28th Material:   Query processing and Optimization, Chapters 12 and 13 (ignore 12.5.5, 12.7, 13.4.4 and 13.5)   Transactions, Chapter.

Slides:



Advertisements
Similar presentations
Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
Advertisements

Lecture 4: A Case for RAID (Part 2) Prof. Shahram Ghandeharizadeh Computer Science Department University of Southern California.
RAID A RRAYS Redundant Array of Inexpensive Discs.
RAID: Redundant Array of Inexpensive Disks Supplemental Material not in book.
Chapter 16: Recovery System
Chapter 4 Infrastructure as a Service (IaaS)
Database Administration and Security Transparencies 1.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
Chapter 3 Presented by: Anupam Mittal.  Data protection: Concept of RAID and its Components Data Protection: RAID - 2.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
1 Disk Based Disaster Recovery & Data Replication Solutions Gavin Cole Storage Consultant SEE.
Introduction to Database Management  Department of Computer Science Northern Illinois University January 2001.
The Google File System. Why? Google has lots of data –Cannot fit in traditional file system –Spans hundreds (thousands) of servers connected to (tens.
High Performance Computing Course Notes High Performance Storage.
Lecture 3: A Case for RAID (Part 1) Prof. Shahram Ghandeharizadeh Computer Science Department University of Southern California.
Lesson 1: Configuring Network Load Balancing
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Module 3 - Storage MIS5122: Enterprise Architecture for IT Auditors.
Managing Storage Lesson 3.
LAN / WAN Business Proposal. What is a LAN or WAN? A LAN is a Local Area Network it usually connects all computers in one building or several building.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Managing Multi-User Databases AIMS 3710 R. Nakatsu.
Implementing Multi-Site Clusters April Trần Văn Huệ Nhất Nghệ CPLS.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
Scalability Terminology: Farms, Clones, Partitions, and Packs: RACS and RAPS Bill Devlin, Jim Cray, Bill Laing, George Spix Microsoft Research Dec
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
DotHill Systems Data Management Services. Page 2 Agenda Why protect your data?  Causes of data loss  Hardware data protection  DMS data protection.
Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
© 2005 Mt Xia Technical Consulting Group - All Rights Reserved. HACMP – High Availability Introduction Presentation November, 2005.
Virtualization for Storage Efficiency and Centralized Management Genevieve Sullivan Hewlett-Packard
Chapter © 2006 The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/ Irwin Chapter 7 IT INFRASTRUCTURES Business-Driven Technologies 7.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
Lecture 1: Overview of CSCI 485 Notes: I presented parts of this lecture as a keynote at Educator’s Symposium of OOPSLA Shahram Ghandeharizadeh Associate.
MapReduce and GFS. Introduction r To understand Google’s file system let us look at the sort of processing that needs to be done r We will look at MapReduce.
"1"1 Introduction to Managing Data " Describe problems associated with managing large numbers of disks " List requirements for easily managing large amounts.
Using NAS as a Gateway to SAN Dave Rosenberg Hewlett-Packard Company th Street SW Loveland, CO 80537
GFS. Google r Servers are a mix of commodity machines and machines specifically designed for Google m Not necessarily the fastest m Purchases are based.
VMware vSphere Configuration and Management v6
Fault Tolerance and Disaster Recovery. Topics Using Antivirus software Fault tolerance –Power –Redundancy –Storage –Services Disaster Recovery –Backup/Restore.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Enhancing Scalability and Availability of the Microsoft Application Platform Damir Bersinic Ruth Morton IT Pro Advisor Microsoft Canada
Gamma DBMS Part 1: Physical Database Design Shahram Ghandeharizadeh Computer Science Department University of Southern California.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
Seminar On Rain Technology
RAID TECHNOLOGY RASHMI ACHARYA CSE(A) RG NO

Network Attached Storage Overview
High Availability 24 hours a day, 7 days a week, 365 days a year…
Maximum Availability Architecture Enterprise Technology Centre.
A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)
Introduction to Networks
RAID RAID Mukesh N Tekwani
Introduction to Databases Transparencies
A Redundant Global Storage Architecture
TECHNICAL SEMINAR PRESENTATION
Distributed computing deals with hardware
UNIT IV RAID.
Mark Zbikowski and Gary Kimura
Specialized Cloud Architectures
RAID RAID Mukesh N Tekwani April 23, 2019
Lecture 1: Overview of CSCI 485 Notes: I presented parts of this lecture as a keynote at Educator’s Symposium of OOPSLA Shahram Ghandeharizadeh Director.
Lecture 1: Overview of CSCI 485 Notes: I presented parts of this lecture as a keynote at Educator’s Symposium of OOPSLA Shahram Ghandeharizadeh Associate.
Presentation transcript:

Midterm 2: April 28th Material:   Query processing and Optimization, Chapters 12 and 13 (ignore , 12.7, and 13.5)   Transactions, Chapter 14   Concurrency Control, Chapter 15, ignore 15.7 to   Recovery System, Chapter 16, ignore 16.8 and 16.9   Google File System   LRU-K, article by O’Neils and Weikum   Continuous Media, article by Ghandeharizadeh & Muntz (1 st 11 pages)   COSAR-CQN

Enterprise Data Mangement Shahram Ghandeharizadeh Computer Science Department University of Southern California

Challenge: Managing Data is Expensive Cost of Managing Data is $100K/TB/Year :   Down time is estimated at thousands of dollars per minute.   Loss of data results in lost productivity:   20 Megabytes of accounting data requires 21 days and costs $19K to reproduce.   50% of companies that lose their data due to a disaster never re- open; 90% go out of business in 2 years!

Centralize Management of Storage Before   Data stored locally. After   Data stored across the network at a central location. Network Data

Centralize Management of Storage Advantages:   Many clients share storage and data: data remains available when a client fails. Network Data

Centralize Management of Storage Advantages:   Many clients share storage and data.   Redundancy is implemented in one place protecting all clients from disk failure. Network

Centralize Management of Storage Advantages:   Many clients share storage and data.   Redundancy is implemented in one place protecting all clients from disk failure.   Centralized backup: The administrator does not care/know how many clients are on the network sharing storage. Network

Centralize Management of Storage Advantages:   Many clients share storage and data.   Redundancy is implemented in one place protecting all clients from disk failure.   Centralized backup: The administrator does not care/know how many clients are on the network sharing storage. Network High Availability Data Backup Data Sharing

Network failures What about network failures?   Two host bus adapters per server,   Each server connected to a different switch.

Centralize Management of Storage Storage Area Network (SAN):   Block level access,   Write to storage is immediate,   Specialized hardware including switches, host bus adapters, disk chassis, battery backed caches, etc.   Expensive   Supports transaction processing systems. Network Attached Storage (NAS):   File level access,   Write to storage might be delayed,   Generic hardware,   In-expensive,   Not appropriate for transaction processing systems.

Storage Area Network Centralize management of storage:   Storage Area Networks (SANs), Redundancy in data to tolerate disk failures, Regular backup, Disaster recovery.

Concepts and Terminology Virtualization:   Available storage is represented as one HUGE disk drive, e.g., a SAN with a thousand 1.5 TB disk provides 1 Petabyte of storage,   Available storage is partitioned into Logical Unit Numbers (LUNs),   A LUN is presented to one or more servers,   A LUN appears as a disk drive to a server.   SAN places blocks across physical disks intelligently to balance load.

Question Is it possible to present the same LUN to two different servers simultaneously?

Question Is it possible to present the same LUN to two different servers simultaneously? YES! Can two different servers read and write the files stored on the presented LUN?

Question Is it possible to present the same LUN to two different servers simultaneously? YES! Can two different servers read and write the files stored on the presented LUN? Yes!   What are the consequences?

Concepts: Backup Snapshot: State of a LUN at one instance in time. Copy-on-write:   A snapshot consists of the original blocks of a LUN,   Every time an application writes a block, SAN generates a new copy for the current LUN (snapshot maintains the original), Advantage: copy of blocks in support of backup is generated on-demand.

Copy-on-Write Original LUN and Snapshot taken midnight Sunday morning

Copy-on-Write Original LUN and Snapshot taken midnight Sunday morning. Write block 5 changes the current LUN to: As blocks are written, the physical blocks of the snapshot materialize Old 5 5

Hot Standby An in-expensive server that is maintained on the side to assume responsibility for a failed server. Goal: Minimize downtime.

Summary SAN and NAS are shared-disk architecture, SAN is appropriate for transaction processing systems, Hardware alone is not a substitute for a parallel, high performance transaction processing system, e.g., Teradata, Oracle RAC, etc.