Clustering Technology For Scaleability Jim Gray Microsoft Research

Slides:



Advertisements
Similar presentations
Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.
Advertisements

Past High Availability Standards Efforts Jim Gray Microsoft
Virtual Disk based Centralized Management for Enterprise Networks
Technology Analysis LINUX Alper Alansal Brian Blumberg Ramank Bharti Taihoon Lee.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP and Carrier Network System Availability.
Multiple Processor Systems
Distributed Processing, Client/Server and Clusters
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Introduction to Computer Administration Introduction.
Chapter 9: The Client/Server Database Environment
Ravi Sankar Technology Evangelist | Microsoft
Thanks to Microsoft Azure’s Scalability, BA Minds Delivers a Cost-Effective CRM Solution to Small and Medium-Sized Enterprises in Latin America MICROSOFT.
System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.
2. Computer Clusters for Scalable Parallel Computing
Chapter 7 LAN Operating Systems LAN Software Software Compatibility Network Operating System (NOP) Architecture NOP Functions NOP Trends.
MCITP Guide to Microsoft Windows Server 2008 Server Administration (Exam #70-646) Chapter 11 Windows Server 2008 Virtualization.
Distributed Processing, Client/Server, and Clusters
Chapter 16 Client/Server Computing Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
Server Platforms Week 11- Lecture 1. Server Market $ 46,100,000,000 ($ 46.1 Billion) Gartner.
Operating Systems CS208. What is Operating System? It is a program. It is the first piece of software to run after the system boots. It coordinates the.
Advanced Distributed Software Architectures and Technology group ADSaT 1 Scalability & Availability Paul Greenfield CSIRO.
1 © Prentice Hall, 2002 The Client/Server Database Environment.
Installing software on personal computer
VMware vCenter Server Module 4.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
BMC Software confidential. BMC Performance Manager Will Brown.
PMIT-6102 Advanced Database Systems
1 Distributed Processing, Client/Server, and Clusters Chapter 13.
1 In Summary Need more computing power Improve the operating speed of processors & other components constrained by the speed of light, thermodynamic laws,

1 © Prentice Hall, 2002 Chapter 8: The Client/Server Database Environment Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott,
MBA 664 Database Management Systems Dave Salisbury ( )
What is Architecture  Architecture is a subjective thing, a shared understanding of a system’s design by the expert developers on a project  In the.
1 Lecture 20: Parallel and Distributed Systems n Classification of parallel/distributed architectures n SMPs n Distributed systems n Clusters.
Windows 2000 Advanced Server and Clustering Prepared by: Tetsu Nagayama Russ Smith Dale Pena.
High Performance Computing Cluster OSCAR Team Member Jin Wei, Pengfei Xuan CPSC 424/624 Project ( 2011 Spring ) Instructor Dr. Grossman.
High-Availability Linux.  Reliability  Availability  Serviceability.
A+ Guide to Managing and Maintaining Your PC Fifth Edition Chapter 14 Managing and Troubleshooting Windows 2000.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
CLUSTER COMPUTING STIMI K.O. ROLL NO:53 MCA B-5. INTRODUCTION  A computer cluster is a group of tightly coupled computers that work together closely.
Components of Database Management System
© 2005 by Prentice Hall 1 Chapter 9: The Client/Server Database Environment Modern Database Management 7 th Edition Jeffrey A. Hoffer, Mary B. Prescott,
SXe on Windows 2000 Installing Windows 2000 Server, Progress, and SXe.
Server Systems Administration. Types of Servers Small Servers –Usually are PCs –Need a PC Server Operating System (SOS) such as Microsoft Windows Server,
EIDE Design Considerations 1 EIDE Design Considerations Brian Wright Portland General Electric.
PARALLEL PROCESSOR- TAXONOMY. CH18 Parallel Processing {Multi-processor, Multi-computer} Multiple Processor Organizations Symmetric Multiprocessors Cache.
HNC COMPUTING - Network Concepts 1 Network Concepts Network Concepts Network Operating Systems Network Operating Systems.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
2: Operating Systems Networking for Home & Small Business.
CLIENT SERVER COMPUTING. We have 2 types of n/w architectures – client server and peer to peer. In P2P, each system has equal capabilities and responsibilities.
Lecture 17 Page 1 CS 111 Online Single System Image Approaches Built a distributed system out of many more- or-less traditional computers – Each with typical.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
Chapter 16 Client/Server Computing Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
Chapter 9: The Client/Server Database Environment
Douglas Potter IBI Minneapolis User Group November 2008
High Availability 24 hours a day, 7 days a week, 365 days a year…
Managing Multi-User Databases
Netscape Application Server
The Client/Server Database Environment
Clustering Technology For Fault Tolerance
Distributed System Structures 16: Distributed Structures
Chapter 17: Database System Architectures
Scaleout vs. Scaleup Robert Barnes Microsoft
Chapter 17 Parallel Processing
Introduction To Distributed Systems
Database System Architectures
Distributed Systems and Concurrency: Distributed Systems
Presentation transcript:

Clustering Technology For Scaleability Jim Gray Microsoft Research

Cluster: Shared What? 4 Shared Memory Multiprocessor –Multiple processors, one memory –all devices are local –DEC, SG, Sun Sequent nodes –easy to program, not commodity 4 Shared Disk Cluster –an array of nodes –all shared common disks –VAXcluster + Oracle 4 Shared Nothing Cluster –each device local to a node –ownership may change –Tandem, SP2, Wolfpack

The Answer: BOTH SMP and Cluster? Grow Up with SMP 4xP6 is now standard Grow Out with Cluster Cluster has inexpensive parts Cluster of PCs

Clusters being built 4 Teradata 500 nodes (50k$/slice) 4 Tandem,VMScluster 150 nodes (100k$/slice) 4 Intel, 9,000 55M$ ( 6k$/slice) 4 IBM: m$ (200k$/slice) 4 PC clusters (bare handed) at dozens of nodes web servers (msn, PointCast,…), DB servers 4 KEY TECHNOLOGY HERE IS THE APPS. –Apps distribute data –Apps distribute execution

So, Whats New? 4 When slices cost 50k$, you buy 10 or When slices cost 5k$ you buy 100 or Manageability, programmability, usability become key issues (total cost of ownership). 4 PCs are MUCH easier to use and program

So, Whats New? 4 PCs create virtuous cycle New MPP & NewOS New App New MPP & NewOS New App New MPP & NewOS New App New MPP & NewOS New App Standard OS & Hardware Apps Customers Vicious Cycle No Customers! Virtuous Cycle: Standards allow progress and investment protection

What is Wolfpack? 4 A consortium of 60 HW & SW vendors (everybody who is anybody) 4 A set of APIs for clustering and fault tolerance 4 An enhancement to NT Server (in beta test ) 4 Key concepts –System: a particular node –Cluster: a collection of systems working together –resource: a hardware or software module –resource dependency: one resource needs another –resource group: fails over as a unit: dependencies do not cross group boundaries

What is Wolfpack? Cluster Api DLL Database Manager Event Processor Node Manager Failover Mgr ResourceMgr Communication Manager Resource Monitors Cluster Service Cluster Management Tools Physical Resource DLL Logical Resource DLL App Resource DLL Resource Management Interface App Resource DLL Non Aware App Cluster Aware App RPC Global Update Manager Open Online IsAlive LooksAlive Offline Close Other Nodes

Cluster Advantages 4 Clients and Servers made from the same stuff. –Inexpensive: Built with commodity components 4 Fault tolerance: –Spare modules mask failures 4 Modular growth –grow by adding small modules 4 Parallel data search –use multiple processors and disks

Single System Image:Is It Important? 4 Yes, if you dont have it you fail –parallel MPPs vs Tandem, Teradata, VAXcluster. 4 NUMA & Cluster: –some things are farther away. –Must program in parallel to utilize multiple cpus, disks, wires 4 OS, DBMS, TPmonitor, Web Server, ORB give transparency: load balance data and programs. 4 Administrator, Programmer, User –do not want to know about program & data location

What Happens When a Component Fails? 4 Redundant disk or path: configure around it. 4 Non-redundant software: restart. 4 Non-redundant hardware: migrate software to surviving nodes. 4 Fault detection: 1 ms to 10 sec. 4 Failover.1 sec to 1 min. 4 This is standard in Tandem, Teradata, VMScluster

What are Support Costs? 4 Cluster lowers support costs by –masking failures (instant repair via spare modules) –allowing online maintenance and upgrades. 4 Commodity parts are much cheaper –10$/MIPS vs 10,000$/MIPS –1k$/OS vs 30K$/month/OS 4 Moden OSs are easier to install, configure, manage –GUI –Self-tuning –Online and task-based help –Built in wizards