Large Scale Parallel File System and Cluster Management ICT, CAS.

Slides:



Advertisements
Similar presentations
Virtual Disk based Centralized Management for Enterprise Networks
Advertisements

Cloud Computing to Satisfy Peak Capacity Needs Case Study.
Adam Duffy Edina Public Schools.  The heart of virtualization is the “virtual machine” (VM), a tightly isolated software container with an operating.
Novell Server Linux vs. windows server 2008 By: Gabe Miller.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
Silicon Graphics, Inc. Poster Presented by: SGI Proprietary Technologies for Breakthrough Research Rosario Caltabiano North East Higher Education & Research.
1 Recap (RAID and Storage Architectures). 2 RAID To increase the availability and the performance (bandwidth) of a storage system, instead of a single.
What is it? Hierarchical storage software developed in collaboration with five US department of Energy Labs since 1992 Allows storage management of 100s.
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
Implementing Failover Clustering with Hyper-V
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
Mass RHIC Computing Facility Razvan Popescu - Brookhaven National Laboratory.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
GDC Workshop Session 1 - Storage 2003/11. Agenda NAS Quick installation (15 min) Major functions demo (30 min) System recovery (10 min) Disassembly (20.
Data Deduplication in Virtualized Environments Marc Crespi, ExaGrid Systems
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google∗
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
1 The Google File System Reporter: You-Wei Zhang.
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 HP + DDN = A WINNING PARTNERSHIP Systems architected by HP and DDN Full storage hardware and.
Report : Zhen Ming Wu 2008 IEEE 9th Grid Computing Conference.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Copyright 2009 Fujitsu America, Inc. 0 Fujitsu PRIMERGY Servers “Next Generation HPC and Cloud Architecture” PRIMERGY CX1000 Tom Donnelly April
High Performance Computing Cluster OSCAR Team Member Jin Wei, Pengfei Xuan CPSC 424/624 Project ( 2011 Spring ) Instructor Dr. Grossman.
So, Jung-ki Distributed Computing System LAB School of Computer Science and Engineering Seoul National University Implementation of Package Management.
Ceph Storage in OpenStack Part 2 openstack-ch,
Building the Infrastructure Grid: Architecture, Design & Deployment Logan McLeod – Database Technology Strategist.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Virtualization for Storage Efficiency and Centralized Management Genevieve Sullivan Hewlett-Packard
Farm Management D. Andreotti 1), A. Crescente 2), A. Dorigo 2), F. Galeazzi 2), M. Marzolla 3), M. Morandin 2), F.
© 2006 IBM Corporation Flash Copy Solutions im Windows Umfeld TSM for Copy Services Wolfgang Hitzler Technical Sales Tivoli Storage Management
1 Selecting LAN server (Week 3, Monday 9/8/2003) © Abdou Illia, Fall 2003.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
Selling the Storage Edition for Oracle November 2000.
Storage Tank in Data Grid Shin, SangYong(syshin, #6468) IBM Grid Computing August 23, 2003.
GStore: GSI Mass Storage ITEE-Palaver GSI Horst Göringer, Matthias Feyerabend, Sergei Sedykh
Microsoft Azure Storage. Networking Compute Storage Virtual Machine Operating System Applications Data & Access Runtime Provision.
CHEP04 Performance Analysis of Cluster File System on Linux Yaodong CHENG IHEP, CAS
Terascala – Lustre for the Rest of Us  Delivering high performance, Lustre-based parallel storage appliances  Simplifies deployment, management and tuning.
VMware vSphere Configuration and Management v6
High Availability in DB2 Nishant Sinha
Sep. 17, 2002BESIII Review Meeting BESIII DAQ System BESIII Review Meeting IHEP · Beijing · China Sep , 2002.
Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing,
WINDOWS SERVER 2003 Genetic Computer School Lesson 12 Fault Tolerance.
May l Washington, DC l Omni Shoreham Parallels Virtuozzo Containers Roadmap Andrey Moruga Virtualization Product Manager, Parallels.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Juraj Sucik, Michal Kwiatek, Rafal.
Tackling I/O Issues 1 David Race 16 March 2010.
Simple to deploy, easy to manage Consistently rich experience, regardless of deployment model 1 Platform 1 Experience Pooled virtual machines Highest.
Load Rebalancing for Distributed File Systems in Clouds.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Extending Auto-Tiering to the Cloud For additional, on-demand, offsite storage resources 1.
BIG DATA/ Hadoop Interview Questions.
Peter Idoine Managing Director Oracle New Zealand Limited.
Page : 1 SC2004 Pittsburgh, November 12, 2004 DEISA : integrating HPC infrastructures in Europe DEISA : integrating HPC infrastructures in Europe Victor.
Bernd Panzer-Steindel CERN/IT/ADC1 Medium Term Issues for the Data Challenges.
PHD Virtual Technologies “Reader’s Choice” Preferred product.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Chapter 1: Introduction
Diskless Networks By Sam Morris.
The demonstration of Lustre in EAST data system
Introduction to Data Management in EGI
Large Scale Test of a storage solution based on an Industry Standard
Storage Virtualization
Hadoop Technopoints.
Introduction to Operating Systems
Specialized Cloud Architectures
PerformanceBridge Application Suite and Practice 2.0 IT Specifications
Presentation transcript:

Large Scale Parallel File System and Cluster Management ICT, CAS

About ICT, CAS Institute of Computing Technology, Chinese Academy of Science The first (from 1958) and largest national IT research institute in China The largest graduate school of Computer Science in China Builder of most Chinese systems in HPC TOP 500 Focusing on computing system architecture: CPU, Compiler, Network, Grid, HPC and Storage

Storage Centre of ICT Founded in 2001 Leader: Dr. Xu Lu (from HP Lab) Storage for scientific computing –BWFS: Parallel cluster file system –Service on Demand system: Storage-based cluster management system. Storage for business computing –VSDS: Virtual storage research project –Backup / Virtual Computing……

Initial Goals…… Cluster is the future of scientific computing Avoid the I/O bottleneck – BWFS –Easy to extend to large scale –Both in capacity and speed of parallel data access Reduce the Management Work – Service on Demand system –Easy to use and management when the scale is getting larger –Higher availability facing the hardware fault when the scale is getting larger Make the cluster easy to scale, use, management and access - SUMA

The Storage Bottleneck of Cluster NFS (Network File System) –Most widely used in clusters to provide shared data access –Simple and easy to use and management Scalability Problem –Multiple NFS server means multiple name space –Hard to extend in capacity. –The performance do not increase with the capacity Parallel Access Problem –Poor performance in I/O density computing –Weak MS Windows support

Parallel network file system –Support multiple storage appliances (8-128) in a single name space (Up to 512 TB) –Separated Data and Meta-Data access to provide parallel accessing between different storage appliance Global name space between clients with different platforms –Fully compatible with NFS (not 100% POSIX) –Support data sharing between Linux and Windows clients –Support IA32, IA64 and x86_64 hardware platforms What’s BWFS

Centralized Management –Web based management for the storage appliances and the storage sub-system –Integrated client management with Service on Demand system. Online extension –Add storage appliances to increase the capacity without stopping the application –The new data will be automatically stripped between all the storage appliances to get a high performance.

`` Meta-DataUser Data Data Access on NFS

User-Data Meta-Data Data Access on BWFS

Bandwidth of BWFS write large files(20G per node, 1MB record size) Number of client nodes Aggregate Bandwidth (MB/s) 1SN 2SN 4SN NFS read large files(20G per node, 1MB record size) Number of client nodes Aggregate Bandwidth(MB/s) 1SN 2SN 4SN NFS

Paradigm Epos 3 (China Petrol, Xinjiang)

Paradigm Disco (China Petrol, Xinjiang)

Management Interface

Service on Demand System Initially developed as a subsystem of BWFS to provide cluster management Reduce the management work especially in the system deployments Increase the availability against the storage components fail Enable the fast schedule in large server farms with multiple clusters Boot the system directly from the BWFS storage appliance without the need of local hard disks

Traditional Cluster Deployment System 20mins

Shortcoming 1: Inefficiency in Schedule 20 mins

Shortcoming 2: Inefficiency in Maintains Hard disk errors occupy 30%-50% of all the computer system errors

Shortcoming 3: Inefficiency in Capacity A 5GB system on a 74GB hard disk The disks are getting larger and larger but the system images are keeping small to reduce deployment time

Service on Demand System Diskless boot OS by TCP/IP –Virtual SCSI disk to support Windows and Linux –Fully compatible with applications Provide high performance snapshots to support fast cloning of system images –Copy on Write when the system image is modified –Online backup system image with snapshot Automatic take over on failed clients Integrated monitor engine to support automatic schedule or adaptive computing (still in researching)

Service on Demand System Service 1 Service 2 Service N Map to Local Disk Network

系统 Web 系统 Paradigm Image Fast Deployment and Schedule CGG Image Paradigm Snapshot Paradigm Snapshot Paradigm Snapshot Paradigm Snapshot Paradigm Services CGG Services Paradigm Snapshot CGG Snapshot

System Image System Snapshot Easy to maintain System Snapshot System Snapshot System Snapshot System Snapshot Maintenance

Management UI

Thanks 谢谢!