3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-3.

Slides:



Advertisements
Similar presentations
High Performance Computing Course Notes Grid Computing.
Advertisements

Introduction CSCI 444/544 Operating Systems Fall 2008.
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Workload Management Massimo Sgaravatto INFN Padova.
1 GRID D. Royo, O. Ardaiz, L. Díaz de Cerio, R. Meseguer, A. Gallardo, K. Sanjeevan Computer Architecture Department Universitat Politècnica de Catalunya.
Cross Cluster Migration Remote access support Adianto Wibisono supervised by : Dr. Dick van Albada Kamil Iskra, M. Sc.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
DISTRIBUTED COMPUTING
CoG Kit Overview Gregor von Laszewski Keith Jackson.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
The Globus Project: A Status Report Ian Foster Carl Kesselman
The Anatomy of the Grid Mahdi Hamzeh Fall 2005 Class Presentation for the Parallel Processing Course. All figures and data are copyrights of their respective.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Commodity Grid Kits Gregor von Laszewski (ANL), Keith Jackson (LBL) Many state-of-the-art scientific applications, such as climate modeling, astrophysics,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
1 Observations on Architecture, Protocols, Services, APIs, SDKs, and the Role of the Grid Forum Ian Foster Carl Kesselman Steven Tuecke.
7. Grid Computing Systems and Resource Management
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Globus Grid Tutorial Part 2: Running Programs Across Multiple Resources.
2/22/2001Greenbook 2001/OASCR1 Greenbook/OASCR Activities Focus on technology to enable SCIENCE to be conducted, i.e. Software tools Software libraries.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
© Copyright AARNet Pty Ltd PRAGMA Update & some personal observations James Sankar Network Engineer - Middleware.
Background Computer System Architectures Computer System Software.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Data Infrastructure in the TeraGrid Chris Jordan Campus Champions Presentation May 6, 2009.
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Workload Management Workpackage
Clouds , Grids and Clusters
A. Rama Bharathi Regd. No: 08931F0040 III M.C.A
Grid Computing.
University of Technology
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
Parallel and Multiprocessor Architectures – Shared Memory
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Presentation transcript:

3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-3

Distributed Computing ● Concept has been used for two decades ● Basic idea: run scheduler across systems to runs processes on least-used systems first ➔ Maximize utilization ➔ Minimize turnaround time ● Have to load executable and input files to selected resource ➔ Shared file system ➔ File transfers upon resource selection

Distributed vs. Parallel Computing ● Different ➔ Distributed computing executes independent (but possibly related) applications on different systems; jobs do not communicate with each other ➔ Parallel computing executes a single application across processors, distributing the work and/or data but allowing communication between processes ● Non-exclusive: can distribute parallel applications to parallel computing system s

Grid Computing ● Enable communities (“virtual organizations”) to share geographically distributed resources as they pursue common goals—in the absence of central control, omniscience, trust relationships. ● Resources (HPC systems, visualization systems & displays, storage systems, sensors, instruments, people) are integrated via ‘middleware’ to facilitate use of all resources.

Grid Possibilities ● A biochemist exploits 10,000 computers to screen 100,000 compounds in an hour ● 1,000 physicists worldwide pool resources for petaflop analyses of petabytes of data ● Civil engineers collaborate to design, execute, & analyze shake table experiments ● Climate scientists visualize, annotate, & analyze terabyte simulation datasets ● An emergency response team couples real time data, weather model, population data

Grid Usage Models ● Distributed computing: job scheduling on Grid resources with secure, automated data transfer ● Workflow: synchronized scheduling and automated data transfer from one system to next in pipeline (e.g. HPC system to visualization lab to storage system) ● Coupled codes, with pieces running on different systems simultaneously ● Meta-applications: parallel apps spanning multiple systems

Grid Usages Models ● Some models are similar to models already being used, but are much simpler due to: ➔ single sign-on ➔ automatic process scheduling ➔ automated data transfers ● But Grids can encompass new resources likes sensors and instruments, so new usage models will arise

Grid Requirements ● Single allocation (or none needed) ● Single sign-on: authentication to any Grid resources authenticates for all others ● Single compute space: one scheduler for all Grid resources ● Single data space: can address files and data from any Grid resources ● Single development environment: Grid tools and libraries that work on all grid resources

The Security Problem ● Resources being used may be extremely valuable & the problems being solved extremely sensitive ● Resources are often located in distinct administrative domains ➔ Each resource may have own policies & procedures ● The set of resources used by a single computation may be large, dynamic, and/or unpredictable ➔ Not just client/server ● It must be broadly available & applicable ➔ Standard, well-tested, well-understood protocols ➔ Integration with wide variety of tools

The Resource Management Problem ● Enabling secure, controlled remote access to computational resources and management of remote computation ➔ Authentication and authorization ➔ Resource discovery & characterization ➔ Reservation and allocation ➔ Computation monitoring and control

The Grid Systems Technologies ● Systems and security problems addressed by new protocols & services. E.g., Globus: ➔ Grid Security Infrastructure (GSI) for security ➔ Globus Metadata Directory Service (MDS) for discovery ➔ Globus Resource Allocations Manager (GRAM) protocol as a basic building block Resource brokering & co-allocation services ➔ GridFTP for data movement

The Programming Problem ● How does a user develop robust, secure, long-lived applications for dynamic, heterogeneous, Grids? ● Presumably need: ➔ Abstractions and models to add to speed/robustness/etc. of development ➔ Tools to ease application development and diagnose common problems ➔ Code/tool sharing to allow reuse of code components developed by others

Grid Programming Technologies ● “Grid applications” are incredibly diverse (data, collaboration, computing, sensors, …) ➔ Seems unlikely there is one solution ● Most applications have been written “from scratch,” with or without Grid services ● Application-specific libraries have been shown to provide significant benefits ● No new language, programming model, etc., has yet emerged that transforms things ➔ But certainly still quite possible

Examples ● MPICH-G2: Grid-enabled message passing ● CoG Kits, GridPort: Portal construction, based on N-tier architectures ● GDMP, Data Grid Tools, SRB: replica management, collection management ● Condor-G: simple workflow management ● Legion: object models for Grid computing ● Cactus: Grid-aware numerical solver framework ➔ Note tremendous variety, application focus

MPICH-G2 ● A complete implementation of the Message Passing Interface (MPI) for heterogeneous, wide area environments ➔ Based on the Argonne MPICH implementation of MPI (Gropp and Lusk) ● Globus services for authentication, resource allocation, executable staging, output, etc. ● Programs run in wide area without change! ● See also: MetaMPI, PACX, STAMPI, MAGPIE

Grid Events ● Global Grid Forum: working meeting ➔ Meets 3 times/year, alternates U.S.-Europe, with July meeting as major event ● HPDC: major academic conference ➔ HPDC-11 in Scotland with GGF-8, July 2002 ● Other meetings include ➔ IPDPS, CCGrid, EuroGlobus, Globus Retreats

Future Trends in HPC ● Monitoring and understanding future trends in HPC is important: ➔ users: applications should be written to be efficient on current and future architectures ➔ developers: tools should be written to be efficient on current and future architectures ➔ computing centers: system purchases are expensive and should have upgrade paths

Commercialization ● Computing technologies (including HPC) are now propelled by profits, not sustained by subsidies ➔ Web servers, databases, transaction processing and especially multimedia applications drive the need for computational performance. ➔ Most HPC systems are ‘scaled up’ commercial systems, with less additional hardware and software compared to commercial systems. ➔ It’s not engineering, it’s economics.

Processors and Nodes ● Easy predictions: ➔ microprocessors performance increase continues at ~60% per year (Moore’s Law) for 5+ years. ➔ total migration to 64-bit microprocessors ➔ use of even more cache, more memory hierarchy. ➔ increased emphasis on SMPs ● Tougher predictions: ➔ resurgence of vectors in microprocessors? Maybe ➔ dawn of multithreading in microprocessors? Yes

SMPs ● More processors are faster, of course ➔ SMPs are simplest form of parallel systems ➔ efficient if not limited by memory bus contention: small numbers of processors ● Commercial market for high performance servers at low cost drives need for SMPs ● HPC market for highest performance, ease of programming drives development of SMPs

SMPs ● Trends are to: ➔ build bigger SMPs ➔ attempt to share memory across SMPs (cc- NUMA)

Web-based Grid Computing ● Web currently used mostly for content delivery ● Web servers on HPC systems can execute applications ● Web servers on Grids can launch applications, move/store/retrieve data, display visualizations, etc. ● NPACI HotPage already enables single sign-on to NPACI Grid Resources

Summary ● HPC systems will grow in performance but probably change little in design (5-10 years): ➔ HPC systems will be larger versions of smaller commercial systems, mostly large SMPs and clusters of inexpensive nodes ➔ Some processors will exploit vectors, as well as more/larger caches. ➔ Best HPC systems will have been designed ‘top-down’ instead of ‘bottom-up’, but all will have been designed to make the ‘bottom’ profitable. ➔ Multithreading is the only likely, near-term major architectural change.

Summary ● Using HPC systems will change much more: ➔ Grid computing will become widespread in HPC and in commercial computing ➔ Visual supercomputing and collaborative simulation will be commonplace. ➔ WWW interfaces to HPC resources will make transparent supercomputing commonplace. ● But programming the most powerful resources most effectively will remain difficult.