Report on Communication Architecture for Clusters (CAC) Workshop Dhabaleswar K. (DK) Panda Department of Computer and Info. Science The Ohio State University.

Slides:

Advertisements

Similar presentations

System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.

Advertisements

Efficient Collective Operations using Remote Memory Operations on VIA-Based Clusters Rinku Gupta Dell Computers Dhabaleswar Panda.

PANEL Session : The Future of I/O from a CPU Architecture Perspective #OFADevWorkshop.

Intel CAC 2002 Panel Page 1 It’s the Interface, Stupid! Shubu Mukherjee VSSAD, Intel Corporation 2002 CAC Panel Disclaimer: All opinions expressed in this.

2. Computer Clusters for Scalable Parallel Computing

Today’s topics Single processors and the Memory Hierarchy

Welcome to the 10 th OFA Workshop #OFADevWorkshop.

© 2010 IBM Corporation Welcome to Systor 2010 Yaron Wolfsthal, Mgr, Systems Technologies, Haifa Research Lab The 3rd Annual Haifa Experimental Systems.

Institute of Computer Science Foundation for Research and Technology – Hellas Greece Computer Architecture and VLSI Systems Laboratory Exploiting Spatial.

IBM RS6000/SP Overview Advanced IBM Unix computers series Multiple different configurations Available from entry level to high-end machines. POWER (1,2,3,4)

NPACI Panel on Clusters David E. Culler Computer Science Division University of California, Berkeley

CAIR: What next? Richard Parncutt, 10 April 2010.

A Comparative Study of Network Protocols & Interconnect for Cluster Computing Performance Evaluation of Fast Ethernet, Gigabit Ethernet and Myrinet.

CS 524 – High- Performance Computing Outline. CS High-Performance Computing (Wi 2003/2004) - Asim LUMS2 Description (1) Introduction to.

Realizing the Performance Potential of the Virtual Interface Architecture Evan Speight, Hazim Abdel-Shafi, and John K. Bennett Rice University, Dep. Of.

TRANSACT th ACM SIGPLAN Workshop on Transactional Computing February 15, 2009 Dan Grossman Program Chair.

NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.

Storage area network and System area network (SAN)

INSTITUTE OF COMPUTING TECHNOLOGY Opening Remark for 1 st BPOE Workshop Jianfeng Zhan, Chinese Academy of Sciences Santa Clara, CA, USA.

The hybird approach to programming clusters of multi-core architetures.

INSTITUTE OF COMPUTING TECHNOLOGY BPOE-4 workshop The fourth workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware Salt Lake.

New Direction Proposal: An OpenFabrics Framework for high-performance I/O apps OFA TAC, Key drivers: Sean Hefty, Paul Grun.

Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based.

CN. Computer NEtwork ► A computer network, often simply referred to as a network, is a collection of computers.

Towards a Common Communication Infrastructure for Clusters and Grids Darius Buntinas Argonne National Laboratory.

High Performance User-Level Sockets over Gigabit Ethernet Pavan Balaji Ohio State University Piyush Shivam Ohio State University.

QCD Project Overview Ying Zhang September 26, 2005.

DISTRIBUTED COMPUTING

Maximizing The Compute Power With Mellanox InfiniBand Connectivity Gilad Shainer Wolfram Technology Conference 2006.

Artdaq Introduction artdaq is a toolkit for creating the event building and filtering portions of a DAQ. A set of ready-to-use components along with hooks.

CLUSTER COMPUTING STIMI K.O. ROLL NO:53 MCA B-5. INTRODUCTION  A computer cluster is a group of tightly coupled computers that work together closely.

Center for Programming Models for Scalable Parallel Computing: Project Meeting Report Libraries, Languages, and Execution Models for Terascale Applications.

High Performance I/O and Data Management System Group Seminar Xiaosong Ma Department of Computer Science North Carolina State University September 12,

The Development, Maintenance, and Use of Course Web Sites The Development, Maintenance, and Use of Course Web Sites Panel at the ACM SIGCSE 34th Technical.

Extreme scale parallel and distributed systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward.

Workload-driven Analysis of File Systems in Shared Multi-Tier Data-Centers over InfiniBand K. Vaidyanathan P. Balaji H. –W. Jin D.K. Panda Network-Based.

Eighth International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2) Yong Chen PavanBalaji Abhinav Vishnu P2S2.

Cluster Workstations. Recently the distinction between parallel and distributed computers has become blurred with the advent of the network of workstations.

An Analysis of 10-Gigabit Ethernet Protocol Stacks in Multi-core Environments G. Narayanaswamy, P. Balaji and W. Feng Dept. of Comp. Science Virginia Tech.

Remote Direct Memory Access (RDMA) over IP PFLDNet 2003, Geneva Stephen Bailey, Sandburst Corp., Allyn Romanow, Cisco Systems,

Impact of High Performance Sockets on Data Intensive Applications Pavan Balaji, Jiesheng Wu, D.K. Panda, CIS Department The Ohio State University Tahsin.

N. GSU Slide 1 Chapter 05 Clustered Systems for Massive Parallelism N. Xiong Georgia State University.

ICS-FORTH 25-Nov Infrastructure for Scalable Services Are we Ready Yet? Angelos Bilas Institute of Computer Science (ICS) Foundation.

Recap and Highlights from 2010 MER Conference Craig Young.

Report to Commission IV September 29, 2003 for Study Group Small Satellite Missions for Earth Observation (SSMEO) Study Group Cost-Effective Earth Observation.

Infiniband Bart Taylor. What it is InfiniBand™ Architecture defines a new interconnect technology for servers that changes the way data centers will be.

Computer and Computational Sciences Division Los Alamos National Laboratory On the Feasibility of Incremental Checkpointing for Scientific Computing Jose.

CS 4396 Computer Networks Lab Router Architectures.

Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 April 11, 2006 Session 23.

Mellanox Connectivity Solutions for Scalable HPC Highest Performing, Most Efficient End-to-End Connectivity for Servers and Storage April 2010.

InfiniBand By Group 3: Casey Bauer Mary Daniel William Hunter Hannah McMahon John Walls.

Interconnection network network interface and a case study.

COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.

Mellanox Connectivity Solutions for Scalable HPC Highest Performing, Most Efficient End-to-End Connectivity for Servers and Storage September 2010 Brandon.

3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.

Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.

CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.

PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.

SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE

VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.

Fall ‘99 Simulation Interoperability Workshop RTI Interoperability Study Group Final Report Michael D. Myjak, Chair.

Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 April 6, 2006 Session 22.

Computer Networks Laboratory project. In cooperation with Mellanox Technologies Ltd. Guided by: Crupnicoff Diego. Gurewitz Omer. Students: Cohen Erez.

Berkeley Cluster Projects

Women in Cybersecurity (WiCyS) Initiative

Planning Symposia & Workshops

Welcome to Workshop on Network-I/O Convergence: Experience, Lessons, Implications NICELI SIGCOMM August 27, 2003.

The Development, Maintenance, and Use of Course Web Sites Panel at the ACM SIGCSE 34th Technical Symposium on Computer Science Education, Reno,

Supporting Faculty Research

Presentation transcript:

Report on Communication Architecture for Clusters (CAC) Workshop Dhabaleswar K. (DK) Panda Department of Computer and Info. Science The Ohio State University

Objectives Clusters are being targeted for high-end computing as well as high- performance servers Requires –High-performance communication and I/O subsystems –Low overhead programming environment support –Support for QoS for emerging applications –Study of the impact of emerging networking technologies and standards (VIA, InfiniBand) Goals: Bring together researchers and practitioners from academia, industry, research labs and national labs to discuss solutions as well as future trends for designing scalable, high-performance, and cost-effective communication and I/O architectures for clusters

Organization Started in 2001 Takes place in conjunction with Int’l Parallel and Distributed Processing Symposium (IPDPS) –CAC ’01 (IPDPS ’01) in San Francisco, April 2001 –CAC ’02 (IPDPS ’02) in Ft. Lauderdale, April 2002 –CAC ’03 (IPDPS ’03) in Nice, France, April 2003 Co-organized by –D. K. Panda (OSU) –Jose Duato (Univ. of Valencia, Spain) –Craig Stunkel (IBM TJ Watson)

Organization Follow-up to the Communication and Architectural Support for Network-Based Parallel Computing (CANPC) Workshops, held in conjunction with HPCA conference from Focuses on interaction between presenters and audience Thanks to many people in this room –PC members –Keynote Speakers –Authors –Session Chairs –Panel Moderators –Panelists Please do not blame me if your paper was not accepted –We (organizers) simply followed the recommendations by PC members

CAC ‘01 14 papers out of 24 Grouped along four sessions –Low-level Messaging –Interconnection and Communication –Switch/Router and NIC Support –Network Services and Communication Keynote Talk by Prof. Thomas Sterling Commodity Clusters: The Third Wave for High Performance Computing –Focused on multiple issues related to cluster computing –Emphasized on next generation cluster computing in space (using computers in satellites)

CAC ’01 (Cont’d) Panel Session InfiniBand: The de-facto standard for system and local area networks or just a scalable replacement for PCI Buses? –Moderator: Timothy Pinkston (USC) –Panelists: Jose Duato (Univ. of Valencia, Spain) Michael Krause (HP) Irving Robinson (Intel) Thomas Sterling (Caltech and JPL) Madhu Talluri (Sun) Alan Benner (IBM) Six papers and a copy of the panel report (after going through another round of review process) have been selected to appear in a special issue of Cluster Computing journal, April 2003 Was attended by around people

CAC ’02 11 papers out of 19 (less submissions due to 9/11) Grouped along four sessions –Routing and Switching –Remote Memory Communication –I/O and NIC Support –InfiniBand Keynote Talk by Prof. Tony Skjellum Explicit Parallel Programming with Message Passing Interfaces: Legacy, Longevity, Optimizability, Evolvability –Focused on multiple issues related to past and current development of MPI –Emphasized on multiple aspects of optimizing MPI implementations

CAC ’02 (Cont’d) Panel Session Cluster Interconnects Crystal Ball: Which will win in 2006? –Moderator: Craig Stunkel (IBM TJ Watson) –Panelists: David Addision (Quadrics) Kevin Dierling (Mellanox) Patrick Geoffray (Myricom) Shubu Mukherjee (Intel) Renato J. Recio (IBM) Was also attended by around people

CAC ’03 13 papers out of 37 Grouped along five sessions –Communications Hardware –Network Interfaces and Collective Communication –Communication Libraries –System Services –Performance Evaluation Keynote Talk by Prof. Dan Reed Clusters: Challenges and Opportunities Panel session Top 3 technologies that are limiting cluster interconnects –Moderator: Ron Brightwell (Sandia) –Panelists: being decided

Web Pointers CAC home page