Course Wrap-Up Miodrag Bolic CEG4136. What was covered Interconnection network topologies and performance Shared-memory architectures Message passing.

Slides:



Advertisements
Similar presentations
Multiple Processor Systems
Advertisements

Shantanu Dutt Univ. of Illinois at Chicago
SE-292 High Performance Computing
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Multiple Processor Systems
Parallel System Performance CS 524 – High-Performance Computing.
Parallel Architectures: Topologies Heiko Schröder, 2003.
History of Distributed Systems Joseph Cordina
Parallel Architectures: Topologies Heiko Schröder, 2003.
CSCI 8150 Advanced Computer Architecture Hwang, Chapter 2 Program and Network Properties 2.4 System Interconnect Architectures.
1 Lecture 23: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Appendix E)
Parallel Computing Platforms
Course Outline Introduction in algorithms and applications Parallel machines and architectures Overview of parallel machines, trends in top-500 Cluster.
1 CSE SUNY New Paltz Chapter Nine Multiprocessors.
Parallel System Performance CS 524 – High-Performance Computing.
1 Lecture 25: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Review session,
Parallel Computer Architectures
1 Static Interconnection Networks CEG 4131 Computer Architecture III Miodrag Bolic.
ECE669 L16: Interconnection Topology March 30, 2004 ECE 669 Parallel Computer Architecture Lecture 16 Interconnection Topology.
Lecture 37: Chapter 7: Multiprocessors Today’s topic –Introduction to multiprocessors –Parallelism in software –Memory organization –Cache coherence 1.
MULTICOMPUTER 1. MULTICOMPUTER, YANG DIPELAJARI Multiprocessors vs multicomputers Interconnection topologies Switching schemes Communication with messages.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
Chapter 5 Array Processors. Introduction  Major characteristics of SIMD architectures –A single processor(CP) –Synchronous array processors(PEs) –Data-parallel.
Interconnect Network Topologies
1 Lecture 23: Interconnection Networks Topics: Router microarchitecture, topologies Final exam next Tuesday: same rules as the first midterm Next semester:
Interconnect Networks
Network Topologies Topology – how nodes are connected – where there is a wire between 2 nodes. Routing – the path a message takes to get from one node.
A brief overview about Distributed Systems Group A4 Chris Sun Bryan Maden Min Fang.
Multiple Processor Systems. Multiprocessor Systems Continuous need for faster and powerful computers –shared memory model ( access nsec) –message passing.
CS668- Lecture 2 - Sept. 30 Today’s topics Parallel Architectures (Chapter 2) Memory Hierarchy Busses and Switched Networks Interconnection Network Topologies.
1 Interconnects Shared address space and message passing computers can be constructed by connecting processors and memory unit using a variety of interconnection.
CSE Advanced Computer Architecture Week-11 April 1, 2004 engr.smu.edu/~rewini/8383.
August 15, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 12: Multiprocessors: Non-Uniform Memory Access * Jeremy R. Johnson.
Dynamic Interconnect Lecture 5. COEN Multistage Network--Omega Network Motivation: simulate crossbar network but with fewer links Components: –N.
Computer Architecture Distributed Memory MIMD Architectures Ola Flygt Växjö University
1 Dynamic Interconnection Networks Miodrag Bolic.
Chapter 6 Multiprocessor System. Introduction  Each processor in a multiprocessor system can be executing a different instruction at any time.  The.
Multiprocessor Interconnection Networks Todd C. Mowry CS 740 November 3, 2000 Topics Network design issues Network Topology.
Switches and indirect networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0.
Anshul Kumar, CSE IITD CSL718 : Multiprocessors Interconnection Mechanisms Performance Models 20 th April, 2006.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 January Session 4.
Course Outline Introduction in algorithms and applications Parallel machines and architectures Overview of parallel machines, trends in top-500, clusters,
1 Introduction CEG 4131 Computer Architecture III Miodrag Bolic.
Chapter 8-2 : Multicomputers Multiprocessors vs multicomputers Multiprocessors vs multicomputers Interconnection topologies Interconnection topologies.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
Birds Eye View of Interconnection Networks
1 Interconnection Networks. 2 Interconnection Networks Interconnection Network (for SIMD/MIMD) can be used for internal connections among: Processors,
Parallel Programming Sathish S. Vadhiyar. 2 Motivations of Parallel Computing Parallel Machine: a computer system with more than one processor Motivations.
YEAR 2006 The University of Auckland | New Zealand PRESENTATION Computer Science 703 Advance Computer Architecture 2006 Semester 1 Preparation for Test.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
1 Introduction ELG 6158 Digital Systems Architecture Miodrag Bolic.
Super computers Parallel Processing
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture CSE 8383 May 2, 2006 Session 29.
Lecture 27 Multiprocessor Scheduling. Last lecture: VMM Two old problems: CPU virtualization and memory virtualization I/O virtualization Today Issues.
Spring EE 437 Lillevik 437s06-l22 University of Portland School of Engineering Advanced Computer Architecture Lecture 22 Distributed computer Interconnection.
LECTURE #1 INTRODUCTON TO PARALLEL COMPUTING. 1.What is parallel computing? 2.Why we need parallel computing? 3.Why parallel computing is more difficult?
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 April 28, 2005 Session 29.
Corse Overview Miodrag Bolic ELG7187 Topics in Computers: Multiprocessor Systems on Chip.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Tree-Based Networks Cache Coherence Dr. Xiao Qin Auburn University
Lynn Choi School of Electrical Engineering
Parallel Architecture
Auburn University COMP8330/7330/7336 Advanced Parallel and Distributed Computing Interconnection Networks (Part 2) Dr.
Lecture 23: Interconnection Networks
Connection System Serve on mutual connection processors and memory .
High Performance Computing & Bioinformatics Part 2 Dr. Imad Mahgoub
CS 6290 Many-core & Interconnect
Birds Eye View of Interconnection Networks
Chip&Core Architecture
Presentation transcript:

Course Wrap-Up Miodrag Bolic CEG4136

What was covered Interconnection network topologies and performance Shared-memory architectures Message passing architectures Scheduling Multicores and networks on chip Cloud computing Elements of system-on-chip design in Labs and sometimes during lectures Basic elements of parallel programming

What was not covered Parallel programming Operating systems Advanced manycore architectures

Final Format Duration: 3 hours Format: –1 theoretical problem –4 problems with multiple parts

Final Format Closed book, closed notes exam. No cheat sheet Only material cover in the class, DGDs, assignments and labs will be on the exam.

Complaining You can come to see the exam on –Fri (Dec 23rd) 13:00-14:00 To complain: you will fill the form and I will notify you about the decision.

Things to Review Assignments Lecture notes Quizzes from 2005, 2006 and 2007

Format 1.Theoretical question 2.Interconnection networks 3.Message passing and scheduling 4 Shared memory systems 5 Advanced architectures or scheduling

Example of the theoretical question for final Type of questions: Compare Define and explain What to study: Everything

Dynamic Interconnection Networks Properties –Network latency –Hardware complexity –Blocking/Nonblocking Switches –Permutations and legitimate states Multistage Interconnection networks –Omega network: »topology, »number of switches, stages and permutations, »routing protocol Crossbar

Static Interconnection Networks Network properties –Node degree d –Diameter D –Bisection width Complete Star Tree Linear array Ring Mesh Torus Hypercube –routing protocol k-ary n-cubes To prepare for dynamic and static interconnection networks use –slides, –assignment and

Message passing Message Passing Properties Store-and-forward routing Wormhole routing Virtual channels Deterministic routing algorithms Deadlocks

Scheduling Dependence graph Scheduling without considering communication –Scheduling inforest/outforest task graphs Heuristic algorithms –Communication Delay versus Parallelism –Clustering –Node duplication

Shared Memory Systems Cache coherence policies –Snooping protocols –Directory protocols

Parallel programing Parallel addition and matrix multiplication on shared memory and message passing systems Again – you will be required to modify the given program

Performance Amdahl law Speedup, Efficiency Parallelism profile, average parallelism, MIPS Scalability Understanding of performance of the program for parallel addition Classification of parallel processing systems

Embedded multicores Review terminology –Symmetric and Asymetric processing –Virtualization and hypervisor –Cache stashing –Run to completion –Posix and OpenMP Example of router implementation using processor cores

Cache coherence for manycores Coherence Bandwidth Requirements Broadcast vs. Directory Protocols Read hit and read miss procedure for: –Private L2 caches –Shared L2 caches

Router design for manycores Architecture of the virtual channel router Pipeline stages –Lookahead routing Buffer Organization Switch organization Arbiters and allocators –Round-robin arbiter

Cloud computing Terminology Services Advantages Comparison with grid and high-performance computing