E6200, Fall 07, Oct 24Ambale: CMP1 Bharath Ambale Venkatesh 10/24/2007.

Slides:



Advertisements
Similar presentations
Multi-core processors. 2 Processor development till 2004 Out-of-order Instruction scheduling Out-of-order Instruction scheduling.
Advertisements

The Interaction of Simultaneous Multithreading processors and the Memory Hierarchy: some early observations James Bulpin Computer Laboratory University.
Lecture 6: Multicore Systems
Intel Multi-Core Technology. New Energy Efficiency by Parallel Processing – Multi cores in a single package – Second generation high k + metal gate 32nm.
Structure of Computer Systems
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
June 30th, 2006 ICS’06 -- Håkan Zeffer: Håkan Zeffer Zoran Radovic Martin Karlsson Erik Hagersten Uppsala University Sweden TMA A Trap-Based.
Single-Chip Multiprocessor Nirmal Andrews. Case for single chip multiprocessors Advances in the field of integrated chip processing. - Gate density (More.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon, Dec 5, 2005 Topic: Intro to Multiprocessors and Thread-Level Parallelism.
Trevor Burton6/19/2015 Multiprocessors for DSP SYSC5603 Digital Signal Processing Microprocessors, Software and Applications.
Chapter 17 Parallel Processing.
1 Pipelining for Multi- Core Architectures. 2 Multi-Core Technology Single Core Dual CoreMulti-Core + Cache + Cache Core 4 or more cores.
How Multi-threading can increase on-chip parallelism
Simultaneous Multithreading: Multiplying Alpha Performance Dr. Joel Emer Principal Member Technical Staff Alpha Development Group Compaq.
1 Lecture 10: ILP Innovations Today: ILP innovations and SMT (Section 3.5)
Single-Chip Multi-Processors (CMP) PRADEEP DANDAMUDI 1 ELEC , Fall 08.
Multi-core Processing The Past and The Future Amir Moghimi, ASIC Course, UT ECE.
Joram Benham April 2,  Introduction  Motivation  Multicore Processors  Overview, CELL  Advantages of CMPs  Throughput, Latency  Challenges.
Computer performance.
Hyper-Threading, Chip multiprocessors and both Zoran Jovanovic.
Computer System Architectures Computer System Software
Semiconductor Memory 1970 Fairchild Size of a single core –i.e. 1 bit of magnetic core storage Holds 256 bits Non-destructive read Much faster than core.
8 – Simultaneous Multithreading. 2 Review from Last Time Limits to ILP (power efficiency, compilers, dependencies …) seem to limit to 3 to 6 issue for.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Multi Core Processor Submitted by: Lizolen Pradhan
Multi-core architectures. Single-core computer Single-core CPU chip.
Multi-Core Architectures
1 Multi-core processors 12/1/09. 2 Multiprocessors inside a single chip It is now possible to implement multiple processors (cores) inside a single chip.
Lecture 11 Multithreaded Architectures Graduate Computer Architecture Fall 2005 Shih-Hao Hung Dept. of Computer Science and Information Engineering National.
Parallel Processing - introduction  Traditionally, the computer has been viewed as a sequential machine. This view of the computer has never been entirely.
Nicolas Tjioe CSE 520 Wednesday 11/12/2008 Hyper-Threading in NetBurst Microarchitecture David Koufaty Deborah T. Marr Intel Published by the IEEE Computer.
SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.
Niagara: a 32-Way Multithreaded SPARC Processor
Outline  Over view  Design  Performance  Advantages and disadvantages  Examples  Conclusion  Bibliography.
Super computers Parallel Processing By Lecturer: Aisha Dawood.
SYNAR Systems Networking and Architecture Group CMPT 886: Computer Architecture Primer Dr. Alexandra Fedorova School of Computing Science SFU.
Shashwat Shriparv InfinitySoft.
SIMULTANEOUS MULTITHREADING Ting Liu Liu Ren Hua Zhong.
Multi-core processors. 2 Processor development till 2004 Out-of-order Instruction scheduling Out-of-order Instruction scheduling.
© Wen-mei Hwu and S. J. Patel, 2005 ECE 511, University of Illinois Lecture 4: Microarchitecture: Overview and General Trends.
Floating Point Numbers & Parallel Computing. Outline Fixed-point Numbers Floating Point Numbers Superscalar Processors Multithreading Homogeneous Multiprocessing.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
HyperThreading ● Improves processor performance under certain workloads by providing useful work for execution units that would otherwise be idle ● Duplicates.
Computer performance issues* Pipelines, Parallelism. Process and Threads.
Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance Presented by: Peyman Nov 2007.
Computer Organization CS224 Fall 2012 Lesson 52. Introduction  Goal: connecting multiple computers to get higher performance l Multiprocessors l Scalability,
By Kunle Olukotun, BasemA. Nayfeh, Lance Hammond, Ken Wilson, and Kunyung Chang Presented by Dheeraj Kumar Kaveti.
Computer Structure 2015 – Intel ® Core TM μArch 1 Computer Structure Multi-Threading Lihu Rappoport and Adi Yoaz.
SYNAR Systems Networking and Architecture Group CMPT 886: Computer Architecture Primer Dr. Alexandra Fedorova School of Computing Science SFU.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
Carnegie Mellon /18-243: Introduction to Computer Systems Instructors: Anthony Rowe and Gregory Kesden 27 th (and last) Lecture, 28 April 2011 Multi-Core.
Processor Performance & Parallelism Yashwant Malaiya Colorado State University With some PH stuff.
Computer Architecture: Multi-Core Processors: Why? Prof. Onur Mutlu Carnegie Mellon University.
Fall 2012 Parallel Computer Architecture Lecture 4: Multi-Core Processors Prof. Onur Mutlu Carnegie Mellon University 9/14/2012.
COMP 740: Computer Architecture and Implementation
Parallel Processing - introduction
Lynn Choi School of Electrical Engineering
Simultaneous Multithreading
Multi-core processors
Computer Structure Multi-Threading
/ Computer Architecture and Design
Multi-Core Computing Osama Awwad Department of Computer Science
Hyperthreading Technology
Superscalar Pipelines Part 2
Levels of Parallelism within a Single Processor
Computer Architecture Lecture 4 17th May, 2006
Hardware Multithreading
Simultaneous Multithreading in Superscalar Processors
CSC3050 – Computer Architecture
Levels of Parallelism within a Single Processor
Presentation transcript:

E6200, Fall 07, Oct 24Ambale: CMP1 Bharath Ambale Venkatesh 10/24/2007

E6200, Fall 07, Oct 24Ambale: CMP2 Trends Increasing processor size/speed leads to increase in power required Increasing heat generated leads to need of cooling components Increase in network speeds is much slower than increase in processor speeds Future applications are becoming more and more parallel – multimedia, face recognition, voice recognition, etc… Future applications also becoming data intensive

E6200, Fall 07, Oct 24Ambale: CMP3 Instruction-level parallelism (ILP) Re-ordering of instructions so that they can be executed in parallel Pipelining Superscalars Maximum of 6-10 instructions per cycle for real applications Bottlenecks: Branch prediction

E6200, Fall 07, Oct 24Ambale: CMP4 Thread-level parallelism (TLP) Multiple threads spawned from same process (SMT) Loop-level parallelism Threads interact with each other Pentium 4 uses hyper- threading Bottleneck: Memory cache

E6200, Fall 07, Oct 24Ambale: CMP5 Process-level Parallelism (PLP) Run multiple independent processes controlled by the OS Symmetric Multiprocessors (SMP) : multiple independent processors connected by a network (cluster) Bottleneck: network

E6200, Fall 07, Oct 24Ambale: CMP6 Memory Access Cache miss: when data has to fetched from main memory Cache miss in superscalars leads to significant delay SMT leads to multiple processes accessing a shared cache – a cache is pushed to have more ports Memory bandwidth is a problem

E6200, Fall 07, Oct 24Ambale: CMP7 CMP Introduce multiple supersacalar processors each capable of running multiple threads Each processor has individual cache and also has a shared cache Processors need not be homogenous

E6200, Fall 07, Oct 24Ambale: CMP8 CMP Advantages Multi-tasking Shorter signal path Less-power consumed Memory bandwidth is not the limiting problem Disadvantages Specialized software to utilize multithreading Thermal management is more difficult Commercial CMP’s Intel and AMD’s dual-core, quad-core,..

E6200, Fall 07, Oct 24Ambale: CMP9 Cell Processor

E6200, Fall 07, Oct 24Ambale: CMP10 References "A D&T Roundtable: Are Single-Chip Multiprocessors in Reach?," IEEE Design and Test of Computers,vol. 18, no. 1, pp , January/February, Wenbin Yao, Dongsheng Wang, Weimin Zheng, Songliu Guo. “Current Trends in High Performance Computing and Its Applications.” Architecture Design of a Single-chip Multiprocessor, pp , L. Hammond, B. Nayfeh, and K. Olukotun. "A single-chip multiprocessor." IEEE Computer, vol. 30, no. 9, pp , September 1997.