Coe818 Advanced Computer Architecture

Slides:



Advertisements
Similar presentations
Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.
Advertisements

Streaming SIMD Extension (SSE)
CS136, Advanced Architecture Limits to ILP Simultaneous Multithreading.
Instruction-Level Parallel Processors {Objective: executing two or more instructions in parallel} 4.1 Evolution and overview of ILP-processors 4.2 Dependencies.
Lecture 8 Dynamic Branch Prediction, Superscalar and VLIW Advanced Computer Architecture COE 501.
CPE 731 Advanced Computer Architecture ILP: Part V – Multiple Issue Dr. Gheith Abandah Adapted from the slides of Prof. David Patterson, University of.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture VLIW Steve Ko Computer Sciences and Engineering University at Buffalo.
Anshul Kumar, CSE IITD CS718 : VLIW - Software Driven ILP Introduction 23rd Mar, 2006.
Dynamic Branch PredictionCS510 Computer ArchitecturesLecture Lecture 10 Dynamic Branch Prediction, Superscalar, VLIW, and Software Pipelining.
1 ILP (Recap). 2 Basic Block (BB) ILP is quite small –BB: a straight-line code sequence with no branches in except to the entry and no branches out except.
1 Advanced Computer Architecture Limits to ILP Lecture 3.
Chapter 4 Advanced Pipelining and Intruction-Level Parallelism Computer Architecture A Quantitative Approach John L Hennessy & David A Patterson 2 nd Edition,
EECC551 - Shaaban #1 Fall 2005 lec# Static Compiler Optimization Techniques We examined the following static ISA/compiler techniques aimed.
Single-Chip Multiprocessor Nirmal Andrews. Case for single chip multiprocessors Advances in the field of integrated chip processing. - Gate density (More.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon, Dec 5, 2005 Topic: Intro to Multiprocessors and Thread-Level Parallelism.
Instruction Level Parallelism (ILP) Colin Stevens.
12/1/2005Comp 120 Fall December Three Classes to Go! Questions? Multiprocessors and Parallel Computers –Slides stolen from Leonard McMillan.
RISC. Rational Behind RISC Few of the complex instructions were used –data movement – 45% –ALU ops – 25% –branching – 30% Cheaper memory VLSI technology.
Intel Architecture. Changes in architecture Software architecture: –Front end (Feature changes such as adding more graphics, changing the background colors,
Semiconductor Memory 1970 Fairchild Size of a single core –i.e. 1 bit of magnetic core storage Holds 256 bits Non-destructive read Much faster than core.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
1 Multi-core processors 12/1/09. 2 Multiprocessors inside a single chip It is now possible to implement multiple processors (cores) inside a single chip.
VTU – IISc Workshop Compiler, Architecture and HPC Research in Heterogeneous Multi-Core Era R. Govindarajan CSA & SERC, IISc
Performance of mathematical software Agner Fog Technical University of Denmark
Hyper Threading (HT) and  OPs (Micro-Operations) Department of Computer Science Southern Illinois University Edwardsville Summer, 2015 Dr. Hiroshi Fujinoki.
SYNAR Systems Networking and Architecture Group CMPT 886: Computer Architecture Primer Dr. Alexandra Fedorova School of Computing Science SFU.
Spring 2003CSE P5481 Midterm Philosophy What the exam looks like. Definitions, comparisons, advantages & disadvantages what is it? how does it work? why.
Multi-core processors. 2 Processor development till 2004 Out-of-order Instruction scheduling Out-of-order Instruction scheduling.
CENTRAL PROCESSING UNIT. CPU Does the actual processing in the computer. A single chip called a microprocessor. Composed of an arithmetic and logic unit.
Final Review Prof. Mike Schulte Advanced Computer Architecture ECE 401.
Lab Activities 1, 2. Some of the Lab Server Specifications CPU: 2 Quad(4) Core Intel Xeon 5400 processors CPU Speed: 2.5 GHz Cache : Each 2 cores share.
SYNAR Systems Networking and Architecture Group CMPT 886: Computer Architecture Primer Dr. Alexandra Fedorova School of Computing Science SFU.
Lecture 1: Introduction CprE 585 Advanced Computer Architecture, Fall 2004 Zhao Zhang.
Processor Performance & Parallelism Yashwant Malaiya Colorado State University With some PH stuff.
COEN 6741 Grading Scheme ► Test#1: 30% ► Test#2: 30% ► Project: 40%
Elec/Comp 526 Spring 2015 High Performance Computer Architecture Instructor Peter Varman DH 2022 (Duncan Hall) rice.edux3990 Office Hours Tue/Thu.
Itanium® 2 Processor Architecture
CS 352H: Computer Systems Architecture
COMP 740: Computer Architecture and Implementation
Advanced Architectures
Chapter 1: Introduction
ECE 486/586 Computer Architecture Introductions Instructor and You
Computer Architecture Principles Dr. Mike Frank
CPE 731 Advanced Computer Architecture ILP: Part V – Multiple Issue
Multi-core processors
Parallel Computing Lecture
Computer Architecture Principles Dr. Mike Frank
CS203 – Advanced Computer Architecture
Lecture 10 Tomasulo’s Algorithm
Instruction Scheduling for Instruction-Level Parallelism
Array Processor.
Levels of Parallelism within a Single Processor
Computer Architecture Lecture 4 17th May, 2006
Symmetric Multiprocessing (SMP)
STUDY AND IMPLEMENTATION
Computer Architecture
CHAPTER 8: CPU and Memory Design, Enhancement, and Implementation
Mattan Erez The University of Texas at Austin
CC423: Advanced Computer Architecture ILP: Part V – Multiple Issue
EE 4xx: Computer Architecture and Performance Programming
Overview Prof. Eric Rotenberg
Mattan Erez The University of Texas at Austin
Levels of Parallelism within a Single Processor
Advanced Architecture +
CSC3050 – Computer Architecture
CS 286 Computer Organization and Architecture
The University of Adelaide, School of Computer Science
Figure 7-1: Non-Pipelined Instruction Execution vs. 2-stage Pipeline
Instruction Level Parallelism
What Are Performance Counters?
Presentation transcript:

Coe818 Advanced Computer Architecture OBJECTIVES Understand What is Used in Advanced Processor Why Industry is Using Multi-Core Performance Limitation and Challenges Facing Computers How can Applications Utilize Advanced Features in the Chip

Multi-core How Multi-Core Communicate Why need Cache Coherency Why need Synchronization The size of L3 in high end and low end CPUs are quite different.

What is Inside Single Core: branch predictor, dynamic scheduling and execution unit This break down is also an approximation.

What is inside a single processor -Parallelism with Pipelining: implementation, hazards, multi execution units and dynamic scheduling -Instruction Level Parallelism: Loop unrolling, Superscalar and VLIW -Parallel Single operation on multiple data: Vector Operations and MMX (SIMD) -Speculative Execution of Instructions: Branch Prediction