1 ECE 734 Final Project Presentation Fall 2000 By Manoj Geo Varghese MMX Technology: An Optimization Outlook.

Slides:



Advertisements
Similar presentations
Machine cycle.
Advertisements

Intel Pentium 4 ENCM Jonathan Bienert Tyson Marchuk.
Contents Even and odd memory banks of 8086 Minimum mode operation
Fall EE 333 Lillevik 333f06-l20 University of Portland School of Engineering Computer Organization Lecture 20 Pipelining: “bucket brigade” MIPS.
Pentium microprocessors CAS 133 – Basic Computer Skills/MS Office CIS 120 – Computer Concepts I Russ Erdman.
100 Performance ENGR 3410 – Computer Architecture Mark L. Chang Fall 2006.
Microprocessors I Time: Sundays & Tuesdays 07:30 to 8:45 Place: EE 4 ( New building) Lecturer: Bijan Vosoughi Vahdat Room: VP office, NE of Uni Office.
1 Microprocessor-based Systems Course 4 - Microprocessors.
Intel Labs Labs Copyright © 2000 Intel Corporation. Fall 2000 Inside the Pentium ® 4 Processor Micro-architecture Next Generation IA-32 Micro-architecture.
Embedded Systems Programming
IA- 32 Architecture Richard Eckert Anthony Marino Matt Morrison Steve Sonntag.
CSCE101 – 4.2, 4.3 October 17, Power Supply Surge Protector –protects from power spikes which ruin hardware. Voltage Regulator – protects from insufficient.
Advanced Micro Devices - Athlon Buddy Guest Mike Lewitt Bill McCorkle November 28, 2001.
EECS 470 Superscalar Architectures and the Pentium 4 Lecture 12.
1 Pipelining for Multi- Core Architectures. 2 Multi-Core Technology Single Core Dual CoreMulti-Core + Cache + Cache Core 4 or more cores.
The Pentium 4 CPSC 321 Andreas Klappenecker. Today’s Menu Advanced Pipelining Brief overview of the Pentium 4.
Copyright © 2006, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners Intel® Core™ Duo Processor.
Intel Pentium 4 Processor Presented by Presented by Steve Kelley Steve Kelley Zhijian Lu Zhijian Lu.
Hiep Hong CS 147 Spring Intel Core 2 Duo. CPU Chronology 2.
Inside The CPU. Buses There are 3 Types of Buses There are 3 Types of Buses Address bus Address bus –between CPU and Main Memory –Carries address of where.
Intel Architecture. Changes in architecture Software architecture: –Front end (Feature changes such as adding more graphics, changing the background colors,
Semiconductor Memory 1970 Fairchild Size of a single core –i.e. 1 bit of magnetic core storage Holds 256 bits Non-destructive read Much faster than core.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Computer Organization & Assembly Language
Copyright © 2007 Heathkit Company, Inc. All Rights Reserved PC Fundamentals Presentation 27 – A Brief History of the Microprocessor.
The Arrival of the 64bit CPUs - Itanium1 นายชนินท์วงษ์ใหญ่รหัส นายสุนัยสุขเอนกรหัส
Lecture 1 ECE Spring 2000 ECE 291 Spring 2000 Lecture 1: Microprocessor Evolution & Organization Constantine D. Polychronopoulos Professor, ECE.
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
History of Microprocessor MPIntroductionData BusAddress Bus
1 Acknowledgements Class notes based upon Patterson & Hennessy: Book & Lecture Notes Patterson’s 1997 course notes (U.C. Berkeley CS 152, 1997) Tom Fountain.
Computer Architecture By Chris Van Horn. CPU Basics “Brains of the Computer” Fetch Execute Cycle Instruction Branching.
Hyper Threading Technology. Introduction Hyper-threading is a technology developed by Intel Corporation for it’s Xeon processors with a 533 MHz system.
Introduction to MMX, XMM, SSE and SSE2 Technology
The Intel 86 Family of Processors
Playstation2 Architecture Architecture Hardware Design.
Chap 4: Processors Mainly manufactured by Intel and AMD Important features of Processors: Processor Speed (900MHz, 3.2 GHz) Multiprocessing Capabilities.
The Pentium Series CS 585: Computer Architecture Summer 2002 Tim Barto.
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
Stored Program Concept Learning Objectives Learn the meaning of the stored program concept The processor and its components The fetch-decode-execute and.
Pentium 4 Deeply pipelined processor supporting multiple issue with speculation and multi-threading 2004 version: 31 clock cycles from fetch to retire,
William Stallings Computer Organization and Architecture 6th Edition
Itanium® 2 Processor Architecture
Differences of 8086,80386,i7.
Protection in Virtual Mode
Microarchitecture.
CS 286 Computer Architecture & Organization
Visit for more Learning Resources
x86 Processor Architecture
Chapter 4 Processor Technology and Architecture
CIT 668: System Architecture
Multi-core processors
Matrix Multiplication Continued
Basic Computer Organization
CS203 – Advanced Computer Architecture
Introduction to Pentium Processor
عمارة الحاسب.
Microprocessors Chapter 4.
Special Instructions for Graphics and Multi-Media
The Microarchitecture of the Pentium 4 processor
Superscalar Pipelines Part 2
BIC 10503: COMPUTER ARCHITECTURE
Microprocessor & Assembly Language
Comparison of Two Processors
Comparison of AMD64, IA-32e extensions and the Itanium architecture
STUDY AND IMPLEMENTATION
Intel 8080 Processor The 8080 was an 8-bit processor
Coe818 Advanced Computer Architecture
Alex Saify Chad Reynolds James Aldorisio Brian Bischoff
CS 286 Computer Organization and Architecture
CSE 502: Computer Architecture
Presentation transcript:

1 ECE 734 Final Project Presentation Fall 2000 By Manoj Geo Varghese MMX Technology: An Optimization Outlook

2 Issues Facing MMX Today Latency. Cache Limits and Structure./Pipelining Floating Point Operation. Instruction Pairing/Mixing MMX instructions. Speed of Execution. Where are we in store for??

3 Next Generation Technology Intel ® Pentium ® 4 is an example Net Burst TM Architecture 1.5 GHz Processor. 400 MHz System Bus. Advanced Floating Point Operations. Trace Caches-Advanced Branch Prediction Algorithm. L1 Execution Cache + Data Cache. 256KB L2 Advanced Trace Cache. Hyper Pipelined Technology. 20 Level Pipelining. Rapid Execution Engine. Speed of ALU = 2 * Core frequency YES !!! (Atleast for integer operations) Do we stop here? Of Course No!!!!

4 But what about MMX Capability SSE2-Streaming SIMD Extensions new Instructions for Double Precision FP. Cache and Memory Management. Enable 3-D Graphics. Video Decoding/Encoding/ Speech Recognition. 128 bit MMX Registers. Separate Register for Data Movement. Visual internet- Voice over IP.

5 Work and Present Status Mentor Graphics Pipelining –Single level Clock cycle time reduced from ns !!! Imagine 20 Level Pipelining in P4!!! Cache Consideration. MMX instruction latencies. Pair MMX instructions. Mix MMX & Integer instructions. Study of Processors.