PentiumPro 450GX Chipset Synthesis Steen Larsen Presentation 1 for ECE572 Nov 10 2003.

Slides:



Advertisements
Similar presentations
VHDL Design of Multifunctional RISC Processor on FPGA
Advertisements

System Integration and Performance
Sumitha Ajith Saicharan Bandarupalli Mahesh Borgaonkar.
The First Microprocessor By: Mark Tocchet and João Tupinambá.
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
Integrated Circuits Laboratory Faculty of Engineering Digital Design Flow Using Mentor Graphics Tools Presented by: Sameh Assem Ibrahim 16-October-2003.
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
IO Controller Module Arbitrates IO from the CCP Physically separable from CCP –Can be used as independent data logger or used in future projects. Implemented.
University College Cork IRELAND Hardware Concepts An understanding of computer hardware is a vital prerequisite for the study of operating systems.
Programmable logic and FPGA
Recap – Our First Computer WR System Bus 8 ALU Carry output A B S C OUT F 8 8 To registers’ input/output and clock inputs Sequence of control signal combinations.
1 CS402 PPP # 1 Computer Architecture Evolution. 2 John Von Neuman original concept.
ECE 331 – Digital System Design Course Introduction and VHDL Fundamentals (Lecture #1)
Altera DE2 Board and Quartus II Software ECE 3450 M. A. Jupina, VU, 2014.
Ethernet Bomber Ethernet Packet Generator for network analysis Oren Novitzky & Rony Setter Advisor: Mony Orbach Started: Spring 2008 Part A final Presentation.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
1 Chapter 7 Design Implementation. 2 Overview 3 Main Steps of an FPGA Design ’ s Implementation Design architecture Defining the structure, interface.
Implementation of DSP Algorithm on SoC. Characterization presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompany engineer : Emilia Burlak.
Introduction to FPGA and DSPs Joe College, Chris Doyle, Ann Marie Rynning.
COMPUTER MEMORY Modern computers use semiconductor memory It is made up of thousands of circuits (paths) for electrical currents on a single silicon chip.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
Chapter 4  Converts data into information  Control center  Set of electronic circuitry that executes stored program instructions  Two parts ◦ Control.
Final presentation Encryption/Decryption on embedded system Supervisor: Ina Rivkin students: Chen Ponchek Liel Shoshan Winter 2013 Part A.
Ross Brennan On the Introduction of Reconfigurable Hardware into Computer Architecture Education Ross Brennan
Basic Microcomputer Design. Inside the CPU Registers – storage locations Control Unit (CU) – coordinates the sequencing of steps involved in executing.
9/22/2010Lecture 1 - Introduction1 ECE 5465 Advanced Microcomputers.
Overview Introduction The Level of Abstraction Organization & Architecture Structure & Function Why study computer organization?
Trigger design engineering tools. Data flow analysis Data flow analysis through the entire Trigger Processor allow us to refine the optimal architecture.
ECE 545 Project 1 Part IV Key Scheduling Final Integration List of Deliverables.
Matrix Multiplication on FPGA Final presentation One semester – winter 2014/15 By : Dana Abergel and Alex Fonariov Supervisor : Mony Orbach High Speed.
Introduction CSE 410, Spring 2008 Computer Systems
Maurice Goodrick, Bart Hommels 1 CALICE-UK WP2.2 Slab Data Paths Plan: – emulate multiple VFE chips on long PCBs – study the transmission behaviour.
Spartan-II Memory Controller For QDR SRAMs Lobby Pitch February 2000 ®
ASIC/FPGA design flow. FPGA Design Flow Detailed (RTL) Design Detailed (RTL) Design Ideas (Specifications) Design Ideas (Specifications) Device Programming.
Complete CompTIA A+ Guide to PCs, 6e Chapter 2: On the Motherboard © 2014 Pearson IT Certification
GBT Interface Card for a Linux Computer Carson Teale 1.
Computers Are Your Future Eleventh Edition Chapter 2: Inside the System Unit Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall1.
Chapter 2 The CPU and the Main Board  2.1 Components of the CPU 2.1 Components of the CPU 2.1 Components of the CPU  2.2Performance and Instruction Sets.
SHA-3 Candidate Evaluation 1. FPGA Benchmarking - Phase Round-2 SHA-3 Candidates implemented by 33 graduate students following the same design.
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
Array Synthesis in SystemC Hardware Compilation Authors: J. Ditmar and S. McKeever Oxford University Computing Laboratory, UK Conference: Field Programmable.
1 Fly – A Modifiable Hardware Compiler C. H. Ho 1, P.H.W. Leong 1, K.H. Tsoi 1, R. Ludewig 2, P. Zipf 2, A.G. Oritz 2 and M. Glesner 2 1 Department of.
PROCStar III Performance Charactarization Instructor : Ina Rivkin Performed by: Idan Steinberg Evgeni Riaboy Semestrial Project Winter 2010.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Lecture 16: Reconfigurable Computing Applications November 3, 2004 ECE 697F Reconfigurable Computing Lecture 16 Reconfigurable Computing Applications.
EEE440 Computer Architecture
Part A Presentation Implementation of DSP Algorithm on SoC Student : Einat Tevel Supervisor : Isaschar Walter Accompanying engineer : Emilia Burlak The.
ECE 545 Project 2 Specification. Schedule of Projects (1) Project 1 RTL design for FPGAs (20 points) Due date: Tuesday, November 22, midnight (firm) Checkpoints:
Computer Organization & Assembly Language © by DR. M. Amer.
Computer Engineering 1502 Advanced Digital Design Professor Donald Chiarulli Computer Science Dept Sennott Square
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
Lopamudra Kundu Reg. No. : of Roll No.:- 91/RPE/ Koushik Basak
© 2010 Altera Corporation - Public Lutiac – Small Soft Processors for Small Programs David Galloway and David Lewis November 18, 2010.
Digital Design Using VHDL and PLDs ECOM 4311 Digital System Design Chapter 1.
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
ASIC/FPGA design flow. Design Flow Detailed Design Detailed Design Ideas Design Ideas Device Programming Device Programming Timing Simulation Timing Simulation.
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the Field Programmable Port Extender John Lockwood and David Taylor Washington University.
FPGA Technology Overview Carl Lebsack * Some slides are from the “Programmable Logic” lecture slides by Dr. Morris Chang.
ECE 448 – FPGA and ASIC Design with VHDL George Mason University ECE 448 Lab 2 Implementing Combinational Logic in VHDL.
Introduction CSE 410, Spring 2005 Computer Systems
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
SUBJECT : DIGITAL ELECTRONICS CLASS : SEM 3(B) TOPIC : INTRODUCTION OF VHDL.
Introduction to Programmable Logic
Field Programmable Gate Array
Field Programmable Gate Array
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
第四章 80386的存贮器和输入/输出接口 作业:P335 5,7,13,17,21,25,36,37,41,44,45,46,48,52,65 21:46.
Presentation transcript:

PentiumPro 450GX Chipset Synthesis Steen Larsen Presentation 1 for ECE572 Nov

General Motive Investigate processor-memory compression methods in a “current” (Intel pipelined FSB and DIMM) architecture to further evaluate advantages of using memory compression architectures. Investigate processor-memory compression methods in a “current” (Intel pipelined FSB and DIMM) architecture to further evaluate advantages of using memory compression architectures. Quickly evaluate using FPGA synthesis different methods of address, instruction, and data compression by modifying and existing production chipset. Quickly evaluate using FPGA synthesis different methods of address, instruction, and data compression by modifying and existing production chipset.

What is memory compression? Processor reads and writes to main memory in 32 byte cache lines. Processor reads and writes to main memory in 32 byte cache lines. Optimize this path of information (instruction/address/data) Optimize this path of information (instruction/address/data) Currently simulations, and one implementation at IBM on instruction compression. Currently simulations, and one implementation at IBM on instruction compression. Example of many reads per write Example of many reads per write Example of predominance of “0” and “1” Example of predominance of “0” and “1” Benefit of 1-2% improvement Benefit of 1-2% improvement

More detail of 450GX DC/DP

Why synthesis to FPGA? Quick re-synthesis and adjustment to existing VHDL structures (reduce simulation and validation cycles) Quick re-synthesis and adjustment to existing VHDL structures (reduce simulation and validation cycles) Rapid advancement of FPGA logic and internal components. (Moore’s law on older chipset) Rapid advancement of FPGA logic and internal components. (Moore’s law on older chipset)

Altera Cyclone device Similar to other Altera and Xilinx devices. (LE, PLL, memory, IO, licensed IP) Similar to other Altera and Xilinx devices. (LE, PLL, memory, IO, licensed IP)

Altera LE internals

Orion 450GX details Focusing only on memory interface Focusing only on memory interface NEC ASIC written in VHDL NEC ASIC written in VHDL PLL, Dual port RAM PLL, Dual port RAM 66MHz 64bit data bus 66MHz 64bit data bus 1 st generation DIMM, ECC 1 st generation DIMM, ECC DC is 208 PQFP and DP is 240 PQFP DC is 208 PQFP and DP is 240 PQFP Plan to use existing platforms Plan to use existing platforms

Scope of DC/DP logic vhd/.vhdtmp files or 98MB (This includes a lot of testbench files!) vhd/.vhdtmp files or 98MB (This includes a lot of testbench files!) VHDL lines total of entity/architecture/package/config VHDL lines total of entity/architecture/package/config entity-architecture VHDL lines in DP logic (32 files) entity-architecture VHDL lines in DP logic (32 files) package VHDL lines in DP logic package VHDL lines in DP logic

Synthesis conversion from ASIC-> FPGA File format and locations (separate entity/architecture, autogenerated packages files for each entity architecture File format and locations (separate entity/architecture, autogenerated packages files for each entity architecture Packages converted from NEC ASIC to Altera logic Packages converted from NEC ASIC to Altera logic

Further issues PLLs are needed, need to regenerate PLLs are needed, need to regenerate Dual Port RAM is an ASIC primitive and needs to be converted to Altera RAM block. (Biggest time sink was conversion on unsigned (STD_LOGIC_VECTOR)) Dual Port RAM is an ASIC primitive and needs to be converted to Altera RAM block. (Biggest time sink was conversion on unsigned (STD_LOGIC_VECTOR))

VHDL syntax differences Maximum one WAIT statement in a VHDL process. Was used to ensure multi-clock error output generation. May need to correct in the long run. Maximum one WAIT statement in a VHDL process. Was used to ensure multi-clock error output generation. May need to correct in the long run. PLLs currently commented out, and generated off a single off-chip clock. PLLs currently commented out, and generated off a single off-chip clock.

Currently…

ScreenShot of layout

Now proceeding to DataControl chip 140 architecture files compared to 32 of the datapath chip 140 architecture files compared to 32 of the datapath chip Main packaging conversion difficulties should be solved. Main packaging conversion difficulties should be solved. Estimate 5-10X complexity of logic, so potential pitfalls Estimate 5-10X complexity of logic, so potential pitfalls

What is needed beyond synthesis Simulation of basic read/write processor access Simulation of basic read/write processor access Understanding of current chipset errata Understanding of current chipset errata Circuit board layout (GTL translation, bus clock rate down from 66MHz) Circuit board layout (GTL translation, bus clock rate down from 66MHz) Boot DOS/Linux/Windows Boot DOS/Linux/Windows Implement compression algorithms Implement compression algorithms

Questions?

If I have seen further it is by standing on the shoulders of giants. –Isaac Newton, Letter to Robert Hooke, February 5, 1675