HSSD Project Planning Nuno Miguel Cardanha Paulino PDEEC 2011-2012.

Slides:



Advertisements
Similar presentations
Computer Organization Lab 1 Soufiane berouel. Formulas to Remember CPU Time = CPU Clock Cycles x Clock Cycle Time CPU Clock Cycles = Instruction Count.
Advertisements

Control path Recall that the control path is the physical entity in a processor which: fetches instructions, fetches operands, decodes instructions, schedules.
The Microprocessor and its Architecture
Performance of Cache Memory
Scalable Multi-Cache Simulation Using GPUs Michael Moeng Sangyeun Cho Rami Melhem University of Pittsburgh.
2.3) Example of program execution 1. instruction  B25 8 Op-code B means to change the value of the program counter if the contents of the indicated register.
Programmable Interval Timer
1 Recap: Memory Hierarchy. 2 Unified vs.Separate Level 1 Cache Unified Level 1 Cache (Princeton Memory Architecture). A single level 1 cache is used for.
Contiki A Lightweight and Flexible Operating System for Tiny Networked Sensors Presented by: Jeremy Schiff.
1 Foundations of Software Design Fall 2002 Marti Hearst Lecture 4: Operating Systems.
CSCE 212 Quiz 4 – 2/16/11 *Assume computes take 1 clock cycle, loads and stores take 10 cycles and branches take 4 cycles and that they are running on.
IO Controller Module Arbitrates IO from the CCP Physically separable from CCP –Can be used as independent data logger or used in future projects. Implemented.
Midterm Wednesday Chapter 1-3: Number /character representation and conversion Number arithmetic Combinational logic elements and design (DeMorgan’s Law)
Review for Midterm 2 CPSC 321 Computer Architecture Andreas Klappenecker.
Computer Organization and Assembly language
Predictor-Directed Stream Buffers Timothy Sherwood Suleyman Sair Brad Calder.
CPU Fetch/Execute Cycle
Computer Science 210 Computer Organization The Instruction Execution Cycle.
Some Useful Circuits Lecture for CPSC 5155 Edward Bosworth, Ph.D. Computer Science Department Columbus State University.
Lecture 8 Presented By Dr. Shazzad Hosain Asst. Prof. EECS, NSU.
TEAM 1: Miguel Harmant Rodney Rodriguez Elias Crespo Javier Parra Alfredo Alonso Marc-Wayne Anglin.
A Relational Algebra Processor Final Project Ming Liu, Shuotao Xu.
Raster-scan system In addition to the central processing unit a special purpose processor called the video controller or display controller is used to.
Operation Frequency No. of Clock cycles ALU ops % 1 Loads 25% 2
Dynamic Resource Monitoring and Allocation in a virtualized environment.
An Efficient Implementation of Scalable Architecture for Discrete Wavelet Transform On FPGA Michael GUARISCO, Xun ZHANG, Hassan RABAH and Serge WEBER Nancy.
COMPUTER SCIENCE Data Representation and Machine Concepts Section 2.1 Instructor: Lin Chen Sept 2013.
1 i206: Lecture 4: The CPU, Instruction Sets, and How Computers Work Marti Hearst Spring 2012.
1 Control Unit Operation and Microprogramming Chap 16 & 17 of CO&A Dr. Farag.
Typical C++ Environment and Library Introduction Speaker : Wei-Lu Lin Advisor : Ku-Yaw Chang 2012/10/14.
Computer Systems - Registers. Starter… Discuss in pairs the definition of the following Control Unit Arithmetic and Logic Unit Registers Internal clock.
Computer architecture
By Edward A. Lee, J.Reineke, I.Liu, H.D.Patel, S.Kim
Computer Structure & Architecture 7b - CPU & Buses.
Computer Architecture 2 nd year (computer and Information Sc.)
We will focus on operating system concepts What does it do? How is it implemented? Apply to Windows, Linux, Unix, Solaris, Mac OS X. Will discuss differences.
Department of Computer Science MapReduce for the Cell B. E. Architecture Marc de Kruijf University of Wisconsin−Madison Advised by Professor Sankaralingam.
Proposal Presentation Aircraft Combat (MINI Game) Tan Siyu Miao Yun 10/9/2013.
Computer Systems Organization
GEM: A Framework for Developing Shared- Memory Parallel GEnomic Applications on Memory Constrained Architectures Mucahid Kutlu Gagan Agrawal Department.
E X C E E D I N G E X P E C T A T I O N S L3-CPU IS 4490 N-Tier Client/Server Architectures Dr. Hoganson Kennesaw State University Layer 3 - CPU CPU has.
Pentium Architecture Arithmetic/Logic Units (ALUs) : – There are two parallel integer instruction pipelines: u-pipeline and v-pipeline – The u-pipeline.
Computer Organization and Assembly Languages Yung-Yu Chuang 2005/09/29
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 8 Networks and Multiprocessors.
Shouqing Hao Institute of Computing Technology, Chinese Academy of Sciences Processes Scheduling on Heterogeneous Multi-core Architecture.
Approximate Computing on FPGA using Neural Acceleration Presented By: Mikkel Nielsen, Nirvedh Meshram, Shashank Gupta, Kenneth Siu.
Logical & Physical Address Nihal Güngör. Logical Address In simplest terms, an address generated by the CPU is known as a logical address. Logical addresses.
RISC / CISC Architecture by Derek Ng. Overview CISC Architecture RISC Architecture  Pipelining RISC vs CISC.
Bootstrapping Introduction. Bootstrapping Introduction: Computers execute programs stored in main memory, and initially the operating system is on the.
10/25/2005Comp 120 Fall October 25 Review for 2 nd Exam on Tuesday 27 October MUL not MULI Ask Questions!
First proposal for a modification of the GIS schema
Address – 32 bits WRITE Write Cache Write Main Byte Offset Tag Index Valid Tag Data 16K entries 16.
Improving Memory Access 1/3 The Cache and Virtual Memory
Lecture: Pipelining Basics
What should we teach regarding…
Intro to Architecture & Organization
Control Unit Introduction Types Comparison Control Memory
Architecture Overview
Overview Part 1 - Registers, Microoperations and Implementations
Figure 11-1.
Figure Overview.
Figure Overview.
Lecture 10 review Booting sequence in Brief
Clock Algorithm Example
Presentation transcript:

HSSD Project Planning Nuno Miguel Cardanha Paulino PDEEC

Implementation Overview Figure 1 – Overview of Local Memory Architecture CPU (Microblaze) Runs application LMB Injector Bus monitor/modifier RPU – Executes CDFGs – No memory accesses

Memory Access Architecture Proposal Enchanced Injector – Explicitly stall the CPU – Bus access multiplexing Loader/Storer – Memory stimuli – Clock gate RPU when FIFOs full – 1 load and 1 store per – Loader scheduling Adapt RPU – Generate/accept data/address at any row, or limit loads/stores to first/last rows – Loader FIFO not empty stall Select memory access benchmarks – Vecsum, dotprod, max Compare speedups with CatapultC Figure 2 – Possible Memory access architecture