Network On Chip Cache Coherency Final presentation – Part A Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter.

Slides:



Advertisements
Similar presentations
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Advertisements

Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
1 Message passing architectures and routing CEG 4131 Computer Architecture III Miodrag Bolic Material for these slides is taken from the book: W. Dally,
PRESENTED BY: PRIYANK GUPTA 04/02/2012 Generic Low Latency NoC Router Architecture for FPGA Computing Systems & A Complete Network on Chip Emulation Framework.
What is Flow Control ? Flow Control determines how a network resources, such as channel bandwidth, buffer capacity and control state are allocated to packet.
NETWORK ON CHIP ROUTER Students : Itzik Ben - shushan Jonathan Silber Instructor : Isaschar Walter Final presentation part A Winter 2006.
L2 to Off-Chip Memory Interconnects for CMPs Presented by Allen Lee CS258 Spring 2008 May 14, 2008.
Spring 2008 Network On Chip Platform Instructor: Yaniv Ben-Itzhak Students: Ofir Shimon Guy Assedou.
Network based System on Chip Final Presentation Part B Performed by: Medvedev Alexey Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
NETWORK ON CHIP ROUTER Students : Itzik Ben - shushan Jonathan Silber Instructor : Isaschar Walter Characterization presentation Winter 2006.
NETWORK ON CHIP ROUTER Students : Itzik Ben - shushan Jonathan Silber Instructor : Isaschar Walter Final presentation part B Spring 2006.
Network based System on Chip Performed by: Medvedev Alexey Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
Network based System on Chip Students: Medvedev Alexey Shimon Ofir Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
IP I/O Memory Hard Disk Single Core IP I/O Memory Hard Disk IP Bus Multi-Core IP R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R Networks.
Network based System on Chip Part A Performed by: Medvedev Alexey Supervisor: Walter Isaschar (Zigmond) Winter-Spring 2006.
Lei Wang, Yuho Jin, Hyungjun Kim and Eun Jung Kim
Performed by : Rivka Cohen and Sharon Solomon Instructor : Walter Isaschar המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון.
1 Lecture 13: Interconnection Networks Topics: flow control, router pipelines, case studies.
1 Lecture 25: Interconnection Networks Topics: flow control, router microarchitecture Final exam:  Dec 4 th 9am – 10:40am  ~15-20% on pre-midterm  post-midterm:
NETWORK ON CHIP ROUTER Students : Itzik Ben - shushan Jonathan Silber Instructor : Isaschar Walter PART A Midterm presentation Winter 2006.
Predictive Load Balancing Reconfigurable Computing Group.
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control Final exam reminders:  Plan well – attempt every question.
Modern trends in computer architecture and semiconductor scaling are leading towards the design of chips with more and more processor cores. Highly concurrent.
1 E. Bolotin – The Power of Priority, NoCs 2007 The Power of Priority : NoC based Distributed Cache Coherency Evgeny Bolotin, Zvika Guz, Israel Cidon,
Reliable Data Storage using Reed Solomon Code Supervised by: Isaschar (Zigi) Walter Performed by: Ilan Rosenfeld, Moshe Karl Spring 2004 Midterm Presentation.
Issues in System-Level Direct Networks Jason D. Bakos.
1 Lecture 26: Interconnection Networks Topics: flow control, router microarchitecture.
Orion: A Power-Performance Simulator for Interconnection Networks Presented by: Ilya Tabakh RC Reading Group4/19/2006.
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing.
High Speed Digital Design Project SpaceWire Router Student: Asaf Bercovich Instructor: Mony Orbach Semester: Winter 2009/ Semester Project Date:
Communication issues for NOC By Farhadur Arifin. Objective: Future system of NOC will have strong requirment on reusability and communication performance.
On-Chip Networks and Testing
A Vertical Bubble Flow Network using Inductive-Coupling for 3D CMPs
José Vicente Escamilla José Flich Pedro Javier García 1.
Déjà Vu Switching for Multiplane NoCs NOCS’12 University of Pittsburgh Ahmed Abousamra Rami MelhemAlex Jones.
The Alpha Network Architecture By Shubhendu S. Mukherjee, Peter Bannon Steven Lang, Aaron Spink, and David Webb Compaq Computer Corporation Presented.
Author : Jing Lin, Xiaola Lin, Liang Tang Publish Journal of parallel and Distributed Computing MAKING-A-STOP: A NEW BUFFERLESS ROUTING ALGORITHM FOR ON-CHIP.
Computer Architecture Distributed Memory MIMD Architectures Ola Flygt Växjö University
Presenter: Min-Yu Lo 2015/10/19 Asit K. Mishra, N. Vijaykrishnan, Chita R. Das Computer Architecture (ISCA), th Annual International Symposium on.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
Department of Computer Science and Engineering The Pennsylvania State University Akbar Sharifi, Emre Kultursay, Mahmut Kandemir and Chita R. Das Addressing.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
© Sudhakar Yalamanchili, Georgia Institute of Technology (except as indicated) Switch Microarchitecture Basics.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
Performance Analysis of a JPEG Encoder Mapped To a Virtual MPSoC-NoC Architecture Using TLM 林孟諭 Dept. of Electrical Engineering National Cheng Kung.
Network On Chip Platform
The Alpha Network Architecture Mukherjee, Bannon, Lang, Spink, and Webb Summary Slides by Fred Bower ECE 259, Spring 2004.
Microprocessors and Microsystems Volume 35, Issue 2, March 2011, Pages 230–245 Special issue on Network-on-Chip Architectures and Design Methodologies.
OASIS NoC Revisited Adam Esch (m ). Outline Pre-Research OASIS Overview Research Contributions Remarks OASIS Suggestions Future Work.
Lecture 16: Router Design
Intel Slide 1 A Comparative Study of Arbitration Algorithms for the Alpha Pipelined Router Shubu Mukherjee*, Federico Silla !, Peter Bannon $, Joel.
Virtual-Channel Flow Control William J. Dally
Mohamed ABDELFATTAH Andrew BITAR Vaughn BETZ. 2 Module 1 Module 2 Module 3 Module 4 FPGAs are big! Design big systems High on-chip communication.
Performed by: Jonathan Silber Itzik Ben-Shushan Instructor: Isaschar walter המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory הטכניון.
Network On Chip Cache Coherency Midterm presentation Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter Isaschar.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Power-aware NOC Reuse on the Testing of Core-based Systems* CSCE 932 Class Presentation by Xinwang Zhang April 26, 2007 * Erika Cota, et al., International.
1 Lecture 22: Interconnection Networks Topics: Routing, deadlock, flow control, virtual channels.
Lecture 23: Interconnection Networks
Exploring Concentration and Channel Slicing in On-chip Network Router
Azeddien M. Sllame, Amani Hasan Abdelkader
Using Packet Information for Efficient Communication in NoCs
Natalie Enright Jerger, Li Shiuan Peh, and Mikko Lipasti
EE 122: Lecture 7 Ion Stoica September 18, 2001.
Lecture: Interconnection Networks
Lecture 25: Interconnection Networks
MACS: A Minimal Adaptive Routing Circuit Switched Architecture for Scalable and Parametric NoCs Rohit Kumar Dr. Ann Gordon-Ross Introduction MACS: A.
Multiprocessors and Multi-computers
Presentation transcript:

Network On Chip Cache Coherency Final presentation – Part A Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter Isaschar Spring 2008

Agenda  Project’s general concepts.  Design architecture of the router.  Implementation of the router (using HDL Designer).  Router’s simulations.  Demonstration of our NoC. Network On Chip - Cache Coherency 2

3

General Background  Modern CPU’s are based on CMP - Multi-Core Processor.  Improved performance is achieved by “Distribution and Parallelism”.  Cores interact by using NoC – Network on Chip. Network On Chip - Cache Coherency 4

NoC’s General Diagram Network On Chip - Cache Coherency 5

NoC’s Characteristics  Wormhole packet routing.  Packet’s path is X-Y.  Units can communicate simultaneously.  Reduce power consumption.  Scalability. Network On Chip - Cache Coherency 6

Cache Coherency  Definition: CMP cores use only up to date data.  Originally, Cache Coherency in CMP was achieved by using a central memory control unit. Network On Chip - Cache Coherency 7

Cache Coherency Protocol Nowadays Network On Chip - Cache Coherency 8

Problem Description  Prior Cache Coherency protocols are irrelevant – NoC doesn’t have central unit.  Adding such unit will damage both NoC’s scalability and parallelism. Network On Chip - Cache Coherency 9

Solution Requirements  Won’t affect main NoC’s characteristics (e.g. scalability).  Avoid “Hot Spots” and “Bottle Necks”.  Minimize use of NoC’s resources. Network On Chip - Cache Coherency 10

Solution  Memory control distribution among a number of units according to memory spaces.  Placement of control units as part of the NoC. Network On Chip - Cache Coherency 11

Solution Diagram Network On Chip - Cache Coherency 12

Project’s Goals  Primary Goal: Design and implement Cache Coherency protocol for CMP.  Implement NoC (including NoC’s router).  Assemble CMP based on NoC. Network On Chip - Cache Coherency 13

14 Network On Chip - Cache Coherency

NoC Packet’s Structure  Packet is divided into flits.  There are four flit types: Start, Body, End and Idle. Network On Chip - Cache Coherency 15

Flit’s Structure  Flit contain two fields: Data and Type. Network On Chip - Cache Coherency 16

5 Ports Router  Direct packets according to X-Y routing.  5 ports – North, East, West, South and Processing Unit.  Processing Units are using the network’s communication protocol.  2 Virtual Channels (VC) per port. Network On Chip - Cache Coherency 17

5 Ports Router Structure Network On Chip - Cache Coherency 18

Input Port  Receives Flits from Router or from Processing unit.  Analyze and save the current packet direction.  Switch between Virtual Channels. Network On Chip - Cache Coherency 19

Input Port Structure Network On Chip - Cache Coherency 20

Output Port  Transmits Flits to Router or to Processing unit.  Each Virtual Channel save the currently serviced input port (CSIP).  Switch between Virtual Channels. Network On Chip - Cache Coherency 21

Output Port Structure Network On Chip - Cache Coherency 22

Cross Bar  Transfer Flits from Input Port to the matching Output Port.  Consists of 5 controllers – one for every Output Port. Network On Chip - Cache Coherency 23

Cross Bar Structure Network On Chip - Cache Coherency 24

25 Network On Chip - Cache Coherency

Network’s characteristics  The width of the Data bus is 8 bit.  The size of the ports’ buffers is 4 flits (can contain 4 flit at the most).  The NoC is composed of 9 routers, placed in 3x3 grid formation. Network On Chip - Cache Coherency 26

Input’s VC Implementation Network On Chip - Cache Coherency 27

Output’s VC Implementation Network On Chip - Cache Coherency 28

Input Port Implementation Network On Chip - Cache Coherency 29

Output Port Implementation Network On Chip - Cache Coherency 30

Output’s Controller Implementation Network On Chip - Cache Coherency 31

Crossbar_Mux Implementation Network On Chip - Cache Coherency 32

Cross Bar Implementation Network On Chip - Cache Coherency 33

Router Implementation Network On Chip - Cache Coherency 34

Synthesis Parameters Network On Chip - Cache Coherency 35

Network’s Performance  Latency of the router is 2 cycles.  Throughput of the router is 1 flit per cycle.  System’s clock frequency is 100 [MHz].  Packets can be routed simultaneously.  Packets can by-pass each other. Network On Chip - Cache Coherency 36

37 Network On Chip - Cache Coherency

Cross Transmit 38 Network On Chip - Cache Coherency  Routing two packets simultaneously: port 0 to port 2 and port 3 to port 2.

Traffic avoidance by using VC 39 Network On Chip - Cache Coherency  First packet from port 0 to port 1 get blocked in output port.  Packet from port 3 to port 1 by-pass it.

40 Network On Chip - Cache Coherency

Demonstration Diagram Network On Chip - Cache Coherency 41

General Description  Dummy units transmit packets.  Destination is being set by the switch- buttons.  The Dummy port start transmitting according to its push-button. Network On Chip - Cache Coherency 42

Project Schedule (1 st Semester)  Familiarize with design tools – 3 weeks.  Familiarize with VirtexII Pro FPGA (application & components) – 4 weeks.  Design & Implement NoC’s router – 5 weeks.  Assemble CMP using our router implementation – 2 weeks. Network On Chip - Cache Coherency 43

Project Schedule (2 nd Semester)  Assemble CMP using our router implementation – 4 weeks.  Design Cache Coherency protocol for CMP based on faculty research – 4 weeks.  Implement the protocol as part of the assembled CMP – 6 weeks. Network On Chip - Cache Coherency 44