80-Tile Teraflop Network-On- Chip 1. Contents Overview of the chip Architecture ▫Computational Core ▫Mesh Network Router ▫Power save features Performance.

Slides:



Advertisements
Similar presentations
Subthreshold SRAM Designs for Cryptography Security Computations Adnan Gutub The Second International Conference on Software Engineering and Computer Systems.
Advertisements

Dr. Leo Lehmann Vice Chairman ITU-T Study Group 13 Energy Saving for Future Networks "Future Networks: Cloud Computing, Energy Saving, Security and Virtualization"
A Novel 3D Layer-Multiplexed On-Chip Network
International Symposium on Low Power Electronics and Design Energy-Efficient Non-Minimal Path On-chip Interconnection Network for Heterogeneous Systems.
Computer Abstractions and Technology
Fall EE 333 Lillevik 333f06-l20 University of Portland School of Engineering Computer Organization Lecture 20 Pipelining: “bucket brigade” MIPS.
Power Reduction Techniques For Microprocessor Systems
THE RAW MICROPROCESSOR: A COMPUTATIONAL FABRIC FOR SOFTWARE CIRCUITS AND GENERAL- PURPOSE PROGRAMS Taylor, M.B.; Kim, J.; Miller, J.; Wentzlaff, D.; Ghodrat,
The Microprocessor is no more General Purpose. Design Gap.
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
Introduction to CMOS VLSI Design Lecture 18: Design for Low Power David Harris Harvey Mudd College Spring 2004.
Keeping Hot Chips Cool Thermal Management for Green Computing Yang Ge Professor Qinru Qiu.
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
MINIMISING DYNAMIC POWER CONSUMPTION IN ON-CHIP NETWORKS Robert Mullins Computer Architecture Group Computer Laboratory University of Cambridge, UK.
Lei Wang, Yuho Jin, Hyungjun Kim and Eun Jung Kim
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
CAD and Design Tools for On- Chip Networks Luca Benini, Mark Hummel, Olav Lysne, Li-Shiuan Peh, Li Shang, Mithuna Thottethodi,
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Lecture 5 – Power Prof. Luke Theogarajan
Lecture 7: Power.
Princess Sumaya Univ. Computer Engineering Dept. د. بســام كحـالــه Dr. Bassam Kahhaleh.
Scaling and Packing on a Chip Multiprocessor Vincent W. Freeh Tyler K. Bletsch Freeman L. Rawson, III Austin Research Laboratory.
Lecture#14. Last Lecture Summary Memory Address, size What memory stores OS, Application programs, Data, Instructions Types of Memory Non Volatile and.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
International Symposium on Low Power Electronics and Design NoC Frequency Scaling with Flexible- Pipeline Routers Pingqiang Zhou, Jieming Yin, Antonia.
SMART: A Single- Cycle Reconfigurable NoC for SoC Applications -Jyoti Wadhwani Chia-Hsin Owen Chen, Sunghyun Park, Tushar Krishna, Suvinay Subramaniam,
Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University.
Copyright © 2007 Heathkit Company, Inc. All Rights Reserved PC Fundamentals Presentation 27 – A Brief History of the Microprocessor.
Multi-core Programming Introduction Topics. Topics General Ideas Moore’s Law Amdahl's Law Processes and Threads Concurrency vs. Parallelism.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
1 Computer Architecture Research Overview Rajeev Balasubramonian School of Computing, University of Utah
Hardware Trends. Contents Memory Hard Disks Processors Network Accessories Future.
MS108 Computer System I Lecture 2 Metrics Prof. Xiaoyao Liang 2014/2/28 1.
[Tim Shattuck, 2006][1] Performance / Watt: The New Server Focus Improving Performance / Watt For Modern Processors Tim Shattuck April 19, 2006 From the.
Frank Casilio Computer Engineering May 15, 1997 Multithreaded Processors.
Network-on-Chip Energy-Efficient Design Techniques for Interconnects Suhail Basit.
Protein Explorer: A Petaflops Special Purpose Computer System for Molecular Dynamics Simulations David Gobaud Computational Drug Discovery Stanford University.
Computational Sprinting on a Real System: Preliminary Results Arun Raghavan *, Marios Papaefthymiou +, Kevin P. Pipe +#, Thomas F. Wenisch +, Milo M. K.
CPU Inside Maria Gabriela Yobal de Anda L#32 9B. CPU Called also the processor Performs the transformation of input into output Executes the instructions.
Exascale Computing. 1 Teraflops Chip Knight Corner will be manufactured with Intel’s 3-D Tri-Gate 22nm process and features more than 50 cores.
ATtiny23131 A SEMINAR ON AVR MICROCONTROLLER ATtiny2313.
XStream: Rapid Generation of Custom Processors for ASIC Designs Binu Mathew * ASIC: Application Specific Integrated Circuit.
Runtime Power Gating of On-Chip Routers Using Look-Ahead Routing
University of Michigan, Ann Arbor
Rabi Mahapatra Department of Computer Science & Engineering Texas A&M University.
Networks-on-Chip (NoC) Suleyman TOSUN Computer Engineering Deptartment Hacettepe University, Turkey.
ECEn 191 – New Student Seminar - Session 6 Digital Logic Digital Logic ECEn 191 New Student Seminar.
What is a Microprocessor ? A microprocessor consists of an ALU to perform arithmetic and logic manipulations, registers, and a control unit Its has some.
CPU/BIOS/BUS CES Industries, Inc. Lesson 8.  Brain of the computer  It is a “Logical Child, that is brain dead”  It can only run programs, and follow.
Computer Architecture Lecture 26 Past and Future Ralph Grishman November 2015 NYU.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Click to edit Master title style Progress Update Energy-Performance Characterization of CMOS/MTJ Hybrid Circuits Fengbo Ren 05/28/2010.
1 Dual-V cc SRAM Class presentation for Advanced VLSIPresenter:A.Sammak Adopted from: M. Khellah,A 4.2GHz 0.3mm 2 256kb Dual-V CC SRAM Building Block in.
IP Router Architecture Masoud Sabaei Assistant professor Computer Engineering and Information Technology Department, Amirkabir University of Technology.
By Chad Andrus. TILE-Gx100  100 Identical Processor Cores Each core has its own L2 & L3 cache Each can run its own OS or group together for multiprocessing.
CS203 – Advanced Computer Architecture
M V Ganeswara Rao Associate Professor Dept. of ECE Shri Vishnu Engineering College for Women Bhimavaram Hardware Architecture of Low-Power ALU using Clock.
History of Computers and Performance David Monismith Jan. 14, 2015 Based on notes from Dr. Bill Siever and from the Patterson and Hennessy Text.
C. Murad Özsert Intel's Tera Scale Processor Architecture.
Low-power Task Scheduling for GPU Energy Reduction Li Tang, Yiji Zhang.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Power.
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
Physical Memory and Physical Addressing ( Chapter 10 ) by Polina Zapreyeva.
High Performance Computer Architecture:
CS203 – Advanced Computer Architecture
Lynn Choi School of Electrical Engineering
Inc. 32 nm fabrication process and Intel SpeedStep.
Jim Held Intel Fellow & Director, Tera-scale Computing Research
A High Performance SoC: PkunityTM
Chapter 1 Introduction.
Presentation transcript:

80-Tile Teraflop Network-On- Chip 1

Contents Overview of the chip Architecture ▫Computational Core ▫Mesh Network Router ▫Power save features Performance Evaluation 2

Overview of the chip Purpose: High speed floating point calculations (research chip) Tile based Network- On-Chip Low power consumption 3

4 Computational core The Processing Engine inside the tile

80 GB/s throughput Mesochronous interface Data can be routed across 2 lanes 5 Mesh Network router Overview of the crossbar router

6 Mesh Network router Area reduction trough bit interleaving InterleavingRoutingDe-interleaving

7 Power saving features Sleep transistors: reduce standby leakage Body bias circuits: reduce active leakage Controlled by special instructions Operating voltage: V Operating frequency: 0-5.8GHz

Performance Extreme amount of FLOPS/Watt Low voltage performance still impressive: ▫11W, 310 GFLOPS 8

Evaluation Very scalable Energy efficient Heat spreading possible Fault tolerant Dynamic routing across mesh network 3d stacked memory very promising Not general purpose yet Communication with the outside world is hard Programming might be a problem 9

End of presentation 10