Power Aware Solutions for NoC Architecture Yaniv Ben-Itzhak Noc Seminar Winter 08.

Slides:



Advertisements
Similar presentations
Comparison Of Network On Chip Topologies Ahmet Salih BÜYÜKKAYHAN Fall.
Advertisements

Multi-core SoC Design is the Challenge! What is the Solution? Drew Wingard CTO Sonics, Inc.
System-level Trade-off of Networks-on-Chip Architecture Choices Network-on-Chip System-on-Chip Group, CSE-IMM, DTU.
International Symposium on Low Power Electronics and Design Energy-Efficient Non-Minimal Path On-chip Interconnection Network for Heterogeneous Systems.
Zhongkai Chen 3/25/2010. Jinglei Wang; Yibo Xue; Haixia Wang; Dongsheng Wang Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China This paper.
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
REAL-TIME COMMUNICATION ANALYSIS FOR NOCS WITH WORMHOLE SWITCHING Presented by Sina Gholamian, 1 09/11/2011.
*time Optimization Heiko, Diego, Thomas, Kevin, Andreas, Jens.
Minimizing Expected Energy Consumption in Real-Time Systems through Dynamic Voltage Scaling Ruibin Xu, Daniel Mosse’, and Rami Melhem.
Networks on Chip : a very quick introduction! Jeremy Chan 11 May 2005.
Module R R RRR R RRRRR RR R R R R Technion – Israel Institute of Technology The Era of Many-Module SoC: Revisiting the NoC Mapping Problem Isask’har (Zigi)
Using one level of Cache:
Module R R RRR R RRRRR RR R R R R Efficient Link Capacity and QoS Design for Wormhole Network-on-Chip Zvika Guz, Isask ’ har Walter, Evgeny Bolotin, Israel.
System design-related Optimization problems Michela Milano Joint work DEIS Università di Bologna Dip. Ingegneria Università di Ferrara STI Università di.
Spring 2008 Network On Chip Platform Instructor: Yaniv Ben-Itzhak Students: Ofir Shimon Guy Assedou.
Performance and Energy Bounds for Multimedia Applications on Dual-processor Power-aware SoC Platforms Weng-Fai WONG 黄荣辉 Dept. of Computer Science National.
MINIMISING DYNAMIC POWER CONSUMPTION IN ON-CHIP NETWORKS Robert Mullins Computer Architecture Group Computer Laboratory University of Cambridge, UK.
Architecture and Real Time Systems Lab University of Massachusetts, Amherst An Application Driven Reliability Measures and Evaluation Tool for Fault Tolerant.
Architecture and Routing for NoC-based FPGA Israel Cidon* *joint work with Roman Gindin and Idit Keidar.
Issues in System-Level Direct Networks Jason D. Bakos.
ECE669 L23: Parallel Compilation April 29, 2004 ECE 669 Parallel Computer Architecture Lecture 23 Parallel Compilation.
Electronic Systems 1 Problem 20: MP3 mapped on NoC-based MPSoC Application Model SDF model (without auto-concurrency) of (modified) MP3 in a certain mode.
ECE-777 System Level Design and Automation Mapping
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
High Performance Embedded Computing © 2007 Elsevier Lecture 16: Interconnection Networks Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte.
Course Outline DayContents Day 1 Introduction Motivation, definitions, properties of embedded systems, outline of the current course How to specify embedded.
Computer Science and Engineering CSE 8383 Advanced Computer Architecture1 Key Research Problems In NoC Design: A Holistic Perspective CSE 8383, March 30,
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
High-Performance Networks for Dataflow Architectures Pravin Bhat Andrew Putnam.
10 years of research on Power Management (now called green computing) Rami Melhem Daniel Mosse Bruce Childers.
CHALLENGING SCHEDULING PROBLEM IN THE FIELD OF SYSTEM DESIGN Alessio Guerri Michele Lombardi * Michela Milano DEIS, University of Bologna.
“Low-Power, Real-Time Object- Recognition Processors for Mobile Vision Systems”, IEEE Micro Jinwook Oh ; Gyeonghoon Kim ; Injoon Hong ; Junyoung.
International Symposium on Low Power Electronics and Design NoC Frequency Scaling with Flexible- Pipeline Routers Pingqiang Zhou, Jieming Yin, Antonia.
Déjà Vu Switching for Multiplane NoCs NOCS’12 University of Pittsburgh Ahmed Abousamra Rami MelhemAlex Jones.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
1 Customer-Aware Task Allocation and Scheduling for Multi-Mode MPSoCs Lin Huang, Rong Ye and Qiang Xu CHhk REliable computing laboratory (CURE) The Chinese.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
L11: Lower Power High Level Synthesis(2) 성균관대학교 조 준 동 교수
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Under-project Meeting Network-on-Chip Group 2007/3/07 TA: 林書彥 黃群翔.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
Rabi Mahapatra Department of Computer Science & Engineering Texas A&M University.
Networks-on-Chip (NoC) Suleyman TOSUN Computer Engineering Deptartment Hacettepe University, Turkey.
Energy-Aware Resource Adaptation in Tessellation OS 3. Space-time Partitioning and Two-level Scheduling David Chou, Gage Eads Par Lab, CS Division, UC.
Dynamic Traffic Distribution among Hierarchy Levels in Hierarchical Networks-on-Chip Ran Manevich, Israel Cidon, and Avinoam Kolodny Group Research QNoC.
ADAM: Run-time Agent-based Distributed Application Mapping for on-chip Communication 林鼎原 Department of Electrical Engineering National Cheng Kung University.
1 November 11, 2015 A Massively Parallel, Hybrid Dataflow/von Neumann Architecture Yoav Etsion November 11, 2015.
Advanced Processor Group The School of Computer Science A Dynamic Link Allocation Router Wei Song, Doug Edwards Advanced Processor Group The University.
CprE 458/558: Real-Time Systems (G. Manimaran)1 Energy Aware Real Time Systems - Scheduling algorithms Acknowledgement: G. Sudha Anil Kumar Real Time Computing.
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Energy-aware QoS packet scheduling.
Jamie Unger-Fink John David Eriksen.  Allocation and Scheduling Problem  Better MPSoC optimization tool needed  IP and CP alone not good enough  Communication.
ECE692 Course Project Proposal Cache-aware power management for multi-core real-time systems Xing Fu Khairul Kabir 16 September 2009.
Effective bandwidth with link pipelining Pipeline the flight and transmission of packets over the links Overlap the sending overhead with the transport.
1 of 14 Lab 2: Formal verification with UPPAAL. 2 of 14 2 The gossiping persons There are n persons. All have one secret to tell, which is not known to.
Network On Chip Cache Coherency Final presentation – Part A Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter.
A Low-Area Interconnect Architecture for Chip Multiprocessors Zhiyi Yu and Bevan Baas VLSI Computation Lab ECE Department, UC Davis.
1 of 14 Lab 2: Design-Space Exploration with MPARM.
Research Interests  NOCs – Networks-on-Chip  Embedded Real-Time Software  Real-Time Embedded Operating Systems (RTOS)  System Level Modeling and Synthesis.
Pradeep Konduri Static Process Scheduling:  Proceedance process model  Communication system model  Application  Dicussion.
M AESTRO : Orchestrating Predictive Resource Management in Future Multicore Systems Sangyeun Cho, Socrates Demetriades Computer Science Department University.
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
Pablo Abad, Pablo Prieto, Valentin Puente, Jose-Angel Gregorio
Nithin Michael, Yao Wang, G. Edward Suh and Ao Tang Cornell University
Babak Sorkhpour, Prof. Roman Obermaisser, Ayman Murshed
Spare Register Aware Prefetching for Graph Algorithms on GPUs
Fault and Energy Aware Communication Mapping with Guaranteed Latency for Applications Implemented on NoC Sorin Manolache, Petru Eles, Zebo Peng {sorma,
Buffer Space Optimisation with Communication Mapping and Traffic Shaping for NoCs Sorin Manolache, Petru Eles, Zebo Peng Linköping University, Sweden.
Department of Electrical Engineering Joint work with Jiong Luo
Communication Driven Remapping of Processing Element (PE) in Fault-tolerant NoC-based MPSoCs Chia-Ling Chen, Yen-Hao Chen and TingTing Hwang Department.
Presentation transcript:

Power Aware Solutions for NoC Architecture Yaniv Ben-Itzhak Noc Seminar Winter 08

Power Aware Various Solutions SoC Mapping blocks. Routing. Scheduling. CMP (MPSoC) Threads Allocation. Routing. Scheduling.

Power Aware Various Solutions SoC Mapping blocks. Routing. Scheduling. CMP (MPSoC) Threads Allocation. Routing. Scheduling.

SoC Radu Marculescu & Jingcao Hu: Exploiting the Routing Flexibility for Energy Performance Aware Mapping of Regular NoC Architectures (2003) Energy-Aware Communication and Task Scheduling for Network-on-Chip Architectures under Real-Time (2004) Energy-and Performance-Aware Mapping for Regular NoC Architectures (2005) Communication and Task Scheduling of Application-Specific Networks-on- Chip (2005)

Problem Formulation How to map each IP to which tile ? Total communication energy consumption is minimized

Communication Energy Model Energy consumed by the switch Energy consumed on the links Energy consumed by the buffer Energy consumed by the internal wires negligible

Communication Energy Model Energy consumed by the switch Energy consumed on the links

Communication Energy Model Energy consumed by the switch Energy consumed on the links Calculated independently of the underlying traffic model. Determined by the Manhattan distance between them.

Solution Application Characterization Graph (APCG) G = G(C,A) : c i represents one selected IP. a i, j characterizes the communication from c i to c j. v(a i, j ): communication volume (bits) from c i to c j. b(a i, j ): arc minimum bandwidth requirement from vertex c i to c j,

Solution Architecture Characterization Graph (ARCG) G =G(T,R): t i represents one tile in the architecture. r i, j represents the routing from t i to t j: P i, j : a set of candidate minimal paths from tile t i to tile t j. e(r i, j ):

Solution Find mapping function map() from APCG to ARCG and a deadlock-free, minimal routing function, which: communication volume (bits) from c i to c j

Solution The search space of the problem increases factorially with the system size. For System with 4X4 tiles, there are 16! Possible mappings !!

Solution Branch-and-Bound Algorithm

Power Aware Various Solutions SoC Mapping blocks. Routing. Scheduling. CMP Threads Allocation. Routing. Scheduling.

1 CMP (MPSoC( Threads Allocation Nero PhotoShop WinRar Quake 4 Threads Allocation

Nero PhotoShop WinRar Quake 4 Off CMP (MPSoC( Threads Allocation

CMP (MPSoC( Basic Processor Model Cache Miss T(p) Idle t(p) p Cache Miss …

CMP (MPSoC( Basic Processor Model Power = Power processors + Power NoC

CMP (MPSoC( Basic Processor Model Power = Power processors + Power NoC Power NoC calculated by bit energy model Depends on the Architecture (Cache in the Middle,Tiled CMP, Message Passing)

CMP (MPSoC( Basic Processor Model Threads Total Average Performance (All processors)

CMP (MPSoC( Power = Power processors + Power NoC P NoC calculated by bit energy model

CMP (MPSoC( Avg. Performance Threads Performance Average

CMP (MPSoC( Threads Allocation with constraints of: Asymmetry. Shared Memory –  Cache in the Middle  Tiled CMP Architecture. Message Passing. Threads Allocation with: Processors On/Off. DVS - Dynamic Voltage Scaling. DFS - Dynamic Frequency Scaling.

Questions ? Thank You !