Dual-Pipeline Heterogeneous ASIP Design Swarnalatha Radhakrishnan, Hui Guo, Sri Parameswaran School of Computer Science & Engineering University of New.

Slides:



Advertisements
Similar presentations
Instructional Technology vs. Educational Technology
Advertisements

Department of Computer Science and Engineering University of Washington Brian N. Bershad, Stefan Savage, Przemyslaw Pardyak, Emin Gun Sirer, Marc E. Fiuczynski,
Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.
 Understanding the Sources of Inefficiency in General-Purpose Chips.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Extensible Processors. 2 ASIP Gain performance by:  Specialized hardware for the whole application (ASIC). −  Almost no flexibility. −High cost.  Use.
Architecture Description Languages for Programmable Embedded Systems P. Mishra and N. Dutt IEE Proc.-Comput. Digit. Tech., Vol. 152, No. 3, May 2005 Speaker:
Superscalar Processors (Pictured above is the DEC Alpha 21064) Presented by Jeffery Aguiar.
Source Code Optimization and Profiling of Energy Consumption in Embedded System Simunic, T.; Benini, L.; De Micheli, G.; Hans, M.; Proceedings on The 13th.
Active Messages: a Mechanism for Integrated Communication and Computation von Eicken et. al. Brian Kazian CS258 Spring 2008.
6/25/2015Platform Design H.Corporaal and B. Mesman1 Platform Design TU/e 5kk70 Henk Corporaal Bart Mesman ASIP Application Specific Instruction-set Processor.
February 21, 2008 Center for Hybrid and Embedded Software Systems Mapping A Timed Functional Specification to a Precision.
The Effect of Data-Reuse Transformations on Multimedia Applications for Different Processing Platforms N. Vassiliadis, A. Chormoviti, N. Kavvadias, S.
Enhancing Embedded Processors with Specific Instruction Set Extensions for Network Applications A. Chormoviti, N. Vassiliadis, G. Theodoridis, S. Nikolaidis.
Reconfigurable Computing in the Undergraduate Curriculum Jason D. Bakos Dept. of Computer Science and Engineering University of South Carolina.
Trend towards Embedded Multiprocessors Popular Examples –Network processors (Intel, Motorola, etc.) –Graphics (NVIDIA) –Gaming (IBM, Sony, and Toshiba)
Platform Design TU/e 5kk70 Henk Corporaal Bart Mesman ASIP Application Specific Instruction-set Processor.
Educational Computer Architecture Experimentation Tool Dr. Abdelhafid Bouhraoua.
Dynamic Hardware Software Partitioning A First Approach Komal Kasat Nalini Kumar Gaurav Chitroda.
Effective Methods for Software and Systems Integration
L29:Lower Power Embedded Architecture Design 성균관대학교 조 준 동 교수,
Yongjoo Kim*, Jongeun Lee**, Jinyong Lee*, Toan Mai**, Ingoo Heo* and Yunheung Paek* *Seoul National University **UNIST (Ulsan National Institute of Science.
1 Miodrag Bolic ARCHITECTURES FOR EFFICIENT IMPLEMENTATION OF PARTICLE FILTERS Department of Electrical and Computer Engineering Stony Brook University.
Introspective 3D Chips S. Mysore, B. Agrawal, N. Srivastava, S. Lin, K. Banerjee, T. Sherwood (UCSB), ASPLOS 2006 Shimin Chen (LBA Reading Group Presentation)
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 1 Slide 1 An Introduction to Software Engineering.
Software Pipelining for Stream Programs on Resource Constrained Multi-core Architectures IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEM 2012 Authors:
Uncovering the Multicore Processor Bottlenecks Server Design Summit Shay Gal-On Director of Technology, EEMBC.
Automated Design of Custom Architecture Tulika Mitra
Section 10: Advanced Topics 1 M. Balakrishnan Dept. of Comp. Sci. & Engg. I.I.T. Delhi.
POLITECNICO DI MILANO Reconfiguration 4 Reliability design methodology for reliability assessment and enhancement of FPGA-based systems Dynamic Reconfigurability.
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
High Performance Embedded Computing © 2007 Elsevier Lecture 3: Design Methodologies Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte Based.
System Design with CoWare N2C - Overview. 2 Agenda q Overview –CoWare background and focus –Understanding current design flows –CoWare technology overview.
High Performance Embedded Computing © 2007 Elsevier Chapter 1, part 2: Embedded Computing High Performance Embedded Computing Wayne Wolf.
SPREE RTL Generator RTL Simulator RTL CAD Flow 3. Area 4. Frequency 5. Power Correctness1. 2. Cycle count SPREE Benchmarks Verilog Results 3. Architecture.
1 Exploring Custom Instruction Synthesis for Application-Specific Instruction Set Processors with Multiple Design Objectives Lin, Hai Fei, Yunsi ACM/IEEE.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
ESL and High-level Design: Who Cares? Anmol Mathur CTO and co-founder, Calypto Design Systems.
ACMSE’04, ALDepartment of Electrical and Computer Engineering - UAH Execution Characteristics of SPEC CPU2000 Benchmarks: Intel C++ vs. Microsoft VC++
Introduction to Algorithms (2 nd edition) by Cormen, Leiserson, Rivest & Stein Chapter 1: The Role of Algorithms in Computing (slides by N. Adlai A. DePano)
CS 3610: Software Engineering – Fall 2009 Dr. Hisham Haddad – CSIS Dept. Chapter 6 System Engineering Overview of System Engineering.
A Graph Based Algorithm for Data Path Optimization in Custom Processors J. Trajkovic, M. Reshadi, B. Gorjiara, D. Gajski Center for Embedded Computer Systems.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Compiling Several Classes of Communication Patterns on a Multithreaded Architecture Gagan Agrawal Department of Computer and Information Sciences Ohio.
Title of Selected Paper: IMPRES: Integrated Monitoring for Processor Reliability and Security Authors: Roshan G. Ragel and Sri Parameswaran Presented by:
This material exempt per Department of Commerce license exception TSU Xilinx Tool Flow.
Compilers for Embedded Systems Ram, Vasanth, and VJ Instructor : Dr. Edwin Sha Synthesis and Optimization of High-Performance Systems.
A Hybrid Design Space Exploration Approach for a Coarse-Grained Reconfigurable Accelerator Farhad Mehdipour, Hamid Noori, Hiroaki Honda, Koji Inoue, Kazuaki.
Compiler and Runtime Support for Enabling Generalized Reduction Computations on Heterogeneous Parallel Configurations Vignesh Ravi, Wenjing Ma, David Chiu.
ECEG-3202 Computer Architecture and Organization Chapter 7 Reduced Instruction Set Computers.
1 November 11, 2015 A Massively Parallel, Hybrid Dataflow/von Neumann Architecture Yoav Etsion November 11, 2015.
NISC set computer no-instruction
Sunpyo Hong, Hyesoon Kim
Embedded Computer Architecture
Hyunchul Park†, Kevin Fan†, Scott Mahlke†,
The Effect of Data-Reuse Transformations on Multimedia Applications for Application Specific Processors N. Vassiliadis, A. Chormoviti, N. Kavvadias, S.
POLITECNICO DI MILANO A SystemC-based methodology for the simulation of dynamically reconfigurable embedded systems Dynamic Reconfigurability in Embedded.
Array Multiplier Haibin Wang Qiong Wu. Outlines Background & Motivation Principles Implementation & Simulation Advantages & Disadvantages Conclusions.
Embedded Systems. What is Embedded Systems?  Embedded reflects the facts that they are an integral.
Programmable Hardware: Hardware or Software?
Dynamo: A Runtime Codesign Environment
Ph.D. in Computer Science
Evaluating Register File Size
Chapter 1: Introduction
Overview of System Engineering
Anne Pratoomtong ECE734, Spring2002
Dynamically Reconfigurable Architectures: An Overview
CS 8532: Advanced Software Engineering
HIGH LEVEL SYNTHESIS.
Presentation transcript:

Dual-Pipeline Heterogeneous ASIP Design Swarnalatha Radhakrishnan, Hui Guo, Sri Parameswaran School of Computer Science & Engineering University of New South Wales, Sydney, Australia Presenters: Firew & Ying Dec 21/2007

Dual-pipeline Heterogeneous ASIP Design Slide 2 of 18 Outline Background Motivation Objective and benefit Design scheme –Steps and tools used –The architecture template –Phase I: Target instruction Set Generation –Phase II: Dual Pipeline Instruction Set Creation –Phase III: Dual-Pipeline ASIP construction and code generation Simulation Result Conclusion

Dual-pipeline Heterogeneous ASIP Design Slide 3 of 18 Background Embedded systems are now everywhere and continuously growing in capability and complexity. Can be implemented using either GPP, ASIP or ASIC. GPP: –programmable, –availability of tools, –but consume more power –Huge team, manual design ASIC: –low power device, –having a small foot print –but inflexible –complex to design ASIP –compromise solution –Still programmable in the domain –Quick to design –Consume less power than GPP

Dual-pipeline Heterogeneous ASIP Design Slide 4 of 18 Motivation Why enhancements on ASIP? 98% of the market of all microprocessors May be constrained by –real-time performance –power utilization, –size and weight –silicon-area –safety –cooling, … Customized design that suits a particular need – increase in performance – reduce power – avoid unnecessary functional units Most researches focus on single-pipeline ASIP One way of enhancement: –enabling multiple instruction issue through a multi-pipeline ASIP design

Dual-pipeline Heterogeneous ASIP Design Slide 5 of 18 Objective and benefit Goal : –A two pipeline heterogeneous* ASIP design Benefits: –Exploit parallelism by dual pipelines –improve performance with minimal area penalty –greater design space for the designer –allows code generated for a single pipeline processor to be utilized without major modification –extension to multiple pipeline ASIP processor * Heterogeneity of the processor arises by the differing instruction sets issued to the two pipes

Dual-pipeline Heterogeneous ASIP Design Slide 6 of 18 Design Scheme(1/7) ASIP design Process Application/ s processor model OK? more appl.? yes no yes Estimations cycles/alg occupation HW design SW (code generation) Estimations nsec/cycle, area, power/instr go to phase 2 3 phases 1. exploration 2. hw design 3. design appl. sw

Dual-pipeline Heterogeneous ASIP Design Slide 7 of 18 Design Scheme(2/7) Basic Steps and Tools Used Assembling: GCC compiler Analyzing and re-ordering Merging Special Instructions generation Target Instruction Set generation Dual-pipeline instruction set generation Two distinct ASIP processors design: ASIPmeiste r Integrating Code generation Simulating: ModelSim and Synplify Phase I Phase II Phase III

Dual-pipeline Heterogeneous ASIP Design Slide 8 of 18 Design Scheme(3/7) The Architecture Template

Dual-pipeline Heterogeneous ASIP Design Slide 9 of 18 Design Scheme(4/7) Phase I: Target instruction Set Generation

Dual-pipeline Heterogeneous ASIP Design Slide 10 of 18 Design Scheme(5/7) –Phase II: Dual Pipeline Instruction Set Creation

Dual-pipeline Heterogeneous ASIP Design Slide 11 of 18 Design Scheme(6/7) Phase II

Dual-pipeline Heterogeneous ASIP Design Slide 12 of 18 Design Scheme(7/7) Phase III: Dual-Pipeline ASIP construction and code generation

Dual-pipeline Heterogeneous ASIP Design Slide 13 of 18 Simulation

Dual-pipeline Heterogeneous ASIP Design Slide 14 of 18 Result(1/2) PNF: Prime Number Finder IS: InSort GCD: Greatest Common Divisor BS: Bubblesort MM: Matrix Multiplication SS: ShellSort

Dual-pipeline Heterogeneous ASIP Design Slide 15 of 18 Result(2/2)

Dual-pipeline Heterogeneous ASIP Design Slide 16 of 18 Conclusion(1/2) In dual-pipeline ASIP design in comparison to single, a performance improvement of 27.6% and a reduction of switching activity by 6.1% for a number of benchmarks is achieved with an area penalty of about 16%. Allocating load/store operations to one of the pipes and ALU operations to the other pipe is preferred. Simple hardware modules reduces switching activity and hence power

Dual-pipeline Heterogeneous ASIP Design Slide 17 of 18 Conclusion(2/2) Availability of synthesis tools and well- flowing design methodology  predictable time-to-market Small design team But design effort seems proportionally growing with complexity of application

Dual-pipeline Heterogeneous ASIP Design Slide 18 of 18 Thank you! Questions?