Marcus T. Schmitz and Bashir M. Al-Hashimi

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Adders Used to perform addition, subtraction, multiplication, and division (sometimes) Half-adder adds rightmost (least significant) bit Full-adder.
Unit-iv.
© Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems Introduction.
Advanced Piloting Cruise Plot.
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Generative Design in Civil Engineering Using Cellular Automata Rafal Kicinger June 16, 2006.
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 1 Embedded Computing.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 3 CPUs.
Effective Change Detection Using Sampling Junghoo John Cho Alexandros Ntoulas UCLA.
UNITED NATIONS Shipment Details Report – January 2006.
and 6.855J Spanning Tree Algorithms. 2 The Greedy Algorithm in Action
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Exit a Customer Chapter 8. Exit a Customer 8-2 Objectives Perform exit summary process consisting of the following steps: Review service records Close.
My Alphabet Book abcdefghijklm nopqrstuvwxyz.
Multiplying binomials You will have 20 seconds to answer each of the following multiplication problems. If you get hung up, go to the next problem when.
2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt ShapesPatterns Counting Number.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 5 second questions
Year 6 mental test 10 second questions
Around the World AdditionSubtraction MultiplicationDivision AdditionSubtraction MultiplicationDivision.
Predicting Performance Impact of DVFS for Realistic Memory Systems Rustam Miftakhutdinov Eiman Ebrahimi Yale N. Patt.
Evaluating Window Joins over Unbounded Streams Author: Jaewoo Kang, Jeffrey F. Naughton, Stratis D. Viglas University of Wisconsin-Madison CS Dept. Presenter:
Pinwheel Scheduling for Power-Aware Real-Time Systems Gaurav Chitroda Komal Kasat Nalini Kumar.
ABC Technology Project
Hash Tables.
Gate Sizing for Cell Library Based Designs Shiyan Hu*, Mahesh Ketkar**, Jiang Hu* *Dept of ECE, Texas A&M University **Intel Corporation.
Chapter 10: Virtual Memory
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
VOORBLAD.
Making Time-stepped Applications Tick in the Cloud Tao Zou, Guozhang Wang, Marcos Vaz Salles*, David Bindel, Alan Demers, Johannes Gehrke, Walker White.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
“Start-to-End” Simulations Imaging of Single Molecules at the European XFEL Igor Zagorodnov S2E Meeting DESY 10. February 2014.
BIOLOGY AUGUST 2013 OPENING ASSIGNMENTS. AUGUST 7, 2013  Question goes here!
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
1..
© 2012 National Heart Foundation of Australia. Slide 2.
Adding Up In Chunks.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Formal models of design 1/28 Radford, A D and Gero J S (1988). Design by Optimization in Architecture, Building, and Construction, Van Nostrand Reinhold,
Universität Kaiserslautern Institut für Technologie und Arbeit / Institute of Technology and Work 1 Q16) Willingness to participate in a follow-up case.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
H to shape fully developed personality to shape fully developed personality for successful application in life for successful.
Januar MDMDFSSMDMDFSSS
Week 1.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
A SMALL TRUTH TO MAKE LIFE 100%
PSSA Preparation.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
Mani Srivastava UCLA - EE Department Room: 6731-H Boelter Hall Tel: WWW: Copyright 2003.
By Rasmussen College. 1. What majors or programs do you offer? 2. What is the average length of your programs? 3. What percentage of your students graduate?
Hard Real-Time Scheduling for Low- Energy Using Stochastic Data and DVS Processors Flavius Gruian Department of Computer Science, Lund University Box 118.
Presentation transcript:

Energy-Efficient Mapping and Scheduling for DVS Enabled Distributed Embedded Systems Marcus T. Schmitz and Bashir M. Al-Hashimi University of Southampton, United Kingdom Petru Eles Linköping University, Sweden

Contents Motivation & Introduction Dynamic Voltage Scaling Co-Synthesis with DVS Consideration DVS optimised Scheduling DVS optimised Mapping Experimental Results Conclusions

Motivation Low Energy: Portable Applications Autonomous Systems Feasibilty Issues (SoC - heat) Operational Cost and Environmental Reasons System Level Co-Design: Shrinking Time-To-Market Windows Reducing Production Cost High Degree of Optimisation Freedom

Introduction Dynamic Voltage Scaling System Level Co-Synthesis Energy-Efficient Co-Synthesis for DVS Sytems

Dynamic Voltage Scaling (DVS) Energy vs. Speed 1.2 DVS Processor 1 Frequency 0.8 VR f Reg. Energy 0.6 Voltage/Frequency 0.4 0.2 1 1.5 2 2.5 3 3.5 4 4.5 5 1/Speed Available from: Transmeta, AMD, Intel

Co-Synthesis for DVS Systems System Specification, Technology Lib. Allocation Mapping Designer driven Scheduling EE-GMA EE-GLSA Voltage Scaling Evaluation

DVS in Distributed Systems [23] Input: Scheduling (mapping) Power profile Output: scaled voltage for each DVS task Emax Esc < Emax P P Slack PE0 PE0 CL0 CL0 2.3V 3.3V 2.4V PE1 PE1 d d t t Voltage Scaling @ Vmax @ dyn. V

Energy-Efficient Scheduling Two objectives: Timing feasibility Garantee deadlines Low energy dissipation Optimisation DVS usability – Slack time Traditional scheduling technique focus mainly on timing feasibility! Problem due to power variations: Simply increase deadline slack leads to sub-optimal solutions!

Energy-Efficient Scheduling P E=71J E=65.6J P PE0 t t 4 5 t t 4 5 Slack  Savings  DVS PE1 t 1 t t 2 t t 1 t 2 PE2 Slack t 3 t 6 t 3 t 6 t t S2: E=71J E=53.9J P P Slack Slack  Savings  PE0 t 4 t 5 t 4 t 5 DVS PE1 t t t 2 1 t t 1 t 2 PE2 t t 3 t 3 6 t 6 t t

Energy-Efficient Scheduling Based on Genetic List Scheduling Algorithm [6,10] Task priorities are encoded into priorities strings t0 Schedule List Scheduler t1 t2 t3 t4 Duties of the Scheduler: Select ready task with highest priority Schedule selected task Update schedule and ready list Repeat until no un-scheduled task is left PS 4 3 9 7 2 t0 t1 t2 t3 t4

EE-GLSA Initial Population List Scheduler DVS Insertion Assign fitness 3 7 8 1 2 3 2 1 No Hole Filling! No Mapping! Initial Population List Scheduler DVS Insertion Timing, Energy Assign fitness Mutation Rank individuals Optimised Population low high Mating Selection GA

Advantages Optimisation can be based on an arbitrary complex fitness function, including: Timing Energy (DVS technique) Enlarged search space (|T+C|! different schedules) Trade-off freedom: Synthesis time <-> quality Easily adaptable to computing clusters Multiple populations with immigration scheme

Hole Filling Problem Hole filling t0 PE0 t4 t2 t3 t3 t1 t4 t2 PE1 t0 7 t2 t3 t3 1 d3 t1 d2 d3,4 4 t4 6 d4 t2 4 PE1 d2 t0 t1 Therefore, priorities decide solely upon execution order!

Task Mapping Why seperation from the list scheduling? Regardless of priorties, greedy mapping P t0 7 PE0 LS t1 4 d1 PE1 t t2 d1,2 5 d2

Task Mapping Make greedy mapping decision based on: Timing Energy t0 ? 7 PE0 ? LS t1 4 d1 PE1 ? t t2 d1,2 5 d2

Task Mapping Make mapping decision based on: Timing Energy t0 t0 LS t1 7 PE0 t0 LS t1 4 d1 PE1 t t2 d1,2 5 d2

Task Mapping Make mapping decision based on: Timing Energy t0 t0 ? LS 7 PE0 t0 ? LS t1 4 d1 PE1 ? t t2 d1,2 5 d2

Task Mapping Make mapping decision based on: Timing Energy t0 t0 LS t1 7 PE0 t0 LS t1 4 d1 PE1 t2 t t2 d1,2 5 d2

Task Mapping Make mapping decision based on: Timing Energy t0 t0 LS t1 7 PE0 t0 LS t1 4 d1 PE1 t2 t t2 d1,2 5 d2

Task Mapping Make mapping decision based on: Timing Energy t0 t0 LS t1 7 PE0 t0 LS t1 4 d1 PE1 t2 t1 t t2 d1,2 5 d2

Task Mapping Make mapping decision based on: Timing Energy t0 t0 t2 LS 7 PE0 t0 t2 LS t1 4 d1 PE1 t1 t t2 d1,2 5 d2

Genetic Mapping Algorithm [8] Task mapping are encoded into mapping strings task PE 0 1 1 2 2 3 4 5 6 CPU DVS-CPU ASIC 0 1 2 d 5 3 6 4 0 1 2 Chromosome

EE-GMA Including DVS Initial Population EE-GLSA Insertion Timing, Energy + Area Insertion Assign fitness Mutation Rank individuals Optimised Population low high Mating Selection GA

Experimental Results 4 Benchmark Sets: 27 generated by TGFF [7] 8 to 100 tasks: Power variations 2.6 2 Hou examples taken from [13] 8 to 20 tasks: Power variations 11 TG1 and TG2 taken from [11] 60 examples with 30 tasks, each: No power variations Measurement application taken from [3] 12 tasks: No power profile is provided Power and time overhead for DVS is neglected Average results of 5 optimisation runs

Schedule Optimisation

Schedule Optimisation

Mapping Optimisation

Conclusions DVS capability can achieve high energy savings in distributed embedded systems Proposed a new energy-efficient two-step mapping and scheduling approach Iterative improvement provides high savings / ad hoc constructive techniques are not suitable Optimisation times are reasonable Additional objectives can be easily included Consideration of power profile information leads to further energy reductions