1 of 14 1/34 Embedded Systems Design: Optimization Challenges Paul Pop Embedded Systems Lab (ESLAB) Linköping University, Sweden.

Slides:



Advertisements
Similar presentations
Energy-efficient Task Scheduling in Heterogeneous Environment 2013/10/25.
Advertisements

© 2004 Wayne Wolf Topics Task-level partitioning. Hardware/software partitioning.  Bus-based systems.
ECE-777 System Level Design and Automation Hardware/Software Co-design
1 of 14 1 /23 Flexibility Driven Scheduling and Mapping for Distributed Real-Time Systems Paul Pop, Petru Eles, Zebo Peng Department of Computer and Information.
Martha Garcia.  Goals of Static Process Scheduling  Types of Static Process Scheduling  Future Research  References.
Unrestricted © Siemens AG All rights reserved. The Digital Factory – Enabling a Holistic Approach to Automation MIT Industrial IoT Workshop – October.
Embedded Software Optimization for MP3 Decoder Implemented on RISC Core Yingbiao Yao, Qingdong Yao, Peng Liu, Zhibin Xiao Zhejiang University Information.
1 of 16 April 25, 2006 Minimizing System Modification in an Incremental Design Approach Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer.
1 of 14 1/14 Design Optimization of Time- and Cost-Constrained Fault-Tolerant Distributed Embedded Systems Viaceslav Izosimov, Paul Pop, Petru Eles, Zebo.
Communication Scheduling for Time-Triggered SystemsSlide 1 Communication Scheduling for Time-Triggered Systems Paul Pop, Petru Eles and Zebo Peng Dept.
System design-related Optimization problems Michela Milano Joint work DEIS Università di Bologna Dip. Ingegneria Università di Ferrara STI Università di.
Aleksandra Tešanović Low Power/Energy Scheduling for Real-Time Systems Aleksandra Tešanović Real-Time Systems Laboratory Department of Computer and Information.
High-level System Modeling and Power Management Techniques Jinfeng Liu Dept. of ECE, UC Irvine Sep
Predictable Implementation of Real-Time Applications on Multiprocessor Systems-on-Chip Alexandru Andrei Embedded Systems Laboratory Linköping University,
CS244-Introduction to Embedded Systems and Ubiquitous Computing Instructor: Eli Bozorgzadeh Computer Science Department UC Irvine Winter 2010.
Bogdan Tanasa, Unmesh D. Bordoloi, Petru Eles, Zebo Peng Department of Computer and Information Science, Linkoping University, Sweden December 3, 2010.
1 of 30 June 14, 2000 Scheduling and Communication Synthesis for Distributed Real-Time Systems Paul Pop Department of Computer and Information Science.
1 of 10 April 25, 2001 Minimizing System Modification in an Incremental Design Approach Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer.
Process Scheduling for Performance Estimation and Synthesis of Hardware/Software Systems Slide 1 Process Scheduling for Performance Estimation and Synthesis.
Performance and Energy Bounds for Multimedia Applications on Dual-processor Power-aware SoC Platforms Weng-Fai WONG 黄荣辉 Dept. of Computer Science National.
1 of 14 1/15 Schedulability Analysis and Optimization for the Synthesis of Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded.
Reliability-Aware Frame Packing for the Static Segment of FlexRay Bogdan Tanasa, Unmesh Bordoloi, Petru Eles, Zebo Peng Linkoping University, Sweden 1.
1 of 14 1 Fault-Tolerant Embedded Systems: Scheduling and Optimization Viacheslav Izosimov, Petru Eles, Zebo Peng Embedded Systems Lab (ESLAB) Linköping.
Scheduling with Optimized Communication for Time-Triggered Embedded Systems Slide 1 Scheduling with Optimized Communication for Time-Triggered Embedded.
1 of 16 March 30, 2000 Bus Access Optimization for Distributed Embedded Systems Based on Schedulability Analysis Paul Pop, Petru Eles, Zebo Peng Department.
1 Oct 2, 2003 Design Optimization of Mixed Time/Event-Triggered Distributed Embedded Systems Traian Pop, Petru Eles, Zebo Peng Embedded Systems Laboratory.
1 of 14 1 Analysis and Synthesis of Communication-Intensive Heterogeneous Real-Time Systems Paul Pop Computer and Information Science Dept. Linköpings.
1 of 14 1/15 Design Optimization of Multi-Cluster Embedded Systems for Real-Time Applications Paul Pop, Petru Eles, Zebo Peng, Viaceslav Izosimov Embedded.
Courseware Basics of Real-Time Scheduling Jan Madsen Informatics and Mathematical Modelling Technical University of Denmark Richard Petersens Plads, Building.
1 of 14 1 Scheduling and Optimization of Fault- Tolerant Embedded Systems Viacheslav Izosimov Embedded Systems Lab (ESLAB) Linköping University, Sweden.
1 of 16 June 21, 2000 Schedulability Analysis for Systems with Data and Control Dependencies Paul Pop, Petru Eles, Zebo Peng Department of Computer and.
HW/SW Co-Synthesis of Dynamically Reconfigurable Embedded Systems HW/SW Partitioning and Scheduling Algorithms.
1 of 14 1 / 18 An Approach to Incremental Design of Distributed Embedded Systems Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer and.
DATE Optimizations of an Application- Level Protocol for Enhanced Dependability in FlexRay Wenchao Li 1, Marco Di Natale 2, Wei Zheng 1, Paolo Giusto.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
Tradeoff Analysis for Dependable Real-Time Embedded Systems during the Early Design Phases Junhe Gan.
Task Alloc. In Dist. Embed. Systems Murat Semerci A.Yasin Çitkaya CMPE 511 COMPUTER ARCHITECTURE.
Low Contention Mapping of RT Tasks onto a TilePro 64 Core Processor 1 Background Introduction = why 2 Goal 3 What 4 How 5 Experimental Result 6 Advantage.
COMP3221/9221: Microprocessors and Embedded Systems COMP3221: Microprocessors and Embedded Systems Lecture 31: Embedded Systems
Low-Power Wireless Sensor Networks
An efficient active replication scheme that tolerate failures in distributed embedded real-time systems Alain Girault, Hamoudi Kalla and Yves Sorel Pop.
Rensselaer Polytechnic Institute Rajagopal Iyengar Combinatorial Approaches to QoS Scheduling in Multichannel Wireless Systems Rajagopal Iyengar Rensselaer.
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
1 of 14 1/15 Synthesis-driven Derivation of Process Graphs from Functional Blocks for Time-Triggered Embedded Systems Master thesis Student: Ghennadii.
Dependable communication synthesis for distributed embedded systems Nagarajan Kandasamy, John P. Hayes, Brian T. Murray Presented by John David Eriksen.
1 Exploring Custom Instruction Synthesis for Application-Specific Instruction Set Processors with Multiple Design Objectives Lin, Hai Fei, Yunsi ACM/IEEE.
Energy/Reliability Trade-offs in Fault-Tolerant Event-Triggered Distributed Embedded Systems Junhe Gan, Flavius Gruian, Paul Pop, Jan Madsen.
Hardware/Software Co-design Design of Hardware/Software Systems A Class Presentation for VLSI Course by : Akbar Sharifi Based on the work presented in.
CS244-Introduction to Embedded Systems and Ubiquitous Computing Instructor: Eli Bozorgzadeh Computer Science Department UC Irvine Winter 2010.
Hard Real-Time Scheduling for Low- Energy Using Stochastic Data and DVS Processors Flavius Gruian Department of Computer Science, Lund University Box 118.
6. Application mapping 6.1 Problem definition
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
Wajid Minhass, Paul Pop, Jan Madsen Technical University of Denmark
1 of 14 1/15 Schedulability-Driven Frame Packing for Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded Systems Lab (ESLAB)
Multi-Objective Optimization for Topology Control in Hybrid FSO/RF Networks Jaime Llorca December 8, 2004.
Task Mapping and Partition Allocation for Mixed-Criticality Real-Time Systems Domițian Tămaș-Selicean and Paul Pop Technical University of Denmark.
1 of 14 Lab 2: Formal verification with UPPAAL. 2 of 14 2 The gossiping persons There are n persons. All have one secret to tell, which is not known to.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
1 of 14 Lab 2: Design-Space Exploration with MPARM.
1 Chapter 5 Branch-and-bound Framework and Its Applications.
Pradeep Konduri Static Process Scheduling:  Proceedance process model  Communication system model  Application  Dicussion.
Optimization of Time-Partitions for Mixed-Criticality Real-Time Distributed Embedded Systems Domițian Tămaș-Selicean and Paul Pop Technical University.
Reliable energy management System reliability is affected by use of energy management The use of DVS increases the probability of faults, thus damaging.
Prabhat Kumar Saraswat Paul Pop Jan Madsen
Paul Pop, Petru Eles, Zebo Peng
Babak Sorkhpour, Prof. Roman Obermaisser, Ayman Murshed
Period Optimization for Hard Real-time Distributed Automotive Systems
Department of Electrical Engineering Joint work with Jiong Luo
Anand Bhat*, Soheil Samii†, Raj Rajkumar* *Carnegie Mellon University
Presentation transcript:

1 of 14 1/34 Embedded Systems Design: Optimization Challenges Paul Pop Embedded Systems Lab (ESLAB) Linköping University, Sweden

2 of 14 2/34 Outline  Embedded systems  Example area: automotive electronics  Embedded systems design  Optimization problems  Fault-tolerant mapping and scheduling  Voltage scaling  Communication delay analysis  Assessment and message

3 of 14 3/34 Embedded Systems General purpose systemsEmbedded systems Microprocessor market shares

4 of 14 4/34 Example Area: Automotive Electronics  What is “automotive electronics”?  Vehicle functions implemented with electronics  Body electronics  System electronics: chassis, engine  Information/entertainment

5 of 14 5/34 Automotive Electronics Market Size 8.9 Market ($billions) Cost of Electronics / Car ($) 90% of future innovations in vehicles: based on electronic embedded systems 2006: 25% of the total cost of a car will be electronics

6 of 14 6/34 Automotive Electronics Platform Example Source: Expanding automotive electronic systems, IEEE Computer, Jan. 2002

7 of 14 7/34 Outline  Embedded systems  Example area: automotive electronics  Embedded systems design  Optimization problems  Fault-tolerant mapping and scheduling  Voltage scaling  Communication delay analysis  Assessment and message

8 of 14 8/34 Embedded Systems Design Model of system implementation Estimation: exec. time System platform System model System-level design tasks Analysis Software synthesis Hardware synthesis  Growing complexity  Constraints  Time, energy, size  Cost, time-to-market  Safety, reliability  Heterogeneous  Hardware components  Comm. protocols  Mapping and scheduling  Voltage scaling  Communication delay analysis

9 of 14 9/34 Embedded System Design, Cont.  Goal: automated design optimization techniques  Successfully manage the complexity of embedded systems  Meet the constraints imposed by the application domain  Shorten the time-to-market  Reduce development and manufacturing costs Optimization: the key to successful design

10 of 14 10/34 Outline  Embedded systems  Example area: automotive electronics  Embedded systems design  Optimization problems  Fault-tolerant mapping and scheduling  Voltage scaling  Communication delay analysis  Assessment and message

11 of 14 11/34 Optimization Problems 1.Mapping and scheduling 1.1 Mapping to minimize communication 1.2 Mapping and scheduling 1.3 Fault-tolerant mapping and scheduling 2.Voltage scaling 2.1 Continuous voltage scaling 2.2 Discrete voltage scaling 3.Communication delay analysis

12 of 14 12/34  Given  Application: set of interacting processes  Platform: set of nodes Problem #1.1: Mapping P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 m4m4  Assessment  Optimal solutions even for large problem sizes N1N1 N2N2 P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 m4m4  Determine  Mapping of processes to nodes  Such that the communication is minimized

13 of 14 13/34 Problem #1.2: Mapping and Scheduling  Given  Application: set of interacting processes  Platform: set of nodes  Timing constraints: deadlines  Determine  Mapping of processes and messages  Schedule tables for processes and messages  Such that the timing constraints are satisfied S2S2 S1S1 P1P1 P4P4 P2P2 m1m1 m2m2 m3m3 m4m4 P3P3 N1N1 N2N2 Bus Schedule table Deadline P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 m4m4 N1N1 N2N2

14 of 14 14/34 Problem #1.2:  Assessment  Scheduling is NP-complete even in simpler context  D. Ullman, “NP-Complete Scheduling Problems”, Journal of Computer Systems Science, volume 10, pages 384–393,  ILP formulation  Can’t obtain optimal solutions for large problem sizes  Alternative: divide the problem  Scheduling  Heuristic: List scheduling  Mapping  Simulated annealing  Tabu-search  Problem-specific greedy algorithms

15 of 14 15/34 Messages: Schedule tables Messages: Fault-tolerant protocol Processes: Schedule tables Fault-Tolerant Mapping and Scheduling... Transient faults TDMA bus: TTP Processes: Re-execution and replication

16 of 14 16/34 Fault-Tolerance Techniques P1P1 P1P1 P1P1 Re-execution N1N1 P1P1 P1P1 P1P1 Replication N1N1 N2N2 N3N3 P1P1 P1P1 N1N1 N2N2 P1P1 Re-executed replicas 2

17 of 14 17/34 Problem #1.3: Formulation Application: set of process graphsArchitecture: time-triggered system Fault-model: transient faults  Given  Application: set of interacting processes  Platform: set of nodes  Timing constraints: deadlines  Fault model: number of transient faults in the system period  Determine  Mapping of processes and messages  Schedule tables for processes and messages  Fault-tolerance policy assignment  Such that the timing constraints are satisfied

18 of 14 18/34 P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 P4P4 m2m2 Missed Fault-Tolerance Policy Assignment P1P1 P2P2 P3P3 N1N1 N2N P4P N1N1 N2N2 P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 m2m2 P4P4 N1N1 N2N2 P1P1 P3P3 S1S1 S2S2 P4P4 P2P2 P1P1 m1m1 m1m1 m2m2 m2m2 P2P2 m3m3 m3m3 P3P3 P4P4 Missed P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 m2m2 P4P4 No fault-tolerance: application crashes N1N1 N2N2 P1P1 P3P3 S1S1 S2S2 P4P4 P2P2 P1P1 m2m2 m1m1 TTP Met Optimization of fault-tolerance policy assignment Deadline

19 of 14 19/34 Tabu-Search: Policy Assignment & Mapping P1P1 P2P2 P3P3 N1N1 N2N P4P N1N1 N2N2 P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 Design transformations

20 of 14 20/34 Problem #2: Voltage Scaling  GSM Phone:  Search  Radio link control  Talking  MP3 Player  Digital Camera:  Take photo  Restore photo Timing constraints Battery power Chip power Power constraints

21 of 14 21/34 Problem #2: Voltage Scaling deadline t P 11 22 33 Energy/speed trade-offs: varying the voltages V bs CPU V dd f1f1 f2f2 f3f3 Different voltages: different frequencies Mapping and scheduling: given (fastest freq.) Power deadline t 11 22 33 Slack

22 of 14 22/34 00 22 33 55 44 11 CPU1 CPU2 CPU3 Bus V dd V bs V dd V bs V dd V bs Problem #2.1: Continuous Voltage Scaling  Given  Application: set of interacting processes  Platform: set of nodes, each having supply voltage (V dd ) and body bias voltage (V bs ) inputs  Mapping and schedule table (including timing constraints) P deadline t 11 22 33 Slack Architecture and mapping Schedule table / processor

23 of 14 23/34 Problem #2.1: Continuous Voltage Scaling  Determine  Voltage levels V dd and V bs for each process  Such that system energy is minimized and  Deadlines are satisfied deadline t P 11 22 33 P t 11 22 33 Slack  Output  Input Height: voltage level Area: energy  Assessment  Convex nonlinear problem  Polynomial time solvable with an arbitrary good precision  A. Andrei, “Overhead-Conscious Voltage Selection for Dynamic and Leakage Energy Reduction of Time-Constrained Systems”, technical report, Linköping University, 2004

24 of 14 24/34 Problem #2.1: Formulation  Minimize energy  E[  0 ] + E[  1 ] + E[  2 ] + E_OH[  1 -  2 ]  Such that  T start [  0 ] + T exe [  0 ]  T start [  1 ]  T start [  1 ] + T exe [  1 ] + T oh [  1 -  2 ]  T start [  2 ]  T start [  2 ] + T exe [  2 ]  DL[  2 ] Energy due to processes Overhead due to voltage changes Precedence relationships Deadlines

25 of 14 25/34 Problem #2.2: Discrete Voltage Scaling  Problem formulation  Given discrete execution frequencies Processors can operate using a frequency from a fixed discrete set Changing the frequency incurs a delay and an energy penalty  Determine the set of frequencies for each task  Such that system energy is minimized and  Deadlines are satisfied t P deadline 22 33 11 f2f2 f2f2 f2f2 f3f3 f3f3 f1f1 Discrete

26 of 14 26/34  Given  1 processor: f  {50, 100, 150} MHz  3 processes   1: P={10, 20, 30} mW, dl=1ms, NC=100 cycles   2: P={12, 22, 32} mW, dl=1.5ms, NC=100 cycles   3: P={15, 25, 35} mW, dl=2ms, NC=100 cycles  Schedule: execution order is  1,  2,  3  Determine  For each process, number of clock cycles to be executed at each frequency  such that the energy is minimized Problem #2.2: Example

27 of 14 27/34 Problem #2.2: Example, Cont.  Assessment:  Strongly NP-hard problem  The frequencies are now a set of integers; identical to:  P. De, “Complexity of the Discrete Time-Cost Tradeoff Problem for Project Networks”, Operations Research, 45(2):302–306, March Each task has to execute the given number of cycles Task execution time Precedence constraints Deadline constraints Minimize energy  MILP formulation for the optimal solution

28 of 14 28/34 Embedded Systems Design Mapped and scheduled model Estimation: exec. time System platform System model System-level design tasks Analysis Software synthesis Hardware synthesis  Communication protocols  Communication delay analysis

29 of 14 29/34 Problem #3: FlexRay Analysis  FlexRay communication protocol  Becoming de-facto standard in automotive electronics  BMW, DaimlerChrysler, General Motors, Volkswagen, Bosch, Motorola, Philips  Deterministic data transmission, fault-tolerant, high data-rate... Communication protocol: FlexRay SR Worst-case communication delay  Problem  Given  Application: set of interacting processes  Platform: set of nodes connected by FlexRay  Implementation: Mapping and scheduling  Determine  Worst-case communication delays for messages

30 of 14 30/34 Problem #3: FlexRay Analysis, Cont. StaticDynamicStaticDynamic Generalized Time-division multiple access Flexible Time-division multiple access Bus cycle m4m4 m5m5 m6m6 m7m7 Arrive dynamically Off-line: Worst- case analysis Off-line: Schedule table m2m2 m3m3 m1m1 Statically assigned

31 of 14 31/34 Problem #3: Formulation and Example  Given  FlexRay bus  Length of the static phase  Length of the dynamic phase  Dynamically arriving messages  Priorities  Determine for each message  Worst-case communication delay Static Dynamic Fixed size bin m4m4 m5m5 m6m6 m7m7 Priority Dynamic Analyze this! m4m4 m5m5 m6m6 m7m7 Static Bin covering: two bins

32 of 14 32/34 Problem #3:  Assessment  ”Classic” bin covering problem  Given  Set of bins of fixed integer size  Set of items of integer size  Determine  Maximum number of bins that can be filled with the items  Assessment  Asymptotic fully polynomial time approximation  FlexRay dynamic phase analysis ≠ ”classic” bin covering  Bins have an upper limit: size of the dynamic phase  Assessment  Approximation algorithm does not exist  MILP formulation feasible up to 60 messages Wanted: better solution

33 of 14 33/34 Outline  Embedded systems  Example area: automotive electronics  Embedded systems design  Optimization problems  Fault-tolerant mapping and scheduling  Voltage scaling  Communication delay analysis  Assessment and message

34 of 14 34/34 Message  Optimization  Key to successful embedded systems design  Challenges  Classify the problems  Divide the problem into sub-problems  Formulate the problems  Solve the problems optimally  Fast and accurate heuristics for specific problems