Advantages of Time-Triggered Ethernet

Slides:



Advertisements
Similar presentations
Operating Systems Components of OS
Advertisements

Time-Triggered Protocol
COMPUTER NETWORK TOPOLOGIES
System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.
Copyright © 2009, TTTech Computertechnik AG. All rights reserved; may be published with permission by MAPLD 2009 Reliable Synchronization.
Advantages of Deterministic Ethernet for Space Applications
Bus Architectures for Satety- Critical Embedded Systems --by Harit Desai.
Carnegie Mellon R-BATCH: Task Partitioning for Fault-tolerant Multiprocessor Real-Time Systems Junsung Kim, Karthik Lakshmanan and Raj Rajkumar Electrical.
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
Lecture Objectives: 1)Explain the limitations of flash memory. 2)Define wear leveling. 3)Define the term IO Transaction 4)Define the terms synchronous.
Fault Detection in a HW/SW CoDesign Environment Prepared by A. Gaye Soykök.
Overview of PTIDES Project
2/23/2009CS50901 Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial Fred B. Schneider Presenter: Aly Farahat.
Chapter 13 Embedded Systems
TTP ® - As Predictable as Time 1 Copyright © 2002, TTTech Computertechnik AG. All rights reserved. TTTech Powernode As a Linux Prototyping Platform for.
7. Fault Tolerance Through Dynamic or Standby Redundancy 7.5 Forward Recovery Systems Upon the detection of a failure, the system discards the current.
The Rare Glitch Project: Verifying Bus Protocols for Embedded Systems Edmund Clarke, Daniel Kroening Carnegie Mellon University.
Page 1 Copyright © Alexander Allister Shvartsman CSE 6510 (461) Fall 2010 Selected Notes on Fault-Tolerance (12) Alexander A. Shvartsman Computer.
Internetworking Fundamentals (Lecture #2) Andres Rengifo Copyright 2008.
Network Topologies.
EtherCAT Protocol Implementation Issues on an Embedded Linux Platform
HRTC Meeting 12 September 2002, Vienna Smart Sensors Thomas Losert.
1 Fault Tolerance in the Nonstop Cyclone System By Scott Chan Robert Jardine Presented by Phuc Nguyen.
Weekly Meeting Time-Triggered Ethernet: Concepts and Switch Design Andrew Mortellaro William Garcia.
Fault Tolerance via the State Machine Replication Approach Favian Contreras.
1 System Models. 2 Outline Introduction Architectural models Fundamental models Guideline.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
Eric Keller, Evan Green Princeton University PRESTO /22/08 Virtualizing the Data Plane Through Source Code Merging.
ARMADA Middleware and Communication Services T. ABDELZAHER, M. BJORKLUND, S. DAWSON, W.-C. FENG, F. JAHANIAN, S. JOHNSON, P. MARRON, A. MEHRA, T. MITTON,
Embedded TechCon Synchronizing mechatronic systems in real-time using FPGAs and Industrial Ethernet Sari Germanos
DEVICES AND COMMUNICATION BUSES FOR DEVICES NETWORK
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
Henri Kujala Integration of programmable logic into a network front-end of a telecontrol system Supervisor: Professor Patric Östergård Instructor: Jouni.
Heterogeneous Multikernel OS Yauhen Klimiankou BSUIR
In-Vehicle Communication SAN Group RTS Regular Meeting Presentation December 2008.
CONTI'20041 Event Management in Distributed Control Systems Gheorghe Sebestyen Technical University of Cluj-Napoca Computers Department.
UNIT -1. DATA COMMUNICATIONS The term telecommunication means communication at a distance. The word data refers to information presented in whatever form.
TTP and FlexRay. Time Triggered Protocols Global time by fault tolerant clock synchronisation Exact time point of a certain message is known (determinism)
Secure Systems Research Group - FAU 1 Active Replication Pattern Ingrid Buckley Dept. of Computer Science and Engineering Florida Atlantic University Boca.
Dtsi/Sol CEA System Software Activities 125/02/2005VD R&D topics Designing tools and system software for:  The management of parallelism Mono-processor.
Time Triggered Networks: use in space 2015 CCSDS spring SOIS Plenary 23 March 2015 Glenn Rakow/NASA-GSFC.
I/O Computer Organization II 1 Interconnecting Components Need interconnections between – CPU, memory, I/O controllers Bus: shared communication channel.
Hwajung Lee. One of the selling points of a distributed system is that the system will continue to perform even if some components / processes fail.
Cisco Network Devices Chapter 6 powered by DJ 1. Chapter Objectives At the end of this Chapter you will be able to:  Identify and explain various Cisco.
Chapter 11 Fault Tolerance. Topics Introduction Process Resilience Reliable Group Communication Recovery.
Mixed Criticality Systems: Beyond Transient Faults Abhilash Thekkilakattil, Alan Burns, Radu Dobrin and Sasikumar Punnekkat.
Tolerating Communication and Processor Failures in Distributed Real-Time Systems Hamoudi Kalla, Alain Girault and Yves Sorel Grenoble, November 13, 2003.
Synthesis of Communication Schedules for TTEthernet-based Mixed-Criticality Systems Domițian Tămaș-Selicean 1, Paul Pop 1 and Wilfried Steiner 2 1 Technical.
Middleware for Fault Tolerant Applications Lihua Xu and Sheng Liu Jun, 05, 2003.
Timing Analysis of Rate Constrained Traffic for the TTEthernet Communication Protocol Domițian Tămaș-Selicean 1, Paul Pop 1 and Wilfried Steiner 2 1 Technical.
1 May-2014 Automotive Protocols & Standards. 2 CAN (Controller Area Network)  Overview Controller Area Network is a fast serial bus designed to provide.
An Introduction to CAN CAN Basics 2 Renesas Interactive
A Survey of Fault Tolerance in Distributed Systems By Szeying Tan Fall 2002 CS 633.
DS - IX - NFT - 0 HUMBOLDT-UNIVERSITÄT ZU BERLIN INSTITUT FÜR INFORMATIK DEPENDABLE SYSTEMS Vorlesung 9 NETWORK FAULT TOLERANCE Wintersemester 99/00 Leitung:
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Classifying fault-tolerance Masking tolerance. Application runs as it is. The failure does not have a visible impact. All properties (both liveness & safety)
Ensuring Reliable Networks Copyright © TTTech Computertechnik AG. All rights reserved.Page 1 Christian Fidi Product Manager Advantages cFS.
Deterministic Communication with SpaceWire
Overview Parallel Processing Pipelining
Christian Fidi Product Manager 2017/05/12
RAID RAID Mukesh N Tekwani
Fault Tolerance Distributed Web-based Systems
Modeling and Simulation of TTEthernet
DETERMINISTIC ETHERNET FOR SCALABLE MODULAR AVIONICS
Chapter 2: Operating-System Structures
Mark McKelvin EE249 Embedded System Design December 03, 2002
Co-designed Virtual Machines for Reliable Computer Systems
RAID RAID Mukesh N Tekwani April 23, 2019
Chapter 2: Operating-System Structures
Time-Triggered Architecture
Presentation transcript:

Advantages of Time-Triggered Ethernet Christian Fidi Product Manager October 28th, 2015

Space Application Requirements

Space Application Requirements

Architecture Theory A System needs to ensure the: Correctness of the data Voting or ensure that the received value is right Temporal correctness (time of use and order)  Synchronization There are two architectures supporting fault-tolerants: Voting architecture (voting or byzantine voting) Fail-Silent architecture (COM/MON or dual-core lock-step)

Replica Determinism: Example Stage Separation Consider a rocket launch. The real-time system responsible for the stage separation system has three redundant channels: Channel 1 – Separation and Fire Boosters Channel 2 – No Separation and do not Fire Boosters Channel 3 – No Separation and Fire Boosters (Fault)  Majority – No Separation and Fire Boosters!  Temporal order within spare time needs to be guaranteed!

Voting Architecture–MIL1553 (TT) 3 redundant busses/lanes (1FT but not covering byzantine faults) Each Computer has one bus master node (bus controller) All Computers receive the messages from the other lanes where they are slave Precise synchronization has to be done between the lanes to be able to vote (state exchange) If one node fails than whole lane may be lost Voting is done in a two out of three manner [© 2010 Data Device Corporation. Distributed and Reconfigurable Architecture for Flight Control System]

Disadvantages Additional point to point communication needed to ensure low latency synchronization Multiple protocols are needed For synchronization, Deterministic data, High speed data Additional wiring needed Software needs to take care of: Precise synchronization Redundancy management Support different protocols Testing effort and hardware (since this is application specific)

Time-Triggered Communication 1. Globale Notion of Time Local clocks – free running Local view of global time 2. Message Schedule Copyright © TTTech Computertechnik AG. All rights reserved. Page 8

Synchronization Services Clock Synchronization Service Clock Synchronization Service is executed during normal operation mode to keep the local clocks synchronized to each other. Startup/Restart Service is executed to reach an initial synchronization of the local clocks in the system. Integration/Reintegration Service is used for components to join an already synchronized system. Clique Detection Services are used to detect loss of synchronization and establishment of disjoint sets of synchronized components. Startup/Restart Service

FT Synchronized Global Time Fault-tolerant synchronization services are needed for establishing a robust global time base in the sub-microsecond area

Permanence of PCFs Using the transparent_clock value, a receiver can determine the “earliest safe” point in time when a PCF becomes permanent: permanence_delay = max_transmission_delay – transparent_clock permanence_point_in_time = receive_point_in_time + permanence_delay Example: max_transmission_delay in this network is 0:30 frame F1 is transmitted by node A at 10:00 frame F2 is transmitted by node B at 10:05 frame F1 has a transmission delay A  C of 0:20. This is visible in F1’s transparent_clock frame F2 has a transmission delay B  C of 0:05. This is visible in F2’s transparent_clock receiver C sees: F2 arrives at 10:10, becomes permanent at 10:10 + (0:30 - 0:05) = 10:35 receiver C sees: F1 arrives at 10:20, F1 becomes permanent at 10:20 + (0:30 - 0:20) = 10:30  F1 becomes permanent before F2 B F2 10:05 F1 10:10 A C 10:20 Comp 10:00

External Clock Synchronization External synchronization to e.g. PPS of the fault-tolerant clock

Time-triggered Traffic Timing Full control of timings in the system Defined latency and sub-microsecond jitter Minimum memory needs Fault-containment regions I’ll expect M between 11:05 and 11:15 I’ll accept M only between 10:40 and 10:50 I’ll accept M only between 10:55 and 11:05 M M M …but sender and receiver still only do “I’ll transmit M at…” and “I’ll expect M at…” – the added complexity is in the network, not in the nodes I’ll forward M at 11:00 M I’ll transmit M at 10:45 I’ll forward M at 11:10 Let’s see if I can receive M …a switch

TTEthernet Traffic Partitioning TTEthernet provides a set of time-triggered services implemented on top of standard IEEE 802.3 Ethernet. These services are designed to enable design of synchronous, highly dependable embedded computing and networking systems, capable of tolerating multiple faults. With TTEthernet, robustly partitioned multimedia data streams, critical control data and standard LAN messages can operate in one network without congestion or unintended interactions. On this slide, four synchronous time-triggered Ethernet streams are shown in red. They are robustly separated from other asynchronous priority-based or rate-constrained datastreams such as IEEE DCB or AVB, and other lower-priority standard Ethernet traffic. Page 14

Extensions & Standard Ethernet Time-triggered extensions for standard switched Gigabit-Ethernet Startup Recovery Robust fault-tolerant distributed clock Makes Ethernet viable for safety-critical distributed applications!

Fault-Containment Regions in TTEthernet TTEthernet defines Switches and End Systems as two kinds of Fault-Containment Regions. Frame loss is mapped to the respective sender. Depending on cost and reliability targets, switches and or end systems may be implemented with standard or high-integrity in order to be able to scale from single to dual fault tolerance. Protocol mechanisms can be configured to handle Strictly Omissive Asymmetric switch faults (HI) and fully Transmissive Asymmetric end system faults (SI).

High-Integrity: Self-Checking Pair High integrity design: Self checking pair Two processor that execute same function in parallel Comparator checks output of both processors. If one processor fails (maliciously) and generates wrong data, second processors shuts down. Self-checking pair ensures fail-silence !

Requirement: Easy “System of Systems” Fusion Priority 1 time-triggered Priority 2 SoS architecture with TTEthernet supports reconfiguration Several separate vehicles or elements fuse into a new combined network configuration architecture is coupled together through Virtual Backplane The Integrated Systems ideal for system of systems individual systems come together > coupled through fusion of Virtual Backplane predetermined, yet dynamic, re-configuration of the individual computing element’s configuration tables enables the several free-flying elements start a mission with individual state vectors, to fuse into a combined configuration that share a new, common set of state vectors. 18

TTE-Controller  Switch Controller COM  Switch Controller MON End System IP/UDP ARINC653 Partitions support in HW  CPU Management & Diagnostics Available in Q3/2016

Software Tools and Development Systems TTEthernet Products TTEthernet TTESwitches A664 Software Tools and Development Systems  TTEVerify (for DO cert.) TTETools (development) Switch Controller SMC 6U VPX* TTECOM TTESync Lib (middleware) TTEEnd Systems A664 Upcoming key product launches (2013+2014) PMC Lab End System Controller PMC Pro ARINC 653 v4.0 Linux v4.0

Cross Industry © NASA Sikorsky S97 Raider NASA Orion Vestas Wind Turbines TTEthernet Examples of Reliable Safety Critical Networks Audi Piloted Driving Aribus DS Ariane 6 Oil Platform

Conclusion The protocol and implementation supports Synchronization Deterministic communication Fault-tolerance But also allows the flexibility of the standard Ethernet  Reduces SW complexity Space graded components are up coming The environment is developed cross industry (embedded SW, tools, test- and development equipment)

Thank You! Any Questions?