A Real-Time Gracefully Degrading Avionics System for Unmanned Aerial Vehicles Published in: National Aerospace & Electronics Conference (NAECON), 2012.

Slides:

Advertisements

Similar presentations

I/O Organization popo.

Advertisements

Georgia Tech Aerial Robotics Dr. Daniel P Schrage Jeong Hur Fidencio Tapia Suresh K Kannan SUCCEED Poster Session 6 March 1997.

Christian Delbe1 Christian Delbé OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis November Automatic Fault Tolerance in ProActive.

Arduino Autonomous Robotics System

Outline quad-copter Abstract Quad-Copter Movement Hand movement

Chapter 19: Network Management Business Data Communications, 5e.

FIU Chapter 7: Input/Output Jerome Crooks Panyawat Chiamprasert

ReVive: Cost-Effective Architectural Support for Rollback Recovery in Shared-Memory Multiprocessors Milos Prvulovic, Zheng Zhang, Josep Torrellas University.

Autonomous Quadrocopter Proposal Brad Bergerhouse, Nelson Gaske, Austin Wenzel Dr. Malinowski.

Chapter 19: Network Management Business Data Communications, 4e.

EEC 688/788 Secure and Dependable Computing Lecture 12 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University

Parth Kumar ME5643: Mechatronics UAV ATTITUDE AND HEADING HOLD SYSTEM.

CS 582 / CMPE 481 Distributed Systems Fault Tolerance.

Reegan Worobec & David Sloan In collaboration with UAARG.

1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.

Electrical and Computer Systems Engineering Postgraduate Student Research Forum 2001 Design and Development of a Distributed Avionics System for use in.

CS-550 (M.Soneru): Recovery [SaS] 1 Recovery. CS-550 (M.Soneru): Recovery [SaS] 2 Recovery Computer system recovery: –Restore the system to a normal operational.

2/23/2009CS50901 Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial Fred B. Schneider Presenter: Aly Farahat.

7. Fault Tolerance Through Dynamic or Standby Redundancy 7.5 Forward Recovery Systems Upon the detection of a failure, the system discards the current.

1 of 14 1 Scheduling and Optimization of Fault- Tolerant Embedded Systems Viacheslav Izosimov Embedded Systems Lab (ESLAB) Linköping University, Sweden.

Page 1 Copyright © Alexander Allister Shvartsman CSE 6510 (461) Fall 2010 Selected Notes on Fault-Tolerance (12) Alexander A. Shvartsman Computer.

TCP: Software for Reliable Communication. Spring 2002Computer Networks Applications Internet: a Collection of Disparate Networks Different goals: Speed,

A Progressive Fault Tolerant Mechanism in Mobile Agent Systems Michael R. Lyu and Tsz Yeung Wong July 27, 2003 SCI Conference Computer Science Department.

Deon Blaauw Modular Robot Design University of Stellenbosch Department of Electric and Electronic Engineering.

Overview SAP Basis Functions. SAP Technical Overview Learning Objectives What the Basis system is How does SAP handle a transaction request Differentiating.

A Survey of Rollback-Recovery Protocols in Message-Passing Systems M. Elnozahy, L. Alvisi, Y. Wang, D. Johnson Carnegie Mellon University Presented by:

1 Rollback-Recovery Protocols II Mahmoud ElGammal.

Sérgio Ronaldo Barros dos Santos, Cairo Lúcio Nascimento Júnior,

S1.6 Requirements: KnightSat C&DH RequirementSourceVerification Source Document Test/Analysis Number S1.6-1Provide reliable, real-time access and control.

Lecture 6: Introduction to Distributed Computing.

Control Over WirelessHART Network S. Han, X. Zhu, Al Mok University of Texas at Austin M. Nixon, T. Blevins, D. Chen Emerson Process Management.

Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.

1 Fault Tolerance in the Nonstop Cyclone System By Scott Chan Robert Jardine Presented by Phuc Nguyen.

DISTRIBUTED ALGORITHMS Luc Onana Seif Haridi. DISTRIBUTED SYSTEMS Collection of autonomous computers, processes, or processors (nodes) interconnected.

Computational Mechanics and Robotics The University of New South Wales

IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.

1. Introduction 1.1 Background 1.2 Real-time applications 1.3 Misconceptions 1.4 Issues in real-time computing 1.5 Structure of a real-time system.

A Survey of Rollback-Recovery Protocols in Message-Passing Systems.

Sérgio Ronaldo Barros dos Santos (ITA-Brazil)

Distributed Systems: Concepts and Design Chapter 1 Pages

F Networked Embedded Applications and Technologies Lab Department of Computer Science and Information Engineering National Cheng Kung University, TAIWAN.

A Proposal of Application Failure Detection and Recovery in the Grid Marian Bubak 1,2, Tomasz Szepieniec 2, Marcin Radecki 2 1 Institute of Computer Science,

SafetyNet: improving the availability of shared memory multiprocessors with global checkpoint/recovery Daniel J. Sorin, Milo M. K. Martin, Mark D. Hill,

12. Recovery Study Meeting M1 Yuuki Horita 2004/5/14.

Lecture 4: Sun: 23/4/1435 Distributed Operating Systems Lecturer/ Kawther Abas CS- 492 : Distributed system & Parallel Processing.

Phong Le (EE) Josh Haley (CPE) Brandon Reeves (EE) Jerard Jose (EE)

Parallelization and Characterization of Pattern Matching using GPUs Author: Giorgos Vasiliadis 、 Michalis Polychronakis 、 Sotiris Ioannidis Publisher:

Specifications: WEIGHT: 58 Kg Walking speed: km/h Walking Cycle: Cycle adjustable stride adjustable Grasping force: 0.5 kg/hand (5 finger hand)

Cooperative Air and Ground Surveillance Wenzhe Li.

CprE 458/558: Real-Time Systems

Operating System Principles And Multitasking

The palm was created using a modular cavity design. It was designed using ProEngineer and printed using Rapid Prototype. The fingers were made using Polymorph.

FUFO project Final report.

1 SOARS Matt Edwards Arseny Dolgov John Shelton Johnny Jannetto Galina Dvorkina Nick Driver Eric Kohut Kevin Eberhart Self Organizing Aerial Reconnaissance.

Why Do Airplanes Crash? Investigating Air Data Inertial Reference Units Department of Electrical and Computer Engineering INTRODUCTION Modern aircraft.

The life of an instruction in EV6 pipeline Constantinos Kourouyiannis.

University of Westminster – Checkpointing Mechanism for the Grid Environment K Sajadah, G Terstyanszky, S Winter, P. Kacsuk University.

Ground Control Station Flight conTrol

Middleware for Fault Tolerant Applications Lihua Xu and Sheng Liu Jun, 05, 2003.

FTOP: A library for fault tolerance in a cluster R. Badrinath Rakesh Gupta Nisheeth Shrivastava.

FTC-Charm++: An In-Memory Checkpoint-Based Fault Tolerant Runtime for Charm++ and MPI Gengbin Zheng Lixia Shi Laxmikant V. Kale Parallel Programming Lab.

Autonomous Quadcopter Andrew Martin, Baobao Lu, Cindy Xin Ting Group 37 TA: Katherine O'Kane.

KNU RTLAB A Real-Time Linux System For Autonomous Navigation And Flight Attitude Control Of An Uninhabited Aerial Vehicle Charles E. Hall, Jr. Mechanical.

1 Fault Tolerance and Recovery Mostly taken from

Week#3 Software Quality Engineering.

Prepared by Ertuğrul Kuzan

Operating Systems (CS 340 D)

Supporting Fault-Tolerance in Streaming Grid Applications

Fault Tolerance Distributed Web-based Systems

University of Wisconsin-Madison Presented by: Nick Kirchem

Presentation transcript:

A Real-Time Gracefully Degrading Avionics System for Unmanned Aerial Vehicles Published in: National Aerospace & Electronics Conference (NAECON), 2012 IEEE Authors: Belal H. Sababha Princess Sumaya University for Technology, Amman, Jordan Osamah A. Rawashdeh and Waseem A. Sa’deh Oakland University, Rochester, Michigan, USA Presented by: Srinivas Narne, Yash Kulkarni and Shefali Gundecha # of 23

Overview  Fault-tolerance  Graceful Degradation  Test-Bed  Avionics system  Gracefully degraded avionics system  Recovery Manager 2 of 23

Fault Tolerance  Fault tolerance is the ability of a system to continue operation in presence of hardware and software faults.  In safety-critical applications, the correct operation is vital, requiring the use of fault tolerant techniques in applications.  It is typically achieved through redundancy in hardware and software to enable fault detection and recovery. 3 of 23

Graceful Degradation  Reconfiguration necessitates the preservation of the recent history for every software task that is responsible of certain functionality in order to be able to restart the task later from the point that it failed at  Saving the state of a task to stable storage is referred to as checkpointing the state of that task  A system manager that is responsible for fault detection, rollback and recovery is implemented on a separate MCU 4 of 23

The Test-Bed  A quadrotor system is used as testbed  Ground Station  Flight Control  Data Processing  Run-time Monitoring  Aerial Vehicle  Avionics System  Payload System 5 of 23

The Test-Bed (Cont.)  Connections between the subsystems  1.3 GHz link for video down-streaming  75 MHz traditional R/C radio for manual flight control  2.4 GHz ZigBee link for telemetry and reconfiguration control  Body of quadrotor consists of magnesium hub joining four carbon fiber arms.  Mounted at the end of each arm is a magnesium motor mount that holds a brushless motor and propeller assembly.  In addition to the processors, other hardware components include an IMU, altimeter, and a modular GPS unit for attitude, altitude, and position estimation, respectively. 6 of 23

The Avionics System  The avionics system is a triple processor setup  Telemetry Processor  Interface Processor  Control Processor  All three processors are Freescale HCS12 microcontrollers running μC/OS-II, the real-time operating system (RTOS)  All MCUs are connected to a CAN bus. A forth dedicated MCU collects checkpoints written by other MCUs from the CAN bus and stores them to stable storage. 7 of 23

The Avionics System (Cont.)  The control processor is responsible for vehicle stabilization and navigation  These functions are achieved by executing software tasks implementing the Proportional- Integral- Differential (PID) control loops  PID gains are found experimentally, and are wirelessly reconfigured by the reconfiguration host on the ground station to allow in-flight tuning 8 of 23

The Gracefully Degrading Avionics System  The attitude stability tasks:  Roll, pitch and yaw PID control tasks  The control feedback task:  IMU task 9 of 23

The Continuously repeated data communication pattern During the process of attitude stability control 10 of 23

Checkpointing 101  Checkpointing protocol : BCS  Chosen due to the relatively low amount of overhead it induces compared to other checkpointing protocols  What exactly is the BCS protocol?  For a consistent global checkpoint for all processes  Every process maintains and propagates an index idx  Process p i initializes idx i to 0 and increments it after a checkpoint is taken  When p i sends a message, it piggybacks idx i onto it  When p i receives a message m with idx m > idx i, it takes a forced checkpoint 11 of 23

Checkpointing for our system  Tasks chosen for checkpointing:  The three attitude stability tasks and the motors control task (T motors_control )  T motors_control is a task inside the sensor-actuator module  Why?  The computations they perform depend on historical data from previous computations  Data that are checkpointed:  For each of the three PID tasks, the accumulative error used in the integral (I) term of the PID controller as well as the previous error used by the differential (D) term of the PID controller  For T motors_control, the current corrections that are used to adjust the speed of the four motors propelling the quadrotor.  Stored in dedicated arrays 12 of 23

Checkpointing Loop 13 of 23

The Recovery Manager ! 14 of 23

Recovery Manager  Each checkpointing task has a unique heart beat message broadcasted through the CAN network as long as it is functioning correctly  Fault injection  Through an external MCU connected to the CAN bus  Can send a command to the RTOS running on any other MCU in the network to kill any specific task executed by that MCU  The recovery manager can send a recover command to the RTOS running on any other MCU in the network  The recover command asks the RTOS to recreate a task and initialize it to a certain state (recovery state).  The recovery state is sent in combination with the recover command 15 of 23

Recovery Manager 16 of 23

Experimental Setup  1 KB memory – checkpointing storage  Acceptable execution rates for the three attitude stability PID control loops  Roll PID: 20 ms (i.e. 50 Hz)  Pitch PID: 20 ms (i.e. 50 Hz)  Yaw PID: 60 ms (i.e Hz)  Fault injected into the roll PID control task.  Recovery time noted. 17 of 23

PERFORMANCE RESULTS Frequency of exchanging application messages is constant. With increase in local checkpoint frequency, maintained checkpointing indexes increase rapidly. But no such increase in the exchange of application messages. Recovery manager side - more checkpoints to be stored 18 of 23

In case of lack of synchronization, checkpoints most likely do not have same index value => More time to find a consistent recovery line From the study, Recovery time for the system at 1 checkpoint per 30 execution loops = 120ms Execution rate for the fastest task in system = 20ms  System will miss 6 (120 ms/20 ms) execution loops - Acceptable in this case In general, to judge that a recovery time delay is acceptable or not depends on the application & how many execution loops are lost (during the recovery process). 19 of 23

Conclusion  Graceful Degradation – promising technique  Dependability + Reduced cost, size, weight & power  Paper overviews a graceful degradation approach to achieve fault tolerance  System included checkpointing coordination, checkpoint management, stable storage, and recovery management  Implementation in form of an avionics system - 3 control loops in parallel  Faults injected during run-time causing the system’s stability control tasks to fail  The system was able to recover in a very short time duration 20 of 23

Our Take on the Paper  Reduction in SWaP but no data available for comparison  “Our flight testing” – no specifications of test conditions mentioned  No insight provided on the mentioned “applications” 21 of 23

REFERENCES Quadrotors and the Future of Engineering : Gustavo M. D. Vieira, Islene C. Garcia, Luiz E. Buzato, “Systematic Analysis of Index-Based Checkpointing Algorithms using Simulation”, IX Brazilian Symposium on Fault Tolerant Computing, of 23

THANK YOU FOR YOUR ATTENTION ! ANY QUESTIONS ? 23 of 23