December 5, 20031 Mars and Beyond: NASA’s Software Challenges in the 21st Century Dr. Michael R. Lowry NASA Ames Research Center.

Slides:

Advertisements

Similar presentations

Abstraction and Modular Reasoning for the Verification of Software Corina Pasareanu NASA Ames Research Center.

Advertisements

1 Symbolic Execution for Model Checking and Testing Corina Păsăreanu (Kestrel) Joint work with Sarfraz Khurshid (MIT) and Willem Visser (RIACS)

1/20 Generalized Symbolic Execution for Model Checking and Testing Charngki PSWLAB Generalized Symbolic Execution for Model Checking and Testing.

Model Checker In-The-Loop Flavio Lerda, Edmund M. Clarke Computer Science Department Jim Kapinski, Bruce H. Krogh Electrical & Computer Engineering MURI.

Software Quality Assurance Plan

Mars Climate Orbiter Team Magna Corp: Tim Toba Mohamed Sahil Nyema Johnson Abner Yemaneab University of Minnesota.

SOFTWARE MAINTENANCE 24 March 2013 William W. McMillan.

Copyright 2000, Stephan Kelley1 Estimating User Interface Effort Using A Formal Method By Stephan Kelley 16 November 2000.

Software Model Checking for Embedded Systems PIs: Matthew Dwyer 1, John Hatcliff 1, and George Avrunin 2 Post-docs: Steven Seigel 2, Radu Iosif 1 Students:

Software Quality Metrics

The Architecture Design Process

Developing Verifiable Concurrent Software Tevfik Bultan Department of Computer Science University of California, Santa Barbara

What Went Wrong? Alex Groce Carnegie Mellon University Willem Visser NASA Ames Research Center.

SE 555 Software Requirements & Specification Beyond Requirements Based on Weigers Chapter17.

Testing an individual module

1 Computer Systems & Architecture Lesson 1 1. The Architecture Business Cycle.

Formal verification Marco A. Peña Universitat Politècnica de Catalunya.

Software Process and Product Metrics

Introduction to Software Testing

Scalable and Flexible Static Analysis of Flight-Critical Software Guillaume P. Brat Arnaud J. Venet Carnegie.

Architecture, Implementation, and Testing Architecture and Implementation Prescriptive architecture vs. descriptive architecture Prescriptive architecture:

Basic Concepts The Unified Modeling Language (UML) SYSC System Analysis and Design.

System Software Integration Testing Mars Polar Lander Steven Ford SYSM /05/12.

S/W Project Management

What is Software Engineering? the application of a systematic, disciplined, quantifiable approach to the development, operation, and maintenance of software”

Chapter 2 The process Process, Methods, and Tools

Software System Engineering: A tutorial

What is a life cycle model? Framework under which a software product is going to be developed. – Defines the phases that the product under development.

Chapter 6 : Software Metrics

Lifecycle Verification of the NASA Ames K9 Rover Executive Dimitra Giannakopoulou Mike Lowry Corina Păsăreanu Rich Washington.

University of Toronto Department of Computer Science CSC444 Lec02 1 Lecture 2: Examples of Poor Engineering “Software Forensics” Case Studies Mars Pathfinder.

Testing Workflow In the Unified Process and Agile/Scrum processes.

.1 RESEARCH & TECHNOLOGY DEVELOPMENT CENTER SYSTEM AND INFORMATION SCIENCES JHU/MIT Proprietary Titan MESSENGER Autonomy Experiment.

Model-Driven Analysis Frameworks for Embedded Systems George Edwards USC Center for Systems and Software Engineering

OHTO -99 SOFTWARE ENGINEERING “SOFTWARE PRODUCT QUALITY” Today: - Software quality - Quality Components - ”Good” software properties.

NASA’s Mars Climate Orbiter Mishap. References Official investigation report IEEE Spectrum investigation report Official report on project management.

Verifying AI Plan Models Even the best laid plans need to be verified Margaret Smith – PI Gordon Cucullu Gerard Holzmann Benjamin Smith Prepared for the.

Copyright © 1994 Carnegie Mellon University Disciplined Software Engineering - Lecture 3 1 Software Size Estimation I Material adapted from: Disciplined.

The System and Software Development Process Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.

FDT Foil no 1 On Methodology from Domain to System Descriptions by Rolv Bræk NTNU Workshop on Philosophy and Applicablitiy of Formal Languages Geneve 15.

Chapter 3: Software Project Management Metrics

Page 1 5/2/2007  Kestrel Technology LLC A Tutorial on Abstract Interpretation as the Theoretical Foundation of CodeHawk  Arnaud Venet Kestrel Technology.

SFWR ENG 3KO4 Slide 1 Management of Software Engineering Chapter 8: Fundamentals of Software Engineering C. Ghezzi, M. Jazayeri, D. Mandrioli.

Symbolic Execution with Abstract Subsumption Checking Saswat Anand College of Computing, Georgia Institute of Technology Corina Păsăreanu QSS, NASA Ames.

SOFTWARE PROCESS AND PROJECT METRICS. Topic Covered  Metrics in the process and project domains  Process, project and measurement  Process Metrics.

1-1 Software Development Objectives: Discuss the goals of software development Identify various aspects of software quality Examine two development life.

Reusing Modeling Elements in IV&V Thomas Otani Naval Postgraduate School 2009 NASA Independent Verification and Validation (IVV) Annual Workshop John Ryan.

MODEL-BASED SOFTWARE ARCHITECTURES.  Models of software are used in an increasing number of projects to handle the complexity of application domains.

Scientific Debugging. Errors in Software Errors are unexpected behaviors or outputs in programs As long as software is developed by humans, it will contain.

Software Engineering1  Verification: The software should conform to its specification  Validation: The software should do what the user really requires.

Software Quality Assurance SOFTWARE DEFECT. Defect Repair Defect Repair is a process of repairing the defective part or replacing it, as needed. For example,

Smart Home Technologies

Using Symbolic PathFinder at NASA Corina Pãsãreanu Carnegie Mellon/NASA Ames.

® IBM Software Group © 2009 IBM Corporation Essentials of Modeling with the IBM Rational Software Architect, V7.5 Module 15: Traceability and Static Analysis.

Test Plan: Introduction o Primary focus: developer testing –Implementation phase –Release testing –Maintenance and enhancement o Secondary focus: formal.

ESA Harwell Robotics & Autonomy Facility Study Workshop Autonomous Software Verification Presented By: Rick Blake.

Quality Assurance in the Presence of Variability Kim Lauenroth, Andreas Metzger, Klaus Pohl Institute for Computer Science and Business Information Systems.

( = “unknown yet”) Our novel symbolic execution framework: - extends model checking to programs that have complex inputs with unbounded (very large) data.

SwCDR (Peer) Review 1 UCB MAVEN Particles and Fields Flight Software Critical Design Review Peter R. Harvey.

Software Architecture Architecture represents different things from use cases –Use cases deal primarily with functional properties –Architecture deals.

System Development Life Cycle (SDLC). Activities Common to Software Projects Planning : Principles Principle #1. Understand the scope of the project.

Mapping Formal Methods to NASA Capability Needs Connecting the Dots Dr. Michael Lowry.

 System Requirement Specification and System Planning.

Failure of Process / Failure of Mission

Chapter 8 – Software Testing

Failure of Process / Failure of Mission

Software Project Planning &

Model-Driven Analysis Frameworks for Embedded Systems

Introduction to Software Testing

Presentation transcript:

December 5, Mars and Beyond: NASA’s Software Challenges in the 21st Century Dr. Michael R. Lowry NASA Ames Research Center

December 5, Outline NASA’s Mission Role of Software within NASA’s Mission The Challenge: Enable Dependable SW-based Systems Technical Challenges –Scaling ! –System-software barrier –Software is opaque and brittle in the large Reasons for Optimism

December 5, NASA’s Vision To improve life here To extend life to there To find life beyond NASA’s Mission To understand and protect our home planet To explore the universe and search for life To inspire the next generation of explorers …as only NASA can 5 Strategic Enterprises One NASA Space Science Earth Science Biological & Physical Research HEDS Aerospace Technology

December 5, Software Growth in Aerospace Missions Software Enables NASA’s Missions ,000 10,000 Instructions (Equivalent Memory Locations in K) Year Source: AF Software Technology Support Center TITAN VENUS MERCURY PERSHING 1 SURVEYOR PERSHING 1A POSEIDON C3 TITAN 111C VIKING PERSHING 11 GALILEO MISSILE PERSHING 11 (AO) TRIDENT C4 VOYAGER MARINER Unpiloted Systems C-17 PROJECTED C-5A A7D/E F-111 P-3A B-1A AWACS B-1B F-15E B-2 F-16 C/D F-22 (PROJECTED) F-111 GEMINI 3 Piloted Systems GEMINI 3 APOLLO 7 GEMINI 8 SHUTTLE/OFT MERCURY 3 SHUTTLE/ OPERATIONAL (Doubling every 3 or 4 years)

December 5, The Challenge: Software Risk Factors

December 5, Mars Climate Orbiter Launched –11 Dec 1998 Mission –interplanetary weather satellite –communications relay for Mars Polar Lander Fate: –Arrived 23 Sept 1999 –No signal received after initial orbit insertion Cause: –Faulty navigation data caused by failure to convert imperial to metric units

December 5, MCO Events Locus of error –Ground software file called “Small Forces” gives thruster performance data –This data is used to process telemetry from the spacecraft Spacecraft signals each Angular Momentum Desaturation (AMD) maneuver Small Forces data used to compute effect on trajectory Software underestimated effect by factor of 4.45 Cause of error –Small Forces Data given in Pounds-seconds (lbf-s) –The specification called for Newton-seconds (N-s) Result of error –As spacecraft approaches orbit insertion, trajectory is corrected Aimed for periapse of 226km on first orbit –Estimates were adjusted as the spacecraft approached orbit insertion: 1 week prior: first periapse estimated at km 1 hour prior: this was down to 110km Minimum periapse considered survivable is 80km –MCO entered Mars occultation 49 seconds earlier than predicted Signal was never regained after the predicted 21 minute occultation Subsequent analysis estimates first periapse of 57km

December 5, Contributing Factors First 4 months, AMD data unusable due to file format errors –Navigators calculated the data by hand –File format fixed by April 1999 –Anomalies in the computed trajectory became apparent almost immediately Limited ability to investigate the anomalies: –Thrust effects measured along Earth-spacecraft line of sight using doppler shift –AMD thrusts are mainly perpendicular to line of sight Failure to communicate between teams: –E.g. Issue tracking system not properly used by navigation team Anomalies were not properly investigated Inadequate staffing –Operations team were monitoring three missions simultaneously (MGS, MCO and MPL) Operations Navigation team unfamiliar with spacecraft –Different team from the development and test team –This team did not fully understand the significance of the anomalies –Assumed familiarity with previous mission (Global Surveyor) was sufficient: did not understand why AMD was performed times more often (MCO has asymmetric solar panels, whereas MGS had symmetric panels) Inadequate Testing –Software Interface Specification was not used during unit testing of small forces software –End-to-end test of ground software was never completed –Ground software was not considered “mission critical” so didn’t have independent V&V Inadequate Reviews –Key personnel missing from critical design reviews

December 5, Analysis Software size, S, increasing exponentially (doubling every three or four years) Errors, cost over-runs, schedule slip due primarily to non-local dependencies during integration (S N, with N<2, best calibration: N=1.2 ) Source: Professor Barry Boehm, Author of Software Cost Modeling SW Size Errors

December 5, Predicted Errors as LOC Grows: Current SW Practices/Technology Errors = e  S N ; where S is the number of modules (LOC/M), and error rate e = 1/10,000 CassiniMPL

December 5, Future Mars Exploration: MSL and MSR

December 5, Beyond Mars: JIMO and TPF

December 5, Technical Challenges and Opportunities System-software barrier –(Verification is easy, validation is hard) Software is transparent and malleable in the small… –But opaque and brittle in the large General-purpose software dependability tools work well in the small –But fail to scale to systems in the large. But there is Reason for Optimism Align software architectures with system analysis Success of formal methods in related field of digital hardware Scaling through specialization Divide and Conquer: compositional reasoning Beyond correctness: exploiting the lattice between true and false for software understanding Providing the research community with realistic experimental testbeds at scale

December 5, Scaling through Specialization: Practical Static Analysis PolySpace C-Verifier C Global Surveyor (NASA Ames) DAEDALUS Coverity Scalability Precision 1 MLoc 500 KLoc 50 KLoc 80% 95% GENERAL-PURPOSE ANALYZERS SPECIALIZED ANALYZERS

December 5, void add(Object o) { buffer[head] = o; head = (head+1)%size; } Object take() { tail=(tail+1)%size; return buffer[tail]; } Code with Transient Error Hard to Show Error Testing cannot reliably show the error appearing, since it may require specific environment actions (inputs) or scheduling (for concurrency errors) Hard to Find Cause of the Error Once we know a way to show the error it is difficult to localize the root cause of the error + void add(Object o) { buffer[head] = o; head = (head+1)%size; } Object take() { tail=(tail+1)%size; return buffer[tail]; } Software Model Checker JPF void add(Object o) { buffer[head] = o; head = (head+1)%size; } Object take() { tail=(tail+1)%size; return buffer[tail]; } Produces Error Trace Error Explanation Localize Cause of the Error A model checker can automatically find a trace that show the error appearing Now we can automatically find an explanation for the error from the error trace produced by the model checker and the original program The algorithm uses model checking to first find similar traces that also cause the error (negatives) and traces that do not cause the error (positives) Set of Positives Traces that don’t show the error Set of Negatives Traces that show different versions of the error Analysis Explaining the Cause of an Error 1. Source code similarities to explain control errors code that appear only in negatives all negatives, and, only and all negatives (causal) 2. Data invariants – explains errors in data 3. Minimal transformations to create a negative from a positive – show the essence of an error

December 5, ( = “unknown yet”) Our novel symbolic execution framework: - extends model checking to programs that have complex inputs with unbounded (very large) data - automates test input generation Generalized Symbolic Execution for Model Checking and Testing input program Model checking Program instrumentation Decision procedures instrumented program correctness specification continue/ backtrack counterexample(s)/ test suite [heap+constraint+thread scheduling] Framework: Future mission software: - concurrent - complex, dynamically allocated data structures (e.g., lists or trees) - highly interactive: - with complex inputs - large environment - should be extremely reliable - testing: - requires manual input - typically done for a few nominal input cases - not good at finding concurrency bugs - not good at dealing with complex data structures void Executive:: startExecutive(){ runThreads(); …} void Executive:: executePlan(…) { while(!empty) executeCurrentPlanNode(); } … Rover Executive execute action environment/ rover status Input plan complex input structure large environment data concurrency, dynamic data (lists, trees) - model checking: - automatic, good at finding concurrency bugs - not good at dealing with complex data structures - feasible only with a small environment - and a small set of input values Current practice in checking complex software: - modular architecture: can use different model checkers/decision procedures class Node { int elem; Node next; Node deleteFirst() { if (elem < 10) return next; else if (elem < 0) assert(false); … } } Code Analysis of “deleteFirst” with our framework -“simulate” the code using symbolic values instead of program data; enumerate the input structures lazily e0 ≥ 10 /\ e0<0 e0 < 10e0 ≥ 10 truefalse true FALSE Precondition: acyclic list Numeric Constraints Decision Procedures Structural Constraints Lazy initialization+ Enumerate all structures e0 null e0 e1

December 5, System-Level Verification check (system-level) integration properties based on module specifications module hierarchy and interfaces used for incremental abstraction architectural patterns potentially reusable generate module/environment assumptions check implementation modules against their design specifications monitor properties that cannot be verified monitor environment assumptions

December 5, Module Verification Modules may require context information to satisfy a property Assumption || Module ╞ Property (assume – guarantee reasoning) Environment Module a b c Property Assumption Developer encodes them Abstractions of environment, if known how are assumptions obtained? Automatically generate exact assumption A –for any environment E (E || Module ╞ Property) IFF E╞ A Demonstrated on Rover example Automated Software Engineering 2002

December 5, Mission Manager Viewpoint Asking the Right Questions When can we stop testing? What process should we use? What is the value of formal methods? Qualitative Correlative Model Peer Review superior to testing for incorrect spec Model Checking for uncertain environments Quantitative Predictive Model Mission trade studies: how much cost for acceptable risk Development: optimize use of Assurance technologies Mission: increase use of CPU cycles for software monitoring

December 5, HDCP Goals The overall mission of the HDCP project is to increase the ability of NASA to engineer highly dependable software systems Method: –Science of Dependability: Develop better ways to measure and predict software dependability –What are the potential measurables for the various attributes? –How can we move past the present surrogates and approach the artifact more directly? –Empirical evaluation of NASA and NASA-contractor dependability problems of technologies and engineering principles to address the problems –Testbeds Development of realistic testbeds for empirical evaluation of technologies and attributes. –Intervention technologies

December 5, Active MDS Testbed Projects Golden Gate Project –Demonstrate that RT-Java is suitable for mission systems –Drive MDS/RTSJ rover at JavaOne –Collaborators: Champlin, Giovannoni SCRover Project –Develop rover testbed –Collection defect and process data for experience base –Collaborators: Boehm, Madachy, Medvidovic, Port Dependability cases –Develop dependability cases for time management and software architectures –Collaborators: Goodenough, Weinstock, Maxion, Hudak Analysis of MDS architectural style –Analysis based on MDS use architectural-components types –Collaborators: Garlan Process improvement –Data collection from mainline MDS and SCRover development efforts –Collaborators: Johnson, Port

December 5, MDS in 1 Minute Approach Product line practice to exploit commonalities across missions: An information and control architecture to which missions/products conform A systems engineering process that is analytical, disciplined, and methodical Reusable and adaptable framework software Problem Domain Mission information, control, and operations of physical systems Developed for unmanned space science missions Scope includes flight, ground and simulation/test Applicable to robots that operate autonomously to achieve goals specified by humans Architecturally suited for complex systems where “everything affects everything” MDS Products Unified flight, ground and test architecture Orderly systems engineering methodology Frameworks (C++ and Java) Processes, tools, and documentation Examples Reusable software

December 5, Component-Based Architecture Handles interactions among elements of the system software Inward looking Addresses software engineering issues State-Based Architecture Handles interactions among elements of the system under control Outward looking Addresses systems engineering issues Managing Interactions Complex interactions make software difficult –Elements that work separately often fail to work together –Combinatorics of interaction is staggering, so it’s not easy to get right –This is a major source of unreliability There are two approaches to this in MDS: “A unified approach to managing interactions is essential”

December 5, MDS is… State-Based Architecture State variables hold state values, including degree of uncertainty Estimators interpret measurement and command evidence to estimate state Controllers issue commands, striving to achieve goals Hardware proxies provide access to hardware busses, devices, instruments Models express mission- specific relations among states, commands, and measurements A goal is a constraint on the value of a state variable over a time interval Key Features:  Systems analysis/design organized around states and models  State control architecturally separated from state determination  System operated via specifications of intent: goals on state

December 5, From theory to flight... JPL Transition Path Mars Smart Lander (MSL) Technology Infusion –Scheduled Launch: 2009 –MSL has baselined MDS technology System engineering Software frameworks –MSL Technology Gates PMSRAugust, 2004 Integrated demoJune, 2005 PDRFebruary, 2006 MSL sample technology categories –Software architecture with infused technologies –Verification and Validation tools and methodologies –Processes and supporting tools –Cost modeling for system engineering, software adaptation and autonomy validation MDS compatible technologies are directly relevant to MSL

December 5, Conclusions System-software barrier –(Verification is easy, validation is hard) Software is transparent and malleable in the small… –But opaque and brittle in the large General-purpose software dependability tools work well in the small –But fail to scale to systems in the large. But there is Reason for Optimism Align software architectures with system analysis Success of formal methods in related field of digital hardware Scaling through specialization Divide and Conquer: compositional reasoning Beyond correctness: exploiting the lattice between true and false for software understanding Providing the research community with realistic experimental testbeds at scale