The Failure of the London Ambulance Service Michael McDougall CIS 573 November 16 th, 1999.

Slides:



Advertisements
Similar presentations
Week 1 (2) 2008IS33 IS Failure Cases 1 COMP3470 IS33 People-Centred Information Systems Development Week 1 : Lecture 2 IS Failure Case Studies School of.
Advertisements

Jeremy Siviter, IBI Group, Project Manager May 18th, 2011
Effectively applying ISO9001:2000 clauses 6 and 7.
Test-First Programming. The tests should drive you to write the code, the reason you write code is to get a test to succeed, and you should only write.
Test process essentials Riitta Viitamäki,
Roll-out Carts Proactive Replacement Programs. Outline – Proactive Replacement Program Background What (is proactive replacement)? Why (proactive replacement)?
Excalibur Communications Paul Crow Corporate Sales Manager.
Step by Step Guide. A new car depreciates (decreases in value) quickly - after 3 years is worth 60-70% of its original value ($20,000 = $12,000) – as.
Chapter 6 Scheduling. 222 Learning Objectives Estimate the duration for each activity Establish the estimated start time and required completion time.
©Ian Sommerville 2004Software Engineering Case Studies Slide 1 The London Ambulance fiasco l The London Ambulance Service (LAS) Computer Aided Despatch.
Technology Services Strategic Plan for Operational Efficiency
Advanced Database Projects In Access © Hodder Education 2008 Access Projects – Problem Specification.
Transfer from primary to secondary school September 2014 Secondary School Admissions 2014.
Methods: Deciding What to Design In-Young Ko iko.AT. icu.ac.kr Information and Communications University (ICU) iko.AT. icu.ac.kr Fall 2005 ICE0575 Lecture.
Marwan Al-Namari Week 2. ADSL : Asymmetric Digital Subscriber Line Ethernet networks - 10BASE-T - 100BASE-TX BASE-T BASE-TX (Cat5e.
WHY THEY FAILED AND LESSONS TO BE DRAWN Samuel Franklin G53QAT: Quality Assurance and Testing Famous Software Failures.
Slide 1 Client / Server Paradigm. Slide 2 Outline: Client / Server Paradigm Client / Server Model of Interaction Server Design Issues C/ S Points of Interaction.
SWE Introduction to Software Engineering
William Stallings Data and Computer Communications 7th Edition
Universal Medical Record. What is a medical record –Sources of information –Uses –How is it maintained –What are its component parts Medical Record.
Functional areas Retail Business.
Topic 10Summer London Ambulance System Some of the slides created by Sommerville.
Chapter 2 A Strategy for the Appraisal of Public Sector Investments.
The London Ambulance fiasco
CONTENTS:-  What is Event Log Service ?  Types of event logs and their purpose.  How and when the Event Log is useful?  What is Event Viewer?  Briefing.
Information Technology Project Management by Jack T. Marchewka Power Point Slides by Jack T. Marchewka, Northern Illinois University Copyright 2006 John.
Information Technology Project Management By Denny Ganjar Purnama, MTI Universitas Pembangunan Jaya May 2014.
A COMPUTER is an electronic device. Every computer performs 4 general operations: 1. Input 2. Process 3. Output 4. Storage.
Fleet Safety: Managing technology integration, training, and legal risks Teresa Prisbrey, Director of Marketing
Types of Operating System
AICT5 – eProject Project Planning for ICT. Process Centre receives Scenario Group Work Scenario on website in October Assessment Window Individual Work.
Commercial Database Applications Testing. Test Plan Testing Strategy Testing Planning Testing Design (covered in other modules) Unit Testing (covered.
AS Level ICT Mrs. Ghazaal. In the past, when a customer wanted to talk to someone in a company they would usually be able to telephone and be put through.
IT Business Continuity Briefing March 3,  Incident Overview  Improving the power posture of the Primary Data Center  STAGEnet Redundancy  Telephone.
9/12/2015Dr Andy Brooks1 Lecture 3 London Ambulance System (LAS) 26 & 27 October and November FOR0383 Software Quality Assurance.
Payment Automation Network, Inc. Click your mouse to continue. Press to quit
資工 4A 陳怡秀 Microsoft Visual Studio’s Journey to Continuous Delivery.
SOFTWARE ENGINEERING1 Introduction. Software Software (IEEE): collection of programs, procedures, rules, and associated documentation and data SOFTWARE.
Maintaining File Services. Shadow Copies of Shared Folders Automatically retains copies of files on a server from specific points in time Prevents administrators.
Royal Latin School. Spec Coverage: a) Explain the advantages of networking stand-alone computers into a local area network e) Describe the differences.
BT Young Scientists & Technology Exhibition App Risk Management.
2011 Census 2007 Census Test – emerging findings Garnett Compton, ONS Updated 4 September 2007 BSPS – 12 September 2007.
Plan Design Analyze Develop Test Implement Maintain Systems Development Life Cycle MAT Dirtbikes.
Data and Computer Communications Chapter 10 – Circuit Switching and Packet Switching (Wide Area Networks)
McLean HIGHER COMPUTER NETWORKING Lesson 15 (a) Disaster Avoidance Description of disaster avoidance: use of anti-virus software use of fault tolerance.
Step 5: Complete Your Project. Setting the scene Suppose you have been running a project to write a small piece of computer software for a business. The.
SOFTWARE ENGINEERING1 Introduction. SOFTWARE ENGINEERING2 Software Q : If you have to write a 10,000 line program in C to solve a problem, how long will.
Who Says Servers Can’t Crash? Rocky Mountain PBS Survives Multiple Server Crashes and Lives to tell about it! Presented By Michelle Nesmith Rocky Mountain.
Marketing Research Approaches. Research Approaches Observational Research Ethnographic Research Survey Research Experimental Research.
Vinay Paul. CONTENTS:- What is Event Log Service ? Types of event logs and their purpose. How and when the Event Log is useful? What is Event Viewer?
WATERFALL DEVELOPMENT MODEL. Waterfall model is LINEAR development lifecycle. This means each phase must be completed before moving onto the next!!! WHAT.
The London Ambulance Service Case. The Manual System 999 call to BT Call switched to LAS call takers –Record details & map reference onto a form Send.
Use of Mobile Technology for Data Collection in Zimbabwe Experiences Gained and Lessons Learnt By Rodgers M. Sango Zimbabwe National Statistics Agency.
Cervion Systems Customer Service Customer Service Overview.
DEFENSIVE DRIVING TRAINING What's difficult about driving? Increasing amount of vehicles on the road Other drivers attitudes Weather conditions Heavy.
Observing the Current System Benefits Can see how the system actually works in practice Can ask people to explain what they are doing – to gain a clear.
Nursing Informatics MNS 5103 MASTER OF NURSING SCIENCE (MNS)
Discovering Sensor Networks: Applications in Structural Health Monitoring Summary Lecture Wireless Communications.
1 Chapter 1- Introduction How Bugs affect our lives What is a Bug? What software testers do?
1 Software Testing and Quality Assurance Motivation and Review of Software Verification & Validation (2)
Written by: Harry Goldstein Published: IEEE Spectrum, September
Project Failure RYAN BIBBY – PROJECT 5 ASSIGNMENT 2 TASK 1.4.
Data and Computer Communications Chapter 7 Circuit Switching and Packet Switching.
Introduction SOFTWARE ENGINEERING.
The Accident On October 26th 1992 the London Ambulance System failed.
Your Facility Your Information
1.2 System Design Basics.
Computer Automated Dispatch Project
Defining project management
The London Ambulance fiasco
Presentation transcript:

The Failure of the London Ambulance Service Michael McDougall CIS 573 November 16 th, 1999

The Accident On October 26 th 1992 the London Ambulance System failed. Phones rang for up to 10 minutes Ambulance response times were delayed Some calls were lost On November 2 nd the system crashed completely. Software was a major cause of the failures.

Outline London Ambulance Service Computer Aided Despatch (CAD) system Background Planning the system Developing the system How it failed ISO – Software Development Standard LAS failure w.r.t. ISO standard

Background The London ambulance service (LAS) is was the largest ambulance service in the world. 6.8 million residents – much higher during daytime. Services 5000 patients a day. Handles between 2000 and 2500 calls a day (more than 1 per minute). Employs 2700 full-time staff.

Background In 1990 the LAS was not meeting the U.K. standards for ambulance response times. Other parts of the U.K. National Health Service had undergone reforms throughout the 80’s but the LAS had not changed much since Staff/Management relations were low.

Despatch system The despatch system was responsible for: Taking emergency calls Deciding which ambulance to send Sending information to ambulances Managing allocation of ambulances

Despatch system Take Call Collection Point Paper Regional Allocator Paper RA Paper Despatcher Voice Ambulance

Despatch system The UK national standard required that this take less than 3 minutes. The LAS system in 1990 had a number of inefficiencies which made it impossible to meet the standard.

Inefficiencies Take Call Finding the location of an accident was often difficult and time consuming.

Inefficiencies Paper Moving pieces of paper took unnecessary time

Inefficiencies Collection Point Identifying duplicate calls relied on human memory and was therefore slow and error prone.

Inefficiencies Regional Allocator Paper Despatcher Voice Ambulance Voice communication was slow Allocating ambulances was done by hand. Relied on memory of allocator.

Improving the system The LAS was under pressure from their superiors, MPs, the public and the media to improve performance. LAS management decided that a Computer Aided Despatch system was the fastest way to improve service.

The Plan LAS wanted to radically change the despatch system. In Autumn 1991 they began to write the system requirements for the new system.

CAD system goals Take Call Finding the location of an accident was often difficult and time consuming. Software connected to public telephones will locate incidents automatically

CAD system goals Paper Moving pieces of paper took unnecessary time Information will move through a network between workstations.

CAD system goals Collection Point Identifying duplicate calls relied on human memory and was therefore slow and error prone. AI will try to identify duplicate calls.

CAD system goals Regional Allocator Allocating ambulances was done by hand. Relied on memory of allocator. Allocation of nearest ambulance will be done by computer in most cases.

CAD system goals Despatcher Voice Ambulance Voice communication was slow Digital communication to and from ambulances

LAS ambitions The new system was intended to mobilize an ambulance in less than 1 minute. The system would be the most ambitious of its time. A much more modest system had been planned for the LAS, but this was abandoned when it failed load-testing. No independent audit of the system requirements was carried out.

CAD requirements LAS wanted a one-phase delivery LAS decided that the system should cost £1,500,000 LAS decided that the system would take 6 months to implement (though a project of this scale would usually take 18 months) These requirements were not based on any analysis of the design. They appear to be arbitrary.

Asking for tenders In early 1991, LAS publicized the requirements and asked for bids Many potential suppliers expressed doubts that the project could be finished on time with the required budget LAS replied that the timetable was not negotiable

Bids Many potential suppliers submitted bids for the project Most of the bids required more time and/or money The bids were evaluated by LAS staff who had no experience with information technology

Selecting a contractor Only one bid was under £1,500,000 and promised an implementation system in 6 months. This bid was selected. The winning bid was from Systems Options Ltd (SO), a small software house with no experience in safety- critical software. SO had never managed a large project

The Contract LAS signed a contract with SO in September The system was supposed to go on-line on January 8 th, The contract did not specify who would act as project manager or who would be responsible for quality assurance. No acceptance criteria was defined

Developing the system Suppliers failed to meet deadlines SO initially handled the project management, but this shifted to LAS as the project proceeded No independent QA or audit was performed; LAS intended to save money by leaving QA to the suppliers

Problem tracking There was a formal procedure for reporting, analyzing and fixing bugs but… this was often skipped so that the software could be changed quickly to satisfy users

Training problems Users were trained long before the system was on-line. The training was often out of date or forgotten by the time the system was available Users were only trained for their part of the system

Partial deployment The complete system was not ready by Jan 8; systems was deployed in pieces Bugs encountered System needed perfect vehicle information Every 53 rd vehicle was unavailable Workstations froze often (Windows 3.0) Vehicle allocation could not be overridden Sending the wrong vehicle

Expected to fail Interacting with the system was often awkward and frustrating The LAS Staff had little confidence in the system

No testing No testing of the full system was ever done Nobody ever tested to see if radio system could handle traffic Management did not know what resources were required to maintain service; the CAD system was supposed to give this information

Failure 1 On October 26th the LAS management decided to switch to the full CAD system. This decision was made even though the system was never tested there were outstanding bugs which were considered ‘severe’

Failure 1 Initially the system worked; there were some errors but the staff were able to correct them As the load increased the system response time decreased and the ambulance location data became less and less reliable

Feedback problems Bad data Crew frustration Fewer available vehicles More calls Longer waits for ambulance Bad allocation

Design errors Some of the design decisions made it harder to recover from errors Allocators could only get info on ambulances by reserving an ambulance Control room layout made it hard for operators to communicate System could not handle operators overriding computer decisions

Consequences At the height of the accident emergency calls were ringing for 10 minutes before being answered Some calls were lost because the list of calls was too big for the terminals 80% of ambulances took more than 15 minutes to respond. (Average was 67%).

Consequences cont. The media reported that patients died because of the failure. A coroner later concluded that this was false.

Failure 2 After the first failure LAS went back to the semi-automated system in use before October 26th. On November 4th the system froze The cause was a server that had run out of memory

Memory leak The server software had been changed 3 weeks before. This change introduced a small memory leak. The server had been running out of memory ever since

Backup system There was a backup server, but it was only designed to work in the full CAD system

Consequences At the time of the 2nd failure the load was light enough that the staff recovered all the information lost in the crash. No calls were missed. LAS went back to the original paper system

Next class ISO Software life cycle processes Would standards have prevented the LAS failure? Are standards worth it?