Erik Hollnagel, PhD University of Linköping 16 May :00 PM

Slides:



Advertisements
Similar presentations
Elements of an Effective Safety and Health Program
Advertisements

Accident and Incident Investigation
Health and Safety Chapter 10.
Protect the Force Through Risk Management AC1 ACCIDENT CAUSATION.
PATH216 Mohamed M. B. Alnoor SAFETY and INJURY PREVENTION.
Humans in safety critical systems
Terminal Safety. Objectives Identify main causes Outline terminal safety organization State the safe working practices.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 20 Slide 1 Critical systems development.
Introduction to effective Incident/Accident Analysis
Accident Investigations
Federal Motor Vehicle Safety Standards. Applicability Each standard of this part applies to all motor vehicles or items of motor vehicle equipment manufactured.
October 2008 International Rail Safety Conference 2008 Denver, Colorado, USA.
Workplace Safety and Health Program
Responsible CarE® Employee health and Safety Code David Sandidge Director, Responsible Care American Chemistry Council June 2010.
©Ian Sommerville 2006Critical Systems Slide 1 Critical Systems Engineering l Processes and techniques for developing critical systems.
PERMIT REQUIRED CONFINED SPACES PERMIT REQUIRED CONFINED SPACES.
Safety and Health Programs
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 2 Slide 1 Systems engineering 1.
Accident /Incident Investigation Paul Thornton
Occupational Road Risk Health and safety issues for vehicles and drivers Mike Lewis MIOSH, RSP.
Protection Against Occupational Exposure
Hazard Analysis Critical Control Point (HACCP)
Control environment and control activities. Day II Session III and IV.
PHILOSOPHY OF ACCIDENT PREVENTION
Software Dependability CIS 376 Bruce R. Maxim UM-Dearborn.
Pipeline Qra Seminar Title slide Title slide.
Hazard Identification
The Field Guide to Human Error Investigations- The Old View (Chapters 1 – 6) By Dekker AST 425.
Topic 5 Understanding and learning from error. LEARNING OBJECTIVE Understand the nature of error and how health care can learn from error to improve patient.
Levels of safety Priorities for eliminating hazards in the workplace Eliminate the hazard through the machine design stage Apply safeguarding technology.
George Firican ICAO EUR/NAT Regional Officer Almaty, 5 to 9 September 2005 SAFETY MANAGEMENT SYSTEMS.
Science What is “Safety” Freedom from danger Safety is the condition of being protected against failure, breakage, error, accidents, or harm. (Protection.
How Hospitals Protect Your Health Information. Your Health Information Privacy Rights You can ask to see or get a copy of your medical record and other.
XpsOES : A New Tool for Improving Safety at Workplace Yasar Kucukefe, Ph.D., National Power Energy.
Health and Safety Policy
Safety Induction to the Lift & Escalator Industry
 Injury  Hazards  Conditional Circumstances  Legal Responsibilities  Guidelines for Safe Products/systems  Safety Hierarchy  Safe Design Principles.
Situational Awareness Numerous aircraft and operational displays, when combined with effective and efficient communications and facilities, provide Air.
M. Frize, Winter 2003 Reliability and Medical Devices Prof. Monique Frize, P. Eng., O.C. ELG5123/ February 2003.
Safety-Critical Systems T Ilkka Herttua. Safety Context Diagram HUMANPROCESS SYSTEM - Hardware - Software - Operating Rules.
Why Do We Have Accidents?
Historical Perspective and Overview. Accident Causation  Why do accidents happen?  This question has concerned safety & health decision makers for decades.
Hazard Identification
Quality Assurance.
Safety-Critical Systems 7 Summary T V - Lifecycle model System Acceptance System Integration & Test Module Integration & Test Requirements Analysis.
A guide to... Safe Systems of Work.
What is an accident and why should it be investigated?
EliminationSubstitutionEngineeringAdministrationPPE Incident Reporting.
Safe Systems of Work. Legislation w HSWA Section 2 (2) (a): Provide and maintain plant and systems of work that are, so far as is reasonably practicable,
Presented to: By: Date: Federal Aviation Administration AIRWORTHINESS Positive Safety Culture Failure to Follow Procedures 1 R1.
WHAT IF ANALYSIS USED TO IDENTIFY HAZARDS HAZARDOUS EVENTS
Behind the Mirror of Safety Steve Danon Director, Risk Control Services Marcotte Insurance Agency.
SafeMARINERTM Helping Companies Get to Zero
An introduction to… Lockout Tagout. Aim Every year thousands of workers get killed or injured while performing repairs or maintenance on industrial equipment.
RISK MANAGEMENT FOR COMMUNITY EVENTS. Today’s Session Risk Management – why is it important? Risk Management and Risk Assessment concepts Steps in the.
EFFECTIVE ACCIDENT/INCIDENT INVESTIGATION 15 FEBRUARY 2013 PHILIPPINE ASSOCIATION OF SAFETY ENGINEERS -QATAR- -QATAR- COMMITTEE ON SAFETY EDUCATION 2013.
BMS4667 Laboratory Leadership and Management Dr. David Ricketts.
LECTURE 7 AVIATION SAFETY & SECURITY
Process Safety Management Soft Skills Programme Nexus Alliance Ltd.
Health and Safety Legislation E5: Human Factors Design IB Technology.
ACCIDENT CAUSATION.
Guide for the application of CSM design targets (CSM DT)
Construction Safety Management
ACCIDENT INVESTIGATION PRESENTATION
Reporting Incidents and Hazards Accident Prevention
Elements of an Effective Safety and Health Program
Elements of an Effective Safety and Health Program
Presentation transcript:

Erik Hollnagel, PhD University of Linköping 16 May 2002 2:00 PM 2002 Human-Technology Integration Colloquium Series Air Force Research Laboratory Human Effectiveness Directorate Barrier Analysis and Accident Prevention Erik Hollnagel, PhD University of Linköping 16 May 2002 2:00 PM

Understanding and predicting accidents 'My dear friend Copperfield,' said Mr. Micawber, … ‘Accidents will occur in the best-regulated families; they may be expected with confidence, and must be borne with philosophy.’ Systems; organisations The probability that a specified event will occur. The degree of certainty by which accidents can be expected. The principles (models and theories) for describing and analysing accidents The lessons learned and the approaches to system design (prevention, protection). Charles Dickens David Copperfield (1850) Chapter 28

Changes in attributed cause types Technology, equipment Human performance Organisation 100 90 ? 80 % Attributed cause 70 60 50 40 ? 30 20 ? 10 1960 1965 1970 1975 1980 1985 1990 1995 2000

Model contents and model form Accident meta-model The change in the contents (attributed causes) of models still refers to the same accident meta-model. Technology, equipment Human performance Organisation

Causality assumption Every cause has an effect Cause Effect 1. If we know what this is ... 2. then we can look for this! Every event (effect) has a prior cause Cause Effect 2. then we can find out what this is! 1. If we can see what this is ...

Cause and effect Isaac Newton: Classical mechanics, clear relations between cause and effect (1st, 2nd, 3rd Law) David Hume: Causality = priority in time of cause to effect, contiguity in space and time, necessary connection Willard Gibbs: Statistical mechanics, probabilistic relations between cause and effect Non-linear dynamics (chaos theory): Confluence, coincidence

Sequential accident models Linear chain of events Domino model Sequential models Tree models Event tree Network models Critical path Principle of accident analysis Goal of accident analysis Search for recognisable, specific causes and well-defined cause effect links. Causes - when found - can be eliminated or contained.

Sequential accident model Direction of causality Unsafe act Unexpected, unwanted consequence Direction of reasoning

Domino model (Heinrich, 1930) Ancestry person Hazards Accident Injury Social environment Fault of Mechanical & physical Unsafe act Accident Injury

Domino model - cause elimination Hazards Mechanical & physical Unsafe act Accident Injury Ancestry Accident Injury person Social environment Fault of

Anatomy of an accident Green, 1988 Normal condition Abnormal condition Unexpected event Loss of control Failure of control Accident The accident is described as a sequence of co-occurring events / conditions Lack of defence

Spill at Cadarache (F) Tank overflow alarm fails Water spills into low level radiation tank Tap in eye rinsing basin not turned off Basin overflows to storage tank 10-12 m3 water flows into sump Tank overflow alarm fails Contaminated water in outside rainwater tank Sump pump is connected to outside rainwater tank

Epidemiological accident models Latent conditions Iceberg model Epidemiological models Carriers-barriers Swiss cheese model Pathological systems Principle of accident analysis Goal of accident analysis Search for “carriers” and latent conditions; define indications of general system “health”. Make defences and barriers stronger. … but … causality cannot be attributed solely on the basis of a temporal relation (A prior to B)

Epidemiological accident model Direction of causality Latent system conditions Performance deviation Working conditions Unexpected, unwanted consequence Latent system conditions Direction of reasoning

Epidemiological model (Suchman, 1960) Risk-taking; Appraising margin of error Injury; Damage Predisposition characteristics Situational characteristics Accident conditions Accident effects Susceptible host; Hazardous environment; Injury-producing agent Unexpected; Unavoidable; Unintentional

NY Subway Crash A NYC subway train on the Williamsburg Bridge crashed into the rear end of another train on 5 June 1995. Motorman apparently ran through a red light, was still applying power at time of crash. Motorman was killed, 54 were injured. ATC is supposed to apply emergency brakes whenever a train runs a red light. The brakes did work, but: Distance to train ahead was 288 ft Breaking distance at 32 mph is 360 ft Collision speed was about 14-18 mph Signal spacing was defined in 1918 (sic). At that time trains were shorter, lighter, and slower than modern trains. Trains had been upgraded, but control systems had not.

Systemic accident models Control theory Sharp end Blunt end Systemic models Coincidence Stochastic resonance Principle of accident analysis Goal of accident analysis Search for unusual dependencies and “common conditions” Performance variability can be detected and controlled

Systemic accident model Latent system conditions Barriers defences Function failure at the “sharp end” Unexpected, unwanted consequence Function failure at the “blunt end” Latent system conditions

M/S Stockholm – July 20, 2000 Captain + first officer on bridge Two crewmembers (AB) in engine room Complete loss of electrical power One AB accidentally shuts off fuel to main generator Captain connects shaft generator Captain sets pitch control = 0; rudder to port to avoid rock face SG overload trip after 10 seconds Emergency generator does not start Captain tries to reconnect SG First officer turns off unnecessary electrical equipment (galley, etc.) Captain stops engines; emergency clutch out Grounding of M/S Stockholm Emergency generator started

Factors at local workplace Sharp end - blunt end Factors at local workplace Morals, social norms Unsafe acts Management Company Regulator Government “Blunt end” factors are removed in space and time “Sharp end” factors work here and now

Sharp-end - blunt-end Everybody’s blunt end is someone else’s sharp end. Accident Operational staff Work actions Company Management Government Regulators Source: K. Roberts, 2001

Accident meta-models Sequential accident model Epidemiological accident model Systemic accident model Search principle of accident analysis Specific causes and well-defined links. Carriers, barriers, and latent conditions. Functional dependencies and common conditions Goal of accident analysis Eliminate or contain causes. Strengthen defences and barriers . Monitor & control performance variability

Evolving concept of causes Latent failure conditions Barriers Resources Other Safety culture Organisational failures Quality management Pathogenic organisations Accident / event Technical failures Software failures Violations Operation Heuristics Cognitive functions “Human error” Information processes Management Maintenance Design Simple causality Complex coincidences

Axioms of industrial safety (1-5) The occurrence of an injury invariably results from a completed sequence of factors - the last one of these being the accident itself. The accident in turn is invariably caused or permitted directly by the unsafe act of a person and/or a mechanical or physical hazard. 2 The unsafe acts of persons are responsible for a majority of accidents. 3 The person who suffers a disabling injury caused by an unsafe act, in the average case has had over 300 narrow escapes from serious injury as a result of committing the very same unsafe act. Likewise, persons are exposed to mechanical hazards hundreds of times before they suffer injury. 4 The severity of an injury is largely fortuitous - the occurrence of the accident that results in injury is largely preventable. 5 The four basic motives or reasons for the occurrence of unsafe acts provide a guide to the selection of appropriate corrective measures.

Axioms of industrial safety (6-10) Four basic methods … for preventing accidents - engineering revision, persuasion and appeal, personnel adjustment, and discipline. 7 Methods of most value in accident prevention are analogous with the methods required for the control of the quality, cost, and quantity of production. 8 Management has the best opportunity and ability to initiate the work of prevention; therefore it should assume the responsibility. 9 The supervisor or foreman is the key man in industrial accident prevention. His … … supervision to the control of worker performance is the factor of greatest influence in successful accident prevention. … 10 The humanitarian incentive for preventing accidental injury is supplemented by two powerful economic factors: (1) the safe establishment is efficient productively and the unsafe establishment is inefficient; (2) the direct employer cost of industrial injuries for compensation claims and for medical treatment is but one-fifth of the total cost which the employer must pay.

Exploding steam engines US 1816-1848 233 steamboat explosions 2.562 persons killed; 2.097 injured Property loss in excess of 3.000.000 $ Most accidents were blamed on owners and operators. BUT Boiler technology lagged behind improvements in steam engines. Little understanding of build-up of steam pressure, effects of corrosion, causes of boiler explosions. Engineers lacked proper training and skills.

Counterfactual reasoning “Why didn’t they do A”? “Why didn’t they do B”? Actual outcome Possible outcome 1 Possible outcome 2 Going back through a sequence, investigators often wonder why opportunities to avoid the bad outcome were missed. This, however, does not explain the failure

Performance deviations "Knowledge and error flow from the same mental sources, only success can tell one from the other." (Mach, 1905) Actions with a negative outcome. Human performance is inherently variable! HUMAN ERROR! Both types are performance deviations, and may have the same “causes” Work conditions are inherently variable! Actions with a beneficial outcome. CREATIVITY, LEARNING

Multiple meanings of “error” Error-as-cause Oil spill was caused by human error Cause Consequence (observable failure) Error-as-outcome Error-as-event Error-as-action I left the key in the lock; latent “human error” I forgot to check the water level

What is an “error”? Correctly performed actions Actual outcomes = intended outcomes Failure detected and recovered Actual outcomes  intended outcomes Failure detected but tolerated Immediate effects Latent effects Failure detected but not recovered Failure not detected

A cynical definition of causes A “cause” is the identification, after the fact, of a limited set of aspects of the situation that are seen as the necessary and sufficient conditions for the effect(s) to have occurred. A “cause” has the following characteristics: It can unequivocally be associated with a system structure or function (people, components, procedures, etc.) It is possible to do something to reduce or eliminate the cause within accepted limits of cost and time. It conforms to the current “norms” for explanations. The determination of the “cause” is a relative (pragmatic) rather than absolute (scientific) process.

Analysis-prediction dilemma ? Looking back, we acknowledge that accidents reflect complex coincidences Looking ahead, accident “models” are still mostly linear or sequential.

Are there any known or valid indicators for accident build-up? Accident prevention To prevent accidents, we must know: Are there any known or valid indicators for accident build-up? What Which types of accidents are possible in a system? Which types of accidents are possible in a system? Are there effective means (barriers, defences) to guard against accidents? Where Where in the system can accidents occur? How What are the “mechanisms” of an accident? When Under which conditions are accidents likely?

Barriers and safety Barrier purposes (WHY) Barrier function (WHAT) a barrier is an obstacle, obstruction or hindrance that may: prevent an action or event from taking place protect against or diminish the negative consequences of an action or event that has taken place. Barrier function (WHAT) The specific manner by which the barrier achieves its purpose Barrier system (HOW) The foundation or basis for the barrier function, the required organisational and/or physical structure Barriers can be single or combined (defence-in-depth) Barriers are effective even if the cause is unknown or uncertain.

Prevention and protection Accident Initiating event, failure mode (“Incorrect” action) Protection (safety barriers): Active barrier functions that deflect consequences Protection (boundaries): Passive barrier functions that minimise consequences Prevention (control barriers): Active or passive barrier functions that prevent the initiating event from occurring.

Barrier system types Physical, material Functional Symbolic Immaterial Obstructions, hindrances, ... Functional Mechanical (interlocks) Logical, spatial, temporal Symbolic Signs & signals Procedures Interface design Immaterial Rules, laws, principles Ten Commandments, Laws of Robotics

Types of barrier systems Material barriers Physically prevents an action from being carried out, or prevents the consequences from spreading Functional (active or dynamic) barriers Hinders the action via preconditions (logical, physical, temporal) and interlocks (passwords, synchronisation, locks) Symbolic barriers (perceptual, conceptual barriers) Requires an act of interpretation to work, i.e. an intelligent and perceiving agent (signs, signals alarms, warnings) Immaterial barriers (non-material barriers) Not physically present in the situation, rely on internalised knowledge (rules, restrictions, laws)

Barriers systems on the road Symbolic: requires interpretation Physical: works even when not seen Symbolic: requires interpretation Symbolic: requires interpretation

Barrier systems / barrier functions Examples Containing Walls,fences, tanks, valves Material, physical Restraining Safety belts, cages Keeping together Safety glass Dissipating Air bags, sprinklers Preventing (hard) Locks, brakes, interlocks Functional Preventing (soft) Passwords, codes, logic Hindering Distance, delays, synchronisation Countering Function coding, labels, warnings Regulating Instructions, procedures Symbolic Indicating Signs, signals, alarms Permitting Work permits, passes Communicating Clearance, approval Immaterial Monitoring Monitoring Prescribing Rules, restrictions, laws

Spill at Cadarache (F) Tank overflow alarm fails Function: indicating System: symbolic Barrier: Tank overflow alarm Water spills into low level radiation tank Tap in eye rinsing basin not turned off Basin overflows to storage tank Barrier: Tank overflow alarm 10-12 m3 water flows into sump Contaminated water in outside rainwater tank Tank overflow alarm fails Sump pump is connected to outside rainwater tank

Train accident Temporary incapacitation Illness Performance variability Inattention Speed: Too fast Observation missed Speed: Too fast Temporary incapacitation Memory failure Inadequate plan Barrier: ATC Train derailed ATC not working: Equipment not activated

Double role of barriers Train out of control Automation acts as barrier if engineer fails Engineer acts as barrier if automation fails Barrier: ATC Barrier: Engineer Engineer misses a signal ATC does not function

Glasgow bus accident Bus collides with bridge Bus collides with bridge Bridge too low for bus Bus driver doesn’t notice low bridge Bridge too low for bus Bus driver doesn’t notice low bridge Unusual route Bus driver tired Party late September 18, 1994

Glasgow bus accident Bus collides with bridge Automatic braking Low bridge too close Bridge too low for bus Bus driver doesn’t notice low bridge Acoustic signal Low bridge approaching Unusual route Bus driver tired Party late

Barrier evaluation criteria Efficiency: how efficient the barrier is expected to be in achieving its purpose. Robustness: how resistant the barrier is w.r.t. variability of the environment (working practices, degraded information, unexpected events, etc.). Delay: Time from conception to implementation. Resources required. Costs in building and maintaining the barrier. Safety relevance: Applicability to safety critical tasks. Evaluation: How easy it is to verify that the barrier works. Other: Maintenance needs; complexity; reusability; …

Evaluation of barrier quality Material barriers Functional barriers Symbolic barriers Immaterial barriers Efficiency High High Medium Low Robustness (reliability) Medium-High Medium-High Low-medium Low Delay Long Long Medium Short Resource needs Medium-High Medium-High Medium Low Safety relevance Low Medium Low (uncertain interpretation) Low Evaluation Easy Medium Easy Difficult

Requirements for effective barrier functions Barrier system Barrier function relies on Pre/condition for proper functioning Reliance on humans Material Physical properties Reliable construction, possibly regular maintenance Low (maintenance) Functional Mechanical Reliable construction, regular maintenance. Low Logical Verified implementation, adequate security. Low Spatio-temporal Reliable construction, regular maintenance. Low Monitoring Reliable performance of monitor Medium Symbolic Interface design Valid design specification, verified implementation, systematic updating Medium Information High-quality interface design, reliable functioning. High Signs, signals and symbols Regular maintenance, systematic modification, High Permission or authorisation High compliance by users. High Immaterial Communicative, interpersonal Nominal working conditions (no stress, noise, distraction, etc. High Rules, cautions, prohibitions High compliance by users. High

Redundancy in accident prevention (Reason, 1997) Provide means of escape and rescue Contain and eliminate hazards Interpose safety barriers between hazards and losses Restore system to safe state in off-normal conditions Provide alarms when danger is imminent Give clear guidance on safe operation Create understanding of hazards In the meaning of redundancy, Reason proposed the seven step of defences-in-depth as shown here. Concrete Abstract

Diversity in accident prevention Immaterial barrier system Laws, rules, principles, … Guide humans on safe performance Need to be interpreted Symbolic barrier system Signs, signals, procedures, … Functional barrier system The idea of barrier system that is proposed Prof. Hollnagel is related to the diversity of defences. The barrier systems are classified into four types. That is, immaterial barriers such as laws, rules, and procedures, symbolic barriers such as signs and signals, functional barriers such as interlock, and password, and material barriers such as walls and guardrails. Note here that immaterial and symbolic barriers can guide humans to behave safely, but they need to be interpreted. So these barriers can be breached if the interpretation is not successful. On the other hand, functional barriers and material barriers can not guide the human behavior but prevents unsafe acts and its consequences. So these barriers need not to be interpreted. Interlocks, passwords, … Prevent unsafe acts and their consequences Do not need to be interpreted Material barrier system Walls, guardrails, …

Structure of Defences-in-Depth (Itoh, 2001) Help escaping Contain hazards Interpose barriers Restore system Provide alarms Give guidance Create understan-ding Immaterial Prescribing Symbolic Indicating Countering, regulating Permission, communi-cation Functional Preventing Preventing, hindering Monitoring What I want to claim in this talk is very simple. That is, combination of the notion of redundancy and diversity is useful to analyze the structure of defences. This table illustrates the redundancy-diversity matrix. Examples of barrier function type that are mentioned by Hollnagel in 1999 are shown in each cell. Material Dissipating Restoring, Keeping together Containing, protecting

Conclusions Accident model determines analysis and responses “Causes” reflect the assumptions of the underlying models Performance variability management rather than accident / “error” prevention The misleading simplicity of “human error” Human performance is inherently variable - but not unreliable Variability reflects work conditions Performance deviations have positive and negative consequences: “errors” as an opportunity for learning. Barriers are effective even if causes are unknown or uncertain Distinction between barrier systems and barrier functions Requirements for effective barrier functions. Accident prevention: redundancy and diversity