Patrick Hudson Tim Hudson Hudson Global Consulting

Slides:



Advertisements
Similar presentations
Risk Assessment. Objectives By the end of this presentation you will know: What risk assessment is; Where the need for risk assessment comes from; and.
Advertisements

Integra Consult A/S Safety Assessment. Integra Consult A/S SAFETY ASSESSMENT Objective Objective –Demonstrate that an acceptable level of safety will.
Safety Management Systems (SMS) An Introduction for Senior Management
Accident and Incident Investigation
Major Accident Prevention Policy (MAPP) and Safety Management System (SMS) in the Context of the Seveso II Directive.
Session No. 1 Basic Contemporary Safety Concepts
PROJECT RISK MANAGEMENT
Marine Corps Artillery Detachment, Fort Sill Veterans Day 2008 ORM.
ICAO Aerodrome Safety Workshop Almaty, Kazakhstan – 18 to 22 November 2002 NON-CONFORMITIES AND EXEMPTIONS AERONAUTICAL STUDIES.
Safety System & Scene. Overview Safety Terms Hazards Hazard Lists Worst Case Conditions Hazard Characteristics Analysis Sumary.
Project Management.
Aviation Safety, Security & the Environment: The Way Forward Vince Galotti Chief/Air Traffic Management ICAO Safety and Efficiency An ICAO Perspective.
1 Software Testing and Quality Assurance Lecture 38 – Software Quality Assurance.
“ Hard work and concern for the society is the key to success ” - O P Jindal On-site and Off-site Emergency Plans Based on Integral Risk Management – Key.
Computer Engineering 203 R Smith Risk Management 7/ Risk Management The future can never be predicted with 100% accuracy. Failure to plan for risks.
The Australian/New Zealand Standard on Risk Management
1 Risk evaluation Risk treatment. 2 Risk Management Process Risk Management Process.
HSE for Supervisors HEMP Module
Software Project Risk Management
Risk Management. RISK RISK = the probability and severity of loss linked to hazards. RISK = the probability and severity of loss linked to hazards. The.
What SMS means for an Operator’s relationship with the CAA
Bureau of Workers’ Comp PA Training for Health & Safety (PATHS)
A Safety Management System (SMS) is: “A systematic approach to managing safety, including the necessary organizational structures, accountabilities,
Health and Safety Executive IChemE – Process Safety KPIs Workshop session Ian Travers Head of Chemical Industries Strategy Unit Hazardous Installations.
April 2006ICAO Seminar Baku Principles and Elements of SMS A Review Patrick Hudson ICAO/Leiden University.
 Students will be able to:  List items in a AI plan  List items to include in an AI kit  Explain why human error could be a cause or a symptom of.
Risk assessments made easy! Phillip Crisp EcoSolve Australia Pty Ltd.
Software Dependability CIS 376 Bruce R. Maxim UM-Dearborn.
Nicosia, May 15th, th International Conference Challenges facing aviation safety - An airlines’ perspective.
Testing and Cost / Benefit Tor Stålhane. Why cost / benefit – 1 For most “real” software systems, the number of possible inputs is large. Thus, we can.
George Firican ICAO EUR/NAT Regional Officer Almaty, 5 to 9 September 2005 SAFETY MANAGEMENT SYSTEMS.
Science What is “Safety” Freedom from danger Safety is the condition of being protected against failure, breakage, error, accidents, or harm. (Protection.
Risk Management - the process of identifying and controlling hazards to protect the force.  It’s five steps represent a logical thought process from.
Risk Management Project Management Digital Media Department Unit Credit Value : 4 Essential Learning time : 120 hours.
Incident Investigation and Analysis
Safety Induction to the Lift & Escalator Industry
Chapter 12 Project Risk Management
Software Testing and Quality Assurance Software Quality Assurance 1.
Chapter(3) Qualitative Risk Analysis. Risk Model.
Jacques Vanier ICAO EUR/NAT Regional Officer Almaty, 5 to 9 September 2005 SAFETY MANAGEMENT SYSTEMS RISK VERSUS SAFETY.
Hazard Identification
IT Risks and Controls Revised on Content Internal Control  What is internal control?  Objectives of internal controls  Types of internal controls.
Introduction to Hazards Risk Management
The Risk Management Process
I DENTIFYING C AUSES OF A CCIDENTS Surface vs. Root Causes Surface causes are: the hazardous conditions or unsafe work practices that directly or indirectly.
TUI GROUP | RSC | Martin Ring | November 2015 Compliance vs Performance based risk management Martin Ring Head of Risk, Safety and Compliance.
1 Project Management C53PM Session 4 Russell Taylor Staff Work-base – 1 st Floor
Jacques Vanier ICAO EUR/NAT Regional Officer Almaty, 5 to 9 September 2005 SAFETY MANAGEMENT SYSTEMS SAFETY PERFORMANCE.
Monitoring, review and audit.
Development, Validation, Implementation and Enhancement for a Voluntary Protection Programs Center of Excellence (VPP CX) Capability for the Department.
RISK MANAGEMENT FOR COMMUNITY EVENTS. Today’s Session Risk Management – why is it important? Risk Management and Risk Assessment concepts Steps in the.
1 Software Testing and Quality Assurance Lecture 38 – Software Quality Assurance.
COSCAP-SA1 AERODROME CERTIFICATION COURSE AERODROME CERTIFICATION COURSE Safety Management System An introduction to the principles and concepts associated.
Department of Defense Voluntary Protection Programs Center of Excellence Development, Validation, Implementation and Enhancement for a Voluntary Protection.
RISK & ITS MANAGEMENT. Risk A crisis situation involves : - a threat to resource & people, - a loss of control, - visible and / or invisible effects on.
LECTURE 7 AVIATION SAFETY & SECURITY
Safety Management Systems Session Two Safety Risk Management APTA Webinar April 28, 2016.
Auditoria Interna e Gestão de Risco Operacional
Approaches to Defining Risk
NON-CONFORMITIES AND EXEMPTIONS AERONAUTICAL STUDIES
Monitoring, Review and Audit of OHS&E
DEFINITIONS.
HUMAN RESOURCE GOVERNANCE, RISK MANAGEMENT AND COMPLIANCE
Air Carrier Continuing Analysis and Surveillance System (CASS)
RISK ASSESSMENT TOOL PREVIEW
HSE Case: Risk Based Approach.
UA Road Transport Safety Forum Hazard & Effects Management Program
Architectural and Engineering Approaches to Infection Control
BHOPAL Industrial Disaster Management Cycle: OECD 2004.
A Safety Management System (SMS) is: “A systematic approach to managing safety, including the necessary organizational structures, accountabilities,
Presentation transcript:

Patrick Hudson Tim Hudson Hudson Global Consulting Risk management tools Patrick Hudson Tim Hudson Hudson Global Consulting

How can we manage risk? We can manage risk by hoping it won’t happen We can manage risk by offering sacrifices to the Gods We can manage risk by understanding what we are doing The first two don’t work The third is what a Safety Management System does

Risk Risk is a complex concept Combination of to different components RISK = Outcome x Probability of that outcome Outcomes – what could happen Usually seen as a scenario Worst case - conservative Most credible worst case Probability of those outcomes Often measured as frequency of occurrence Needs to be applied before anything has gone wrong Probabilities are difficult to estimate Knowing the probability may change its value

Building World Class SMS Session 16 Building World Class SMS

No Structure Structure HSE Policy There is more to an SMS than lots of good intentions No Structure Structure TRIPOD Organization Structure HSE Policy safety management system Alcohol & Drugs Policy Unsafe Act Audit Road Safety Plan Feedback Plan Audit Plans HAZARDS & EFFECTS MGMT. Continuous Improvement Engage Objectives Targets Incident Potential Matrix HSE Plan Check Health Risk Assess. Do EA

DISASTER BANKRUPTCY Safety Management System (SMS) Production Better defenses converted to increased production BANKRUPTCY Protection

Safety Management System (SMS) DISASTER Best practice operations under SMS Production BANKRUPTCY Protection

Generic HSE Management System (Shell) 1- Leadership and Commitment 2 - Policy and Strategic Objectives PLAN 3 - Organisation, Responsibilities Resources and Standards 4 - Hazards & Effects Mgt (Risk Mgt) DO FEEDBACK 5 - Planning & Procedures 6 – Implementation, Monitoring Corrective Action Corrective Action CHECK 7 - Audit 8 - Management Review Corrective Action

Hazard-based approach HEMP - Hazard and Effects Management Process Identify - What are the hazards? Assess - how big are those hazards? Control - how do we control the hazards? Recover - what if it still goes wrong?

Step 1. Identification First identify your hazards What is going to hurt you? Needs to be specific enough to manage practically E.g. not just potential and kinetic energy General enough to manage specifics in the same way Accumulate in a list – Hazard Register A range of tools and methods help here Brainstorming - proactive HAZID Incident analyses - reactive Reporting

Step 2. Assess How big is the risk you are taking and running? A wide range of tools available Not an exact science – whatever anyone tells you Small risks can be ignored Large risks may not be taken Usually framed in terms of ALARP As Low as Reasonably Practicable Not intended to be as low as possible Risk assessment should point to what to do about the hazard in question

Step 3. Manage and control Primarily preventative Success is measured by nothing going wrong Prevention involves a variety of approaches Use of the hierarchy of controls Barriers to keep hazards in place Controls to prevent them escaping Management is directly responsibility for the provision of controls and barriers Requires resourcing, procurement and continuous evaluation Front line personnel is responsible for their use once provided and supported Requires ability to operate the controls and barriers

Step 4. Recovery Recovery is necessary after control over a hazardous process has been lost But before the worst case consequences have been achieved Recovery controls and barriers are reactive The term Mitigation applies best here These controls are usually much more expensive than preventative controls Sometimes challenged because “We’ve never used that so we can get rid of it and save money”

Tools Risk management tools are intended to help one or more of the 4 steps Usually applied continuously to improve Especially on the feedback loops Audits Incident investigations Reporting Performance assessment for predictive improvement Identify – discover unexpected hazards Assess – evaluate what needs to be done Control – systematically list the controls to see if they are adequate to reduce the risk to acceptable levels Recover – identify what will reduce the consequences Successful risk management allows us to take the risks that enable us to get the benefits without disaster These can easily be mapped onto the ICAO components Not just the risk management elements Also all the other elements

Minimising Regret Maximising Opportunity No Regret Normal Operations Go Incident No-Go Missed Opportunity Safe

Risk Assessment Matrices A simple way of supporting the product of outcome and probability Not a discrete set of values, but an easy way of representing the distributions of severity of outcomes and their probabilities So – there is no single CORRECT Matrix

Risk Assessment Matrix The colour determines the level of active risk management required

Risk Calculations 1 2 3 4 Mitigation Right side Now After 5 6 8 10 11 1 2 3 4 Mitigation Right side Now After 5 6 8 10 11 7 9 12 13 14 Reduced exposure Left side

Risk matrix alternative 2 2 4 4 Mitigation Right side 5 8 12 15 28 8 20 40 100 200 Reduced exposure Left side The numbers are a reflection of how unacceptable the matrix cell is

What is ALARP? Risk Options ALARP = As Low As Reasonably Practical 20 20 40 60 80 100 120 1 2 3 4 5 6 Risk Risk to stakeholders Cost Legal mimimum requirements Options

How can we understand our controls? The Bowtie is an industry standard in many high-hazard activities Bowties cover both control and recovery Bowties are not primarily intended to be quantitative, but can be computed with Bowties visually express the extent and types of control and are easy for managers to understand Is everything procedural Does one person have to do everything

Bow-tie Concept HAZARD CONSEQUENCES Events and Circumstances Harm to people and damage to assets or environment CONTROLS HAZARD CONSEQUENCES Explain model Barriers to stop progression from left to right Risk management is about identifying hazards, putting barriers in place and keeping them in place. Barriers can be : hardware, procedures, competence emergency preparedness etc Activities required to achieve this: policy, organisation, standards, improvement loops etc. EXPLAIN VARIOUS DOCUMENTED SYSTEMS: HSE-MS, demonstration of assurance that there is a system to identify barriers and keep them in place HSE-Case, specifying which specific barriers have been put in place, to what specification, who is responsible for keeping it in place, required competency etc. PTW, workplace hazards management system; controlling the Hazards during execution of tasks Most of these elements have been discussed during the previous days. I will concentrate on the various improvement loops and, as many of our activities involve contractors, the management of contractors Undesirable event with potential for harm or damage Engineering activities Maintenance activities Operations activities

Successive layers of defenses, barriers and safeguards The “Swiss Cheese” Model of Accident Causation The layers of Swiss cheese represent the “Defenses/Barriers” in an organization, according to the Reason Model. Successive layers of defenses, barriers and safeguards Some holes due to “active” failures Hazards Some holes due to “latent” conditions Losses

Bow-tie Concept for a specific event Events and Circumstances Harm to people and damage to assets or environment RISK CONTROLS HAZARD CONSEQUENCES Explain model Barriers to stop progression from left to right Risk management is about identifying hazards, putting barriers in place and keeping them in place. Barriers can be : hardware, procedures, competence emergency preparedness etc Activities required to achieve this: policy, organisation, standards, improvement loops etc. EXPLAIN VARIOUS DOCUMENTED SYSTEMS: HSE-MS, demonstration of assurance that there is a system to identify barriers and keep them in place HSE-Case, specifying which specific barriers have been put in place, to what specification, who is responsible for keeping it in place, required competency etc. PTW, workplace hazards management system; controlling the Hazards during execution of tasks Most of these elements have been discussed during the previous days. I will concentrate on the various improvement loops and, as many of our activities involve contractors, the management of contractors Undesirable event with potential for harm or damage Engineering activities Maintenance activities Operations activities

A problem for aviation Simple models have difficulty in capturing recent major commercial aviation incidents Asiana 214, QF 32, AF 447, BA 38

A Diversion - Causality Simple accidents are simply caused Linear and deterministic Complex accidents are more complex 80-20 rule suggests simple accidents are 80% Remaining 20% require us to recognize complexity

Theory 1 - how accidents are caused Linear causes – A causes B causes C Deterministic - either it is a cause or it isn’t We can compute both backwards and forwards People are seen as the problem – human error etc Probably good enough to catch 80% of the accidents we are likely to have Covers most of private and GA operators

Private users

Theory 2 - how accidents are caused Non-Linear causes Cause and consequence may be disproportionate These causes are organizational, not individual Deterministic dynamics- either it is a cause or it isn’t We can compute both backwards and forwards Increasingly difficult with non-linear causes This is the Organizational Accident Model Probably good enough to catch 80% of the residual accidents = 96% Probably best GA and professional operations

Oilfield operations

Non-linearity The size of an effect (consequence) is linearly proportional to the input – linearity Non-linearity is different The size of an effect (bad consequences) gets bigger (or smaller after a while) as a function of the input The improvement in performance gets smaller (almost always) even though the input gets bigger Linearity works fine to start with, but only 80% of the cases

Linear and non-linear functions Effect Effect Cause Cause Suddenly gets a lot worse

More non-linear functions Effect Effect Cause Cause It can’t get much worse Both – starts bad, tails off

Determinism A Causes B If A happens, then B will happen next

Non-determinism Move from A causes B to A makes B more likely Causation is probabilistic Probabilities are distributions, not points

Conditionalize on latest aircraft generation 4 th generation aircraft have dominantly weird accidents

Types of accidents Theory 1 Simple models may cover 80% of all accidents These are the simple personal accidents Theory 2 The next step gets 80% of the remainder = 96% These are the complex personal accidents and some organizational accidents Theory 3 The probabilistic approach may net the next 80% = 99.2% These are the complex process accidents

Theory 3 - how accidents are caused Non-Linear causes Non-Deterministic dynamics Probabilistic rather than specific Influences on outcomes by people and the organisation Probabilities may be distributions rather that single values We cannot compute both backwards and forwards The dominant accidents that remain are WEIRD WILDLY ERRATIC INCIDENTS RESULTING IN DISASTER Prior to an event there may be a multitude of possible future outcomes

Unusual or WEIRD Accidents In commercial aviation major accidents are now extremely rare Simple risk assessment and analysis models often fail to capture how these accidents are caused We need to understand our risk space better The Rule of Three is an example of how to do this

The Rule of Three Accidents have many causes (50+) A number of dimensions were marginal Marginal conditions score as Orange NO-Go conditions score as Red The Rule of 3 is Three Oranges = Red

Aircraft Operation Dimensions Crew Factors Experience, Duty time, CRM Aircraft Perf. Category, Aids, Fuel, ADDs Weather Cloud base, wind, density alt, icing, wind Airfield Nav Aids, ATC, Dimensions, Topography Environment Night/day, Traffic, en route situation Plan Change, Adequacy, Pressures, Timing Platform Design, Stability, Management

The Rule of Three Crash Big Sky Outcome We fixed it Problem No problem 1/2 1 1/2 2 1/2 3 1/2 No of Oranges

Why does the rule work? People use cognitive capacity to allow for increasing risk As the oranges increase the remaining available capacity is reduced At 3 oranges there is little available capacity remaining Any trigger can de-stabilize the system An accident suddenly becomes very likely

How random numbers combine Load > strength Normal upper limit Normal lower limit

The danger zone/safe zone – safe operating envelope concept

Risk Risk is a complex concept Classically probability x outcome Safety management is about: Taking risk – acceptable (ALOS) vs unacceptable Running risk – getting away with it Can be based on luck or on professionalism The granularity of the outcomes and how they can be reached is essential Most approaches are crude Salami slicing is a way to evade regulation

Risk Space High Risk areas Low risk/resilient areas

Single distribution A Known danger zone

Single distribution B Known danger zone

Single distribution C Known danger zones Known danger zone

Combined distribution (A,B,C)

Combined distribution (A,B,C) Known danger zone Known danger zones

Combined distribution (A,B,C) Known danger zone Known danger zones Unexpected danger zone

Simple view of combined distribution

Simple view of combined distribution Low average risk despite danger zone

Simple view of combined distribution Medium average risk despite danger zone

Simple view of combined distribution High average risk due to sufficient granularity

Mission Creep and Drift into Danger Success with risks makes people willing to accept greater risks This is a consequence of risk homeostasis This can look like complacency, but is a natural consequence of their successes, so far Failure to understand the finer detail of the risk space makes this drift into danger more likely

Conclusion Conventional risk assessment involves uncovering the potential for bad consequences Modern commercial aviation is very safe, so the accidents we wish to avoid may not be caught by standard techniques Advanced risk analysis involves increasing our understanding of the risk space we operate in