The Empirical Bayes Method for Before and After Analysis

Slides:



Advertisements
Similar presentations
FUTURE CMF RESEARCH AND CHALLENGES Traffic Records Forum October 27, 2014 Daniel Carter, UNC HSRC.
Advertisements

Lec 33, Ch.5, pp : Accident reduction capabilities and effectiveness of safety design features (Objectives) Learn what’s involved in safety engineering.
Spring Before-After Studies Recap: we need to define the notation that will be used for performing the two tasks at hand. Let: be the expected number.
Spring  Crash modification factors (CMFs) are becoming increasing popular: ◦ Simple multiplication factor ◦ Used for estimating safety improvement.
Department of Civil Engineering University of Washington Quantitative Safety Analysis for Intersections on Washington State Two-lane Rural Highways Master’s.
Investigation of Varied Time Intervals in Crash Hotspot Identification Authors: Wen Cheng, Ph.D., P.E., Fernando Gonzalez, EIT, & Xudong Jia; California.
Developing an Intelligent Decision Support System for the Proactive Implementation of Traffic Safety Strategies Hongyi Chen, Ph.D. Assistant Professor.
Spring Sampling Frame Sampling frame: the sampling frame is the list of the population (this is a general term) from which the sample is drawn.
Spring INTRODUCTION There exists a lot of methods used for identifying high risk locations or sites that experience more crashes than one would.
Empirical Bayes Estimate Spring Empirical Bayes Model For the EB method, a different weight is assigned to the prior distribution and standard estimate.
Incorporating Safety into the Highway Design Process.
Need to know in order to do the normal dist problems How to calculate Z How to read a probability from the table, knowing Z **** how to convert table values.
Generalized Linear Models
Incorporating Temporal Effect into Crash Safety Performance Functions Wen Cheng, Ph.D., P.E., PTOE Civil Engineering Department Cal Poly Pomona.
The Empirical Bayes Method for Safety Estimation Doug Harwood MRIGlobal Kansas City, MO.
1 Validation and Implication of Segmentation on Empirical Bayes for Highway Safety Studies Reginald R. Souleyrette, Robert P. Haas and T. H. Maze Iowa.
Evaluation of Alternative Methods for Identifying High Collision Concentration Locations Raghavan Srinivasan 1 Craig Lyon 2 Bhagwant Persaud 2 Carol Martell.
1 CEE 763 Fall 2011 Topic 1 – Fundamentals CEE 763.
Safety management software for state and local highway agencies: –Improves identification and programming of site- specific highway safety improvements.
Role of SPFs in SafetyAnalyst Ray Krammes Federal Highway Administration.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
HSM: Another Tool for Safety Management in Wyoming 1 Excellence in Transportation.
SPF Development and Data Needs John Milton Ph.D., P.E., Washington State Department of Transportation National Safety Performance Function Summit July.
1 7. What to Optimize? In this session: 1.Can one do better by optimizing something else? 2.Likelihood, not LS? 3.Using a handful of likelihood functions.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
1 CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first parametric SPF CH6: Which fit is fitter CH7: Choosing the objective function.
CE 552 Week 9 Crash statistical approaches Identification of problem areas - High crash locations.
July 29 and 30, 2009 SPF Development in Illinois Yanfeng Ouyang Department of Civil & Environmental Engineering University of Illinois at Urbana-Champaign.
Calibrating Highway Safety Manual Equations for Application in Florida Dr. Siva Srinivasan, Phillip Haas, Nagendra Dhakar, and Ryan Hormel (UF) Doug Harwood.
 2006 National Council on Compensation Insurance, Inc. Slide 1 of 17 A Claim Counts Model for Discerning the Rate of Inflation from Raw Claims Data Spring.
Fall  Crashes are “independent” and “random” events (probabilistic events)  Estimate a relationship between crashes and covariates (or explanatory.
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Session 2 History How did SPF come into being and why is it here to stay? Geni Bahar, P.E. NAVIGATS Inc.
Role of Safety Performance Functions in the Highway Safety Manual July 29, 2009.
Evaluating the performance of three different network screening methods for detecting high collision concentration locations using empirical data Prepared.
LOW COST SAFETY IMPROVEMENTS Practitioner Workshop The Tools – Identification of High Crash Locations – Session #2.
1 The Highway Safety Manual Predictive Methods. 2 New Highway Safety Manual of 2010 ►Methodology is like that for assessing and assuring the adequacy.
Low Cost Safety Improvements Pooled Fund Study Analytical Basics Dr. Bhagwant Persaud.
SUR-2250 Error Theory.
Impact of Intersection Angle on Safety
Caldwell and Wilson (1999) 1. Determine primary rating factor for a road section based on traffic volume and user types 2. Primary rating factor is then.
The Empirical Bayes Method for Before and After Analysis
Artificial Realistic Data (ARD)
HSM Applications to Multilane Rural Highways and Urban Suburban Streets Predicting Crash Frequency and CMFs for Rural Divided Multilane Highways - Session.
Physics 114: Exam 2 Review Weeks 7-9
Generalized Linear Models
Solve Systems of Linear Equations by Elimination
Types of Poisson Regression. Offset Regression  A variant of Poisson Regression  Count data often have an exposure variable, which indicates the number.
8.1 – 8.3 Solving Proportions, Geometric Mean, Scale Factor of Perimeter and Area Since these sections are all very short, we are combining them into one.
Statistical Process Control
Physics 114: Exam 2 Review Material from Weeks 7-11
Estimating
Exploratory Analysis of Crash Data
Before-After Studies Part I
26th CARSP Conference, Halifax, June 5-8, 2016
Transportation Engineering Basic safety methods April 8, 2011
Network Screening & Diagnosis
Doug Harwood Midwest Research Institute
KS3 Mathematics A5 Functions and graphs
HW 7b: HSM Practitioner’s Guider for Two-Lane Rural Highways Workshop
Day 71 – Verifying dilations
If the question asks: “Find the probability if...”
Pull 2 samples of 10 pennies and record both averages (2 dots).
Solving Systems of Equations by the Substitution and Addition Methods
Solving Inequalities Solving inequalities follows the same procedures as solving equations. There are a few special things to consider with.
HSM Practitioner’s Guider for Two-Lane Rural Highways Workshop
Algebra 1 Section 4.6.
HSM Practitioner’s Guider for Two-Lane Rural Highways Workshop
OUTLINE Questions? Quiz Go over homework Next homework Forecasting.
CARSP Conference May 26-29, 2019 Calgary
Presentation transcript:

The Empirical Bayes Method for Before and After Analysis

Key Reference Hauer, E., D.W. Harwood, F.M. Council, M.S. Griffith, “The Empirical Bayes method for estimating safety: A tutorial.” Transportation Research Record 1784, pp. 126-131. National Academies Press, Washington, D.C.. 2002 http://www.engr.uky.edu/~rsouley/CE%20635/docs/Bayes_tutor_hauer.pdf Open This Document and read through as you go along on PPT

EB Procedures Abridged Full Last 2-3 years data Traffic volume Can use more data Includes other factors

Empirical Bayes Weight should be based on sound logic and real data

The SPF – Safety Performance Function So what is the expected number of crashes for facilities of this type? Develop a (negative binomial) regression model to fit all the data – must have data to do this.* An example SPF: μ=average crashes/km-yr (or /yr for intersections) So, if ADT = 4000 Note: this SPF depends only on ADT … it needn’t * Can also use equations from HSM, but need “phi”

The overdispersion parameter The negative binomial is a generalized Poisson where the variance is larger than the mean (overdispersed) The “standard deviation-type” parameter of the negative binomial is the overdispersion parameter φ Variance of crash counts = η[1+η/(φL)] Where … μ=average crashes/km-yr (or /yr for intersections) η=μYL (or μY for intersections) = number of crashes/time φ=estimated by the regression (units must be complementary with L, for intersections, L is taken as one)

Example 1: How many crashes should we expect next year???

Example 1: road segment, 1 yr. of data

Example 1: computing the weight What happens when Y is large? When μ is small (compared to φ)?

Example 1 (cont): = 4.71 ± 1.19 accidents/km/year Note that the variance of the EB estimate, 2.142 = 4.58, is less than the variance of the accident counts on the segment, 9.44 = 4.71 ± 1.19 accidents/km/year

Example 2: 3 years of data: 12, 7, 8 4000 vpd Step 1: Step 2: Step 3: Note effect of more data… weight is smaller, placing less emphasis on the model estimate As before = 7.97 ± 1.44 accidents/year for the section (compare to previous estimate and reliability) From ex 1) 8.48 ± 2.14

Example 3: AMFs 1.2 meter shoulders (instead of 1.5) AMF (CMF) = 1.04 (4% increase in crashes) Step 1: Step 2: Step 3: Why is weight lower? 1) 4.71 ± 1.19 2) 4.43 ± 0.80 3) 4.47 ± 0.81

Example 4: subsections Total length = 1.5km, 11 crashes in 2 years

Large for fatals (helps you not to “chase” them Example 5: Severity 2.41 x 0.019 = 0.046 … 1.8 x 3 x 0.046 = 0.247 Note: φ stays same (mult dist. by constant); Large for fatals (helps you not to “chase” them Note: 20.357 ≠ 23.9 (from prob 2) … why? What is the suggested an ad hoc solution?

Example 6: intersection ADT=4520 SPF = 6.54×10-5 ×ADTmainline0.82×ADTminor road0.51 ADT=230 AMF = 1.27 7 crashes in 3 years Step 1: Step 2: Step 3: So, what can you conclude about the site?

Example 7: group of intersections 11 crashes in 3 years Applies if you don’t know what crashes happened at what intersection Step 1: Step 2: (simplistic) However, not clear what to use

Example 7: (cont)

Example 7: (cont) Step 3: using w=0.088, Why so much confidence in the actual number? Is it because we have 3 yrs of data? Is it because 11 is smaller than 20.7? What would happen if 11 had been, say, 32?

Example 8 (The full procedure) 1.8 km, 9 yrs. Unchanged road ADT varies, AMF = 0.95, 74 total crashes μ =

Example 8, cont. (If all μ are equal) Why so small???

Example 9: Secular Trends Yearly multipliers can be used like AMFs to account for weather, technology changes (must be able to get them) Make much difference?

Example 10: Projections Projections can be made by using a simple ratio of ADTs (raised to the appropriate power) multiplied by the corresponding ratio of AMFs or yearly multipliers

Some thought questions Does EB eliminate RTM as stated? What happened if the SPF is not appropriate for your site What does appropriate mean?

Software for Homework You will need some software to develop the NB regression model for your SPF – that is the “R project” program. Investigate that now (see HW). http://www.r-project.org/ (info on “R”) Download R 2.15.0 for Windows (47 megabytes, 32/64 bit) Installation and other instructions New features in this version: Windows specific, all platforms.

EB estimate

EB estimate Shouldn't THIS be the true safety effect?

Professor, May I be excused? My brain is full. Gary Larson, The Far Side, ©1986