1 Validation and Implication of Segmentation on Empirical Bayes for Highway Safety Studies Reginald R. Souleyrette, Robert P. Haas and T. H. Maze Iowa.

Slides:



Advertisements
Similar presentations
Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Advertisements

Ridge Regression Population Characteristics and Carbon Emissions in China ( ) Q. Zhu and X. Peng (2012). “The Impacts of Population Change on Carbon.
FUTURE CMF RESEARCH AND CHALLENGES Traffic Records Forum October 27, 2014 Daniel Carter, UNC HSRC.
Lec 33, Ch.5, pp : Accident reduction capabilities and effectiveness of safety design features (Objectives) Learn what’s involved in safety engineering.
Spring Before-After Studies Recap: we need to define the notation that will be used for performing the two tasks at hand. Let: be the expected number.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Session #4, Forrest Council, Slides. The Potential Impacts of a Towaway Reporting Threshold on Driver/User and Roadway Safety Programs Forrest M. Council.
Spring  Crash modification factors (CMFs) are becoming increasing popular: ◦ Simple multiplication factor ◦ Used for estimating safety improvement.
Roadway Safety For Local Agencies Doug Bish Traffic Services Engineer Oregon Department of Transportation.
Investigation of Varied Time Intervals in Crash Hotspot Identification Authors: Wen Cheng, Ph.D., P.E., Fernando Gonzalez, EIT, & Xudong Jia; California.
Spring  Types of studies ◦ Naïve before-after studies ◦ Before-after studies with control group ◦ Empirical Bayes approach (control group) ◦ Full.
Spring Sampling Frame Sampling frame: the sampling frame is the list of the population (this is a general term) from which the sample is drawn.
Spring INTRODUCTION There exists a lot of methods used for identifying high risk locations or sites that experience more crashes than one would.
Empirical Bayes Estimate Spring Empirical Bayes Model For the EB method, a different weight is assigned to the prior distribution and standard estimate.
Exploration and Exploitation Strategies for the K-armed Bandit Problem by Alexander L. Strehl.
Incorporating Safety into the Highway Design Process.
Incorporating Temporal Effect into Crash Safety Performance Functions Wen Cheng, Ph.D., P.E., PTOE Civil Engineering Department Cal Poly Pomona.
1 CHAPTER M4 Cost Behavior © 2007 Pearson Custom Publishing.
Session 10 Training Opportunities Brief Overview of Related Courses in USA / Canada Geni Bahar, P.E. NAVIGATS Inc.
Esri International User Conference | San Diego, CA Technical Workshops | Spatial Statistics: Best Practices Lauren Rosenshein, MS Lauren M. Scott, PhD.
Understanding Statistics
University of Maryland Department of Civil & Environmental Engineering By G.L. Chang, M.L. Franz, Y. Liu, Y. Lu & R. Tao BACKGROUND SYSTEM DESIGN DATA.
The Empirical Bayes Method for Safety Estimation Doug Harwood MRIGlobal Kansas City, MO.
Network Screening 1 Module 3 Safety Analysis in a Data-limited, Local Agency Environment: July 22, Boise, Idaho.
2-1 LOW COST SAFETY IMPROVEMENTS The Tools – Identification of High Crash Locations – Session #2.
Chapter 9: Speed, travel time, and delay studies
Evaluation of Alternative Methods for Identifying High Collision Concentration Locations Raghavan Srinivasan 1 Craig Lyon 2 Bhagwant Persaud 2 Carol Martell.
1 CEE 763 Fall 2011 Topic 1 – Fundamentals CEE 763.
Safety management software for state and local highway agencies: –Improves identification and programming of site- specific highway safety improvements.
Incorporating Safety into Transportation Planning for Small and Medium-Sized Communities Teng Wang 10/23/ Program of Study.
Role of SPFs in SafetyAnalyst Ray Krammes Federal Highway Administration.
University of Minnesota Intersection Decision Support Research - Results of Crash Analysis University of Minnesota Intersection Decision Support Research.
Week 41 Estimation – Posterior mean An alternative estimate to the posterior mode is the posterior mean. It is given by E(θ | s), whenever it exists. This.
SPF Development and Data Needs John Milton Ph.D., P.E., Washington State Department of Transportation National Safety Performance Function Summit July.
Pheno_opt_rice Brief introduction by Pepijn van Oort.
Highway Safety Information System (HSIS) Evaluation of Lane Reduction “Road Diet” Measures on Crashes and Injuries Herman F. Huang and Charles V. Zegeer.
Putting Together a Safety Program Kevin J. Haas, P.E.—Traffic Investigations Engineer Oregon Department of Transportation Traffic—Roadway Section (Salem,
CE 552 Week 9 Crash statistical approaches Identification of problem areas - High crash locations.
July 29 and 30, 2009 SPF Development in Illinois Yanfeng Ouyang Department of Civil & Environmental Engineering University of Illinois at Urbana-Champaign.
Calibrating Highway Safety Manual Equations for Application in Florida Dr. Siva Srinivasan, Phillip Haas, Nagendra Dhakar, and Ryan Hormel (UF) Doug Harwood.
Session 9: Local SPF Use- Iowa Michael Pawlovich, Ph.D., P.E. Iowa Dept. of Trans., Traffic and Safety.
Fall  Crashes are “independent” and “random” events (probabilistic events)  Estimate a relationship between crashes and covariates (or explanatory.
Impact of Intersection Angle on Safety HSIS Annual Liaison Meeting David Harkey, Bo Lan, Daniel Carter, Raghavan Srinivasan, Anusha Patel Nujjetty May.
Transpo 2012 Yan Xiao, Mohammed Hadi, Maria Lucia Rojas Lehman Center for Transportation Research Department of Civil and Environmental Engineering Florida.
Session 2 History How did SPF come into being and why is it here to stay? Geni Bahar, P.E. NAVIGATS Inc.
California’s Experience with HFST 1.  Overview  Challenging Locations  Collision Reduction (State and Local Roads)  Typical Placement and current.
Evaluating the performance of three different network screening methods for detecting high collision concentration locations using empirical data Prepared.
Indicate the length up to one estimated value…. A. B. C.
LOW COST SAFETY IMPROVEMENTS Practitioner Workshop The Tools – Identification of High Crash Locations – Session #2.
1 The Highway Safety Manual Predictive Methods. 2 New Highway Safety Manual of 2010 ►Methodology is like that for assessing and assuring the adequacy.
Low Cost Safety Improvements Pooled Fund Study Analytical Basics Dr. Bhagwant Persaud.
DETECTION AND ASSESSMENT OF SAFETY PROBLEMS WITHIN ROAD TRANSPORT DECISION MAKING Prof. Dr. Nikolay Georgiev eng. Violina Velyova ‘Todor Kableshkov’ University.
Chapter 12 Understanding Research Results: Description and Correlation
Impact of Intersection Angle on Safety
The Empirical Bayes Method for Before and After Analysis
Artificial Realistic Data (ARD)
DETERMINING ROAD SURFACE AND WEATHER CONDITIONS WHICH HAVE A SIGNIFICANT IMPACT ON TRAFFIC STREAM CHARACTERISTICS   Reza Golshan Khavas1 and Bruce Hellinga2.
Planning and Prioritizing At-Grade Railroad Crossings
Exploratory Analysis of Crash Data
Before-After Studies Part I
Predict Failures with Developer Networks and Social Network Analysis
Results and Status of State Crash Analyses
26th CARSP Conference, Halifax, June 5-8, 2016
Establishing Safe and Realistic Speed Limits
Network Screening & Diagnosis
The Empirical Bayes Method for Before and After Analysis
Doug Harwood Midwest Research Institute
Systematic Identification of High Crash Locations
Results and Status of State Crash Analyses
Canadian Associate of Road Safety Professionals Conference May 2019
Presentation transcript:

1 Validation and Implication of Segmentation on Empirical Bayes for Highway Safety Studies Reginald R. Souleyrette, Robert P. Haas and T. H. Maze Iowa State University, SAIC and Iowa State University ENVIRONMENTAL HEALTH RISK 2007 Fourth International Conference on The Impact of Environmental Factors on Health MALTA; June, 2007

2 The highway safety problem Source: World Health Organization

3 Mitigation approaches – 4Es Education Enforcement Emergency Response Engineering

4 Engineering studies Limited resources Highest benefit desired High Crash Locations Before and After Studies Small sample size  high variance Selection bias  regression to the mean (RTM)

5 Objectives Validate the state of the art statistical approach, known as empirical Bayesian Demonstrate tradeoffs between model quality and data quantity Investigate effect of data aggregation … to improve identification and therefore mitigation of high crash locations

6

7

8 Statistical approaches we could take… Use long periods Use large number of locations Use Empirical Bayes (EB) –Substitutes “similar” locations for longer observation time –“Weights” site and similar-site data

9 Mr. Smith Mr. Smith had no crashes last year The average of similar drivers is 0.8 crashes per year What do we expect is the number of crashes Mr. Smith will have next year … 0?, 0.8? … Answer … use both pieces of information and weight the expectation Hauer, E., D.W. Harwood, F.M. Council, M.S. Griffith, “The Empirical Bayes method for estimating safety: A tutorial.” Transportation Research Record 1784, pp National Academies Press, Washington, D.C

10 Empirical Bayes (EB) We have two types of information We compute an estimate which is an average of both How much to weight the two depends on… –Quantity –Quality Accepted practice… small scale What should the weight be???

11 ________ 1+(μ∙Y)/φ 1 w = mean # crashes/year from model number of years overdispersion factor weight applied to model estimate EB estimate = w∙(model estimate) + (1-w)∙(site average) Need: site data Need: - model for similar sites (neg. binomial)

12 Objective #1 Test effectiveness of EB by comparing: 1.a single year of data from many locations, with different models and the Empirical Bayes formula, vs. 2.several years of crash data at specific locations

13 Objective #2 explore the relationship between segmentation and accuracy of estimates

14 Description of Data Roads (Iowa) –All (19,400km) –Freeways (1400km) –Multilane (8000km) –2-lane (10,000km) Low ADT (1200 VPD) Med ADT (2400 VPD) High ADT (4400 VPD) –Segments 400m (short) 4km (med) 6.8km (long)

15 Description of Data Intersections (California) –Multiphase (873) –Single Phase (374) –Thru-stop (3047) 5 years of data large-scale validation

16 Analysis – Intersections Three model forms: a)Crashes = α(mainline traffic) β, b)Crashes = α(mainline traffic) β (cross street traffic) γ c)Crashes = α (mainline traffic) β (cross street lanes) δ Three types of intersections –multiphase signals –Single phase signals –Stop sign control Intersection model parameters and descriptive statistics

17 Example intersection crash models (only 2 dimensions shown)

18 Intersection Results Top 10 high crash locations in 2003* * California HSIS Multiphase 4 leg Not intuitive Highest in 2003 Trying to predict this EB model “a” lowest error 4 year average “better” slightly more often than EB

19 Using 4 years of data + EB Now, EB better more often Now, model “d” never best estimate, but still best model four times?

20 Intersection Results Effect on Ranking EB does slightly better than 4 year average, or 2003 alone all models “comparable”

21 Analysis – Roads crashes=α(length)(ADT) β 3 types of roads –Freeway –Multilane divided –2-lane 3 segmentations –0.4, 3.8, and 11.6 km, on average 3 traffic ranges (L,M,H) 15 models Road segment model parameters and descriptive statistics

22 Effect of Segmentation on Correction Freeway-type segments Longest segments Average length 11.6 km Medium segments Average length 3.8 km Shortest segments Average length 0.4 km Note higher EB correction for short segments

23 Conclusions EB+1yr ≈ 4yrs of data Better model did not necessarily improve prediction (at least for the 10 intersections selected) Longer segment models are more accurate Intersection 4-year averages and models are relatively poor predictors –But when combined using EB, better

24 Thank you