Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Slides:



Advertisements
Similar presentations
COMPUTER INTENSIVE AND RE-RANDOMIZATION TESTS IN CLINICAL TRIALS Thomas Hammerstrom, Ph.D. USFDA, Division of Biometrics The opinions expressed are those.
Advertisements

Modeling of Complex Social Systems MATH 800 Fall 2011.
Disease emergence in immunocompromised populations Jamie Lloyd-Smith Penn State University.
Holland on Rubin’s Model Part II. Formalizing These Intuitions. In the 1920 ’ s and 30 ’ s Jerzy Neyman, a Polish statistician, developed a mathematical.
Epidemiologisten tartuntatautimallien perusteita Kari Auranen Rokoteosasto Kansanterveyslaitos Matematiikan ja tilastotieteen laitos Helsingin yliopisto.
A VERY IMPORTANT CONCEPT Disease epidemiology is first and foremost a population biology problem Important proponents: Anderson, May, Ewald, Day, Grenfell.
In biology – Dynamics of Malaria Spread Background –Malaria is a tropical infections disease which menaces more people in the world than any other disease.
RD processes on heterogeneous metapopulations: Continuous-time formulation and simulations wANPE08 – December 15-17, Udine Joan Saldaña Universitat de.
Population dynamics of infectious diseases Arjan Stegeman.
Persistence and dynamics of disease in a host-pathogen model with seasonality in the host birth rate. Rachel Norman and Jill Ireland.
Mathematical models for interventions on drug resistance Hsien-Ho Lin.
SCB : 1 Department of Computer Science Simulation and Complexity SCB : Simulating Complex Biosystems Susan Stepney Department of Computer Science Leo Caves.
Chapter 11 Multiple Regression.
Network modeling of the Ebola Outbreak Ahmet Aksoy.
The Paradigm of Econometrics Based on Greene’s Note 1.
How does mass immunisation affect disease incidence? Niels G Becker (with help from Peter Caley ) National Centre for Epidemiology and Population Health.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
Methods to Study and Control Diseases in Wild Populations Steve Bellan, MPH Department of Environmental Sci, Pol & Mgmt University of California at Berkeley.
Modified SIR for Vector- Borne Diseases Gay Wei En Colin 4i310 Chua Zhi Ming 4i307 Jacob Savos AOS Katherine Kamis AOS.
Motive Konza: understanding disease, since there is no apparent reason to manage native pathogens of native plants Also have background information in.
Dengue virus Climate changes might play an important role in sustaining the transmission cycle between vectors and human hosts and the spread of transmission.
Population Biology: PVA & Assessment Mon. Mar. 14
Infectious disease, heterogeneous populations and public healthcare: the role of simple models SIAM CSE 2009 K.A. Jane White Centre for Mathematical Biology.
Emerging Infectious Disease: A Computational Multi-agent Model.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Spatial patterns on the edge of stability: measles in the Sahel.
Introduction: Why statistics? Petter Mostad
V5 Epidemics on networks
Modelling infectious diseases Jean-François Boivin 25 October
A Data Intensive High Performance Simulation & Visualization Framework for Disease Surveillance Arif Ghafoor, David Ebert, Madiha Sahar Ross Maciejewski,
Epidemiology With Respect to the Dynamics of Infectious Diseases Huaizhi Chen.
DSc 3120 Generalized Modeling Techniques with Applications Part II. Forecasting.
Data and modeling issues in population biology  Alan Hastings (UC Davis) and many colalborators  Acknowledge support from NSF.
BASICS OF EPIDEMIC MODELLING Kari Auranen Department of Vaccines National Public Health Institute (KTL), Finland Division of Biometry, Dpt. of Mathematics.
Modeling of the effect of pneumococcal conjugate vaccination on carriage and transmission of Streptococcus pneumoniae in Kenyan children John Ojal KEMRI-Wellcome.
Sanja Teodorović University of Novi Sad Faculty of Science.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
This presentation is made available through a Creative Commons Attribution- Noncommercial license. Details of the license and permitted uses are available.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Heterogeneity (next 3 classes) 2 case studies –Age –Sexual activity –Others: Day care vs. not, Genetic heterog. Use data Understand behavior of complicated.
A Stochastic Model of Paratuberculosis Infection In Scottish Dairy Cattle I.J.McKendrick 1, J.C.Wood 1, M.R.Hutchings 2, A.Greig 2 1. Biomathematics &
Modeling frameworks Today, compare:Deterministic Compartmental Models (DCM) Stochastic Pairwise Models (SPM) for (I, SI, SIR, SIS) Rest of the week: Focus.
STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Introduction to Chaos by: Saeed Heidary 29 Feb 2013.
Epidemic (Compartment) Models. Epidemic without Removal SI Process Only Transition: Infection Transmission SIS Process Two Transitions: Infection and.
ECE-7000: Nonlinear Dynamical Systems Overfitting and model costs Overfitting  The more free parameters a model has, the better it can be adapted.
Analyzing wireless sensor network data under suppression and failure in transmission Alan E. Gelfand Institute of Statistics and Decision Sciences Duke.
SIR Epidemic and Vaccination
GEO Meningitis Environmental Risk Consultative Meeting, sept 2007, WHO, Geneva Project : Long-term epidemiology of Meningococcal Meningitis in the.
MA354 An Introduction to Math Models (more or less corresponding to 1.0 in your book)
ECE-7000: Nonlinear Dynamical Systems 2. Linear tools and general considerations 2.1 Stationarity and sampling - In principle, the more a scientific measurement.
Simple Infection Model Huaizhi Chen. Simple Model The simple model of infection process separates the population into three classes determined by the.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
MA354 Math Modeling Introduction. Outline A. Three Course Objectives 1. Model literacy: understanding a typical model description 2. Model Analysis 3.
EC 827 Module 2 Forecasting a Single Variable from its own History.
1 Immunisation with a Partially Effective Vaccine Niels G Becker National Centre for Epidemiology and Population Health Australian National University.
Rubella Surveillance and Control in Low-Resource Settings: Limitations, Biases, and Potential for Strengthening Amy Winter, PhD Candidate, Princeton University.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Mean Field Methods for Computer and Communication Systems Jean-Yves Le Boudec EPFL Network Science Workshop Hong Kong July
Review of Probability Theory
Models.
Vaccine Efficacy, Effectiveness and Impact
The Role of Supplementary Immunization Activity Scheduling on Measles Incidence in Nigeria Kurt Frey April 20, 2017 IDM Disease Modeling Symposium Measles.
Chapter 1 – Ecological Data
Modelling infectious diseases
Regression Models - Introduction
Epidemiological Modeling to Guide Efficacy Study Design Evaluating Vaccines to Prevent Emerging Diseases An Vandebosch, PhD Joint Statistical meetings,
Susceptible, Infected, Recovered: the SIR Model of an Epidemic
Presentation transcript:

Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics, Penn State University

Focus on time series analysis of incidence analysis 0 50k 100k 150k 200k 250k 300k 350k 400k ‘44‘55 Quarterly measles incidence y 1-3y 3-5y 5-10y 10-15y 15- y

Outline: ~ 1993 then –> now current challenges mostly anecdotal personal reflections

~ 1993

~1993 maturing* mathematical formalism incorporating: (cf yesterdays talks by Mick and Val) –Seasonal forcing –Age-structure & non-homogenous mixing –Spatial diffusion & metapopulation dynamics –Plausible scenarios of scaling of transmission with pop size –Stochasticity But, early days w.r.t. letting these general models loose on data … * for directly transmitted persistent (SI), fully immunizing (SIR) or fully non-immunizing (SIS)

~1993 Early days w.r.t. letting these general models loose on data … … because of many challenges The obvious: –Absence of data on key state variables (eg susceptible) –Disparity between key state variables and observed quantities (eg incidence is not prevalence) The less obvious: –Weariness regarding whether a detailed quantitative match to data should be a critical characteristic of mathematical models

then –> now

Today’s expectation for match (1) eg TSIR forecast for E&W measles (E&W) Incidence TSIR forecast

S - Susceptibles, B - Births, I - Infected and Infective, - epidemic intensity,  - correction for time discretization, β - seasonal transmission rate Host dynamics Transmission dynamics Stochasticity TSIR: Discrete time ‘piecewise constant’ B-D process (cf Chain-binomial) If l is small relative to S, then the chain can be approximated by an unconstrained B-D process; The conditional distribution of I t+1 is the sum of I t Geometric distributions -> NegBin with clumping I t

Today’s expectation for match (2) eg age-structured TSIR forecast for rubella (South Africa) Metcalf et al 2013 * difference in heat intensity is due to underreporting *

Yesteryear’s expectations Kot and Schaffer (1985) JTB: “One way of resolving the problem is to view the motion in phase space, i.e. in a vector space whose axes are the state independent variables. However, for most real world ecological and epidemiological systems, this requirement is not easily met. It is often difficult even to enumerate all of the state variables, much less to follow their magnitudes over time. Put another way, the variables studied in nature are generally embedded in more complex systems. As a practical matter, it is unlikely that population dynamicists will ever be able to write down the complete governing equations for any natural system.” ID complexities: age-structured mixing, age-specific seasonality in transmission, spatial heterogeneity, heterogeneities in susceptibility, etc, etc

Journey from there to here (1) -> Many ‘Obvious’ challenges were painstakingly resolved along the way

Journey from there to here (1) -> Many ‘Obvious’ challenges were painstakingly resolved along the way 1) Perhaps models may have some qualitative relevance? - Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

Journey from there to here (1) -> Many ‘Obvious’ challenges were painstakingly resolved along the way 1) Perhaps models may have some qualitative relevance? - Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c). 2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c)

Journey from there to here (1) -> Many ‘Obvious’ challenges were painstakingly resolved along the way 1) Perhaps models may have some qualitative relevance? - Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c). 2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c) 3) A seasonal chain-binomial model can in fact be recast as a non- autonomous autoregressive regression: I dear you! - Time-series SIR ver 1 (Finkenstadt & Grenfell) and TSIR ver 2

Journey from there to here (1) -> Many ‘Obvious’ challenges were painstakingly resolved along the way 1) Perhaps models may have some qualitative relevance? - Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c). 2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c) 3) A seasonal chain-binomial model can in fact be recast as a non- autonomous autoregressive regression: I dear you! - Time-series SIR ver 1 (Finkenstadt & Grenfell) and TSIR ver 2 4) Why in the world does the TSIR seem to fit measles in E&W? - ‘Emergent simplicity’ (Grenfell); Dynamic homogeneity (Earn &c)

Journey from there to here (2) 5) We believe! Real dynamics can be predicted by simple mechanistic models (that incorporates key idiosyncrasies) - POMP et al (King &c) - Hierarchical models with observation process (Cauchemez &c). - Age-structured TSIR (Metcalf &c). ….. (cf Simon’s talk)

Journey from there to here (2) 5) We believe! Real dynamics can be predicted by simple mechanistic models (that incorporates key idiosyncrasies) - POMP et al (King &c) - Hierarchical models with observation process (Cauchemez &c). - Age-structured TSIR (Metcalf &c). ….. (cf Simon’s talk) Lessons from last 20 years: - ‘All models are wrong …’ Some much less than we expected. - Emergent simplicity once key idiosyncrasies are identified - ?Tactical/strategical? models may be more relevant than we expected. [The prevailing notion that computation was the important driver in the field is wrong (Cambridge MRCs BUGS has been around since 20 years)]

Some current challenges

Some critical issues are: (i) use nonlinear stochastic modeling to identify all potentially undesirable side effects of intervention-induced reduction in circulation. -Rubella and CRS (cf Jess’ talk) -Chikenpox vaccine and increased shingles incidence -Whooping cough and the role of natural antigen circulation in maintaining immune memory. The possibility of long-term vaccine failure. More case law!

Vaccine introduced The first decades of vaccine induced control was extremely successful … Mass-vaccination introduced in most rich countries in mid ‘40s - early ’50s

Vaccine introduced The first decades of vaccine induced control was extremely successful … … Then even in very high cover areas throughout the developed world (e.g. Massachusetts with consistent >95% cover) the disease re-emerged! Mass-vaccination introduced in most rich countries in mid ‘40s - early ’50s

Re-emergence is associated with a completely different core group Massachusets age-incidence patterns Lavine, King and Bjornstad PNAS

The ‘anamnestic’ 4 compartment SIR model – re-exposure helps maintain immune memory S – suceptible I – Infected R – Highly immune W – Waning: resistant to infection and will get boosted or loose immunity depending on competing rates - force of infection  - boosting coeffiecient  - recovery rate  - rate of loss of immunity - rate of loss of immunity Lavine, King and Bjornstad PNAS

As long as the anamnestic response is at least 10x greater than the naïve response: Pre-vaccination prediction Age

Post-vaccination prediction: Lavine, King and Bjornstad PNAS

     Natural immune boosting in pertussis dynamics and the potential for long-term vaccine failure Incidence Vaccine coverage ‘SIR’ ‘SIS’

Predicted public health consequences of a booster vaccine at age 15 … … The booster may push circulation towards adults of childbearing age and increase perinatal infection and increase severe disease. (cf CRS but different mechanism)

Some critical issues are: (ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules e.g. Measles Incidence in China: 3 provinces From Matt Ferrari

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules; * * Perreti et al PNAS : (‘Model-free forecasting outperforms the correct mechanistic model for simulated and experimental data’) is not the way to go

(iii) probabilistically projecting possible/probable build-up of ‘susceptible pockets’ in the face of imperfect vaccination programs

Burkina Faso: 2009 Malawi: 2010 >135,000 cases following 10 years of low incidence France, 2011 Sao Paolo: 1997 Eg Measles (from Matt Ferrari)

(iv) Important challenge: We need accurate parametric anchoring of mechanistic models Inference for mechanistic models usually reveal severe multicollinearity among parameters: -Various parameter combinations can fit observed data equally well, -but will not make the same out of sample predictions Log-likelihood Intrastage β Interstage β Eg 2-stage PDV model (cf Klepac et al. 2009)

- ‘All models are wrong …’ Some much less than we expected. - Emergent simplicity once key idiosyncrasies are identified. -?Tactical/strategical? models may be more relevant than we expected. - We have an enormous arsenal of model fitting tools. - Multicollinearity makes anchoring of estimates a critical challenge. Current modeling challenges: - Study unanticipated Public health consequences - Consequences of rapidly changing demographics - Understand build-up of susceptible pockets Thank you!