Are the Mortality Effects of PM 10 the Result of Inadequate Modeling of Temperature and Seasonality? Leah Welty EBEG February 2, 2004 Joint work with Scott.

Slides:



Advertisements
Similar presentations
Bayesian Distributed Lag Models: Estimating Effects of Particulate Matter Air Pollution on Daily Mortality Leah J. Welty, PhD Department of Preventive.
Advertisements

Dr. Christos Giannakopoulos Dr. Basil Psiloglou Meteorological and pollution factors affecting hospital admissions in Athens, Greece ENSEMBLES WP6.2 Meeting.
Continued Psy 524 Ainsworth
Topic 12: Multiple Linear Regression
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Hypothesis Testing Steps in Hypothesis Testing:
Improving health worldwide George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
1 Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Consider the variable.
Model Choice in Time Series Studies of Air Pollution and Health Roger D. Peng, PhD Department of Biostatistics Johns Hopkins Blomberg School of Public.
Multiple Regression and Correlation Analysis
Health Risks of Exposure to Chemical Composition of Fine Particulate Air Pollution Francesca Dominici Yeonseung Chung Michelle Bell Roger Peng Department.
Getting Started with Hypothesis Testing The Single Sample.
Relationships Among Variables
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
AM Recitation 2/10/11.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Hypothesis Testing:.
Answering questions about life with statistics ! The results of many investigations in biology are collected as numbers known as _____________________.
Non-continuous Relationships If the relationship between the dependent variable and an independent variable is non-continuous a slope dummy variable can.
Simple Linear Regression
The Impact of Air Pollution on Infant Mortality: Evidence from Geographic Variation in Pollution Shocks Induced by a Recession Kenneth Y. Chay and Michael.
Correlation.
Moderation & Mediation
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
8-1 COMPLETE BUSINESS STATISTICS by AMIR D. ACZEL & JAYAVEL SOUNDERPANDIAN 6 th edition (SIE)
Epi-on-the-Island Time Series Regression (TSR) 6-10 July 2015 Wed 1: DLNMs Ben Armstrong London School of Hygiene and Tropical Medicine.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Term 4, 2005BIO656 Multilevel Models1 Hierarchical Models for Pooling: A Case Study in Air Pollution Epidemiology Francesca Dominici.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
HOW HOT IS HOT? Paul Wilkinson Public & Environmental Health Research Unit London School of Hygiene & Tropical Medicine Keppel Street London WC1E 7HT (UK)
Public Policy Analysis ECON 3386 Anant Nyshadham.
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Comparison of Pooled and Household-Level Usage Impact Analysis Jackie Berger Ferit Ucar IEPEC Conference – August 14, 2013.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Choosing tools to present numbers: Tables, charts, and prose Jane E. Miller, PhD.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
2005 Hopkins Epi-Biostat Summer Institute1 Module 3: An Example of a Two-stage Model; NMMAPS Study Francesca Dominici Michael Griswold The Johns Hopkins.
A generalized bivariate Bernoulli model with covariate dependence Fan Zhang.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Introduction Outdoor air pollution has a negative effect on health. On days of high air pollution, rates of cardiovascular and respiratory events increase.
Introduction to testing statistical significance of interactions Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition.
Forecasting (prediction) limits Example Linear deterministic trend estimated by least-squares Note! The average of the numbers 1, 2, …, t is.
Model-Data Synthesis of CO 2 Fluxes at Niwot Ridge, Colorado Bill Sacks, Dave Schimel NCAR Climate & Global Dynamics Division Russ Monson CU Boulder Rob.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
July Hopkins Epi-Biostat Summer Institute Module II: An Example of a Two- stage Model; NMMAPS Study Francesca Dominici and Scott L. Zeger.
ASSESSMENT OF THE ANNUAL VARIATION OF MALARIA AND THE CLIMATE EFFECT BASED ON KAHNOOJ DATA BETWEEN 1994 AND 2001 Conclusions 1. One month lag between predictors.
Incorporating Climate and Weather Information into Growth and Yield Models: Experiences from Modeling Loblolly Pine Plantations Ralph L. Amateis Department.
Factorial BG ANOVA Psy 420 Ainsworth. Topics in Factorial Designs Factorial? Crossing and Nesting Assumptions Analysis Traditional and Regression Approaches.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
LECTURE 16: BEYOND LINEARITY PT. 1 March 28, 2016 SDS 293 Machine Learning.
1 Module IV: Applications of Multi-level Models to Spatial Epidemiology Francesca Dominici & Scott L Zeger.
LECTURE 17: BEYOND LINEARITY PT. 2 March 30, 2016 SDS 293 Machine Learning.
Methodological Considerations in Assessing Effects of Air Pollution on Human Health Rebecca Klemm, Ph.D. Klemm Analysis Group, Inc. American Public Health.
Beach Modelling: Lessons Learnt from Past Scheme Performance Project: SC110004/S Specific Modelling Tools & Techniques.
Skill of Generalized Additive Model to Detect PM 2.5 Health Signal in the Presence of Confounding Variables Office of Research and Development Garmisch.
Belinda Boateng, Kara Johnson, Hassan Riaz
Central Limit Theorem, z-tests, & t-tests
Simple Linear Regression
What is a Biome? Chapter 6.1 Environmental.
Module 3: An Example of a Two-stage Model; NMMAPS Study
Our theory states Y=f(X) Regression is used to test theory.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Presentation transcript:

Are the Mortality Effects of PM 10 the Result of Inadequate Modeling of Temperature and Seasonality? Leah Welty EBEG February 2, 2004 Joint work with Scott Zeger

Is temperature/seasonality to blame? Temperature and season are important predictors of mortality Temperature and season are associated with air pollution (PM10) Seasonal effects > Temperature effects > PM 10 effects Previous NMMAPS analyses criticized for potential confounding of air pollution with temperature and seasonal effects

“Beyond controlling for time, of particular interest is the question of how fully to control for the potential effects of weather. […] The current analyses continue to include terms for weather, and the sensitivity analyses have further attempted to make estimates in the absence of extreme weather days (e.g., temperatures above or below a certain point). Smaller estimates of effect result from the newly revised analyses, however, and the possibility of subtler effects of weather within the normal climatic ranges continues. Thus, further exploration of weather effects is merited (for example by considering correlated or cumulative effects of multi-day temperature or humidity).” -HEI Special Report, May 2003

Further Investigation: We consider two alternate schemes for modeling temperature and season on mortality Part I: Allow temperature effects to vary by season Part II: Allow for non-linear functions of temperature to affect mortality over longer time periods

Outline Intro Distributed Lag Models (DLMs) Part I: Seasonality of Temperature Effects –Exploratory Analysis –Potential Model –Results on association of PM 10 and mortality Part II: Longer and non-linear DLMs –Potential Models –Results on association of PM 10 and mortality

Distributed Lag Models (DLMs) Allow response to depend on exposure over several days Constrained distributed lags; step function

Example distributed lag functions

‘Total’ Temperature Effect A one degree increase in temperature over each of the past 14 days increases log expected mortality by = area under the distributed lag function. And is approx % increase in mortality for 10 degree increase in temperature over past 14 days.

Distributed Lag Function by Month (New York City)

“Temperature” effects estimated in 4 month periods,

Individual Distributed Lag Coefficients by Time (New York City)

DLMs with Time Varying Coefficients

(covariates: intercept, age category (factor), dow (factor), dom, age by time interaction)

Estimated distributed lag coefficients and time trend (df = 4/yr) (New York City)

Total temp effect & time trend by df in time trend (Chicago)

Combined Temperature and Time Trend Predictors (Chicago)

Model Comparison by df/year in Time Trend (Chicago)

Status Report Temperature alone (even with distributed lags with seasonally varying coefficients) cannot explain all of the seasonal variation in mortality Models should include a season effect Difficult to distinguish practically between temperature and seasonality effects once seasonality is included Next Step: add PM 10

DLMs with Time Varying Coefficients and PM 10 as a Predictor (covariates: intercept, age category (factor), dow (factor), dom, age by time interaction) Model formulation: Consider PM 10 effect for

Estimated PM 10 effects by PM 10 lag, df in smooth of time (Los Angeles)

Estimated PM 10 effects by city, PM 10 lag, df in smooth of time

Part I: Conclusions The slightly bad news –Cannot fully separate temperature effects on mortality from seasonal effects on mortality The good news –Total temperature effects may vary by the degrees of freedom in the time trend, but pollution effect estimates do not change substantially Under our step function DLM for temperature with time varying coefficients: Next step: What if we use other (more flexible) DLMs?

Part II: Non-linear DLMs

PM 10 effect by df, lag of PM 10, and # dist lag vars in model

Part II: Conclusions The bad news –There really isn’t any. The good news –PM 10 effect estimates do not change substantially by the number of lag variables in the model, the df for the smoothness of the lags, or by the lag of PM 10. Continuing Work Pooled PM 10 effect estimates for proposed DLMs

Higher order formulations of DLMs, in particular for modeling interactions in distributed lags for temperature and for pollution 1. Alternate parametric forms 2.Semi-parametric, penalized splines with tapering 3.Bayesian formulation on lags