Skill of Generalized Additive Model to Detect PM 2.5 Health Signal in the Presence of Confounding Variables Office of Research and Development Garmisch.

Slides:



Advertisements
Similar presentations
Lecture 11 (Chapter 9).
Advertisements

Sampling: Final and Initial Sample Size Determination
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Fighting the Great Challenges in Large-scale Environmental Modelling I. Dimov n Great challenges in environmental modelling n Impact of climatic changes.
Correlation and regression
4-1 Operations Management Forecasting Chapter 4 - Part 2.
U.S. EPA Office of Research & Development October 30, 2013 Prakash V. Bhave, Mary K. McCabe, Valerie C. Garcia Atmospheric Modeling & Analysis Division.
Introduction to Regression Analysis
Department of Engineering and Public Policy Carnegie Mellon University Integrated Assessment of Particulate Matter Exposure and Health Impacts Sonia Yeh.
Model Choice in Time Series Studies of Air Pollution and Health Roger D. Peng, PhD Department of Biostatistics Johns Hopkins Blomberg School of Public.
Health Risks of Exposure to Chemical Composition of Fine Particulate Air Pollution Francesca Dominici Yeonseung Chung Michelle Bell Roger Peng Department.
Air quality standards –a statistician’s perspective Peter Guttorp Northwest Research Center for Statistics and the Environment
1 icfi.com | 1 HIGH-RESOLUTION AIR QUALITY MODELING OF NEW YORK CITY TO ASSESS THE EFFECTS OF CHANGES IN FUELS FOR BOILERS AND POWER GENERATION 13 th Annual.
Two and a half problems in homogenization of climate series concluding remarks to Daily Stew Ralf Lindau.
General relationship between health and air quality widely accepted Empirical estimates should inform public policy Difficult to establish cause/effect.
Are the Mortality Effects of PM 10 the Result of Inadequate Modeling of Temperature and Seasonality? Leah Welty EBEG February 2, 2004 Joint work with Scott.
Inference for regression - Simple linear regression
Using Air Quality Models for Emissions Management Decisions
Non-continuous Relationships If the relationship between the dependent variable and an independent variable is non-continuous a slope dummy variable can.
The Impact of Air Pollution on Infant Mortality: Evidence from Geographic Variation in Pollution Shocks Induced by a Recession Kenneth Y. Chay and Michael.
CMAS Conference, October 16 – 18, 2006 The work presented here was performed by the New York State Department of Environmental Conservation with partial.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
TPB Technical Committee May 7, Forecast and Ozone Action Day Program.
WHO European Centre for Environment and Health Overview of health impacts of particulate matter in Europe Michal Krzyzanowski WHO ECEH Bonn Office Joint.
MODELING OF REGIONAL CLIMATE CHANGE EFFECTS ON GROUND-LEVEL OZONE AND CHILDHOOD ASTHMA Perry E. Sheffield, Kim Knowlton, Jessie L. Carr, Patrick L. Kinney.
Air Resources Board California Environmental Protection Agency Air Quality Effects of Trap-Related Emissions John DaMassa April 18, 2002 Originally Presented.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
HOW HOT IS HOT? Paul Wilkinson Public & Environmental Health Research Unit London School of Hygiene & Tropical Medicine Keppel Street London WC1E 7HT (UK)
Predicting Long-term Exposures for Health Effect Studies Lianne Sheppard Adam A. Szpiro, Johan Lindström, Paul D. Sampson and the MESA Air team University.
4-1 Operations Management Forecasting Chapter 4 - Part 2.
Impact of Air Pollution on Public Health: Transportability of Risk Estimates Jonathan M. Samet, MD, MS NERAM V October 16, 2006 Vancouver, B.C. Department.
C. Hogrefe 1,2, W. Hao 2, E.E. Zalewsky 2, J.-Y. Ku 2, B. Lynn 3, C. Rosenzweig 4, M. Schultz 5, S. Rast 6, M. Newchurch 7, L. Wang 7, P.L. Kinney 8, and.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Evaluating temporal and spatial O 3 and PM 2.5 patterns simulated during an annual CMAQ application over the continental U.S. Evaluating temporal and spatial.
Exposure Assessment for Health Effect Studies: Insights from Air Pollution Epidemiology Lianne Sheppard University of Washington Special thanks to Sun-Young.
Opening Remarks -- Ozone and Particles: Policy and Science Recent Developments & Controversial Issues GERMAN-US WORKSHOP October 9, 2002 G. Foley *US EPA.
Importance of chemistry-climate interactions in projections of future air quality Loretta J. Mickley Lu Shen, Daniel H. Cusworth, Xu Yue Earth system models.
Comparison of NAAQS RIA and Risk Assessments - Bryan Hubbell.
Technical Details of Network Assessment Methodology: Concentration Estimation Uncertainty Area of Station Sampling Zone Population in Station Sampling.
Evaluation of CMAQ Driven by Downscaled Historical Meteorological Fields Karl Seltzer 1, Chris Nolte 2, Tanya Spero 2, Wyat Appel 2, Jia Xing 2 14th Annual.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Montserrat Fuentes Statistics Department NCSU Research directions in climate change SAMSI workshop, September 14, 2009.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
INTEGRATING SATELLITE AND MONITORING DATA TO RETROSPECTIVELY ESTIMATE MONTHLY PM 2.5 CONCENTRATIONS IN THE EASTERN U.S. Christopher J. Paciorek 1 and Yang.
Impact of Temporal Fluctuations in Power Plant Emissions on Air Quality Forecasts Prakash Doraiswamy 1, Christian Hogrefe 1,2, Eric Zalewsky 2, Winston.
Quantifying Health Benefits with Local Scale Air Quality Modeling Presentation to CMAS October 7 th, 2008 Bryan Hubbell, Karen Wesson and Neal Fann U.S.
FORECASTIN G Qualitative Analysis ~ Quantitative Analysis.
Preliminary Analysis by: Fawn Hornsby 1, Charles Rogers 2, & Sarah Thornton 3 1,3 North Carolina State University 2 University of Texas at El Paso Client:
Time-series studies for the relationship between air pollution and the population health in Beijing Xiao-chuan Pan Dept. of Occupational and Environmental.
Methodological Considerations in Assessing Effects of Air Pollution on Human Health Rebecca Klemm, Ph.D. Klemm Analysis Group, Inc. American Public Health.
Environmental standards A statistical approach Peter Guttorp National Research Center for Statistics and the Environment.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Measuring Asthma Indicators at New York City Public Schools J. Richmond-Bryant 1, C.J. Saganich 1,2, R. Kalin 3, L. Bukiewicz 3, M.L. Valentine 3, A. Burgie.
The Impact of School Opening on Hospital Admissions for Asthma in New York State Shao Lin, PhD; Xiu Liu, MS; Rena Jones, MS; and Syni-An Hwang, PhD Bureau.
N Engl J Med Jun 29;376(26): doi: 10
Data Book Review & Discussion Biometric & Spectral Analysis
Basic Estimation Techniques
Statistical Methods for Model Evaluation – Moving Beyond the Comparison of Matched Observations and Output for Model Grid Cells Kristen M. Foley1, Jenise.
How Can TEMPO Contribute to Air Pollution Health Effects Research
Confounding and Effect Modification
Predicting Future-Year Ozone Concentrations: Integrated Observational-Modeling Approach for Probabilistic Evaluation of the Efficacy of Emission Control.
Basic Estimation Techniques
Heather Simon, Kirk Baker, Norm Possiel, Pat Dolwick, Brian Timin
George Washington University
ERRORS, CONFOUNDING, and INTERACTION
J. Burke1, K. Wesson2, W. Appel1, A. Vette1, R. Williams1
Bart Ostro, Chief Air Pollution Epidemiology Unit
Modeling spatially-dependent, non-stationary bias in GEOS-CHEM
Lecturer Dr. Veronika Alhanaqtah
A Review of Time Integrated PM2.5 Monitoring Data in the United States
Presentation transcript:

Skill of Generalized Additive Model to Detect PM 2.5 Health Signal in the Presence of Confounding Variables Office of Research and Development Garmisch 2014 Valerie Garcia 1, P.S. Porter 2, E. Gego 2, S.T. Rao 3 1 US EPA, Office of Research and Development, Raleigh NC 2 Porter-Gego, Idaho Falls, ID, 3 NCSU, Raleigh, NC

1 Does Confounding Affect the Detection of Air Pollution Health Effect in New York City (NYC)? However, other related variables (temperature, relative humidity, other air pollutants) are also thought to affect respiratory health Fine particulate matter (< 2.5 microns in size; PM2.5) is known to affect respiratory health Source: modified, Air Pollution and the Health of New Yorkers, 2011 (Silverman and Ito, 2010) The risk of hospitalization for asthma increases with increases in daily levels of PM 2.5 and O 3 in New York City

2 Confounding: relationships between main health effect (independent variable) and health outcome (dependent variable) May result in spurious relationships (e.g., association between independent and dependent variable that doesn’t exist) Does Confounding Affect the Detection of Air Pollution Health Effect in NYC? Temperature Humidity Ozone Time PM2.5Respiratory- Related HA Confounding: Co-variables in model are related to main health effect (PM2.5) and health endpoint (hospital admissions)

1. Emperical health (hospital admissions; HA) and air quality data (PM) used to parameterize simulations of several realizations of PM2.5 concentrations and respiratory-related hospital admissions 2. Run Simulations ( 10 PM + ε) x [(5 HA 1-mo + 5 HA 3-mo) x 2 HA random] = 200 a.PM2.5 (24-hr avg; PM) estimated as a function of O 3 (O3), temperature (TEMP), relative humidity (RH) plus noise (ε) to ensure collinearity b.Hospital admissions (daily total; HA) in correspondence to PM2.5 simulated for 3 health effect values. 3. GAM systematically run with simulated datasets 3 Study Approach Overall approach: Simulated health response to PM2.5 in Generalized Additive Model (GAM) Framework

Spatial and temporal extent: daily values for NYC metropolitan area from Jan 1st 2000 to Dec 31 st 2006 Hospital Admissions: Daily total for asthma, chronic bronchitis, emphysema, and chronic obstructive pulmonary disease from NYS Department of Health Statewide Planning & Research Cooperative (SPARCS) 4 1. Actual health and air quality data used to parameterize simulations Met Data: Max. temperature and relative humidity averaged across 3 monitors (Bronx, Queens, Richland) Air Quality Data: US EPA AIRS database for NY (Region 2) 4 Kings Pop. 2,465,326 Queens Pop. 2,229,379 Bronx Pop. 1,332,650 New York Pop. 1,537,195

5 2. PM2.5 (24-hr avg) estimated as a function of O3, TEMP, RH plus noise to ensure collinearity 10 simulations of PM2.5 (24-hr avg) estimated as a function of O3, TEMP, RH plus noise (modified daily) to ensure collinearity Confounders kept stable for all 10 simulations, while timeseries (ts) for PM2.5 and HA were systematically modified

6 3. Hospital admissions (daily total) in correspondence to PM2.5 simulated for 3 health effect values  Y-intercept (β 0 = 4.74); baseline incidence rate (average considering all independent variables)  Slope (β PM = 0.0, or ); Increase in HA due to PM2.5; β PM = means increase in 10 ppb in PM2.5 causes a rise in HA of about 4%  ‘Flu’ variable (time seasonal = 1 month or 3 months); to account for a more persistent flu/allergy Ln(y) = Ln(B) + ß(PM) Incidence (log scale) PM concentration Ln(B) Epidemiology Study Source: N. Fann, 2009 CMAS presentation 10 simulations (5 for each ‘flu’ scenario) of hospital admissions corresponding to PM2.5 simulations; 3 parameters:

3. Hospital admissions (daily total) in correspondence to PM2.5 simulated for 3 health effect values

Examined the impact of collinearity between PM2.5 and 4 confounding variables on the relative risk HA ~ β 0 + β PM PM2.5 + s(time) + s(O3) + s(TEMP) + s(RH) family=quasi-poisson –Spline function (s) –Calendar time (time) –Ozone 8-hr daily max. concentration (O3) –Daily maximum temperature (TEMP) –Relative humidity (RH) 4. GAM systematically run with simulated datasets Series of 100 simulations (10 PM2.5 realizations x 10 HA [5 each 1- mo flu and 3-mo flu]) realizations Randomly selected 2 HA realizations for a total of 200 simulations Also examined effect of knots on ‘time’ variable as this variable has the strongest association with hospital admissions

Note on smoothing ‘time’ variable and number of knots 4. GAM systematically run with simulated datasets More knots equates to constraining the model more tightly In the example to the right, more knots required to characterize ‘seasonal’ than long-term ‘trend’ Source graphic: Stats.stackexchange.com

Results - Change in β PM (Health Effect) k = knots

Analysis of β PM parameter –GAM is successful at extracting/re-estimating health effect (HA) with three ‘true’ β PM values (0.000, and 0.004) –This is true with year-long data or when data for summers only used; summer-only information leads to less precision (likely from smaller sample size) –Narrowing of β PM interquartile ranges from top (k-10) to bottom (k-49) panels shows increasing number of knots in ‘time’ smooth function leads to less variability in β PM –GAM is skilled at detecting no signal as well as extracting the true health signal Results

Results - Change in Standard Error (Related to Statistical Relevance) Standard error of β PM

Results - Change in Correlation Coefficient (Goodness of Fit) Correlation Coefficient

Results – 4 confounders vs time-variant confounder only

Analysis of standard error (se) –Range of se for 3 different health effects similar for same number of knots indicating incremental HA variability caused by health signals is not a major factor in the overall variability of HA –Increasing smoothness of time confounder leads to smaller se –Loss in accuracy is seen with “summer only” dataset due to limiting data Analysis of correlation coefficient (goodness of fit) –Correlation coefficient increases with number of knots –Correlation coefficients characterizing 3 true slopes are similar; again indicating that the overall affect of PM pollution on HA is rather small in comparison to the other factors such as seasonality Results

16 Summary GAM detects the absence or the existence of a very weak signal (β PM ) effectively, regardless of confounding health effect covariates –Better detects signal (tighter IQR, less error) using year- round data (with time variant function) as opposed to summers only Relationship with time variant variable swamps more subtle pollution and meteorology relationships with health outcome Modifying the knots (or degrees of freedom) for this variable can drastically alter results Increasing knots resulted in better goodness of fit measures and tighter confidence intervals