1 Practical issues and tools for modeling temporal and spatio-temporal trends in atmospheric pollutant monitoring data Paul D. Sampson Department of Statistics.

Slides:



Advertisements
Similar presentations
Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Advertisements

Estimating health effects of air pollution: challenges ahead Francesca Dominici SAMSI Workshop September
SAMSI Workshop: September 15, 2009 Discussion on Spatial Epidemiology: with focus on Chronic Effects of Air Pollution Discussion on Spatial Epidemiology:
Spatial point patterns and Geostatistics an introduction
1 PM NAAQS: Update on Coarse Particle Monitoring and Research Efforts Lydia Wegman, Office of Air Quality Planning & Standards, EPA Presentation at the.
Heather Simon, Adam Reff, Benjamin Wells, Neil Frank Office of Air Quality Planning and Standards, US EPA Ozone Trends Across the United States over a.
NASA AQAST 6th Biannual Meeting January 15-17, 2014 Heather Simon Changes in Spatial and Temporal Ozone Patterns Resulting from Emissions Reductions: Implications.
Combining Information from Related Regressions Duke University Machine Learning Group Presented by Kai Ni Apr. 27, 2007 F. Dominici, G. Parmigiani, K.
Measuring the Health Impacts of Air Pollution in Toronto and Hamilton Murray M. Finkelstein PhD MD Associate Professor, Department of Family Medicine and.
Improving Understanding of Global and Regional Carbon Dioxide Flux Variability through Assimilation of in Situ and Remote Sensing Data in a Geostatistical.
Budapest May 27, 2008 Unifying mixed linear models and the MASH algorithm for breakpoint detection and correction Anders Grimvall, Sackmone Sirisack, Agne.
U.S. EPA Office of Research & Development October 30, 2013 Prakash V. Bhave, Mary K. McCabe, Valerie C. Garcia Atmospheric Modeling & Analysis Division.
Critical Issues of Exposure Assessment for Human Health Studies of Air Pollution Michelle L. Bell Yale University SAMSI September 15, 2009.
Spring INTRODUCTION There exists a lot of methods used for identifying high risk locations or sites that experience more crashes than one would.
x – independent variable (input)
Department of Engineering and Public Policy Carnegie Mellon University Integrated Assessment of Particulate Matter Exposure and Health Impacts Sonia Yeh.
Space-time statistics STAT 518 Sp08. Space-time processes Separable covariance structure: Cov(Z(x,t),Z(y,s))=C 1 (x,y)C 2 (s,t) Nonseparable alternatives.
USING BAYESIAN HIERARCHICAL MODELLING TO PRODUCE HIGH RESOLUTION MAPS OF AIR POLLUTION IN THE EU Gavin Shaddick University of Bath RSS Avon Local Group.
Statistics, data, and deterministic models NRCSE.
Deterministic Solutions Geostatistical Solutions
Human health applications of atmospheric remote sensing Simon Hales, Housing and Health Research Programme, University of Otago, Wellington, New Zealand.
Distribution Function Estimation in Small Areas for Aquatic Resources Spatial Ensemble Estimates of Temporal Trends in Acid Neutralizing Capacity Mark.
STAT 592A(UW) 526 (UBC-V) 890-4(SFU) Spatial Statistical Methods NRCSE.
Health Risks of Exposure to Chemical Composition of Fine Particulate Air Pollution Francesca Dominici Yeonseung Chung Michelle Bell Roger Peng Department.
Statistical Tools for Environmental Problems NRCSE.
Distribution Function Estimation in Small Areas for Aquatic Resources Spatial Ensemble Estimates of Temporal Trends in Acid Neutralizing Capacity Mark.
Maps of PM2.5 over the U.S. Derived from Regional PM2.5 and Surrogate Visibility and PM10 Monitoring Data Stefan R. Falke and Rudolf B. Husar Center for.
Personal Monitoring for Air Pollution Exposure Philip M. Fine, Ph.D. Atmospheric Measurements Manager South Coast Air Quality Management District CAPCOA.
Impact of climate change on PM 2.5 air quality inferred from a multi-model analysis of meteorological modes Loretta J. Mickley Co-Is: Amos P.K.A.
Jenny Stocker, Christina Hood, David Carruthers, Martin Seaton, Kate Johnson, Jimmy Fung The Development and Evaluation of an Automated System for Nesting.
1 Satellite Remote Sensing of Particulate Matter Air Quality ARSET Applied Remote Sensing Education and Training A project of NASA Applied Sciences Pawan.
TCEQ/NUATRC Air Toxics Workshop: Session V – Human Health Effects Nathan Pechacek, M.S. Toxicology Section Texas Commission on Environmental Quality
Intra-Urban Variation in Air Pollution – Implications for Nutritional Interventions Jane E. Clougherty, MSc, ScD Assistant Professor/ Director of Exposure.
Weed mapping tools and practical approaches – a review Prague February 2014 Weed mapping tools and practical approaches – a review Prague February 2014.
Air Quality Health Risk Assessment – Methodological Issues and Needs Presented to SAMSI September 19, 2007 Research Triangle Park, NC Anne E. Smith, Ph.D.
Earth System Sciences, LLC Suggested Analyses of WRAP Drilling Rig Databases Doug Blewitt, CCM 1.
Reinhard Mechler, Markus Amann, Wolfgang Schöpp International Institute for Applied Systems Analysis A methodology to estimate changes in statistical life.
Task Force on Health Recent results - Particulate matter Michal Krzyzanowski TFH Chair Head, Bonn Office European Centre for Environment and Health WHO.
Term 4, 2005BIO656 Multilevel Models1 Hierarchical Models for Pooling: A Case Study in Air Pollution Epidemiology Francesca Dominici.
1 Satellite Remote Sensing of Particulate Matter Air Quality ARSET Applied Remote Sensing Education and Training A project of NASA Applied Sciences Pawan.
Regional climate prediction comparisons via statistical upscaling and downscaling Peter Guttorp University of Washington Norwegian Computing Center
Why is Air Pollution a Global Public Health Problem? Daniel Krewski, PhD, MHA McLaughlin Centre for Population Health Risk Assessment NERAM Colloquium.
CMAS special session Oct 13, 2010 Air pollution exposure estimation: 1.what’s been done? 2.what’s wrong with that? 3.what can be done? 4.how and what to.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Health effects modeling Lianne Sheppard University of Washington.
Urban vs. Rural Atlanta An assessment of : 1)PM2.5 composition and trends 2)The Atlanta Urban Heat Island Effect.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
The Use of Source Apportionment for Air Quality Management and Health Assessments Philip K. Hopke Clarkson University Center for Air Resources Engineering.
Spatio-Temporal Surface Vector Wind Retrieval Error Models Ralph F. Milliff NWRA/CoRA Lucrezia Ricciardulli Remote Sensing Systems Deborah K. Smith Remote.
Spatial Interpolation III
Predicting Long-term Exposures for Health Effect Studies Lianne Sheppard Adam A. Szpiro, Johan Lindström, Paul D. Sampson and the MESA Air team University.
OBJECTIVES (i) An update of the national analysis a) to assess the confounding and modifying effect of community and neighbourhood level ecological.
Impact of Air Pollution on Public Health: Transportability of Risk Estimates Jonathan M. Samet, MD, MS NERAM V October 16, 2006 Vancouver, B.C. Department.
Local Prediction of a Spatio-Temporal Process with Application to Wet Sulfate Deposition Presented by Isin OZAKSOY.
Lecture 6: Point Interpolation
Exposure Assessment for Health Effect Studies: Insights from Air Pollution Epidemiology Lianne Sheppard University of Washington Special thanks to Sun-Young.
Opening Remarks -- Ozone and Particles: Policy and Science Recent Developments & Controversial Issues GERMAN-US WORKSHOP October 9, 2002 G. Foley *US EPA.
Technical Details of Network Assessment Methodology: Concentration Estimation Uncertainty Area of Station Sampling Zone Population in Station Sampling.
INTEGRATING SATELLITE AND MONITORING DATA TO RETROSPECTIVELY ESTIMATE MONTHLY PM 2.5 CONCENTRATIONS IN THE EASTERN U.S. Christopher J. Paciorek 1 and Yang.
PM Methods Update and Network Design Presentation for WESTAR San Diego, CA September 2005 Peter Tsirigotis Director Emissions, Monitoring, and Analysis.
Breakout Session 1 Air Quality Jack Fishman, Randy Kawa August 18.
Space-time processes NRCSE. Separability Separable covariance structure: Cov(Z(x,t),Z(y,s))=C S (x,y)C T (s,t) Nonseparable alternatives Temporally varying.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Study of PM10 Annual Arithmetic Mean in USA  Particulate matter is the term for solid or liquid particles found in the air  The smaller particles penetrate.
Satellite Data for Health & Air Quality Applications
N Engl J Med Jun 29;376(26): doi: 10
How Can TEMPO Contribute to Air Pollution Health Effects Research
Paul D. Sampson Peter Guttorp
Air Quality Assessment and Management
Characteristics of Study Subjects and Events, According to the Quintile of Exposure to Fine Particulate Matter (PM2.5)* - Part I Kristin A. Miller et al.
Presentation transcript:

1 Practical issues and tools for modeling temporal and spatio-temporal trends in atmospheric pollutant monitoring data Paul D. Sampson Department of Statistics University of Washington The International Environmetric Society Modelling Spatio-Temporal Trends Workshop 3 November 2003

2 Our experience in analysis of trends in atmospheric pollutants Part I: Meteorological adjustment and long-term temporal trends in ozone Meteorological adjustment of western Washington and northwest Oregon surface ozone observations with investigation of trends. Reynolds, Das, Sampson, Guttorp, NRCSE TRS #15 ( ) Meteorological adjustment of Chicago, Illinois, surface ozone observations with investigation of trends. Reynolds, Caccia, Sampson, Guttorp, NRCSE TRS #25 ( ) A review of statistical methods for the meteorological adjustment of tropospheric ozone. Thompson, Reynolds, Cox, Guttorp, Sampson, Atmospheric Environment 35, , Part II: Spatial trend for health effects studies Spatial estimation of ambient air concentrations for ozone, , for chronic health effects modeling in 83 counties in the U.S. Current contract with U.S. EPA. Spatio-temporal modeling and prediction of ambient PM2.5 concentrations for acute and chronic health effects modeling with the NIH/NHLBI cohort study, MESA (Multi-Ethnic Study of Atheroscloerosis). Current proposal and ongoing collaboration with colleagues at the Univ of Washington’s Northwest Center for Particulate Matter and Health.

3 Spatio-temporal modeling of ambient PM exposure for chronic health effect studies Paul D. Sampson Department of Statistics University of Washington Northwest Center for Particulate Matter & Health External Science Advisory Committee Meeting 12 November 2003

4 Major North American cohort studies of PM: single community- wide exposure/monitor to characterize a metropolitan area. Fails to address important local spatial variation of air pollutants known to exist within regions. Hoek et al.: 3-component regression model to predict exposure to air pollutants (black smoke and NO 2 ). Incorporates (a) regional background levels, (b) urban gradient (based on population density) and (c) proximity to heavily-trafficked roadways and other point sources. Build on this approach to combine in a spatial model average concentration data from fixed-site ambient monitors and spatial covariate information encoded in a GIS, including population density, proximity to roads, and traffic density. Motivation for fine(r)-scale spatial modeling of pollutant exposure for chronic health effect studies:

5 Aside: U.S. EPA currently funded Epidemiologic Research on Health Effects of Long-Term Exposure to Ambient P.M. and Other Air Pollutants (June 2003) Laden, Schwartz et al (Harvard): Chronic Exposure to Particulate Matter and Cardiopulmonary Disease. Nurses Health Study: Prospective cohort study of 121,700 women throughout U.S. Knutson, Beeson et al (Loma Linda): Relating Cardiovascular Disease Risk to Ambient Air Pollutants using GIS and Bayesian Neural Networks. AHSMOG study. Samet, Zeger, Dominici et al (Johns Hopkins): Chronic and Acute Exposure to Ambient Fine Particulate Matter and Other Air Pollutants: National Cohort Studies of Mortality and Morbidity. Data from Medicate beneficiary file and National Claims History File. Diez-Roux, Keeler, Samson, Lin (Michigan). Long-Term Exposure to Ambient PM and Subclincial Atherosclerosis. MESA Study.

6 EPA apparently mandated/directed that all these studies be concerned with computing exposure estimates from ambient monitoring data and GIS-based information on local traffic, pop density, …. Jon Samet (Johns Hopkins): “EPA should invest in drawing national maps of exposure as all our research groups are trying to do the same thing.”

7 Applications: “MESA Air”: NHLBI-funded Multi-Ethnic Study of Atherosclerosis: effects of ambient PM (and other pollutants) on subclinical cardiovascular function –8700 subjects, aged 50-89, from 9 communities, assessed prospectively, longitudinally. –Monitoring data and exposure assessment: Current AQS PM monitors (mostly 3-day sampling) Supplemental monitors, up to 5 per community (2 week integrated msmts of key pollutants) Mobile gradient monitoring (2 week integrated sampling) PLUS Distances to nearest major roadways with traffic volume and composition Distances to pollutant point source EVALUATION on PM2.5 and co-pollutants measured at 10% of homes Preliminary demonstration of spatio-temporal modeling using S. Calif ozone data.

8 sum of non-ambient (N) and ambient (A) components: Ambient exposure is ambient concentration times an ambient exposure attenuation factor reflecting time spent outside the home and particle infiltration into the home: Model for ambient concentration: trend + residual Personal PM exposure for subject I at time t:

9 Smoothly varying spatio-temporal trend is further decomposed: the 1 st term represents long-term mean concentration and will derive from a Bayesian analysis of a spatial regression model combining average concentration data from fixed- site ambient monitors and spatial covariate information encoded in a GIS. the 2 nd component represents mainly smooth seasonal temporal variation.

10 The variance model for the residual term represents the spatio-temporal variation considered primarily at the 2- week time scale of the fixed sites and mobile gradient monitors. Estimation of this component will be based on (extensions to) the Bayesian model for the Sampson- Guttorp spatial deformation approach to nonstationary spatial covariance as demonstrated in Damian et al. (2001, 2003). This modeling strategy accommodates the spatial varying effects of predominant meteorology, coast lines and topographic features that underlie the statistical relationship between time varying pollutant levels at different points in space.

11

12

13

14

15

16

17

18

19

20

21

22

23 Following Hoek and colleagues (2002 Atmos Env, 2003 Epidemiology), assume the regression model can be written Where represent pop density, proximity to roads, traffic density, and possibly local topographic and climatic wind patterns. The Bayesian analysis incorporates prior information on the parameters and on the spatial covariance structure of residuals from this regression model in a manner similar to that of our Bayesian framework for spatial estimation of the residual component (see (3) below). (1) Estimation of the long-term mean spatial field

24 Note that monitoring observations will be used directly in the estimation, not just in the specification or calibration of the regression model as in the work of Hoek et al. I.e., in Hoek et al., (long-term) exposure is estimated as: In our (geostatistical) approach, we will be estimating the space-time field, ; the long-term exposure at a point includes an estimated (“kriged”) spatial residual and can be written: Mean field

25 The spatial index in the 2 nd component allows for the possibility that the magnitude and precise details of the seasonal variation may vary from location to location over the spatial scale of the regional target communities. Preliminary analysis of PM 2.5 monitoring data in the Los Angeles county region suggests some spatial variation in seasonality, but in some regions we expect to find that this seasonal variation is homogeneous, permitting an additive (separable) decomposition of the spatio-temporal trend. (2) Smooth, spatially varying, temporal variation.

26 Characterize and estimate the seasonal structure of air pollutant concentrations in terms of a model written as: where the are temporal basis functions describing possible seasonal trend patterns, and represent spatially varying coefficients of these trend patterns. Example: O3 trend components. (What do we expect with PM more generally?) Trend decomposition

27 We compute trend components empirically as smoothed versions of the temporal singular vectors of the T  N data matrix (rather than assuming parametric forms such as trigonometric functions). Arbitrary amounts of missing data are accommodated in an EM-like iterative calculation of the SVD. The Bayesian spatial regression model can incorporate the coefficients of these trend components as spatial fields, and thus provide the basis for estimation of at target homes.

28

29

30

31

32

33

34

35

36

37

38 Final component: spatio-temporal variation at the (2-week) time scale of the fixed sites and mobile gradient monitors. Sampson-Guttorp spatial deformation approach (Damian et al. 2001, 2003), to model the nonstationary spatial covariance structure. Allows for spatially varying effects of predominant meteorology, coast lines and topographic features that underlie the statistical relationship between time varying pollutant levels at different points in space. Bayesian analysis provides a full posterior distribution for the model parameters, and thus a ready computation of multiple imputations of exposures for the health effects analysis. (3) Nonstationary residual spatio-temporal variation.

39

40

41 Observed vs Predicted ozone at 3 validation sites.

42

43

44

45 Conclusion: We can estimate/predict both the day-to-day deviations from the trend, and the seasonal shape of the trend quite well, but We sometimes miss the long-term mean. => need to incorporate extra local information to predict the mean concentration.

46

47 Technical details

48 Gaussian assumption after transformation Current AQS data sampling usually every 3 days; proposed sampling on 2-week intervals Conditional, hierarchical approach to estimating the parameters of our space-time models from this “incomplete data,” beginning with models estimated from the longer-term AQS data and them updating estimated model parameters with data from the new fixed and mobile monitors. Details, issues, and extensions

49 First stage of analysis: build separate models and estimates for the three major exposures of interest, PM 2.5, NO x, and O 3. Second stage: take advantage of the association between PM 2.5 and NO x in a multivariate (“co-kriging”) analysis that assumes only that spatial nonstationarity can be expressed in a common underlying deformed coordinate system.

50 or where we are writing C as an S  T (space-time) matrix of observations,  is an S  (J+1) matrix of coefficients multiplying the matrix F, (J+1)  T, with columns containing values of the basis functions evaluated at the S observation sites (i=1,…,S). Obvious calculation is an SVD of the concentration matrix C.

51 where the columns of the (truncated) matrix of right singular vectors is considered to represent the matrix of values of the J+1 temporal basis functions: F = Issues: Smoothness of the singular vectors as components of trend; computation with missing data.

52 Posterior sample

53 Site variances