SAMSI Workshop: September 15, 2009 Discussion on Spatial Epidemiology: with focus on Chronic Effects of Air Pollution Discussion on Spatial Epidemiology:

Slides:



Advertisements
Similar presentations
1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
Advertisements

Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Estimating health effects of air pollution: challenges ahead Francesca Dominici SAMSI Workshop September
Spatial point patterns and Geostatistics an introduction
Spatial point patterns and Geostatistics an introduction
Developing Event Reconstruction for CTA R D Parsons (Univ. of Leeds) J Hinton (Univ. of Leicester)
The Simple Linear Regression Model Specification and Estimation Hill et al Chs 3 and 4.
Karin Larsson. Exposure, Disease and Risk. Presentation of some case studies using geographical information systems (GIS) Occupational and Environmental.
Topic 12: Multiple Linear Regression
MCUAAAR: Methods & Measurement Core Workshop: Structural Equation Models for Longitudinal Analysis of Health Disparities Data April 11th, :00 to.
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
StatisticalDesign&ModelsValidation. Introduction.
1 Characterization of Spatial Heterogeneity for Scaling Non Linear Processes S. Garrigues 1, D. Allard 2, F. Baret 1. 1 INRA-CSE, Avignon, France 2 INRA-Biométrie,
Split Questionnaire Designs for Consumer Expenditure Survey Trivellore Raghunathan (Raghu) University of Michigan BLS Workshop December 8-9, 2010.
CONCEPTS UNDERLYING STUDY DESIGN
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Halûk Özkaynak US EPA, Office of Research and Development National Exposure Research Laboratory, RTP, NC Presented at the CMAS Special Symposium on Air.
Critical Issues of Exposure Assessment for Human Health Studies of Air Pollution Michelle L. Bell Yale University SAMSI September 15, 2009.
Using the Maryland Biological Stream Survey Data to Test Spatial Statistical Models A Collaborative Approach to Analyzing Stream Network Data Andrew A.
Model and Variable Selections for Personalized Medicine Lu Tian (Northwestern University) Hajime Uno (Kitasato University) Tianxi Cai, Els Goetghebeur,
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
FOUR METHODS OF ESTIMATING PM 2.5 ANNUAL AVERAGES Yan Liu and Amy Nail Department of Statistics North Carolina State University EPA Office of Air Quality,
USING BAYESIAN HIERARCHICAL MODELLING TO PRODUCE HIGH RESOLUTION MAPS OF AIR POLLUTION IN THE EU Gavin Shaddick University of Bath RSS Avon Local Group.
“Ghost Chasing”: Demystifying Latent Variables and SEM
GIS in Spatial Epidemiology: small area studies of exposure- outcome relationships Robert Haining Department of Geography University of Cambridge.
Introduction to statistical estimation methods Finse Alpine Research Center, September 2010.
Human health applications of atmospheric remote sensing Simon Hales, Housing and Health Research Programme, University of Otago, Wellington, New Zealand.
FOUR METHODS OF ESTIMATING PM 2.5 ANNUAL AVERAGES Yan Liu and Amy Nail Department of Statistics North Carolina State University EPA Office of Air Quality,
1 Practical issues and tools for modeling temporal and spatio-temporal trends in atmospheric pollutant monitoring data Paul D. Sampson Department of Statistics.
W. James Gauderman, Edward Avol, Fred Lurmann, Nino Kuenzli, Frank Gilliland, John Peters, and Rob McConnell 1 Epidemiology, Volume 16, Number 6, 2005.
Hierarchical models for combining multiple data sources measured at individual and small area levels Chris Jackson With Nicky Best and Sylvia Richardson.
NERAM 2006 Matching the metric to need: modelling exposures to traffic- related air pollution for policy support David Briggs, Kees de Hoogh and John Gulliver.
Intra-Urban Variation in Air Pollution – Implications for Nutritional Interventions Jane E. Clougherty, MSc, ScD Assistant Professor/ Director of Exposure.
The Impact of Air Pollution on Human Health Fernando Holguin MD MPH Associate Professor of medicine and Pediatrics Director, Clinical and Translational.
Air Quality Health Risk Assessment – Methodological Issues and Needs Presented to SAMSI September 19, 2007 Research Triangle Park, NC Anne E. Smith, Ph.D.
TWO-STAGE CASE-CONTROL STUDIES USING EXPOSURE ESTIMATES FROM A GEOGRAPHICAL INFORMATION SYSTEM Jonas Björk 1 & Ulf Strömberg 2 1 Competence Center for.
Term 4, 2005BIO656 Multilevel Models1 Hierarchical Models for Pooling: A Case Study in Air Pollution Epidemiology Francesca Dominici.
Robust GW summary statistics & robust GW regression are used to investigate spatial variation and relationships in a freshwater acidification critical.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
CMAS special session Oct 13, 2010 Air pollution exposure estimation: 1.what’s been done? 2.what’s wrong with that? 3.what can be done? 4.how and what to.
Health effects modeling Lianne Sheppard University of Washington.
Studying the effects of air pollution on children’s health Presented by Elizabeth Stanwyck with Dr. Bimal Sinha University of Maryland, Baltimore County.
Spatial Interpolation III
Modeling as an exposure estimation approach for use in epidemiologic studies Part 2: Example applications KL Dionisio 1, LK Baxter 1, V Isakov 1, SE Sarnat.
Predicting Long-term Exposures for Health Effect Studies Lianne Sheppard Adam A. Szpiro, Johan Lindström, Paul D. Sampson and the MESA Air team University.
Robust GW summary statistics & robust GW regression are used to investigate a freshwater acidification data set. Results show that data relationships can.
OBJECTIVES (i) An update of the national analysis a) to assess the confounding and modifying effect of community and neighbourhood level ecological.
CHAPTER 17 O PTIMAL D ESIGN FOR E XPERIMENTAL I NPUTS Organization of chapter in ISSO –Background Motivation Finite sample and asymptotic (continuous)
Lisa K. Baxter, Kathie L. Dionisio, Janet Burke, and Halûk Özkaynak National Exposure Research Laboratory, U.S. EPA Modeling as an exposure estimation.
GOING FROM 12-KM TO 250-M RESOLUTION Josephine Bates 1, Audrey Flak 2, Howard Chang 2, Heather Holmes 3, David Lavoue 1, Mitchel Klein 2, Matthew Strickland.
Exposure Assessment for Health Effect Studies: Insights from Air Pollution Epidemiology Lianne Sheppard University of Washington Special thanks to Sun-Young.
IRAS en het ILM Gerard Hoek Health effects of NO 2 : evidence from recent epidemiological studies Gerard Hoek Institute for Risk Assessment Sciences.
Designing Factorial Experiments with Binary Response Tel-Aviv University Faculty of Exact Sciences Department of Statistics and Operations Research Hovav.
Montserrat Fuentes Statistics Department NCSU Research directions in climate change SAMSI workshop, September 14, 2009.
Tutorial I: Missing Value Analysis
SGPP: Spatial Gaussian Predictive Process Models for Neuroimaging Data Yimei Li Department of Biostatistics St. Jude Children’s Research Hospital Joint.
INTEGRATING SATELLITE AND MONITORING DATA TO RETROSPECTIVELY ESTIMATE MONTHLY PM 2.5 CONCENTRATIONS IN THE EASTERN U.S. Christopher J. Paciorek 1 and Yang.
Matrix Models for Population Management & Conservation March 2014 Lecture 10 Uncertainty, Process Variance, and Retrospective Perturbation Analysis.
1. Objectives Novartis is developing a new triple fixed-dose combination product. As part of the clinical pharmacology program, pharmacokinetic (PK) drug-drug.
Modeling as an exposure estimation approach for use in epidemiologic studies Part 2: Example applications KL Dionisio 1, LK Baxter 1, V Isakov 1, SE Sarnat.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Satellite Data for Health & Air Quality Applications
N Engl J Med Jun 29;376(26): doi: 10
CH 5: Multivariate Methods
Workshop on Methods for Genomic Selection (El Batán, July 15, 2013) Paulino Pérez & Gustavo de los Campos.
How Can TEMPO Contribute to Air Pollution Health Effects Research
Department of Bioscience
OVERVIEW OF LINEAR MODELS
Crime survey Neuchatel, 7-8 July 2011
OVERVIEW OF LINEAR MODELS
Presentation transcript:

SAMSI Workshop: September 15, 2009 Discussion on Spatial Epidemiology: with focus on Chronic Effects of Air Pollution Discussion on Spatial Epidemiology: with focus on Chronic Effects of Air Pollution Kiros Berhane, Ph.D. (with Duncan Thomas, Jim Gauderman and the CHS Team) Department of Preventive Medicine Keck School of Medicine University of Southern California Los Angeles, CA, USA (

SAMSI Workshop: September 15, 2009 Outline Long Term Cohort Studies –The Childrens Health Study –The multi-Level modeling Paradigm Spatio-temporal Issues Integrated modeling Discussion points

SAMSI Workshop: September 15, 2009 Childrens Health Study Background Designed to take advantage of existing air monitoring data to choose optimal sites Exploits temporal, spatial, and individual comparisons Extensive exposure and health assessment to support all three levels of comparison Study Goal: To assess whether air pollution (regional and/or local) is associated with chronic health effects in children?

SAMSI Workshop: September 15, 2009 LLLL LMLHLMLH HMLMHMLM HHHH LHMHLHMH HHHL HMHLHMHL MMMM MLLL O 3, PM 10, NO 2, H + : L = low M = Medium H = High

SAMSI Workshop: September 15, 2009 –Level I: Between times (k) within subjects (i ) y cik = a ci + b ci t cik + z cik + (x cik –x ci ) 1 + e cik –Level II: Between subjects within community (c) b ci = B c + z ci + (x ci – X c ) 2 + e ci –Level III: Between communities B c = 0 + Z c + X c 3 + e c Fitted simultaneously as a mixed effects model Linear Multi-level Model Spatio-temporal effects could be assessed at any of the levels Berhane et al, Statist Sci 2004; 19:

SAMSI Workshop: September 15, 2009 Framework for Spatial Data Consider –individual data on outcomes (Y i ), covariates (Z i ), and locations (P i ) –ecologic data on communities (W c ) We wish to choose a sample S of individuals on whom to measure true exposure (X i ) We develop an exposure model to predict E(X i |Z i,P i,W c ) for subjects not in S We want a hierarchical spatial measurement error model for analysis of Y i | X i,Z i,P i,W c

SAMSI Workshop: September 15, 2009 Modeling Approaches Hierarchical random effects model for outcome i = E(Y i ) = A C i + Z i 1 + i 2 + s(P i ) E(A c ) = W c i = E(X i | Z i,P i,W C i ; ) cov(Y i,Y j )| C i =C i = exp( ||P i P j ||) cov(A c,A d ) = exp( ||P c P c ||) Likelihood for sampled data L (a,b) = i S Pr(Y,X |Z,S) x i S Pr(Y |Z,S)

SAMSI Workshop: September 15, 2009 Accounting for Intra- Community Variation Goals: To build a model for personal exposure combining spatio-temporal model for ambient concentrations with time-activity data from questionnaires and measurements To optimize the design of time/activity sampling

SAMSI Workshop: September 15, 2009 W Y Z X Traffic, Land Use Local Exposure Measurements HealthOutcome TrueExposure L Locations P RegionalBackground Molitor et al, AJE 2506;164:69-76 (nonspatial) Molitor et al, EHP 2507: (spatial) Bayesian Spatial Measurement Error Model Subsample S | Y, L, W

SAMSI Workshop: September 15, 2009 Spatial Regression Model Exposure model E(X i ) = W i W = land use covariates, dispersion model predictions cov(X i,X j ) = 2 I ij + 2 exp(– D ij ) MESA Air spatio-temporal model: x(s,t) = X 0 (s) + k X k (s) T k (t) Measurement modelE(Z i ) = X i Disease model g[E(Y i )] = X i Multivariate exposure model (co-kriging)

SAMSI Workshop: September 15, 2009 ASSIGNMENT OF LOCAL EXPOSURES For all homes in cohort, we can assign an estimated exposure based on fitted parameters Systematic component depends on community ambient level and traffic density Random component is weighted mean of measurements at other homes, using estimated covariance matrix E(x ci ) = Z ci ´ j i (x cj Z ci ´ ) C cij / C cii

SAMSI Workshop: September 15, 2009 Spatial Model: for Full Cohort Fit subsample data, regressing measurements Z on predictors W E(Z i ) = W i cov(Z i,Z j ) = 2 I ij + 2 exp(– D ij ) Impute exposures X to all subjects based on W and mean of residuals for neighbors X i = Z i + i N j (Z j – X j ) w ij Fit full cohort, regressing health outcomes Y on imputed X, weighted by uncertainties of imputations E(Y i ) = X i var(Y i ) = var(X i ) Thomas, LDA 2007; 13: ^ ^ ^ ^ ^ ^

SAMSI Workshop: September 15, 2009 Multivariate CAR Model Structured covariance matrix with submatrices for each pollutant (p,q) and their correlations cov(X pi,X qj ) = pq exp( pq D ij ) Hope is to incorporate atmospheric chemistry and dispersion theory in means and covariance models We have currently spatial measurements on samples of homes for NO 2 and O 3, but not the same homes Plans to measure NO 2, NO, and O 3 in a larger sample of homes

SAMSI Workshop: September 15, 2009 Sampling Strategies Case-control: choose S to be set of asthma cases and their town-matched controls Surrogate diversity: choose S that maximizes the variance of traffic density Spatial diversity: choose S that maximizes the geographic spread of measurements –Maximize total distance from all other points –Maximize minimum distance from nearest point –Maximize the informativeness of sample for predicting non- sample points Hybrid: First measure cases and controls; then add additional subjects that would be most informative for refining E(X |Z,P,W ) Thomas, LDA 2007; 13:

SAMSI Workshop: September 15, 2009 Spatial Sampling Plans RandomMaximize var(W) Maximize min(info) Maximize min(dist) Thomas, Lifetime Data Analysis 2007; 13:

SAMSI Workshop: September 15, 2009 Additional Substudies Personal exposure measurements Biomarkers of latent disease processes Time-activity data –Have usual times and subjective activity levels in various locations (home, school, playgrounds, in transit, etc.) –Plan to obtain GPS measurements of actual time- resolved locations on a subsample for short periods –Also plan to obtain step-counts and/or accelerometry on a subsample for short periods

SAMSI Workshop: September 15, 2009 Methodologic Research Framework in the CHS Exposure Determinant True Exposure Measurements Genes Asthma Lung Function Biomarker Oxidative Stress & Inflammation Aim 1 Aim 2 Aim 3 Aim 4

SAMSI Workshop: September 15, 2009 Further Extensions of the Integrated Research program

SAMSI Workshop: September 15, 2009 Details Spatio-temporal pollution field: x(s,t) = X(s) T(t) where X = CAR(W a,s x 2,t 2,r,D) Ambient measurements: Z(t) ~ LN(T(t), Z 2 ) Local measurements: z(s) ~ LN(x(s,t s ), z 2 ) True personal exposure: X i = m im X(s im ) e im Personal exposure measurements: Z i ~ LN(X i, X 2 ) Time in MEs: (p i, P i ) ~ MN( i, (n,N)), i ~ MN( ) Activity in MEs: (v im,V im ) ~ LN( im, v 2 ), im ~ LN( m, 2 ) Latent disease: L i ~ LN( X i + 2 G i + 3 X i G i, L 2 ) Biomarker measurements: B i ~ LN(L i, B 2 ) Disease: Y i ~ N( L i, Y 2 )

SAMSI Workshop: September 15, 2009 Discussion Points Issues with exposure modeling for Intra-community variation –Measurement error? –Implications of using snapshots in space/time to assess long term exposure? –Implications of sampling strategies? Differences in spatio-temporal resolution of data: Outcome vs. Exposure –Implications for health effects analysis? Integrated Modeling approaches vs. Compartmentalized modeling –Which way to go? Issues in Chronic vs. Acute effects analysis –Are they really different?

SAMSI Workshop: September 15, 2009

THANK YOU! Contact me at