Occurrence and timing of events depend on Exposure to the risk of an event exposure Risk depends on exposure.

Slides:



Advertisements
Similar presentations
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Advertisements

Logistic Regression Psy 524 Ainsworth.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
[Part 1] 1/15 Discrete Choice Modeling Econometric Methodology Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Models of migration Observations and judgments In: Raymer and Willekens, 2008, International migration in Europe, Wiley.
Measure of disease frequency
Multinomial Logistic Regression
Cohort Studies.
1 G Lect 11M Binary outcomes in psychology Can Binary Outcomes Be Studied Using OLS Multiple Regression? Transforming the binary outcome Logistic.
Basic Statistical Concepts Donald E. Mercante, Ph.D. Biostatistics School of Public Health L S U - H S C.
Log-linear analysis Summary. Focus on data analysis Focus on underlying process Focus on model specification Focus on likelihood approach Focus on ‘complete-data.
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
Generalized Linear Models
1 B. The log-rate model Statistical analysis of occurrence-exposure rates.
C. Logit model, logistic regression, and log-linear model A comparison.
Log-linear modeling and missing data A short course Frans Willekens Boulder, July-August 1999.
Logistic regression for binary response variables.
Biology in Focus, HSC Course Glenda Childrawi, Margaret Robson and Stephanie Hollis A Search For Better Health Topic 11: Epidemiology.
Study Design and Analysis in Epidemiology: Where does modeling fit? Meaningful Modeling of Epidemiologic Data, 2010 AIMS, Muizenberg, South Africa Steve.
1 1. Observations and random experiments Observations are viewed as outcomes of a random experiment.
David Yens, Ph.D. NYCOM PASW-SPSS STATISTICS David P. Yens, Ph.D. New York College of Osteopathic Medicine, NYIT l PRESENTATION.
Multiple Choice Questions for discussion
Ratios,Proportions and Rates MAE Course Measures of frequency The basic tools to describe quantitatively the causes and patterns of disease, or.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Hypothesis Testing Field Epidemiology. Hypothesis Hypothesis testing is conducted in etiologic study designs such as the case-control or cohort as well.
Study Designs Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /4/20151.
ECON ECON Health Economic Policy Lab Kem P. Krueger, Pharm.D., Ph.D. Anne Alexander, M.S., Ph.D. University of Wyoming.
Lecture 8: Generalized Linear Models for Longitudinal Data.
1 Rob Woodruff Battelle Memorial Institute, Health & Analytics Cynthia Ferre Centers for Disease Control and Prevention Conditional.
Logit model, logistic regression, and log-linear model A comparison.
Measures of Association
Module Overview Risk—what is it? How do we perceive risks-socioecological model? It’s all relative.. Where do health statistics come from? How can I learn.
ANALYTICAL STUDIES Prospective Studies COHORT Prepared by: Dr. Sahar Sabbour Community Medicine Department.
Research Study Design. Objective- To devise a study method that will clearly answer the study question with the least amount of time, energy, cost, and.
A. Analysis of count data
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
LESSON 9.5: TYPES OF STUDIES Module 9: Epidemiology Obj. 9.5: Compare & contrast different types of epidemiological studies.
Introduction to Logistic Regression Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein.
Limited Dependent Variables Ciaran S. Phibbs May 30, 2012.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Chapter 2 Nature of the evidence. Chapter overview Introduction What is epidemiology? Measuring physical activity and fitness in population studies Laboratory-based.
Rates, Ratios and Proportions and Measures of Disease Frequency
Epidemiology: Basic concepts and principles ENV
Business Intelligence and Decision Modeling
Limited Dependent Variables Ciaran S. Phibbs. Limited Dependent Variables 0-1, small number of options, small counts, etc. 0-1, small number of options,
Epidemiologic Measures Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /4/20151.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
Measures of Disease Frequency
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Instructor Resource Chapter 17 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Logistic regression (when you have a binary response variable)
1 Introduction to Modeling Beyond the Basics (Chapter 7)
A short introduction to epidemiology Chapter 6: Precision Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Basic Concepts of Epidemiology & Social Determinants of Health Prof. Supannee Promthet 27 Septmber 2013:
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Introdcution to Epidemiology for Medical Students Université Paris-Descartes Babak Khoshnood INSERM U1153, Equipe EPOPé (Dir. Pierre-Yves Ancel) Obstetric,
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Question 1 A new test to diagnose urinary tract infections (UTIs) is being evaluated. The sensitivity of the test is 70% and the specificity is 90%. In.
Instructional Objectives:
BINARY LOGISTIC REGRESSION
Statistical Modelling
Logistic Regression APKC – STATS AFAC (2016).
William Greene Stern School of Business New York University
Introduction to logistic regression a.k.a. Varbrul
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Introduction to Logistic Regression
Epidemiological Measurements of health
Effect Modifiers.
Presentation transcript:

Occurrence and timing of events depend on Exposure to the risk of an event exposure Risk depends on exposure

Age at first marriage and age at change in education: Person- years file Educ: 0 = not in school full-time 1 = secondary eduction 2 = postsecondary education Marriage [MS]: 0 = not married 1 = married Source: Yamaguchi, 1991, p. 22 All age periods prior to marriage and age at marriage are included.

Exposure: examples To risk of conception To risk of infection (e.g. malaria, HIV) To marriage To risk of divorce To risk of dying Health risk

Exposure to risk Whenever an event or act gives rise to gain or loss that cannot be predicted Risk of the unexpected Williams et al., 1995, Risk management and insurance, McGraw-Hill, New York, p. 16

Exposure analysis Being exposed or not If exposed, level of exposure (intensity) Factors affecting level of exposure (e.g. age, contacts, etc.) Interventions may affect level of exposure –Contraceptives and sterilisation are used to prevent unwanted pregnancies –Breastfeeding prolongs postpartum amenorrhoea (PPA) –Immunisation prevents (reduces) risk of infectious disease –Lifestyle reduces/increases risk of lung cancer Which mechanism(s) determines level of exposure –e.g. Breastfeeding stimulates production of prolactin hormone, which inhibits ovulation Hobcraft and Little, ??

Risk levels and differentials Risk measures Prediction of risk levels Determinants of differential risk levels Risk = potential variation in outcome

Count: Number of events during given period ( observation window ) Count data Probability: probability of an outcome: proportion of risk set experiencing a given outcome (event) at least once Basis = Risk set Risk set = all persons at risk at given point in time. Rate: number of events per time unit of exposure (person-time) Basis: duration of exposure (duration at risk) Rate (general) = change in one quantity per unit change in another quantity (usually time; other possible measures include space, miles travelled) (Objective) risk measures

Difference of probabilities: p 1 - p 2 (risk difference) Relative risk: ratio of probabilities (focus: risk factor) prob. of event in presence of risk factor/ prob. of event in absence of risk factor (control group; reference category): p 1 / p 2 Odds: odds on an outcome: ratio of favourable outcomes to unfavourable outcomes. Chance of one outcome rather than another: p 1 / (1-p 1 ) The odds are what matter when placing a bet on a given outcome, i.e. when something is at stake. Odds reflect the degree of belief in a given outcome. Relation odds and relative risk: Agresti, 1996, p. 25 Risk measures

Odds: two categories (binary data) Risk measures Parameters of logistic regression: ln(odds) and ln(odds ratio) In regression analysis,  is linear predictor:  =  0 +  1 x 1 +  2 x 2 +

Odds: multiple categories (polytomous data) Risk measures Parameters of logistic regression: ln(odds) and ln(odds ratio) Select category 3 as reference category

Odds ratio : ratio of odds (focus: risk indicator, covariate) odds in target group / odds in control group [reference category]: ratio of favourable outcomes in target group over ratio in control group. The odds ratio measures the ‘belief’ in a given outcome in two different populations or under two different conditions. If the odds ratio is one, the two populations or conditions are similar. Target group: k=1; Control group: k=2 Risk measures Parameters of logistic regression: ln(odds) and ln(odds ratio)

Risk measures in epidemiology Prevalence: proportion (refers to status) Incidence rate: rate at which events (new cases) occur over a defined time period [events per person-time]. Incidence rate is also referred to as incidence density (e.g. Young, 1998, p. 25; Goldhaber and Fireman, 1991). Case-fatality ratio: proportion of sick people who die of a disease (measure of severity of disease). Is not a rate!! (Young, 1998, p. 27) Confusion: Birth defect prevalence: proportion of live births having defects Birth defect incidence: rate of development of defects among all embryos over the period of gestation (Young, 1998, p. 48)

Risk measures in epidemiology Attributable risk (among the exposed): proportion of events (diseases) attributable to being exposed: [p 1 -p 2 ]/p 1 (since non-exposed can also develop disease)

Subjective probability: degree of belief about the outcome of a trial or process, or about the future. It is the perception of the probability of an outcome or event. ‘It is highly dependent on judgment’ ( Keynes, 1912, A treatise on probability, Macmillan, London ). Keynes regarded probability as a subjective concept: our judgment (intuition, gut feeling) about the likelihood of the outcome. –See also Value-expectancy theory: attractiveness of an alternative (option) depends on the subjective probability of an outcome and the value or utility of the outcome (Fishbein and Ajzen, 1975). (Subjective) risk measures

In case of multiple categories, select a reference category Reference category is coded 0 Various coding schemes!

Coding schemes Contrast coding: one category is reference category (simple contrast coding; dummy coding). Model parameters are deviations from reference category. Indicator variable coding: indicator (0,1) variables Cornered effect coding ( Wrigley, 1985, pp ) [0,1]) Effect coding: the mean is the reference. Model parameters are deviations from the mean. Centred effect coding (Wrigley, 1985, pp ) [-1,+1] Other types of coding: see e.g. SPSS Advanced Statistics, Appendix A Vermunt, 1997, p. 10

Coding schemes Categories are coded: – Binary: [0,1], [-1,+1], [1,2] – Multiple: [0,1,2,3,..], [set of binary] e.g. 3 categories:

Coding schemes Selection of reference category depends on research question

Example

Descriptive statistics

Reference categories: Late [  20], Males Odds on leaving home early (rather than late) Logit - Males: 74/178 = Females: 135/143 = Odds ratio (  ): 0.944/0.416 = (if we bet that a person leaves home early, we should bet on females; they are the ‘winners’ - leave home early) Var(  ) =  2 [1/135+1/143+1/74+1/178] = ln  = Var(ln  ) = 1/135+1/143+1/74+1/178 = Selvin, 1991, p. 345

Leaving home

Relation probabilities, odds and logit

Risk analysis: models Prediction of risk levels and differentials risk levels Probability models and regression models –Counts  Poisson r.v.  Poisson distribution  Poisson regression / log-linear model – Probabilities  binomial and multinomial r.v.  binomial and multinomial distribution  logistic regression / logit model (parameter p, probability of occurrence, is also called risk; e.g. Clayton and Hills, 1993, p. 7 ) – Rates  Occurrences/exposure  Poisson r.v.  log-rate model