Survival models without mortality Casting closed-population wildlife survey models as survival- or recurrent event models David Borchers Roland Langrock,

Slides:



Advertisements
Similar presentations
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Advertisements

Introduction to Regression with Measurement Error STA431: Spring 2015.
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.
Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.
Species interaction models. Goal Determine whether a site is occupied by two different species and if they affect each others' detection and occupancy.
Parameter Estimation: Maximum Likelihood Estimation Chapter 3 (Duda et al.) – Sections CS479/679 Pattern Recognition Dr. George Bebis.
How Long Until …? Given a strike, how long will it last?
Maximum-Likelihood estimation Consider as usual a random sample x = x 1, …, x n from a distribution with p.d.f. f (x;  ) (and c.d.f. F(x;  ) ) The maximum.
1 Review Definition: Reliability is the probability that a component or system will perform a required function for a given period of time when used under.
July 3, Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic.
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Analysis of Complex Survey Data
Kaplan-Meier Estimation &Log-Rank Test Survival of Ventilated and Control Flies (Old Falmouth Line 107) R.Pearl and S.L. Parker (1922). “Experimental Studies.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Data Selection In Ad-Hoc Wireless Sensor Networks Olawoye Oyeyele 11/24/2003.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Institut für Statistik und Ökonometrie A Wildlife Simulation Package WWalter Zucchini 1, David Borchers 2, Stefan Kirchfeld 1, Martin Erdelmeier 1 Jon.
Markov Localization & Bayes Filtering
On Model Validation Techniques Alex Karagrigoriou University of Cyprus "Quality - Theory and Practice”, ORT Braude College of Engineering, Karmiel, May.
1 As we have seen in section 4 conditional probability density functions are useful to update the information about an event based on the knowledge about.
Software Reliability SEG3202 N. El Kadri.
Resource Selection Functions and Patch Occupancy Models: Similarities and Differences Lyman L. McDonald Senior Biometrician Western EcoSystems Technology,
Bayesian Analysis and Applications of A Cure Rate Model.
Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps).
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
BIOL 4240 Field Ecology. How many? “Estimates of abundance themselves are not valuable, and a large book filled with estimates of the abundance of every.
Methodology Solving problems with known distributions 1.
Empirical Likelihood for Right Censored and Left Truncated data Jingyu (Julia) Luan University of Kentucky, Johns Hopkins University March 30, 2004.
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
Satistics 2621 Statistics 262: Intermediate Biostatistics Jonathan Taylor and Kristin Cobb April 20, 2004: Introduction to Survival Analysis.
Multilevel and multifrailty models. Overview  Multifrailty versus multilevel Only one cluster, two frailties in cluster e.g., prognostic index (PI) analysis,
Machine Learning 5. Parametric Methods.
Stat 31, Section 1, Last Time Big Rules of Probability –The not rule –The or rule –The and rule P{A & B} = P{A|B}P{B} = P{B|A}P{A} Bayes Rule (turn around.
Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen
5.2 Distance Sampling Estimating abundance from counts when detection is imperfect.
Estimation of Animal Abundance and Density Miscellaneous Observation- Based Estimation Methods 5.2.
 1 Species Richness 5.19 UF Community-level Studies Many community-level studies collect occupancy-type data (species lists). Imperfect detection.
Pollock’s Robust Design: Extensions II. Quick overview 1.Separation of Recruitment Components in a single patch context (Source-Sink) 2.Separation of.
Multiple Detection Methods: Single-season Models.
Proportional Hazards Model Checking the adequacy of the Cox model: The functional form of a covariate The link function The validity of the proportional.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Spatially Explicit Capture-recapture Models for Density Estimation 5.11 UF-2015.
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
Density Estimation with Closed CR Models 5.10 UF-2015.
Kevin Stevenson AST 4762/5765. What is MCMC?  Random sampling algorithm  Estimates model parameters and their uncertainty  Only samples regions of.
Joint Modelling of Accelerated Failure Time and Longitudinal Data By By Yi-Kuan Tseng Yi-Kuan Tseng Joint Work With Joint Work With Professor Jane-Ling.
Slide 16.1 Hazard Rate Models MathematicalMarketing Chapter Event Duration Models This chapter covers models of elapsed duration.  Customer Relationship.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Single Season Study Design. 2 Points for consideration Don’t forget; why, what and how. A well designed study will:  highlight gaps in current knowledge.
 Occupancy Model Extensions. Number of Patches or Sample Units Unknown, Single Season So far have assumed the number of sampling units in the population.
Estimating Animal Numbers
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
Some Wildlife Census Techniques
Prediction and Missing Data. Summarising Distributions ● Models are often large and complex ● Often only interested in some parameters – e.g. not so interested.
Capture-recapture Models for Open Populations Abundance, Recruitment and Growth Rate Modeling 6.15 UF-2015.
Trace-Contrast models for capture-recapture without capture histories Rachel Fewster Ben Stevenson, David Borchers St Andrews Helen Nathan UoA.
CS479/679 Pattern Recognition Dr. George Bebis
Ch3: Model Building through Regression
CHAPTER 18 SURVIVAL ANALYSIS Damodar Gujarati
Markov chain monte carlo
Estimating Population Size
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Estimating mean abundance from repeated presence-absence surveys
11. Conditional Density Functions and Conditional Expected Values
11. Conditional Density Functions and Conditional Expected Values
Mathematical Foundations of BME Reza Shadmehr
Presentation transcript:

Survival models without mortality Casting closed-population wildlife survey models as survival- or recurrent event models David Borchers Roland Langrock, Greg Distiller, Ben Stevenson, Darren Kidney, Martin Cox

Closed-Population Methods 1.Removal methods 2.Distance Sampling Methods 3.Capture-Recapture Methods 4.Occupancy Methods

This is a discrete survival model with unknown number of censored subjects ( ) pdf of time of death: The Removal Method ( an example with mortality) “Survivor function” t N×F(t)

The Removal Method ( an example with mortality) t h is mortality hazard (per unit time) Continuous survival model with unknown number of censored subjects pdf of time of death: Removal models are survival models with unknown number of censored subjects.

Diagramatically: Time t T 0 Mortality hazard h f(t ;h) = S(t) h Survivor function Survival Model: Detection hazard

The Removal Method ( an example with mortality) Continuous time likelihood (with Poisson rather than Binomial/multinomial) pr(detect)

The Removal Method ( an example with mortality) Continuous time likelihood with individual random effect (with Poisson rather than Binomial/multinomial) Random effect distribution, conditional on detection Hazard depends on x … and hazard that changes with time: Hazard depends on x and t

Diagramatic Removal Model, for a given x : Time t x T 0 pr(detect |x) = 1-S(T |x) Mortality hazard at x: h(t |x) f(t |x) = S(t |x)h(t |x) Survival Model:

Diagramatic Removal Model, for a given x : Time t 0 x T pr(detect |x) = 1-S(T |x) Detection hazard at x: h(t |x) p(x) Line Transect models are survival models with unknown number of censored subjects, and individual random effects. f(t |x) = S(t |x)h(t |x) Survival Model: Diagramatic Line Transect

Line Transect Models Continuous time likelihood with individual random effect (with Poisson rather than Binomial/multinomial) Perpendicular distance distribution, conditional on detection A: Hayes and Buckland (1983) are to “blame” Q: Why is this ignored??

Hayes and Buckland are to blame Prior to Hayes & Buckland (1983), various models for 2-D distribution of detection functions were proposed. Some fitted the data in some situations, but none was robust (i.e. fitted in many situations). H&B (1983) proposed a hazard-rate formulation (effectively a survival model) and showed that marginalising over t resulted in robust forms for p(x), i.e. forms that fitted many cases. Distance sampling has been 1-D ever since.

Is time-to-detection any use? Fewster & Jupp (2013) showed that additional data improves asymptotic efficiency, even when it involves estimating additional nuisance parameters. There are other benefits too… 1.Removal method does not require p(0)=1 2.Removal method does not require known random effect distribution (π(x); uniform for line transects) 3.Can accommodate stochastic availability (i.e. overcome “availability bias”)

Proportion of population that is missed N×F(t) 1. p(0)<1

1. Time-to-detection enables you to estimate p(0) Observer Proportion of population at x=0 that is missed: = 1 – p(0) f(t|x=0): pdf of detection times for animals at peprendicular distance zero

1. Time-to-detection enables you to estimate p(0) Observer (Sometimes not so well) f(t|x=0): pdf of detection times for animals at peprendicular distance zero Removal method poor unless large fraction of population is removed. LT p(0) estimation from time-to-detection data is poor when p(0) is not “close” to 1.

2. Time-to-detection enables you to estimate π(x)

Forward distance

2. Forward distance enables you to estimate π(x)

3. Stochastic availability Time t T 0 Detection hazard at x, given availability: h(t |x) Detection hazard at x,t given availability: h(x,t) 2-State Markov-modulated Poisson Process (MMPP) in which State 1 = shallow diving: (high Poisson event rate) State 0 = deep diving: (low Poisson event rate)

3. Stochastic availability: Bowhead aerial survey

Recap: Is time-to-detection any use? Fewster & Jupp (2013) showed that additional data improves asymptotic efficiency, even when it involves estimating additional nuisance parameters. There are other benefits too… 1.Removal method does not require p(0)=1 2.Removal method does not require known random effect distribution (π(x); uniform for line transects) 3.Can accommodate stochastic availability (i.e. overcome “availability bias”)  (if p(0) not too small)  

Line Transect Method Distance Sampling Methods

Capture-recapture with camera traps Trap k

Spatially Explicit Capture-Recapture (SECR) Time 1 2 R t 11 t 21 t x Poisson Location of animal i’s activity center Number of times animal i is detected on camera k on occasion r Detection function parameters Density model parameters

Continuous-time Spatially Explicit Capture-Recapture Each animal can be detected multiple times, so not a “survival” model. Detection times modelled as Non-homogeneous Poisson Process (NHPP), with rate h k (t|x;θ) for trap k, given activity center at x For generality, allow detection hazard to depend on time Ignoring occasion for simplicity: Number of times animal i is detected on camera k

Continuous-time SECR Discrete-time Model

Continuous-time SECR Discrete-time Model Time-to-detection is NOT informative about density IF (a) so that (b) (then D factorises out of integral and product) ELSE time-to-detection IS informative about denstiy Aside: In case (a) above, continuous-time model is identical to discrete-time Poisson count model.

Continuous-time SECR Discrete-time Model

Continuous-time SECR Notes: 1.Mark-recapture distance sampling is a special case of SECR 2.Aside from independence issues (ask a question if you don’t now what I mean), there is no reason to impose occasions when you have detectors that sample continuously.

Time-to-detection in Occupancy Models F(t) (t) From Bischoff et al. (2014): Prob(detect | Presence) Kaplan-Meier Constant-hazard This is just the continuous-time removal method again (constant hazard, no individual random effects).

Time-to-detection in Occupancy Models: incorporating availability 2-State Markov-modulated Poisson Process (MMPP) : 3 different animals Constant hazard of detection, given pugmark Guillera-Arroita et al. (2012): tiger pugmarks along a transect Does this look familiar? l L 0 Distance

Recall: Line Transect with Stochastic availability Time t T 0 Detection hazard at x, given availability: h(t |x) Detection probability at x,t given availability: h(x,t) 2-State Markov-modulated Poisson Process (MMPP) in which State 1 = shallow diving: (high Poisson rate) State 0 = deep diving: (low Poisson rate) Line Transect with stochastic availability and multiple detections

Time-to-detection in Occupancy Models: incorporating availability Constant hazard of detection, given pugmark Gurutzeta et al. (2012): tiger pugmarks along a transect Same as Line Transect with stochastic availability, except: 1.Distance, not time and pugmarks, not whale blows 2.Constant detection hazard 3.Can’t distinguish between individuals (this adds lots of complication!) 4.Estimating presence, not abundance (this makes things little simpler) l L 0 Distance

Summary Time-to-detection is informative about density/occupancy Removal, Distance Sampling, SECR and Occupancy models share common underlying theory Fertile ground for further method development, each method borrowing from the other. My