Single Season Occupancy Modeling 5.13 UF-2015. 2 Occupancy Modeling State variable is proportion of patches that is occupied by a species of interest.

Slides:



Advertisements
Similar presentations
Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Advertisements

Empirical Estimator for GxE using imputed data Shuo Jiao.
SAMPLE DESIGN: HOW MANY WILL BE IN THE SAMPLE—DESCRIPTIVE STUDIES ?
Mean, Proportion, CLT Bootstrap
Krishna Pacifici Department of Applied Ecology NCSU January 10, 2014.
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.
Species interaction models. Goal Determine whether a site is occupied by two different species and if they affect each others' detection and occupancy.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
Evaluating Hypotheses
Statistical Background
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Inferences About Process Quality
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
Bootstrapping applied to t-tests
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Determining Sample Size
Verification & Validation
CLOSED CAPTURE-RECAPTURE
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
Role of Statistics in Geography
Detecting trends in dragonfly data - Difficulties and opportunities - Arco van Strien Statistics Netherlands (CBS) Introduction.
Using historic data sources to calibrate and validate models of species’ range dynamics Giovanni Rapacciuolo University of California Berkeley
Resource Selection Functions and Patch Occupancy Models: Similarities and Differences Lyman L. McDonald Senior Biometrician Western EcoSystems Technology,
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Corinne Introduction/Overview & Examples (behavioral) Giorgia functional Brain Imaging Examples, Fixed Effects Analysis vs. Random Effects Analysis Models.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Maximum Likelihood - "Frequentist" inference x 1,x 2,....,x n ~ iid N( ,  2 ) Joint pdf for the whole random sample Maximum likelihood estimates.
Three Frameworks for Statistical Analysis. Sample Design Forest, N=6 Field, N=4 Count ant nests per quadrat.
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Bootstraps and Jackknives Hal Whitehead BIOL4062/5062.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Human and Optimal Exploration and Exploitation in Bandit Problems Department of Cognitive Sciences, University of California. A Bayesian analysis of human.
The effect of variable sampling efficiency on reliability of the observation error as a measure of uncertainty in abundance indices from scientific surveys.
Introduction to Occupancy Models Key to in-class exercise are in blue
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Matrix Models for Population Management & Conservation March 2014 Lecture 10 Uncertainty, Process Variance, and Retrospective Perturbation Analysis.
Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.
Multiple Season Model Part I. 2 Outline  Data structure  Implicit dynamics  Explicit dynamics  Ecological and conservation applications.
Estimation of Animal Abundance and Density Miscellaneous Observation- Based Estimation Methods 5.2.
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
Capture-recapture Models for Open Populations “Single-age Models” 6.13 UF-2015.
 1 Species Richness 5.19 UF Community-level Studies Many community-level studies collect occupancy-type data (species lists). Imperfect detection.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Multiple Detection Methods: Single-season Models.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Monitoring and Estimating Species Richness Paul F. Doherty, Jr. Fishery and Wildlife Biology Department Colorado State University Fort Collins, CO.
Single Season Model Part I. 2 Basic Field Situation From a population of S sampling units, s are selected and surveyed for the species. Units are closed.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
1 Occupancy models extension: Species Co-occurrence.
REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.
 1 Modelling Occurrence of Multiple Species. 2 Motivation Often there may be a desire to model multiple species simultaneously.  Sparse data.  Compare/contrast.
 Multi-state Occupancy. Multiple Occupancy States Rather than just presence/absence of the species at a sampling unit, ‘occupancy’ could be categorized.
Review Statistical inference and test of significance.
Multiple Season Study Design. 2 Recap All of the issues discussed with respect to single season designs are still pertinent.  why, what and how  how.
Single Season Study Design. 2 Points for consideration Don’t forget; why, what and how. A well designed study will:  highlight gaps in current knowledge.
 Occupancy Model Extensions. Number of Patches or Sample Units Unknown, Single Season So far have assumed the number of sampling units in the population.
Multi-state Occupancy. Multiple Occupancy States Rather than just presence/absence of the species at a sampling unit, ‘occupancy’ could be categorized.
Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”
Hierarchical models. Hierarchical with respect to Response being modeled – Outliers – Zeros Parameters in the model – Trends (Us) – Interactions (Bs)
Hierarchical Models. Conceptual: What are we talking about? – What makes a statistical model hierarchical? – How does that fit into population analysis?
Chloe Boynton & Kristen Walters February 22, 2017
Estimating Population Size
Estimating mean abundance from repeated presence-absence surveys
Presentation transcript:

Single Season Occupancy Modeling 5.13 UF-2015

2 Occupancy Modeling State variable is proportion of patches that is occupied by a species of interest In surveys, target species is detected or not Advantages over abundance Practical, esp. for large scale Metapopulations

3 Occupancy Modeling

4

Alldredge et al Playback experiment with skilled observers Detection ranged from 0.19 to 0.65 of individual songs 5

6 Occupancy Modeling Due to imperfect detection, naïve occupancy < occupancy 3 problems Distribution underestimated Covariate relationships underestimated Factors affecting detection generate spurious covariate relationships

Covariates Underestimated X Occupancy 0 1True p < 1

Covariates Underestimated 8 Tyre et al. 2003, “Improving Precision and Reducing Bias in Biological Surveys: Estimating False-Negative Error Rates”

Spurious Relationships 9

10 Single Season Model Various approaches to account for imperfect detection. (e.g., Geissler and Fuller 1987, Azuma et al. 1990, MacKenzie et al. 2002, Tyre et al. 2003, Wintle et al and Stauffer et al. 2004) MacKenzie et al. (2002) provide most general treatment of the problem. State space approach that accounts for both observation process and unobserved system state

11 Single Season Model Consider the data as 2 ‘layers’ 1. True presence/absence of the species. 2. Observed data which is conditional upon species distribution. Knowledge about the first layer is imperfect. Must account for the observation process to make reliable inferences about occurrence.

12 Single Season Model  = probability a unit is occupied. p j = probability species is detected at a unit in survey j, given presence.

13 Basic Field Situation From a population of S sampling units, s are selected and surveyed. Units are repeatedly surveyed within a season. Provides information about detection Units are closed to changes in occupancy during a common ‘season’.

Data Summary: Detection Histories A detection history (h i ) for each visited site or sample unit (i) 1 denotes detection 0 denotes nondetection Example detection history: h i = Denotes 4 visits to site Detection at visits 1 and 4

Detection histories

Detection histories High detection Low detection

Detection histories High detection Low detection

18 Model Development Biological Reality Field Observations

19 Likelihood Approach Consider all possible stochastic processes that may have resulted in observed detection histories. Take verbal description of the observed data and translate it into a mathematical equation.

20 Likelihood Approach For example, Verbal description: species is present at the unit, was detected in first and third survey, not detected in second survey. Mathematical translation:

21 Likelihood Approach For example, Verbal description: species is absent, OR species is present at the unit and was never detected. Mathematical translation:

22 Likelihood Approach Biological Reality Present Absent Observations

23 Likelihood Approach Biological Reality Present Absent Observations

24 Likelihood Approach Biological Reality Present Absent Observations

25 Likelihood Approach Model likelihood is the product of the probability statements. Likelihood can be maximized to obtain MLE’s

Likelihood Approach 26 Naive occupancy estimate = Number of parameters = 2 Number of function calls = 62 -2log(likelihood) = AIC = LikeNRSig=6 eps=0.01 ETA=1e-013 Individual Site estimates of Site estimate Std.err 95% conf. interval psi 1 0 : ============================================================ Individual Site estimates of Site estimate Std.err 95% conf. interval P[1] 1 0 : ============================================================ DERIVED parameter - Psi-conditional = [Pr(occ | detection history)] Site psi-cond Std.err 95% conf. interval

27 Pr(occupied|nondetection) “Given the species was not detected, is the unit occupied?” is sometimes of interest. Can be derived from modelling results using Bayes theorem.

28 Pr(occupied|nondetection) Standard errors can be derived with the delta method.

Likelihood Approach 29 Naive occupancy estimate = Number of parameters = 2 Number of function calls = 62 -2log(likelihood) = AIC = LikeNRSig=6 eps=0.01 ETA=1e-013 Individual Site estimates of Site estimate Std.err 95% conf. interval psi 1 0 : ============================================================ Individual Site estimates of Site estimate Std.err 95% conf. interval P[1] 1 0 : ============================================================ DERIVED parameter - Psi-conditional = [Pr(occ | detection history)] Site psi-cond Std.err 95% conf. interval

Hierarchical Bayesian Approach Goal is to estimate posterior probability distribution for probability of occupancy (and detection) at sampling units. Assume a model for species occurrence. Assume prior probabilities. Bayes formula

31 Hierarchical Bayesian Approach Implemented using software such as WinBUGS. → posterior distributions of model parameters and conditional occupancy state for each unit.

32 Hierarchical Bayesian Approach: WinBUGS Code Occupancy state Det. History (data) Prob occ Conditional det model { ### Priors psi ~ dunif(0,1) p0 ~ dunif(0,1) ### Likelihood for (i in 1:s) { z[i] ~ dbern(psi) for (j in 1:k) { h[i,j] ~ dbern(p.eff[i,j]) p.eff[i,j] <- z[i]*p0 }

33 Hierarchical Bayesian Approach If sample size large and priors vague, results identical to MLE

34 Hierarchical Bayesian Approach Advantages Can be used for finite/small populations. Easier to implement complicated models (e.g., spatially explicit models). Derived parameters Disadvantages Model comparisons may be difficult Run time is greater

35 Covariates Site-specific e.g., habitat type, patch size Survey-specific may vary between surveys. e.g., local environmental conditions, observers

36 Covariates Occupancy and detection probabilities may be functions of site-specific covariates (via logit link, say).

37 Covariates Detection probabilities may also be a function of survey-specific covariates.

38 Covariates

39 Covariates Categorical covariates with m categories should be represented with m – 1 indicator (dummy) variables. e.g., if 4 habitat types, use 3 indicator variables; HabA, HabB, HabC, with habitat D considered the ‘standard’. However, suggest all m indicator variables be included in data file.

40 ‘Missing’ Observations Implicit assumption that j th survey of all units are conducted at (approximately) the same time; possibly unlikely in practice. Weather or breakdowns may result in some units not being surveyed. Equal sampling effort may not be possible with available resources.

41 ‘Missing’ Observations Day Unit

42 ‘Missing’ Observations Survey-specific covariates can only be missing if associated detection survey is also missing. Site-specific covariates cannot be missing.

43 Model Assumptions Closure. Surveys are independent. No unmodelled heterogeneity. Species identified correctly (no false detections)

44 What if Occupancy Changes? If species physically occupies units at random within a season, ‘occupancy’ parameter relates to probability a unit is used by the species. Immigration/emigration only; ‘occupancy’ parameter relates to probability that species is present at a unit at end/beginning of season respectively. should allow detection probability to vary within a season. should pool some survey occasions.

45 What if Occupancy Changes? Closure implies 2 nd component = 1. When changes are random, ‘detection probability’ is the product of 2 nd and 3 rd components

46 What if Occupancy Changes? Some designs (2 observers at each survey, j) permit Direct estimation of component 2

47 What if Occupancy Changes? Other non-random changes may cause biases. Is a ‘season’ defined appropriately? Is time between surveys appropriate? Area of active research.

Staggered entry Staggered entry models (Kendall et al. 2013) Permits 1 entry and 1 exit in season Migratory spp 48

Staggered entry 2 new parameters e ij = Pr(species enters site i after survey j | site i is not used at survey j) d ij = Pr(species departs site i after survey j | site i is used at survey j) 49

Staggered entry 50

51 Lack of Independence Surveys are not independent if the outcome of survey A is dependent upon the outcome of survey B. Some forms of dependence may be accommodated with good designs or modelling. In some instances, parameter estimates may be OK, but standard errors too small.

52 Lack of Independence To account for ‘trap response’; define a site- and survey-specific covariate (x ij ) that equals 1 for all surveys after first detection at a site, 0 otherwise. X ij h

53 Accounting for Lack of Closure/Independence Correlation of spatial replicates (Hines et al. 2010, Ecol. Appl.) Motivating example: surveying tigers on trails in India Detection of signs in one spatial replicate indicates high probability of presence of signs on an adjacent spatial replicate

54 Accounting for Lack of Closure/Independence

55 Accounting for Lack of Closure/Independence

56 Accounting for Lack of Closure/Independence

57 Accounting for Lack of Closure/Independence Hines et al. (2010) approach could also possibly be applied to account for temporal correlation or non-random changes in occupancy within a season.

58 Unmodelled Heterogeneity In occupancy probabilities parameter estimates should still be valid as average values across the sites surveyed. In detection probabilities occupancy will be underestimated. covariates may account for some sources of variation.

59 Incorporating Heterogeneity Finite Mixtures Assume occupied sites consist of G groups. Each group has a different p. Group membership is unknown hence a site may belong to any of the G groups.

60 Incorporating Heterogeneity Random Effects Assume p i is a random value from a continuous probability distribution (e.g., beta distribution, logit-normal). Implement using WinBUGS.

61 Abundance Induced Heterogeneity in Detection Differences in the local abundance of the species between sites may induce heterogeneity in detection probability. Royle and Nichols (2003) suggested an extension of occupancy models to accommodate this.

62 Abundance Induced Heterogeneity Local abundance is unknown, but a spatial distribution could be assumed (e.g., Poisson).

63 Abundance Induced Heterogeneity

64 Abundance Induced Heterogeneity Occupancy is now a derived parameter: (1–e –  ). Implicit assumption that the number of animals at a unit is constant. “several hundred sites may be required”

65 Abundance Induced Heterogeneity A good approach for incorporating heterogeneity in detection for occupancy estimation, less reliable if inferences about abundance are desired. Can also use a 0-inflated Poisson.

66 Species Misidentification If species is falsely detected then occupancy could be overestimated. Can account for false detections if some surveys are free of misidentification (Miller et al. 2011). More later…

67 Assessing Model Fit MacKenzie and Bailey (2004) suggest a test based on the observed and expected number of sites with each possible detection history. The expected number is predicted by the model, which may include covariates.

68 Assessing Model Fit For example, for the history 101 : Site Total0.216

69 Assessing Model Fit Use parametric bootstrap to assess the evidence for lack of fit. used to adjust SE’s and AIC values.

70 Assessing Model Fit Test has been shown to perform well to identify poor model structure w.r.t. detection probabilities, but not occupancy probabilities. Found to have generally low-power, especially for the sample sizes expected in many applications ( s <100). Recommend ≥10,000 bootstraps.

71 Assessing Model Fit Test should be conducted on the most complicated model under consideration (the global model) and results applied to all models in candidate set. e.g., if from global model, this value is used to adjust AIC values and SE’s for all models.

72 Finite Population In some circumstances, the sample of s units may constitute a large fraction of the population of interest. Strictly speaking, the methods above estimate the probability of occupancy. an underlying characteristic of the population. The proportion of units occupied is a realization of this process.

73 Finite Population Distinction between them is of little practical consequence for ‘infinite’ (large) populations, but may be for ‘finite’ populations. SE’s will be too large if not accounted for.

74 Finite Population The proportion of occupied sites could be calculated as: SE derived from delta method

75 Finite Population Alternatively, easily implemented using the data augmentation approach. Presence/absence of the species is predicted for units where species not detected. Occupancy state can also be predicted for units that were never surveyed.

76 Finite Population Post. Distn.  mean = 0.56, sd = 0.10 {2.5, 50, 97.5} %iles = {0.37, 0.56, 0.78} Post. Distn.  * mean = 0.56, sd = 0.08 {2.5, 50, 97.5} %iles = {0.46, 0.54, 0.74}

77 Spatial Correlation Probability of occupancy at a unit may depend upon whether a ‘neighbouring’ unit is also occupied. Some forms of clustering may be well explained by covariates. May not be important to account for if interest is in an overall measure of occupancy. May be more important if interest is in maps or predictions of unit-level occupancy.

78 Spatial Correlation What fraction of cells is occupied? What would be the appropriate SE? Random sampling with standard occupancy estimators should work

79 Spatial Correlation Alternatively, could model occupancy with the autologistic function. essentially the logit link with an ‘effect’ related to the number of occupied neighboring units. g i is a function of the occupied ‘neighboring’ units But the exact occupancy state of units will often be unknown…

80 Spatial Correlation Predict the occupancy state for units where it is unknown (either due to nondetection or unsurveyed) and use the data augmentation approach.

81

82 Summary Historically, investigating patterns in occupancy has been the main focus of such studies. Useful for assessing a snapshot of a population, but not for understanding the underlying dynamics. A suite of flexible methods are now available that account for: detectability covariates unequal sampling effort heterogeneity finite populations spatial correlation