458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15.

Slides:



Advertisements
Similar presentations
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Advertisements

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Inferences for Regression.
458 Quantifying Uncertainty using Classical Methods (Likelihood Profile, Bootstrapping) Fish 458, Lecture 12.
Sheng-Ping Wang 1,2, Mark Maunder 2, and Alexandre Aires-Da-Silva 2 1.National Taiwan Ocean University 2.Inter-American Tropical Tuna Commission.
458 Delay-difference models Fish 458, Lecture 14.
Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
What is MPC? Hypothesis testing.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
458 Model Uncertainty and Model Selection Fish 458, Lecture 13.
458 Fitting models to data – II (The Basics of Maximum Likelihood Estimation) Fish 458, Lecture 9.
458 Population Projections (policy analysis) Fish 458; Lecture 21.
Resampling techniques
458 Lumped population dynamics models Fish 458; Lecture 2.
Multispecies Models-II (Including Predation In Population Models)
Quantitative Methods – Week 6: Inductive Statistics I: Standard Errors and Confidence Intervals Roman Studer Nuffield College
Lecture 6: Multiple Regression
458 Estimating Extinction Risk (Population Viability Analysis) Fish 458; Lecture 25.
458 Fitting models to data – III (More on Maximum Likelihood Estimation) Fish 458, Lecture 10.
Hierarchical Bayesian Analysis of the Spiny Lobster Fishery in California Brian Kinlan, Steve Gaines, Deborah McArdle, Katherine Emery UCSB.
458 Fitting models to data – I (Sum of Squares) Fish 458, Lecture 7.
Business Statistics - QBM117 Statistical inference for regression.
Introduction to Regression Analysis, Chapter 13,
Ordinary Least Squares
Inference for regression - Simple linear regression
Hypothesis Testing in Linear Regression Analysis
Introductory Statistical Concepts. Disclaimer – I am not an expert SAS programmer. – Nothing that I say is confirmed or denied by Texas A&M University.
Inference for Regression
Inferences for Regression
Standardizing catch per unit effort data
Investigating the Accuracy and Robustness of the Icelandic Cod Assessment and Catch Control Rule A. Rosenberg, G. Kirkwood, M. Mangel, S. Hill and G. Parkes.
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
PARAMETRIC STATISTICAL INFERENCE
What is the MPC?. Learning Objectives 1.Use linear regression to establish the relationship between two variables 2.Show that the line is the line of.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Ch4 Describing Relationships Between Variables. Pressure.
Surplus-Production Models
ASSESSMENT OF BIGEYE TUNA (THUNNUS OBESUS) IN THE EASTERN PACIFIC OCEAN January 1975 – December 2006.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Regression. Population Covariance and Correlation.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Estimation Method of Moments (MM) Methods of Moment estimation is a general method where equations for estimating parameters are found by equating population.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
ANOVA, Regression and Multiple Regression March
Extending length-based models for data-limited fisheries into a state-space framework Merrill B. Rudd* and James T. Thorson *PhD Student, School of Aquatic.
Using distributions of likelihoods to diagnose parameter misspecification of integrated stock assessment models Jiangfeng Zhu * Shanghai Ocean University,
1 Probability and Statistics Confidence Intervals.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 17 Simple Linear Regression and Correlation.
Cointegration in Single Equations: Lecture 5
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Data weighting and data conflicts in fishery stock assessments Chris Francis Wellington, New Zealand CAPAM workshop, “ Data conflict and weighting, likelihood.
732G21/732G28/732A35 Lecture 3. Properties of the model errors ε 4. ε are assumed to be normally distributed
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Comparing Models.
Quiz 2.
(Residuals and
Policy Evaluation II (Feedback strategies)
BA 275 Quantitative Business Methods
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Regression Assumptions
Regression Assumptions
Presentation transcript:

458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15

458 Observation and Process Error Reminder: Process uncertainty impacts the dynamics of the population (e.g. recruitment variability, natural mortality variability, birth-death processes). Observation uncertainty impacts how we observe the population (e.g. CVs for abundance estimates).

458 Observation and Process Error (the Dynamic Schaefer model) Let us generalize the Schaefer model to allow for both observation and process error: determines the extent of process error, and determines the extent of observation error. Often we assume that one of the two types of error dominate and hence assume the other to be zero.

458 Process error only-I We assume here that and continue under the assume that v=0. Under this assumption : If we assume that w is normally distributed, the likelihood function becomes :

458 Process error only-II Issues to consider: A process error estimator can only estimate biomass for years for which index information is available. The choice of where to place the process error term in the dynamics equation is arbitrary (what “process” is really being modeled?) You need a continuous time-series of data to compute all the residuals. What do you do if there are two series of abundance estimates!

458 Observation error only - I We assume here that and continue under the assumption that w=0. Under this assumption : If we assume that v is normally distributed, the likelihood function becomes :

458 Observation error only - II Issues to consider: An observation error estimator can estimate biomass for all years. The choice of where to place the observation error term is fairly easy. There is no need for a continuous time-series of data and multiple series of abundance estimates can be handled straightforwardly! There is a need to estimate an additional parameter (the initial biomass - often we assume that ).

458 Comparing approaches (Cape Hake) Note that we can’t compare these models because the likelihood functions are different So what can we say about these two analyses

458 Comparing approaches (Residuals) The residuals about the fit of the process error estimator seem more correlated. Formally, a runs test could be conducted. Perhaps plot the residuals against the predicted values; look for a lack of normality

458 Comparing approaches (Retrospective analyses) We re-run the analysis leaving the last few CPUE data points out of the analysis – seems to be a pattern here! Analyses along these lines could have saved northern cod!

458 Simulation testing 1. Fit the model to the data. 2. Run the model forward with process error. 3. Add some observation noise to the predicted CPUE 4. Fit the observation and process error models. 5. Compare the estimates from the observation and the process error estimators with the true values. 6. Repeat steps 2-5 many times.

458 Simulation testing (Cape hake) The simulation testing framework was applied assuming a process error variation of 0.1 and an observation error variation of 0.2. The results were summarized by the distribution for the difference between the true and estimated current depletion (the ratio of current biomass to K). For this scenario, the observation error estimator is both more precise and less biased. Unless there is good evidence for high process error variability (there isn’t for Cape hake), we would therefore prefer the observation error estimator.

458 Simulation testing (Cape hake) Bias - average error isn’t zero

458 Simulation Testing - Recap The reasons for using simulation testing include: we know the correct answer for each generated data set – this is not the case in the real world; and there is no restriction on the types of models that can be compared (e.g. they need not use the same data). The results of simulation testing depend, of course, on the model assumed for the true situation.

458 Readings Haddon (2001); Chapter 10. Hilborn and Mangel (1997), Chapter 7. Polacheck et al. (1993).