Using distributions of likelihoods to diagnose parameter misspecification of integrated stock assessment models Jiangfeng Zhu * Shanghai Ocean University,

Slides:



Advertisements
Similar presentations
Modeling Recruitment in Stock Synthesis
Advertisements

Sardine Two-Stock Hypothesis: Results at the Posterior Mode SPSWG Meeting 28 th August 2013 Carryn de Moor Doug Butterworth Marine Resource Assessment.
An exploration of alternative methods to deal with time-varying selectivity in the stock assessment of YFT in the eastern Pacific Ocean CAPAM – Selectivity.
Are the apparent rapid declines in top pelagic predators real? Mark Maunder, Shelton Harley, Mike Hinton, and others IATTC.
458 Quantifying Uncertainty using Classical Methods (Likelihood Profile, Bootstrapping) Fish 458, Lecture 12.
Sheng-Ping Wang 1,2, Mark Maunder 2, and Alexandre Aires-Da-Silva 2 1.National Taiwan Ocean University 2.Inter-American Tropical Tuna Commission.
Gavin Using size increment data in age-structured stock assessment models CAPAM growth workshop: Nov 2014.
Modeling fisheries and stocks spatially for Pacific Northwest Chinook salmon Rishi Sharma, CRITFC Henry Yuen, USFWS Mark Maunder, IATTC.
An evaluation of alternative binning approaches for composition data in integrated stock assessments Cole Monnahan, Sean Anderson, Felipe Hurtado, Kotaro.
Growth in Age-Structured Stock Assessment Models R.I.C. Chris Francis CAPAM Growth Workshop, La Jolla, November 3-7, 2014.
Model Selection for Selectivity in Fisheries Stock Assessments André Punt, Felipe Hurtado-Ferro, Athol Whitten 13 March 2013; CAPAM Selectivity workshop.
Model time step and species biology considerations for growth estimation in integrated stock assessments P. R. Crone and J. L. Valero Southwest Fisheries.
FMSP stock assessment tools Training Workshop LFDA Theory.
The current status of fisheries stock assessment Mark Maunder Inter-American Tropical Tuna Commission (IATTC) Center for the Advancement of Population.
458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15.
458 Fitting models to data – II (The Basics of Maximum Likelihood Estimation) Fish 458, Lecture 9.
Statistical Concepts (continued) Concepts to cover or review today: –Population parameter –Sample statistics –Mean –Standard deviation –Coefficient of.
Are pelagic fisheries managed well? A stock assessment scientists perspective Mark Maunder and Shelton Harley Inter-American Tropical Tuna Commission
Case Study - Dover Sole Range from Baja California to the Bering Sea. On mud or muddy-sand, at 35 to 1400 m depths. Feed on polychaete worms, shrimp, brittle.
Hui-Hua Lee 1, Kevin R. Piner 1, Mark N. Maunder 2 Evaluation of traditional versus conditional fitting of von Bertalanffy growth functions 1 NOAA Fisheries,
BPS - 3rd Ed. Chapter 131 Confidence intervals: the basics.
How many conditional age-at-length data are needed to estimate growth in stock assessment models? Xi He, John C. Field, Donald E. Pearson, and Lyndsey.
The (potential) value and use of empirical estimates of selectivity in integrated assessments John Walter, Brian Linton, Will Patterson and Clay Porch.
Time-Varying vs. Non-Time- Varying Growth in the Gulf of Mexico King Mackerel Stock Assessment: a Case Study Southeast Fisheries Science Center Jeff Isely,
R. Sharma*, A. Langley ** M. Herrera*, J. Geehan*
Maximum likelihood estimates of North Pacific albacore tuna ( Thunnus alalunga ) von Bertalanffy growth parameters using conditional-age-at-length data.
ASSESSMENT OF BIGEYE TUNA (THUNNUS OBESUS) IN THE EASTERN PACIFIC OCEAN January 1975 – December 2006.
Spatial issues in WCPO stock assessments (bigeye and yellowfin tuna) Simon Hoyle SPC.
Evaluation of a practical method to estimate the variance parameter of random effects for time varying selectivity Hui-Hua Lee, Mark Maunder, Alexandre.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
The Stock Synthesis Approach Based on many of the ideas proposed in Fournier and Archibald (1982), Methot developed a stock assessment approach and computer.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
Workshop on Stock Assessment Methods 7-11 November IATTC, La Jolla, CA, USA.
Simulated data sets Extracted from:. The data sets shared a common time period of 30 years and age range from 0 to 16 years. The data were provided to.
USING INDICATORS OF STOCK STATUS WHEN TRADITIONAL REFERENCE POINTS ARE NOT AVAILABLE: EVALUATION AND APPLICATION TO SKIPJACK TUNA IN THE EASTERN PACIFIC.
M.S.M. Siddeeka*, J. Zhenga, A.E. Puntb, and D. Pengillya
U.S. Department of Commerce | National Oceanic and Atmospheric Administration | NOAA Fisheries | Page 1 Model Misspecification and Diagnostics and the.
The effect of variable sampling efficiency on reliability of the observation error as a measure of uncertainty in abundance indices from scientific surveys.
Extending length-based models for data-limited fisheries into a state-space framework Merrill B. Rudd* and James T. Thorson *PhD Student, School of Aquatic.
1 Yi-Jay Chang 2 Brian Langseth 3 Mark Maunder 1 Felipe Carvalho Performance of a stock assessment model with misspecified time-varying growth 1 – JIMAR,
Estimation of growth within stock assessment models: implications when using length composition data Jiangfeng Zhu a, Mark N. Maunder b, Alexandre M. Aires-da-Silva.
ASSESSMENT OF BIGEYE TUNA (THUNNUS OBESUS) IN THE EASTERN PACIFIC OCEAN January 1975 – December 2005.
Simulation analysis using stock synthesis Maunder, Piner, and Lee.
Simulation of methods to account for spatial effects in the stock assessment of Pacific bluefin tuna Cast by: Hui-hua Lee (NOAA Fisheries, SWFSC) Kevin.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Estimation of selectivity in Stock Synthesis: lessons learned from the tuna stock assessment Shigehide Iwata* 1 Toshihde Kitakado* 2 Yukio Takeuchi* 1.
1 Probability and Statistics Confidence Intervals.
Matrix Models for Population Management & Conservation March 2014 Lecture 10 Uncertainty, Process Variance, and Retrospective Perturbation Analysis.
CAN DIAGNOSTIC TESTS HELP IDENTIFY WHAT MODEL STRUCTURE IS MISSPECIFIED? Felipe Carvalho 1, Mark N. Maunder 2,3, Yi-Jay Chang 1, Kevin R. Piner 4, Andre.
Yellowfin Tuna Major Changes Catch, effort, and length-frequency data for the surface fisheries have been updated to include new data for 2005.
Anders Nielsen Technical University of Denmark, DTU-Aqua Mark Maunder Inter-American Tropical Tuna Commission An Introduction.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Data weighting and data conflicts in fishery stock assessments Chris Francis Wellington, New Zealand CAPAM workshop, “ Data conflict and weighting, likelihood.
Influence of selectivity and size composition misfit on the scaling of population estimates and possible solutions: an example with north Pacific albacore.
Estimating Uncertainty. Estimating Uncertainty in ADMB Model parameters and derived quantities Normal approximation Profile likelihood Bayesian MCMC Bootstrap.
Day 4, Session 1 Abundance indices, CPUE, and CPUE standardisation
Recommended modeling approach Version 2.0. The law of conflicting data Axiom Data is true Implication Conflicting data implies model misspecification.
Day 3 Session 1 Model fitting. So, how do you assess a stock assessment model? There are five key questions you should ask yourself: 1.Assumptions What.
Population Dynamics and Stock Assessment of Red King Crab in Bristol Bay, Alaska Jie Zheng Alaska Department of Fish and Game Juneau, Alaska, USA.
Is down weighting composition data adequate to deal with model misspecification or do we need to fix the model? Sheng-Ping Wang, Mark N. Maunder National.
An Introduction to AD Model Builder PFRP
Pacific-Wide Assessment of Bigeye Tuna
Sardine Two-Stock Hypothesis: Results at the Posterior Mode
11/19/2018 Day 3 Session 3 Special Session – Uncertainty, the stock recruitment relationship and “steepness”
Current developments on steepness for tunas:
Evaluation of the 2004 Resolution on the Conservation of Tuna in the eastern Pacific Ocean (Resolution C-04-09)
SAFS Quantitative Seminar
JABBA-Select: Performance evaluation of JABBA-Select against and an age-structured simulator and estimation model Henning Winker* MARAM International.
Pribilof Island red king crab
U.S. NMFS contracts the CIE to review assessments
Presentation transcript:

Using distributions of likelihoods to diagnose parameter misspecification of integrated stock assessment models Jiangfeng Zhu * Shanghai Ocean University, Shanghai , China Mark N. Maunder, Alexandre M. Aires-da-Silva Inter-American Tropical Tuna Commission, La Jolla, CA 92037, USA * CAPAM workshop on Data conflict and weighting, likelihood functions, and process error La Jolla, CA, USA, October 19-23, 2015

Outline Background (motivation). Model and data. Preliminary results. Discussion and future improvements.

Background Commonly used method to diagnose model misspecification: — Analysis of residuals; — Retrospective analysis; — Likelihood profiling; — Goodness of fit; Potential to use simulation method (e.g., Piner et al. 2011);

Previous study detecting model misspecification using simulation: Figure from Piner et al. (2011) * *Piner, K., H-H. Lee, M. Maunder and R. Methot A simulation-based method to determine model misspecification: examples using natural mortality and population dynamics models. Mar. Coast. Fish. 3, 336–343

Idea for this study We argue that a misspecified parameter might lead the original likelihood value (derived from fitting to original data) to lie out of the range (e.g., 90% percentiles) of the distribution of likelihood derived from the simulated data if the data are informative about the parameter.

Method Develop and evaluate a simulation-based diagnostics using distributions of likelihood components to detect model (par)misspecification. Two key parameters, the steepness (h) of Beverton- Holt S-R relationship and the average length of the oldest individuals (L2), are selected to represent the model misspecification.

Model and data Simple SS3 Bigeye tuna in the eastern Pacific Ocean; Two fisheries: purse seine and longline (1975 through 2011); Selectivity: Asymptotic for purse seine (PS) and dome-shaped for longline (LL); The Beverton–Holt S–R relationship.

Model and data The model is conditioned on historical data by fitting to: Indices of relative abundance of PS ( ) and LL ( ), log-transformed S.D. = 0.4 and 0.15; and Length-composition data of PS ( , average sample size 16, fixed) and LL ( ; average sample size 8, fixed). Pars fixed: VBGF growth (mean and variation), age- specific M, h, maturity and weight-length curves; Pars estimated: B0, rec_devs, fishing mortalities, parameters for selectivity curves, and q.

Simulation - controlled experiment for h (same for L2) 1) Assuming a true h=0.9 for the model, and fit to the original data set (D 0 ) to get the model pars; 2) Run bootstrap to create a single data set (D). The data set D is considered to be free of model misspecification; 3) Refit the data set D by specifying h=0.9 (correctly specified), h = 0.8 (under-specified), and h=1.0 (over-specified); 4) Using pars for each h in (3) to create multiple (e.g. 100) data sets, and fit the data sets by fixing corresponding h to get distribution of likelihood component; and 5) Check the distribution of likelihood component and the likelihood from fitting to D in (3), and determine if the misspecification of h can be detected.

Simulation SS3 model Fit to original data set (D0), assuming true h Mis model Single bootstrap data set ( Da ) Perfect model Bootstrapping Fit to Da by assuming: True hLower h Higher h Mis model Bootstrapping BOOT i Likelihood (Like_ Da ) Likelihood distribution Repeat from this step, with Db, Dc Fit to Random recruitment deviations added (process error)

What we expect might like (e.g. length comp.) Like_Da, If par wrong Like_Da, If par correct Median likelihood (between 5th~95th percentile)

Low convergence rate for h over- assumed Preliminary results of simulation Test parameter h, base on data set: Da

Low convergence rate for h over- assumed Preliminary results Test parameter h, base on data set: Da

Preliminary results Test parameter h, base on data set: Db

Preliminary results Test parameter h, base on data set: Db

Low convergence rate for h over- assumed Test parameter h, base on data set: Dc Preliminary results

Low convergence rate for h over- assumed Test parameter h, base on data set: Dc Preliminary results

Test parameter L2, base on data set: Da Preliminary results

Test parameter L2, base on data set: Da Preliminary results

Test parameter L2, base on data set: Db Preliminary results

Test parameter L2, base on data set: Db Preliminary results

Test parameter L2, base on data set: Dc Preliminary results

Test parameter L2, base on data set: Dc Preliminary results

Discussion How do deal with non-converged runs? Is it indicative to misspecification ? The preliminary results are not what we expected, probably due to: Uncertainties seem not sufficiently covered when simulating data sets for the L2, as previous study indicated that the data is informative for L2;

Discussion Possible way to improve this work: Use calibrated simulation introduced by Besbeas and Morgan(2014), i.e., sample from pars distributions when simulating data.

Discussion If the par misspecification can be detected, implying the data is informative, then we can turn to estimate that parameter within the model; Even if we may not be able to precisely determine the correct par values, we can reduce the uncertainty (e.g. narrower plausible range) by the diagnostics; How to deal with multiple misspecifications (e.g. growth, selectivity, and q) using the simulation test? One by one ?

Comments and suggestions... Questions.