WP 2.4 Evaluation of NMFS Toolbox Assessment Models on Simulated Groundfish Data Sets Comparative Simulation Tests Overview Brooks, Legault, Nitschke,

Slides:



Advertisements
Similar presentations
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Advertisements

CHAPTER Basic Definitions and Properties  P opulation Characteristics = “Parameters”  S ample Characteristics = “Statistics”  R andom Variables.
Tim Smyth and Jamie Shutler Assessment of analysis and forecast skill Assessment using satellite data.
Sheng-Ping Wang 1,2, Mark Maunder 2, and Alexandre Aires-Da-Silva 2 1.National Taiwan Ocean University 2.Inter-American Tropical Tuna Commission.
Modeling fisheries and stocks spatially for Pacific Northwest Chinook salmon Rishi Sharma, CRITFC Henry Yuen, USFWS Mark Maunder, IATTC.
An evaluation of alternative binning approaches for composition data in integrated stock assessments Cole Monnahan, Sean Anderson, Felipe Hurtado, Kotaro.
George Bank Cod Base –Similar to GARM2005 assessment Fleet_dome Survey_dome Survey_and_Fleet_dome Purpose: test models ability to estimate correctly when.
Growth in Age-Structured Stock Assessment Models R.I.C. Chris Francis CAPAM Growth Workshop, La Jolla, November 3-7, 2014.
Model time step and species biology considerations for growth estimation in integrated stock assessments P. R. Crone and J. L. Valero Southwest Fisheries.
Black Sea Bass – Northern Stock Coastal-Pelagic/ASMFC Working Group Review June 15, 2010.
The current status of fisheries stock assessment Mark Maunder Inter-American Tropical Tuna Commission (IATTC) Center for the Advancement of Population.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15.
Barents Sea fish modelling in Uncover Daniel Howell Marine Research Institute of Bergen.
An Overview of Today’s Class
Using Climate Information in Fisheries Stock Assessments (with a focus on Pacific Whiting) Ian Taylor SMA 550: Climate Impacts on the Pacific Northwest.
Stock-Recruit relationships in the F3 model Bruce Kendall July 2006.
Hui-Hua Lee 1, Kevin R. Piner 1, Mark N. Maunder 2 Evaluation of traditional versus conditional fitting of von Bertalanffy growth functions 1 NOAA Fisheries,
458 Fitting models to data – III (More on Maximum Likelihood Estimation) Fish 458, Lecture 10.
1 FLR: An Open-Source Framework for the Evaluation and Development of Management Strategies L.T. Kell, I. Mosqueira, P. Grosjean, J-M. Fromentin, D. Garcia,
RESEARCH A systematic quest for undiscovered truth A way of thinking
Chapter 1 Introduction and Data Collection
WP4: Models to predict & test recovery strategies Cefas: Laurence Kell & John Pinnegar Univ. Aberdeen: Tara Marshall & Bruce McAdam.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Quantitative Skills 1: Graphing
Investigating the Accuracy and Robustness of the Icelandic Cod Assessment and Catch Control Rule A. Rosenberg, G. Kirkwood, M. Mangel, S. Hill and G. Parkes.
Copyright © 2010 Lumina Decision Systems, Inc. Monte Carlo Simulation Analytica User Group Modeling Uncertainty Series #3 13 May 2010 Lonnie Chrisman,
Pacific Hake Management Strategy Evaluation Joint Technical Committee Northwest Fisheries Science Center, NOAA Pacific Biological Station, DFO School of.
ASSESSMENT OF BIGEYE TUNA (THUNNUS OBESUS) IN THE EASTERN PACIFIC OCEAN January 1975 – December 2006.
Decisions from Data: The Role of Simulation Gail Burrill Gail Burrill Michigan State University
A retrospective investigation of selectivity for Pacific halibut CAPAM Selectivity workshop 14 March, 2013 Ian Stewart & Steve Martell.
. Assessment of the Icelandic cod stock Björn Ævarr Steinarsson Marine Research Institute.
Population and Sample The entire group of individuals that we want information about is called population. A sample is a part of the population that we.
ALADYM (Age-Length Based Dynamic Model): a stochastic simulation tool to predict population dynamics and management scenarios using fishery-independent.
The Stock Synthesis Approach Based on many of the ideas proposed in Fournier and Archibald (1982), Methot developed a stock assessment approach and computer.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Workshop on Stock Assessment Methods 7-11 November IATTC, La Jolla, CA, USA.
Effects of variability in fertiliser application on hill country production economics Allan Gillingham (Research Consultant) Alister Metherell.
1 Understanding and Measuring Uncertainty Associated with the Mid-Year Population Estimates Joanne Clements Ruth Fulton Alison Whitworth.
Simulated data sets Extracted from:. The data sets shared a common time period of 30 years and age range from 0 to 16 years. The data were provided to.
WP220 – Marine ecosystems and fisheries Tony Beeching Cefas.
Fisheries 101: Modeling and assessments to achieve sustainability Training Module July 2013.
Evaluation of harvest control rules (HCRs): simple vs. complex strategies Dorothy Housholder Harvest Control Rules Workshop Bergen, Norway September 14,
Management Procedures (Prof Ray Hilborn). Current Management Cycle Fishery: Actual Catches Data Collection Assessment Management Decision.
WP 3.1 Simulation Studies of Issues Associated with Filling Zeros in VPA Tuning Indices Chris Legault and Al Seaver The great appeal of the doctrine that.
The effect of variable sampling efficiency on reliability of the observation error as a measure of uncertainty in abundance indices from scientific surveys.
Chapter 13 Sampling distributions
Simulation analysis using stock synthesis Maunder, Piner, and Lee.
Using distributions of likelihoods to diagnose parameter misspecification of integrated stock assessment models Jiangfeng Zhu * Shanghai Ocean University,
WP 4.1 Report of the Retrospective Working Group It is a simple but sometimes forgotten truth that the greatest enemy to present joy and high hopes is.
CAN DIAGNOSTIC TESTS HELP IDENTIFY WHAT MODEL STRUCTURE IS MISSPECIFIED? Felipe Carvalho 1, Mark N. Maunder 2,3, Yi-Jay Chang 1, Kevin R. Piner 4, Andre.
1 Climate Change and Implications for Management of North Sea Cod (Gadus morhua) L.T. Kell, G.M. Pilling and C.M. O’Brien CEFAS, Lowestoft.
Lecture 10 review Spatial sampling design –Systematic sampling is generally better than random sampling if the sampling universe has large-scale structure.
A bit of history Fry 1940s: ”virtual population”, “catch curve”
UALG Statistical catch at age models Einar Hjörleifsson.
As a data user, it is imperative that you understand how the data has been generated and processed…
Statistical Concepts Basic Principles An Overview of Today’s Class What: Inductive inference on characterizing a population Why : How will doing this allow.
SAMPLING TECHNIQUES N. JAYAKUMAR Assistant professor Dept. of Fisheries Biology and Resource Management Fisheries college & Research Institute, Thoothukudi-8.
Day 3 Session 1 Model fitting. So, how do you assess a stock assessment model? There are five key questions you should ask yourself: 1.Assumptions What.
Uncertainty in Measurement How would you measure 9 ml most precisely? What is the volume being measured here? What is the uncertainty measurement? For.
Population Dynamics and Stock Assessment of Red King Crab in Bristol Bay, Alaska Jie Zheng Alaska Department of Fish and Game Juneau, Alaska, USA.
Is down weighting composition data adequate to deal with model misspecification or do we need to fix the model? Sheng-Ping Wang, Mark N. Maunder National.
Gulf of Maine/Georges Bank Atlantic Herring Assessment Update – 2009.
Conducting surveys and designing questionnaires. Aims Provide students with an understanding of the purposes of survey work Overview the stages involved.
Professor S K Dubey,VSM Amity School of Business
Selectivity.
SAFS Quantitative Seminar
Pribilof Island red king crab
MULTIFAN-CL implementation of deterministic and stochastic projections
Presentation transcript:

WP 2.4 Evaluation of NMFS Toolbox Assessment Models on Simulated Groundfish Data Sets Comparative Simulation Tests Overview Brooks, Legault, Nitschke, O’Brien, Sosebee, Rago, and Seaver Nothing gives rest but the sincere search for truth. Blaise Pascal (French philosopher)

What did we do? Evaluated 5 NFT stock assessment models for three stocks under 4 scenarios meant to examine potential difficulties in real assessments –AIM, ASPIC, SCALE, VPA, ASAP –GB yt (retro), GB cod (domes), white hake (ageing) PopSim used to generate true conditions and create 100 datasets with the same random errors for all 5 models Evaluated Accuracy and Precision of the 100 point estimates from the models –Did not examine precision of each of the 100 runs 60 scenarios 6000 assessments

Why? Test hypothesis that all models are impacted similarly when presented with the same underlying problem A priori know that some models will not perform well under the test conditions because limiting data to VPA years NOT trying to declare one model “winner” NOT trying to declare any model “bad”

PopSim Primer Age and Length Based Population Simulator User defines Dimensions (Years, Ages, Plus Group Age, Lengths) Initial NAA Recruitment time series (or SRR) Annual Fmult and selectivity Biological Characteristics –M, von B, L-W Fishery Sampling Surveys Sets Template for Stock Assessment Model

Surveys vs Indices Surveys –Are a property of the true population –Catchability defined for all ages and years –Uncertainty added to true values at age and length Indices –Are a property of the model –Sum values from surveys –Can be either number or biomass based –Can be limited age range or entire age range –Can be changed between models without impacting underlying truth

Growth Initial NAA distributed according to stdev1 Growth transfer matrices created for each age based on expected von B growth for age and stdev2 Fish not allowed to decrease in size Allows fishing to change distribution of length at age

Market Sampling Markets declared by user Sampling conducted per 100 mt of landings in each market each year

InputOutput

PopSim Limitations PopSim is not reality Annual Time Steps Does not contain spatial components Does not allow gender differences Does not allow density dependent effects No integrated management –Developing MSE wrapper to use PopSim, VPA, AgePro, and Control Rules

This Exercise Used utility to convert VPA run to PopSim –Gets Nyears, plus group age from VPA –Sets initial NAA and R from VPA –Sets annual Fmult from VPA –Estimates one logistic selectivity from VPA Length and biology stuff from user Market stuff from user Surveys and Indices defs from user Tuned markets, sampling, and surveys to represent actual assessments by lead

Farmed Out Assessments AIM – Rago ASPIC – Brooks SCALE – Nitschke VPA – Legault, O’Brien, Sosebee ASAP – Legault Used base case to get template settings reasonable Applied this base case to each of the test cases Some models did additional runs with modified templates to “fix” the problem

Results PopSim compares the distribution of 100 assessments with the known true values Exactly what is compared depends on model –E.g. VPA NAA & FAA, ASPIC B & F Many, many runs and scenarios –PopSim creates tables and graphs –R program to gather results and automatically create plots

Black Line True Circles and Grey Line Median Red dashed Lines 5 and 95%iles Started by looking at direct results

Decided Bias and CV Better

General Conclusions Given failure of all models tested (simple & complex), we suspect other models would also be vulnerable to “retrospective agents” Use of age-specific indices is robust to uncertainty in survey selectivity If ageing is uncertain, these simulations support using models w/o age or models which allow uncertainty in catch at age

General Conclusions (cont.) VPA and ASAP ‘failures’ were similar in pattern Magnitude of bias was less for ASAP Precision usually somewhat better for ASAP Given these similarities, we suggest that ASAP may offer some advantages to VPA (esp. in terms of flexibility)