Comparative Analysis of Statistical Tools To Identify Recruitment-Environment Relationships and Forecast Recruitment Strength Bernard A. Megrey Yong-Woo.

Slides:



Advertisements
Similar presentations
Differential Impacts of Climate Change on Spawning Populations of Atlantic cod in U.S. Waters Lisa Kerr, Steve Cadrin (UMass School for Marine Science.
Advertisements

The Multiple Regression Model.
Forecasting OPS 370.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Hypothesis Testing Steps in Hypothesis Testing:
EPI 809/Spring Probability Distribution of Random Error.
Statistics for Business and Economics
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Chapter 11 Multiple Regression.
Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,
Chapter 7 Forecasting with Simple Regression
Chapter 14 Inferential Data Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Recruitment success and variability in marine fish populations: Does age-truncation matter? Sarah Ann Siedlak 1, John Wiedenmann 2 1 University of Miami,
1 Prediction of Software Reliability Using Neural Network and Fuzzy Logic Professor David Rine Seminar Notes.
AM Recitation 2/10/11.
Megan Stachura and Nathan Mantua University of Washington School of Aquatic and Fishery Sciences September 8, 2012.
Hypothesis Testing in Linear Regression Analysis
Gary D. Marty 1, Peter-John F. Hulson 2, Sara E. Miller 2, Terrance J. Quinn II 2, Steve D. Moffitt 3, Richard A. Merizon 3 1 School of Veterinary Medicine,
Comparing Means. Anova F-test can be used to determine whether the expected responses at the t levels of an experimental factor differ from each other.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
ITK-226 Statistika & Rancangan Percobaan Dicky Dermawan
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Hypothesis Testing PowerPoint Prepared by Alfred.
Outline What Neural Networks are and why they are desirable Historical background Applications Strengths neural networks and advantages Status N.N and.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Using Neural Networks to Predict Claim Duration in the Presence of Right Censoring and Covariates David Speights Senior Research Statistician HNC Insurance.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Correlation & Regression
Physical and related biological variability in the large-scale North Atlantic, with implications for the NW Atlantic Ken Drinkwater Institute of Marine.
Biological and environmental factors influencing recruitment success of North Sea demersal and pelagic fish stocks Alan Sinclair Fisheries and Oceans Canada.
Applications of Neural Networks in Time-Series Analysis Adam Maus Computer Science Department Mentor: Doctor Sprott Physics Department.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Academic Research Academic Research Dr Kishor Bhanushali M
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.
Correlation. Correlation Analysis Correlations tell us to the degree that two variables are similar or associated with each other. It is a measure of.
381 Correlation and Regression-I QSCI 381 – Lecture 36 (Larson and Farber, Sect 9.1)
Recruitment variation in Icelandic summer spawning herring: Is it best explained by ocean environment or by the spawning stock? Guðmundur J. Óskarsson.
By Jyh-haw Yeh Department of Computer Science Boise State University.
Downscaling Global Climate Model Forecasts by Using Neural Networks Mark Bailey, Becca Latto, Dr. Nabin Malakar, Dr. Barry Gross, Pedro Placido The City.
Empirical comparison of historical data and age- structured assessment models for Prince William Sound and Sitka Sound Pacific herring Peter-John F. Hulson,
Influence of selectivity and size composition misfit on the scaling of population estimates and possible solutions: an example with north Pacific albacore.
Data Mining: Neural Network Applications by Louise Francis CAS Convention, Nov 13, 2001 Francis Analytics and Actuarial Data Mining, Inc.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
Consequences of changing climate for North Atlantic cod stocks and implications for fisheries management Keith Brander ICES/GLOBEC Coordinator.
Forecast 2 Linear trend Forecast error Seasonal demand.
Day 4, Session 1 Abundance indices, CPUE, and CPUE standardisation
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Lecture 9 Forecasting. Introduction to Forecasting * * * * * * * * o o o o o o o o Model 1Model 2 Which model performs better? There are many forecasting.
Advanced Data Analytics
Logic of Hypothesis Testing
Chapter 4 Basic Estimation Techniques
Estimation & Hypothesis Testing for Two Population Parameters
A Comparison of Profiling Float and XBT Representations of Upper Layer Temperature Structure of the Northwestern Subtropical North Atlantic Robert L.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Quantitative Methods Simple Regression.
Luís Filipe Martinsª, Fernando Netoª,b. 
S519: Evaluation of Information Systems
Comparing Means.
The concept of population and unit stock
Role of Statistics in Climate Sciences
Kreshna GOPAL C. Prakash KHEDUN Anoop SOHUN
Chapter 10 – Part II Analysis of Variance
Generalized Additive Model
Presentation transcript:

Comparative Analysis of Statistical Tools To Identify Recruitment-Environment Relationships and Forecast Recruitment Strength Bernard A. Megrey Yong-Woo Lee S. Allen Macklin National Oceanic and Atmospheric Administration National Marine Fisheries Service Alaska Fisheries Science Center Seattle, WA USA

Overview Background and motivation Mechanics of testing procedures Results of application of 3 statistical tools to 2 data sets Concluding remarks and observations

Why Forecast Recruitment? Understand important bio-physical factors controlling the recruitment processes –The ultimate test of a model is its ability to predict Project future stock dynamics Evaluate management scenarios Provide reference points for fishery management Assist commercial fisheries decision making

The data we collect as it relates to recruitment variability and the factors that influence it probably will not change dramatically in the near future. “We should endeavor to treat the data differently in a statistical sense.” R.J.H Beverton 1989

What are the best statistical tools for estimating environment-recruitment relationships and forecasting future recruitment states? ? ? ? ? ? ?

Problems in Forecasting The complexity of recruitment forecasting often seems beyond the capabilities of traditional statistical analysis paradigms because…. Bio-physical relationships are inherently nonlinear Often there are limitations in theoretical development or standard models cannot deal with data pathologies Inability to meet required assumptions Time series of data are short Lack of degrees of freedom The need to partition already short time series into segments representing identified regimes

Objectives Test and compare several statistical methods to evaluate their ability –to identify recruitment-environment relationships –to forecast future recruitment In a real world setting we can never know the parameters and underlying relationships of actual data –simulate data with known properties and different levels of measurement error using Gulf of Alaska pollock Use methods on actual North Atlantic data –Norwegian spring spawn herring SB and R, Kola Line SST, and Index of NAO (Toresen and Ostvedt 2000) Environmental effects occur in birth year (i.e. no lags)

Simulated Data with Known Properties R = a·S·exp(-b·S+c·N+d·T+ ε ) R: Recruitment S: Spawning Biomass N: Wind Anomaly - No relationship T: Sea Surface Temperature ε : Measurement Error, N(0,σ 2 ), σ 2 was estimated from a Ricker fit to actual data. *** 3 Error levels: [no error] [½ σ 2 ] [σ 2 ] ^^

Summary of Simulated Data SBWindSST Relationship to Recruitment Nonlinear Ricker noneLinear Functional Relationship Exponential Nonlinear MeanLog Linear Probability Distribution GammaLognormalNormal

Herring Simulated

Tested Statistical Tools Recruitment on the absolute scale (billion fish) Nonlinear Regression (NLR) Generalized Additive Models (GAM) Artificial Neural Network (ANN) FISHERIES APPLICATIONS GAM Cury et al. 1995; Swartzman et al. 1995; Meyers et al. 1995; Jacobsen and MacCall 1995; Daskalov 1999 ANN Chen and Ware 1999

Neural Networks

General Additive Models

Comparisons Statistical Methods Parametric (NLR) vs. Non-parametric (GAM, ANN) Conventional (NLR) vs. Innovative (GAM, ANN) Model Free (GAM, ANN) vs. functional relationships specified a priori (NLR)

Time Series Partitioning 2 Data Segments Training segment used for parameter estimation Forecasting segment used for forecasting accuracy Simulated Data (n=42) Training segment (n=37) Forecasting segment (n=5) Herring Data (n=89) Training segment (n=79) Forecasting segment (n=10)

Simulated vs Predicted, for Error level = 0 R-square for TrainingMSE for Forecasting

Simulated Data Testing and Forecast Segment Comparison using MSE ERROR 2 ERROR 1 ERROR 0

Simulated Data ANN Relative Weight Comparison 3 variables 2 hidden neurons 10 parms 2 variables 2 hidden neurons 8 parms

Spurious Correlations We did see evidence of spurious correlations when analyzing the simulated data. The GAM model, Err = 2; R = SB + WIND + SST WIND was significant in NLR model, Err=3. When dealing with data with typical levels of variation, it is possible to conclude that unnecessary or irrelevant variables are significant. “Spurious correlations are the first enemy of recruitment biologists” Tyler (1992)

Herring Data Testing and Forecast Segment Comparison using MSE and R 2 ANN Relative Weight Comparison

GAM fit to Herring Data

Summary Need to be cautious when dealing with noisy data, because a wrong model or variable could be identified as influential to recruitment. We did see evidence of spurious correlations under very controlled data situations. It appears that ANNs forecast better than conventional parametric methods when data are noisy. Non-parametric methods (GAMs and ANNs) work well for suggesting functional relationships and forecasting future recruitment states –desirable property because real systems are highly non- linear and include complex interactions among the variables.

Summary (con’t) There is no one “best” method to address the environment- recruitment problem. ANNs are highly flexible and show promise for forecasting, thus using GAMs and ANNs together with more traditional methods should enhance analysis and forecasting. When considering estimation in conjunction with forecasting it is better to consider a balance between “best” models. Results underscore the need to build good conceptual models first, then guided by hypotheses regarding factors that control recruitment and their time and space scales of influence, judiciously apply a suite of statistical models to quality data sets. Data mining and “kitchen stew” correlation exercises are not appropriate.

Simulated Data

Norwegian Spring Spawn Herring