Potentielle forklarende variabler for udbytte i forskellige miljøer Hans Pinnschmidt Danmarks JorgbrugsForskning Afdeling for Plantebeskyttelse Forskning.

Slides:



Advertisements
Similar presentations
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Advertisements

Multiple Regression W&W, Chapter 13, 15(3-4). Introduction Multiple regression is an extension of bivariate regression to take into account more than.
Kin 304 Regression Linear Regression Least Sum of Squares
Chapter 8 Linear regression
Chapter 8 Linear regression
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Statistics for the Social Sciences
Multiple Regression Models Advantages of multiple regression Important preliminary analyses Parts of a multiple regression model & interpretation Differences.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 14 1 MER301: Engineering Reliability LECTURE 14: Chapter 7: Design of Engineering.
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Multiple Linear Regression
AP Statistics Chapter 8: Linear Regression
Analysis of Variance & Multivariate Analysis of Variance
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
Transforming the data Modified from: Gotelli and Allison Chapter 8; Sokal and Rohlf 2000 Chapter 13.
Discriminant Analysis Testing latent variables as predictors of groups.
Classification and Prediction: Regression Analysis
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Understanding Research Results
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Introduction to Regression Analysis. Two Purposes Explanation –Explain (or account for) the variance in a variable (e.g., explain why children’s test.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
Agenda Review Association for Nominal/Ordinal Data –  2 Based Measures, PRE measures Introduce Association Measures for I-R data –Regression, Pearson’s.
CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Chapter 8 Curve Fitting.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Statistics PSY302 Quiz One Spring A _____ places an individual into one of several groups or categories. (p. 4) a. normal curve b. spread c.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Chapter 16 Data Analysis: Testing for Associations.
Analysis of Covariance Combines linear regression and ANOVA Can be used to compare g treatments, after controlling for quantitative factor believed to.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression (3)
Multivariate Data Analysis Chapter 1 - Introduction.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
The Practice of Social Research Chapter 6 – Indexes, Scales, and Typologies.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Ch14: Linear Least Squares 14.1: INTRO: Fitting a pth-order polynomial will require finding (p+1) coefficients from the data. Thus, a straight line (p=1)
Choosing an Investment Portfolio
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
Multivariate Transformation. Multivariate Transformations  Started in statistics of psychology and sociology.  Also called multivariate analyses and.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.
Part II Exploring Relationships Between Variables.
Bivariate analysis. * Bivariate analysis studies the relation between 2 variables while assuming that other factors (other associated variables) would.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Nonequivalent Groups: Linear Methods Kolen, M. J., & Brennan, R. L. (2004). Test equating, scaling, and linking: Methods and practices (2 nd ed.). New.
multivariate genotype - environment data
Statistical tests for quantitative variables
Kin 304 Regression Linear Regression Least Sum of Squares
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Multiple Regression.
BPK 304W Regression Linear Regression Least Sum of Squares
Basic Statistical Terms
Simple Linear Regression
15.1 The Role of Statistics in the Research Process
Statistics PSY302 Review Quiz One Spring 2017
Regression Analysis.
Presentation transcript:

Potentielle forklarende variabler for udbytte i forskellige miljøer Hans Pinnschmidt Danmarks JorgbrugsForskning Afdeling for Plantebeskyttelse Forskning Center Flakkebjerg 4200 Slagelse

Background BAROF WP1 data: multivariate measurements on 86 spring barley genotypes in 10 environments (2 years: 2002 & 2003, 3 sites: Flakkebjerg, Foulum, Jyndevad, 2 production systems: ecological & conventional).

variables: yield 1000 grain weight grain protein contents culm length date of emergence growth duration mildew severity rust severity scald severity net blotch severity disease diversity weed cover broken panicles & culms lodging parameters: raw data mean/median/max./min. rank/relative values main effects interaction slopes raw data adjusted for E/G main effects/slopes (residuals) IPCA scores SD/variance factors: genotypeenvironment G 1 E 1....E j. G i variables: X 1(i,j)... X m(i,j) parameters X m(i)1... X m(i)p X m(j)1... X m(j)p } derive information on general properties, specificity, stability/variability

Objectives Multivariate characterisation of genotypes with emphasis on yield-related properties.

Statistical methods Non-linear Canonical Correlation Analysis (NCCA): an optimal scaling procedure suited for handling multivariate data of any kind of scaling (numerical/quantitative, ordinal, nominal). Multiple Regression Analysis (MRA)

Non-linear Canonical Correlation Analysis (NCCA) data treatment: quantitative variables (v m ) were converted into ordinal variables with n categories (v v 1n,..., v m1... v mn ).

Characterisation of environments based on data adjusted for G main effects (= residuals)

Flakkebjerg 2003: high yield, net blotch & panicle breakage; low mildew & lodging Flakkebjerg 2002: high rust & 1000 grain weight; late sowing Foulum 2002 conventional & Jyndevad 2003 ecological: high mildew & lodging; low yield % net blotch Jyndevad 2002 ecological: low yield, 1000 grain weight, weed infestation, protein content

Characterisation of genotypes based on data adjusted for E main effects (= residuals)

dimension 1 (sq. root) dimension 5 (sq. root) high yield & 1000 grain weight; low protein content & lodging low yield & 1000 grain weight high mildew; low net blotch & disease diversity low mildew

Characterisation of genotypes in individual environments based on: actual yield data disease main effects (ME) of G’s environmental disease variability (SD) of G’s (= standard deviation of E adjusted data)

Flakkebjerg 2003: high yield, net blotch & panicle breakage; low mildew & lodging

dimension 4 (sq. root) dimension 6 (sq. root) high yield; low net blotch ME & SD short straw high rust ME & SD long straw low yield; high net blotch ME & SD Flakkebjerg 2003: high yield, net blotch & panicle breakage; low mildew & lodging

Flakkebjerg 2003: high yield, net blotch & panicle breakage; low mildew & lodging Foulum 2002 conventional & Jyndevad 2003 ecological: high mildew & lodging; low yield & net blotch Jyndevad 2002 ecological: low yield, 1000 grain weight, weed infestation, protein content

dimension 1 (sq. root) dimension 5 (sq. root) low yield; high mildew & net blotch ME & SD low mildew ME & SD high yield Jyndevad 2003 ecological: high mildew & lodging; low yield & net blotch

Multiple Regression Analysis (MRA) dependent variables: yield (actual, E-adj. G mean & SD) independent variables: E-adj. G mean & SD of disease severity, weed infestation, growth duration, culm length criteria: Pin/out = 0.05/0.10; Fin/out = 3,84/2.71; tolerance =

Variables must pass both tolerance and minimum tolerance tests in order to enter and remain in a regression equation. Tolerance is the proportion of the variance of a variable in the equation that is not accounted for by other independent variables in the equation. The minimum tolerance of a variable not in the equation is the smallest tolerance any variable already in the equation would have if the variable being considered were included in the analysis. If a variable passes the tolerance criteria, it is eligible for inclusion based on the method in effect.

Mean versus standard deviation of environment-adjusted yield of spring barley genotypes; BAROF

Observed versus estimated mean environment-adjusted yield of spring barley genotypes; BAROF

Observed versus estimated standard deviation of environment- adjusted yield of spring barley genotypes; BAROF

Yield of spring barley genotypes versus main effect yield of the environment; BAROF

Yield of spring barley genotypes estimated based on yield main effect of environment and E-adjusted mean & standard deviation of genotype property variables (disease severity, weed infestation, culm length, growth duration); analysis across environments; BAROF

Yield of spring barley genotypes estimated based on E-adjusted mean & standard deviation of genotype property variables (disease severity, weed infestation, culm length, growth duration); analysis by environment; BAROF

Conclusions & outlook NCCA: “intuitive” method good for “visualising” the main features in multivariate data of various scales useful for obtaining an overall synoptic orientation of G properties and E characteristics  “soft systems approach” MRA:  “hard systems approach” synoptic view neglected Mildew & net blotch had highest yield-related effect, although not always functional (especially in MRA!)