Download presentation

Presentation is loading. Please wait.

1
Website http://www.mun.ca/biology/quant/ http://www.mun.ca/biology/quant/

2
Welcome to Biology 4605 / 7220 Model Based Statistics in Biology

3
Cookie Experiment Was there a preference? Chocolate chip Cinnamon Rolls Are they different? Use statistics – Binomial Test! = = χ 2 = p-value =

4
Are we feeding you a bunch of lies? Leonard Henry Courtney (1832-1918) Do statisticians use a bunch of fancy tests to bolster weak arguments? Are stats misused and misinterpreted? There are three kinds of lies; lies, damned lies and statistics. - Journal of the Royal Statistical Society, No. 59 (1896)

5
Problems: – Rare events – Zero-inflated – Mean is inappropriate Hypothetical example: Less than one endangered species was observed per transect (mean: 0.57 ind./transect). Proceed with development!

6
Statistics are Balderdash! Ernest Rutherford (1871-1937) If your experiment needs statistics, you ought to have done a better experiment Fair Enough…. Balance is important What about field studies?

7
No! Hypothesis testing is inevitable Every experiment may be said to exist only in order to give the facts a chance of disproving the null hypothesis R.A. Fisher (1890-1962)

8
Hypothesis testing is statistical flotsam Everyone will have his own pet assortment of flotsam; mine include most of the theory of significance testing, including multiple comparison tests, and non parametric statistics. John Nelder (1949-2010)

9
The trouble with significance testing Elementary statistics courses for biologists tend to lead to the use of a stereotyped set of tests: 1.Without critical attention to the underlying model involved; 2.Without due regard to the precise distribution of sampling errors; 3.With little concern for the scale of measurement; 4.Careless of dimensional homogeneity; 5.Without considering the ideal transformation; 6.Without any attempt at model simplification; 7.With too much emphasis on hypothesis testing and too little emphasis on parameter estimation. - M.J. Crawley 1993

10
So how should we analyse our data?! 1.Use Model Based Statistics 2.Dont let significance testing do the thinking for you You are always better off thinking about why a model could generate your data and then testing that model - L. Wilkinson et al. 1992 Model Plant height Time in sunlight Data

11
Classic approach Identify a test by name. Check its assumptions. Use automated routines provided in a package. Sort through the output for a p-value. Report whether p was less than 5%. Model Based approach What is the response variable? What are the explanatory variables? Write the model. Check the residuals. Model appropriate? Error structure correct? Take corrective action. Report the model, parameter values, and standard errors. X

12
In short: Write the model* and discard the search for tests Plant height Time in sunlight Data = Model + Residual Y = mX + b + Residual (Regression) *Dont panic…writing a model is easy

13
How to conceptualise a model Quick example Data Verbal GraphicalFormal

14
Data Verbal GraphicalFormal RM 10 125 20 250 325 40 4 450 50 525 575 5100 5150 5175 5200 625 650 675 6125 6150 6175 70 725 80 850 925 100 25 Continued… M = Catch of scallops (kg) R = Seabed roughness (acoustic values)

15
Data Verbal GraphicalFormal RM 10 125 20 250 325 40 4 450 50 525 575 5100 5150 5175 5200 625 650 675 6125 6150 6175 70 725 80 850 925 100 25 Continued… M = Catch of scallops (kg) R = Seabed roughness (acoustic values) Grab samples: 5&6 = Gravel 1-4 = Sand 7-10 = Cobble

16
Data Verbal GraphicalFormal Catch is higher in gravel than in finer (sand) or coarser (cobble) substrates

17
Data Verbal Graphical Formal Catch is higher in gravel than in finer (sand) or coarser (cobble) substrates No obvious linear trend Simplify – Two means model (gravel vs. other)

18
Data Verbal Graphical Formal Catch is higher in gravel than in finer (sand) or coarser (cobble) substrates Two mean model M = K 1 if R = 5 or 6 (gravel) M = K 2 if R not equal 5 or 6 Data = Model + Residual M = [K 1,K 2 ] + Residuals

19
The General Linear Model Data = Model + Normal Residual Data = [Two means] + Normal residual } t-test Data = [Several means] + Normal residual } Oneway ANOVA Data = [Two factors] + Normal residual } twoway ANOVA Data = [Line] + Normal residual } Regression Data = [Line + factors] + Normal residual } ANCOVA

20
Reasons for the model based approach 1.Statistics is modelling 2.Carryover: biological models statistics 3.Model approach leads to learning of concepts and principles

21
Testing models Let computers do the work ExcelMinitabSPSSSASR Spreadsheet visible L Pull down menus L Easily graph data Basic stats functions Randomise data General Linear Model ? Residual analysis Logistic regression Generalized Linear Model Easy to learn FREE

22
Course Goals 1.Introduce you to effective ways of thinking quantitatively about biological phenomena 2.Increase your skill and confidence in the application of quantitative methods 3.Develop your critical capacity, both for your own work and that of others

Similar presentations

OK

Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.

Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google

Ppt on aerobics steps Hospital management system ppt on the asp net Ppt on porter's five forces model example Ppt on indian primary resources Ppt on word association test free Ppt on save environment essay Ppt on history of internet free download Download ppt on mind controlled robotic arms and legs Ppt on current affairs 2014 Ppt on sea level rise map