Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank.

Slides:



Advertisements
Similar presentations
Are our results reliable enough to support a conclusion?
Advertisements

© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Inferential Stats for Two-Group Designs. Inferential Statistics Used to infer conclusions about the population based on data collected from sample Do.
Statistical Analysis I have all this data. Now what does it mean?
STATISTICS For Research. 1. Quantitatively describe and summarize data A Researcher Can:
STATISTICS For Research. Why Statistics? 1. Quantitatively describe and summarize data A Researcher Can:
Correlation.
Statistical Significance R.Raveendran. Heart rate (bpm) Mean ± SEM n In men ± In women ± The difference between means.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
Statistical Analysis I have all this data. Now what does it mean?
Hypothesis of Association: Correlation
Chi Square Analysis  Use to see if the observed value varies from the expected value.  Null Hypothesis – There is no difference between the observed.
CONFIDENCE INTERVAL It is the interval or range of values which most likely encompasses the true population value. It is the extent that a particular.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
Statistical analysis. Types of Analysis Mean Range Standard Deviation Error Bars.
T-test - paired Testing for a difference. What does it do? Tests for a difference in means Compares two cases (eg soil moisture content north & south.
Statistics in IB Biology Error bars, standard deviation, t-test and more.
Statistical Analysis. Null hypothesis: observed differences are due to chance (no causal relationship) Ex. If light intensity increases, then the rate.
Data Analysis.
Correlation – Spearman’s. What does it do? Measures rank correlation – whether highest value in the 1 st data set corresponds to highest in the 2 nd set.
+ Mortality. + Starter for 10…. In pairs write on a post it note: One statistic that we use to measure mortality On another post it note write down: A.
Advanced Higher STATISTICS Spearman’s Rank (Spearman’s rank correlation coefficient) Lesson Objectives 1. Explain why it is used. 2. List the advantages.
Testing of Hypothesis. Test of Significance A procedure to assess the significance of a statistics that a sample statistics would differ from a given.
From last lesson….
MEASURE OF LOCATION Mean is also known as average
Basic Statistics The Chi Square Test of Independence.
Statistical Analysis: Chi Square
Tests of hypothesis Contents: Tests of significance for small samples
Data measurement, probability and Spearman’s Rho
Spearman’s Rho Correlation
Learning Objectives: 1. Understand the use of significance levels. 2
STATISTICS FOR SCIENCE RESEARCH
Testing for a difference
Primary Succession Tropical Rainforest GPP ATP / NADPH Forest fire
Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis
PCB 3043L - General Ecology Data Analysis.
Lesson objectives the different types of variation
STEM Fair: Statistical Analysis
Learning Aims By the end of this session you are going to totally ‘get’ levels of significance and why we do statistical tests!
Data measurement, probability and statistical tests
Inferential Statistics
Spearman’s rho Chi-square (χ2)
STATISTICS For Research
10: Leisure at an International Scale: Sport
Inferential Statistics
AP Biology Intro to Statistics
Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.
Hypothesis Theory examples.
Chi-Squared test AP Biology.
Using Statistical techniques in Geography
Non-parametric tests, part A:
STATISTICS Topic 1 IB Biology Miss Werba.
Paired Samples and Blocks
Statistics.
Data measurement, probability and statistical tests
Standard Deviation & Standard Error
Correlation and the Pearson r
Some statistics questions answered:
Facts from figures Having obtained the results of an investigation, a scientist is faced with the prospect of trying to interpret them. In some cases the.
Statistical analysis.
How do you know if the variation in data is the result of random chance or environmental factors? O is the observed value E is the expected value.
Skills 5. Skills 5 Standard deviation What is it used for? This statistical test is used for measuring the degree of dispersion. It is another way.
Spearman’s Rank For relationship data.
One Way ANOVA test Determine whether there are any statistically significant differences between the means of three or more independent groups.
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Statistics in Biology: Standard Error of the Mean & Error Bars
Presentation transcript:

Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank

Why? Statistical tests allow us to draw conclusions about data based on statistical significance. e.g. Is there a significant difference between the mean heights of students in different year groups? e.g. Is there significant correlation between the height and age of students in a school?

Standard Deviation A measure of how ‘spread out’ data is around a mean. Allows us to compare two or more sets of data to see whether the means are significantly different. Limitations: - doesn’t give you the range of data - can be affected by outliers/chance results

Calculating Standard Deviation 1.Subtract the mean of the data from each data point. 2.Square your answers. 3.Add them all together. 4.Divide your answer by the number of data points minus 1. 5.Take the square root. OR use the calculator method!

What next? Compare the mean+standard deviation for the data sets. If these ranges overlap, there is no significant difference between the means of the data sets. If the ranges do not overlap, there is a significant difference between the means.

Statistical Tests Student’s t-test – are two mean results significantly different? Spearman’s Rank Correlation Coefficient – is there significant correlation between data sets? X 2 (Chi-Squared) – is there a significant difference between observed and expected results?

Null hypothesis Results of an experiment could be due to random chance. Only way to support your hypothesis is to reject a null hypothesis. Null hypothesis states there is no link/correlation/difference between results. → Depends on statistical test used.

Statistics - State null hypothesis - Which test will you use? - Why? - Calculate test statistic - Interpret the test statistic in relation to your null hypothesis. Use the words probability and chance in your answer.

Probabilities We normally work at the 5% probability level (P=0.05). To reject the null hypothesis (and accept your own hypothesis), you must be sure that there is ≤5% probability that the results are due to chance.

William Gosset (aka ‘Student’) ( ) Worked in quality control at the Guinness brewery and could not publish under his own name. Former student of Karl Pearson

What can this test tell you? If there is a statistically significant difference between two means, when:  The sample size is less than 25.  The data is normally distributed BETTER THAN STANDARD DEVIATION! The t-test

t-test x 1 = mean of first sample x 2 = mean of second sample s 1 = standard deviation of first sample s 2 = standard deviation of second sample n 1 = number of measurements in first sample n 2 = number of measurements in second sample x 1 – x 2  (s 1 2 /n 1 ) + (s 2 2 /n 2 ) t = SD =  (x – x) 2 n – 1 

Worked example Does the pH of soil affects seed germination of a specific plant species? Group 1: eight pots with soil at pH 5.5 Group 2: eight pots with soil at pH seeds planted in each pot and the number that germinated in each pot was recorded.

H 0 = there is no statistically significant difference between the germination success of seeds in two soils of different pH H A = there is a significant difference between the germination of seeds in two soils of different pH If the value for t exceeds the critical value (P = 0.05), then you can reject the null hypothesis. What is the null hypothesis (H 0 )?

PotGroup 1 (pH5.5) (x – x) 2 Group 2 (pH7.0) (x – x) Mean  Construct the following table…

Calculate standard deviation for both groups SD =  (x – x) 2 n – 1  SD =  (x – x) 2 n – 1  Group 1: Group 2: = – 1  = – 1  = 2.36 = 3.42

Using your means and SDs, calculate value for t x 1 – x 2  (s 1 2 /n 1 ) + (s 2 2 /n 2 ) t = 39.1 – 43.5  ( /8) + ( /8) t = t = BUT we can ignore the - sign -4.4  =

Compare your value of t with the appropriate critical value: If your value is lower than the critical value: -there is no significant difference between data sets -accept null hypothesis ->5% probability the difference in results is due to chance. If your value is higher than the critical value: -there is a significant difference between data sets -reject null hypothesis -≤5% probability the difference in results is due to chance.

Compare our calculated value of r with the relevant critical value in the stats table of critical values Our value of t = 2.99 Degrees of freedom = n 1 + n 2 – 2 = 14 D.F.Critical Value (P = 0.05) Our value for t exceeds the critical value, so we can reject the null hypothesis. We can conclude that there is a significant difference between the two means, so pH does affect the germination rate for this plant.

Now try the examples on the sheet. Remember: 1.State your null hypothesis. 2.Calculate the mean and standard deviation. 3.Calculate the value of t. 4.Compare the value of t with the critical value. 5.Write a conclusion, stating: 1.Whether or not the confidence limits overlap 2.Whether you accept or reject the null hypothesis 3.What the probability is that the differences between means occurred by chance.

Why use Spearman’s rank?

What can this test tell you? If there is a statistically significant correlation between two sets of measurements from the same sample. What is the null hypothesis? There is no correlation between …………. Spearman’s Rank

Critical values. We use the 0.05 significance (probability) level – this is all you will be given in the EMPA.

Calculating Spearman’s Rank 1.State null hypothesis 2.Rank the data. 3.Calculate the correlation coefficient. 4.Compare r s with table of critical values.

Worked example The table shows the mass of nitrogen in fertiliser added to fields and the mean concentration of nitrates in nearby streams. Is there a significant correlation between the two variables? Mass of N on fields (kg ha -1 ) Concentration of nitrates in stream (mg dm -3 )

There is no statistically significant correlation between the mass of nitrogen in fertiliser added to fields and the mean concentration of nitrates in nearby streams. What is the null hypothesis?

Step 2: Ranking the data Mass of N (kg ha -1 ) Rank Conc. of nitrates (mg dm -3 ) Rank Difference in rank (D) D2D Make sure you rank both sets of data in the same direction (i.e. lowest to highest OR highest to lowest, NOT one each way). Always keep the data in its original pairs.

Step 3: Calculate r s Add up the D 2 values to give ∑D 2 : ∑D 2 = = 2.5 Calculate r s : r s is always between -1 and 1. = 1 – = 0.955

Step 4: Compare your value of r s with the appropriate critical value: Find the critical value for 7 pairs of measurements. Is your answer higher or lower than the critical value?

Step 4: Compare your value of r s with the appropriate critical value: 0.955>0.79, therefore we reject the null hypothesis. There is ≤5% probability that the correlation in results is due to chance. There is statistically significant positive correlation between the mass of nitrogen and the concentration of nitrates.

Step 4: Compare your value of r s with the appropriate critical value: If your value is lower than the critical value: -there is no significant correlation -accept null hypothesis ->5% probability the correlation in results is due to chance. If your value is higher than the critical value: -there is significant correlation -reject null hypothesis -≤5% probability the correlation in results is due to chance. r s values below 0 → negative correlation. r s values above 0 → positive correlation.

Calculating Spearman’s Rank 1.State null hypothesis 2.Rank the data. 3.Calculate the correlation coefficient. 4.Compare r s with table of critical values. 5.Write your conclusion, referring to the critical value, the null hypothesis, probability and chance.

Why use Χ 2 (chi squared)?

What can this test tell you? If there is a statistically significant difference between observed and expected results. What is the null hypothesis? There is no difference between observed and expected results. Χ2Χ2

Critical values. We use the 0.05 significance (probability) level – this is all you will be given in the EMPA.

Calculating Χ 2 1.State null hypothesis 2.Calculate Χ 2 3.Compare Χ 2 with table of critical values.

Worked example The table shows the number of people living on each side of a river who died from cancer in one year. Is there a significant difference in the death rates? Side of riverNorthSouth Death rate (per 100 people) 2612

There is no statistically significant difference between the death rates on each side of the river. What is the null hypothesis?

Step 2: Calculate X 2 Deaths from cancer in people living North of riverSouth of river Observed results (O) 2612 Expected results (E) O – E (O – E) 2 E Calculate expected results according to your null hypothesis. For this data, that means take the mean!

Step 2: Calculate X 2 Add up the values for (O – E) 2 E Σ (O – E) 2 E Χ 2 = = 5.2 =

Step 3: Compare your value of X 2 with the appropriate critical value: Number of degrees of freedom = n - 1 Is your answer higher or lower than the critical value?

Step 4: Compare your value of X 2 with the appropriate critical value: 5.2>3.84, therefore we reject the null hypothesis. There is ≤5% probability that the difference in results is due to chance. There is a statistically significant difference between the death rates on each side of the river.

Step 4: Compare your value of X 2 with the appropriate critical value: If your value is lower than the critical value: -there is no significant difference between results -accept null hypothesis ->5% probability the difference in results is due to chance. If your value is higher than the critical value: -there is a significant difference between results -reject null hypothesis -≤5% probability the difference in results is due to chance.

Calculating Χ 2 1.State null hypothesis 2.Calculate Χ 2 3.Compare Χ 2 with table of critical values. 4.Write your conclusion, referring to the critical value, the null hypothesis, probability and chance.

Assessment Solve the six problems on the sheet. Each problem is marked as follows: –Null hypothesis – 1 mark –Give your choice of test – 1 mark –Say why you’ll use that test – 1 mark –Calculate the test statistic – 1 mark –Interpret the test statistic in relation to your null hypothesis. Use the words probability and chance in your answer. – 2 marks