Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank.

Similar presentations


Presentation on theme: "Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank."— Presentation transcript:

1 Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank

2 Why? Statistical tests allow us to draw conclusions about data based on statistical significance. e.g. Is there a significant difference between the mean heights of students in different year groups? e.g. Is there significant correlation between the height and age of students in a school?

3 Standard Deviation A measure of how ‘spread out’ data is around a mean. Allows us to compare two or more sets of data to see whether the means are significantly different. Limitations: - doesn’t give you the range of data - can be affected by outliers/chance results

4 Calculating Standard Deviation 1.Subtract the mean of the data from each data point. 2.Square your answers. 3.Add them all together. 4.Divide your answer by the number of data points minus 1. 5.Take the square root. OR use the calculator method!

5 What next? Compare the mean+standard deviation for the data sets. If these ranges overlap, there is no significant difference between the means of the data sets. If the ranges do not overlap, there is a significant difference between the means.

6 Statistical Tests Student’s t-test – are two mean results significantly different? Spearman’s Rank Correlation Coefficient – is there significant correlation between data sets? X 2 (Chi-Squared) – is there a significant difference between observed and expected results?

7 Null hypothesis Results of an experiment could be due to random chance. Only way to support your hypothesis is to reject a null hypothesis. Null hypothesis states there is no link/correlation/difference between results. → Depends on statistical test used.

8 Statistics - State null hypothesis - Which test will you use? - Why? - Calculate test statistic - Interpret the test statistic in relation to your null hypothesis. Use the words probability and chance in your answer.

9 Probabilities We normally work at the 5% probability level (P=0.05). To reject the null hypothesis (and accept your own hypothesis), you must be sure that there is ≤5% probability that the results are due to chance.

10 William Gosset (aka ‘Student’) (1876-1937) Worked in quality control at the Guinness brewery and could not publish under his own name. Former student of Karl Pearson

11 What can this test tell you? If there is a statistically significant difference between two means, when:  The sample size is less than 25.  The data is normally distributed BETTER THAN STANDARD DEVIATION! The t-test

12 t-test x 1 = mean of first sample x 2 = mean of second sample s 1 = standard deviation of first sample s 2 = standard deviation of second sample n 1 = number of measurements in first sample n 2 = number of measurements in second sample x 1 – x 2  (s 1 2 /n 1 ) + (s 2 2 /n 2 ) t = SD =  (x – x) 2 n – 1 

13 Worked example Does the pH of soil affects seed germination of a specific plant species? Group 1: eight pots with soil at pH 5.5 Group 2: eight pots with soil at pH 7.0 50 seeds planted in each pot and the number that germinated in each pot was recorded.

14 H 0 = there is no statistically significant difference between the germination success of seeds in two soils of different pH H A = there is a significant difference between the germination of seeds in two soils of different pH If the value for t exceeds the critical value (P = 0.05), then you can reject the null hypothesis. What is the null hypothesis (H 0 )?

15 PotGroup 1 (pH5.5) (x – x) 2 Group 2 (pH7.0) (x – x) 2 1381.273920.25 2413.52452.25 34315.02416.25 4390.02466.25 5374.524820.25 6381.273920.25 7413.52466.25 8369.77440.25 Mean39.11.2743.520.25  38.8882.0 Construct the following table…

16 Calculate standard deviation for both groups SD =  (x – x) 2 n – 1  SD =  (x – x) 2 n – 1  Group 1: Group 2: = 38.88 8 – 1  = 82.0 8 – 1  = 2.36 = 3.42

17 Using your means and SDs, calculate value for t x 1 – x 2  (s 1 2 /n 1 ) + (s 2 2 /n 2 ) t = 39.1 – 43.5  (2.36 2 /8) + (3.42 2 /8) t = t = -2.99 BUT we can ignore the - sign -4.4  0.696 + 1.462 =

18 Compare your value of t with the appropriate critical value: If your value is lower than the critical value: -there is no significant difference between data sets -accept null hypothesis ->5% probability the difference in results is due to chance. If your value is higher than the critical value: -there is a significant difference between data sets -reject null hypothesis -≤5% probability the difference in results is due to chance.

19 Compare our calculated value of r with the relevant critical value in the stats table of critical values Our value of t = 2.99 Degrees of freedom = n 1 + n 2 – 2 = 14 D.F.Critical Value (P = 0.05) 142.15 152.13 162.12 172.11 182.10 Our value for t exceeds the critical value, so we can reject the null hypothesis. We can conclude that there is a significant difference between the two means, so pH does affect the germination rate for this plant.

20 Now try the examples on the sheet. Remember: 1.State your null hypothesis. 2.Calculate the mean and standard deviation. 3.Calculate the value of t. 4.Compare the value of t with the critical value. 5.Write a conclusion, stating: 1.Whether or not the confidence limits overlap 2.Whether you accept or reject the null hypothesis 3.What the probability is that the differences between means occurred by chance.

21 Why use Spearman’s rank?

22 What can this test tell you? If there is a statistically significant correlation between two sets of measurements from the same sample. What is the null hypothesis? There is no correlation between …………. Spearman’s Rank

23 Critical values. We use the 0.05 significance (probability) level – this is all you will be given in the EMPA.

24 Calculating Spearman’s Rank 1.State null hypothesis 2.Rank the data. 3.Calculate the correlation coefficient. 4.Compare r s with table of critical values.

25 Worked example The table shows the mass of nitrogen in fertiliser added to fields and the mean concentration of nitrates in nearby streams. Is there a significant correlation between the two variables? Mass of N on fields (kg ha -1 ) Concentration of nitrates in stream (mg dm -3 ) 411.2 411.3 511.5 561.8 631.6 691.9 722.0

26 There is no statistically significant correlation between the mass of nitrogen in fertiliser added to fields and the mean concentration of nitrates in nearby streams. What is the null hypothesis?

27 Step 2: Ranking the data Mass of N (kg ha -1 ) Rank Conc. of nitrates (mg dm -3 ) Rank Difference in rank (D) D2D2 411.2 411.3 511.5 561.8 631.6 691.9 722.0 1.5 3 4 5 6 7 12354671235467 0.5 -0.5 0 1 0 0.25 0 1 0 Make sure you rank both sets of data in the same direction (i.e. lowest to highest OR highest to lowest, NOT one each way). Always keep the data in its original pairs.

28 Step 3: Calculate r s Add up the D 2 values to give ∑D 2 : ∑D 2 = 0.25 + 0.25 + 1 + 1 = 2.5 Calculate r s : r s is always between -1 and 1. = 1 – 0.045 = 0.955

29 Step 4: Compare your value of r s with the appropriate critical value: Find the critical value for 7 pairs of measurements. Is your answer higher or lower than the critical value?

30 Step 4: Compare your value of r s with the appropriate critical value: 0.955>0.79, therefore we reject the null hypothesis. There is ≤5% probability that the correlation in results is due to chance. There is statistically significant positive correlation between the mass of nitrogen and the concentration of nitrates.

31 Step 4: Compare your value of r s with the appropriate critical value: If your value is lower than the critical value: -there is no significant correlation -accept null hypothesis ->5% probability the correlation in results is due to chance. If your value is higher than the critical value: -there is significant correlation -reject null hypothesis -≤5% probability the correlation in results is due to chance. r s values below 0 → negative correlation. r s values above 0 → positive correlation.

32 Calculating Spearman’s Rank 1.State null hypothesis 2.Rank the data. 3.Calculate the correlation coefficient. 4.Compare r s with table of critical values. 5.Write your conclusion, referring to the critical value, the null hypothesis, probability and chance.

33 Why use Χ 2 (chi squared)?

34 What can this test tell you? If there is a statistically significant difference between observed and expected results. What is the null hypothesis? There is no difference between observed and expected results. Χ2Χ2

35 Critical values. We use the 0.05 significance (probability) level – this is all you will be given in the EMPA.

36 Calculating Χ 2 1.State null hypothesis 2.Calculate Χ 2 3.Compare Χ 2 with table of critical values.

37 Worked example The table shows the number of people living on each side of a river who died from cancer in one year. Is there a significant difference in the death rates? Side of riverNorthSouth Death rate (per 100 people) 2612

38 There is no statistically significant difference between the death rates on each side of the river. What is the null hypothesis?

39 Step 2: Calculate X 2 Deaths from cancer in people living North of riverSouth of river Observed results (O) 2612 Expected results (E) O – E (O – E) 2 E Calculate expected results according to your null hypothesis. For this data, that means take the mean! 19 -7 7 49 2.6

40 Step 2: Calculate X 2 Add up the values for (O – E) 2 E Σ (O – E) 2 E Χ 2 = 2.6 + 2.6 = 5.2 =

41 Step 3: Compare your value of X 2 with the appropriate critical value: Number of degrees of freedom = n - 1 Is your answer higher or lower than the critical value?

42 Step 4: Compare your value of X 2 with the appropriate critical value: 5.2>3.84, therefore we reject the null hypothesis. There is ≤5% probability that the difference in results is due to chance. There is a statistically significant difference between the death rates on each side of the river.

43 Step 4: Compare your value of X 2 with the appropriate critical value: If your value is lower than the critical value: -there is no significant difference between results -accept null hypothesis ->5% probability the difference in results is due to chance. If your value is higher than the critical value: -there is a significant difference between results -reject null hypothesis -≤5% probability the difference in results is due to chance.

44 Calculating Χ 2 1.State null hypothesis 2.Calculate Χ 2 3.Compare Χ 2 with table of critical values. 4.Write your conclusion, referring to the critical value, the null hypothesis, probability and chance.

45 Assessment Solve the six problems on the sheet. Each problem is marked as follows: –Null hypothesis – 1 mark –Give your choice of test – 1 mark –Say why you’ll use that test – 1 mark –Calculate the test statistic – 1 mark –Interpret the test statistic in relation to your null hypothesis. Use the words probability and chance in your answer. – 2 marks


Download ppt "Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank."

Similar presentations


Ads by Google