Presentation is loading. Please wait.

Presentation is loading. Please wait.

Summarizing and Presenting Data. I’m a biologist! Why am I doing math?! We need math and statistics to answer questions like: 1.Does this medicine work?

Similar presentations


Presentation on theme: "Summarizing and Presenting Data. I’m a biologist! Why am I doing math?! We need math and statistics to answer questions like: 1.Does this medicine work?"— Presentation transcript:

1 Summarizing and Presenting Data

2 I’m a biologist! Why am I doing math?! We need math and statistics to answer questions like: 1.Does this medicine work? 2.Do no-fishing areas increase overall reef fish populations? 3.Are Drosophila on the Big Island genetically distinct from Drosophila on O’ahu? Statistics

3 1.Does this medicine work? a)Mrs. Hayashi said she felt better after taking drug A. b)90% of patients who took drug A lived 10 years longer than those on drug B. 2.Do no-fishing areas increase overall reef fish populations? a)My uncle caught nothing when he went fishing next to the reserve yesterday. b)Average size and abundance of commercial fish increased in the years following establishment of a reserve. 3.Are Drosophila on the Big Island genetically distinct from Drosophila on O’ahu? a)They look kinda different b)There are significant base pair differences in the cytochrome oxidase A gene locus Different ways of answering a question – which is better?

4 Statistics The scientific study of numerical data based upon variation in nature.

5 Descriptive Statistics: used to describe/summarize data set e.g., mean, median, range, variance, standard deviation, confidence intervals, frequency distribution, etc. Inferential Statistics: It allows you to make inferences about a population from a sample. It’s used to test the Ho. Ex. t-test, ANOVA, ANCOVA Two types of Statistics

6 Raw Data: the data you collect directly from the organisms or environment you are studying. Population (N): the total # of individuals in the population of interest. Sample (n): the number of observations or individuals measured. n = 3 Variable: any factor that can be controlled, changed, or measured in an experiment. Some Definitions

7 Descriptive Statistics Statistics of LocationStatistics of Location –Average (Mean) –Median –Mode Statistics of DispersionStatistics of Dispersion –Range –Standard Deviation Statistics of LocationStatistics of Location –Average (Mean) –Median –Mode Statistics of DispersionStatistics of Dispersion –Range –Standard Deviation

8 Mean: _ X =  1.2+3.0+0.5+2.3+1.5 5 = 1.7 m Median: Middle number in a data set. Order from smallest to largest. 0.5 1.2 1.5 2.3 3.0 Range: Difference between the largest and smallest data points in a sample. 3.0 - 0.5 = 2.5 m Fern Height (m) 1.2 3.0 0.5 2.3 1.5 Hand calculations

9 Sample Variance _ _ X X-X(X-X) 2 0.5-1.2 1.44 1.2-0.5 0.25 1.5-0.2 0.04 2.3 0.6 0.36 3.0 1.3 1.69 3.78 n = 5 _ X = 1.7 3.78 4 S 2 == 0.945 Standard Deviation S = 0.945 = 0.972

10 Presenting Tables AVERAGEAVERAGE DIET WEIGHTLENGTH DIET WEIGHTLENGTH TYPE (g) (cm) TYPE (g) (cm) high fat545.555.3 low fat346.753.2 TABLE I The Effects of High Fat Versus Low Fat Diets on Rat Weights and Lengths

11 Graphing Data

12 Presenting Graphs

13 Correlation & Regression Correlation: describe the strength of association between two variables. The correlation coefficient is designated as r. r can range from -1 to +1. +1 = a strong positive correlation 0 = no correlation -1 = a strong negative correlation r = +1 r = -1 r = 0

14 Presenting Graphs The independent variable is always plotted on the x-axis Does the weight cause the length or the length cause the weight?

15 This relationship can be represented by a regression equation. Equation for a straight line: y = bx + a y = DV, b = slope, x = IV, a = intercept Simple Linear Regression: used to detect a linear relationship between variables when we think x affects y (but not the reverse).

16 Simple Linear Regression: detect a linear relationship between variables when x affects y. Regression Equation for a straight line: y = bx + a y = Dependent variable, b = slope, x = Independent variable, a = intercept e.g. y = height, x = age -> Tree height = b*(tree age) + a How do we calculate a + b?

17 = p-value The p-value (since its less than 0.05) says that the slope of our line is significantly different from zero. Therefore, there’s a significant effect of age on height. intercept = a slope = b EXCEL output:

18 Tasks for Today’s Lab Calculate averages and standard deviations of your seed pod lengths, weights, and number of seeds per pod (use only the units originally measured, not converted units). Do the same for pooled class data (will need Excel files of raw data from you). Create a single table that presents the values (both your data & the pooled class data - include only the averages and SDs in this table).

19 Tasks for Today’s Lab Using only the class pooled data, prepare a frequency histogram for the number of seeds per pod.Using only the class pooled data, prepare a frequency histogram for the number of seeds per pod. Graph the relationship between pod weight and pod length.Graph the relationship between pod weight and pod length. Graph the relationship between number of seeds per pod and pod length.Graph the relationship between number of seeds per pod and pod length. Interpret that data.Interpret that data.


Download ppt "Summarizing and Presenting Data. I’m a biologist! Why am I doing math?! We need math and statistics to answer questions like: 1.Does this medicine work?"

Similar presentations


Ads by Google