AP Lab Skills Guide Data will fall into three categories:

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

1 Chapter 1: Sampling and Descriptive Statistics.
IB Math Studies – Topic 6 Statistics.
Statistical Tests Karen H. Hagglund, M.S.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Standard Error for AP Biology
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 4: The Normal Distribution and Z-Scores.
Inference for regression - Simple linear regression
Programming in R Describing Univariate and Multivariate data.
Quantitative Skills: Data Analysis and Graphing.
Numerical Descriptive Techniques
Choosing and using statistics to test ecological hypotheses
Quantitative Skills: Data Analysis
Chapter 3 Averages and Variations
Quantitative Skills 1: Graphing
Statistics Recording the results from our studies.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Chapter 2 Describing Data.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Numerical Measures of Variability
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.
STATISTICS Chapter 2 and and 2.2: Review of Basic Statistics Topics covered today:  Mean, Median, Mode  5 number summary and box plot  Interquartile.
Describing Quantitative Data with Numbers
A QUANTITATIVE RESEARCH PROJECT -
Introduction to Statistics
Descriptive Statistics ( )
Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis also helps distinguish among multiple.
Advanced Quantitative Techniques
Statistical analysis.
One-Variable Statistics
AP Biology Intro to Statistics
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
Standard Error for AP Biology
AP Biology Intro to Statistics
STATISTICS FOR SCIENCE RESEARCH
Descriptive Statistics
Correlation, Bivariate Regression, and Multiple Regression
TYPES OF GRAPHS There are many different graphs that people can use when collecting Data. Line graphs, Scatter plots, Histograms, Box plots, bar graphs.
2.5: Numerical Measures of Variability (Spread)
Data Mining: Concepts and Techniques
Statistical analysis.
Descriptive Statistics (Part 2)
Data Analysis-Descriptive Statistics
Introduction to Bio-Medical statistics
Quantitative Skills : Graphing
Standard Error for AP Biology
Basic Statistics Overview
AP Biology Intro to Statistics
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Laugh, and the world laughs with you. Weep and you weep alone
Standard Deviation and Standard Error of the Mean
AP Biology Intro to Statistics
Stats for AP Biology SLIDE SHOWS MODIFIED FROM:
DAY 3 Sections 1.2 and 1.3.
Topic 5: Exploring Quantitative data
HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis
Unit 6B Measures of Variation Ms. Young.
How do we categorize and make sense of data?
Summary descriptive statistics: means and standard deviations:
Using Statistics in Biology
Using Statistics in Biology
Standard Error for AP Biology
Displaying and Summarizing Quantitative Data
When You See (This), You Think (That)
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Higher National Certificate in Engineering
Data Literacy Graphing and Statisitics
Presentation transcript:

AP Lab Skills Guide Data will fall into three categories: 1. Parametric (normal) data - Normal distribution around mean - Mean and SD can predict future observations Ex. Heart rate, plant height, body temp 2. Nonparametric data - Does not fit normal distribution - May include large “outliers” 3. Frequency or count data - Counting how many of an item fit into a category Ex) Doing a genetic cross (Aa x Aa) and counting how many offspring are AA, Aa and aa. Ex2) Data collected as percentages like the percentage of cells in interphase of a root tip…you are just counting.

AP Lab Skills Guide Ex2) Data collected as percentages like the percentage of cells in interphase of a root tip…you are just counting.

AP Lab Skills Guide Let’s make up a problem and build a histogram, calculate SD (standard deviation) and SEM (standard error of the mean) Let’s say I measured the heart rate of 10 people: Heart rate (bpm) 76 82 90 73 58 65 63 74 71 68 (Mean – data point)2 (72-76)2 (72-82)2 (72-90)2 (72-73)2 (72-58)2 (72-65)2 (72-63)2 (72-74)2 (72-71)2 (72-68)2 = 16 = 100 = 324 = 1 = 36 = 49 = 81 = 4 Now divide by the sample size minus one: 628/(9) = 69.8 Lastly, take the sqrt Sqrt 69.8 = 8.4 (this is the SD) 72 +/- 8.4 bpm Mean = 72 Add them up: 628

AP Lab Skills Guide Let’s make up a problem and build a histogram, calculate SD (standard deviation) and SEM (standard error of the mean) Number of individuals Let’s say I measured the heart rate of 10 people: Histogram of Resting Heart Rates Heart rate (bpm) 76 82 90 73 58 65 63 74 71 68 Mean = 72 Heart rate bins (bpm)

AP Lab Skills Guide What does the SD tell you? SD describes the predicted spread or variation of the measured variable in the ENTIRE population… Remember, you only measured a small sampling…what if you measured everyone? Ex) If the mean is 50 and the standard deviation is 10 (50+/- 10) then 68% of the population is predicted to be between 40 (50-10) and 60 (50+10). And… 95% of the population predicted to be between 2 standard deviations - 30 (50 minus 20) and 70 (50 plus 20).

AP Lab Skills Guide Let’s make up a problem and build a histogram, calculate SD (standard deviation) and SEM (standard error of the mean) Let’s say I measured the heart rate of 10 people: Heart rate (bpm) 76 82 90 73 58 65 63 74 71 68 (Mean – data point)2 (72-76)2 (72-82)2 (72-90)2 (72-73)2 (72-58)2 (72-65)2 (72-63)2 (72-74)2 (72-71)2 (72-68)2 = 16 = 100 = 324 = 1 = 36 = 49 = 81 = 4 72 +/- 8.4 bpm Now explain this data in words This says that the mean is 72 and that 68% of the entire population is predicted to have heart rates between 63.6 and 80.4, and that 95% of the population is between 55.2 and 88.8 Mean = 72 Add them up: 628

AP Lab Skills Guide 72 +/- 2.65bpm = 8.4/SQRT(10) = 2.65 bpm Now lets determine the SEM(SE): 72 +/- 2.65bpm = 8.4/SQRT(10) = 2.65 bpm Standard Error of the Mean (SEM or SE) Standard Deviation (S, SD, σ)

AP Lab Skills Guide What does the SEM tell you? Exactly what it says…it is the predicted error in the mean itself. It gives you a range over which the actual mean of the entire population is predicted to be. Again, remember that you only measured a small sampling…the mean you calculate is not likely the actual mean…what if you measured everyone? Ex) If the mean is 40 and the SEM is 4 then there is a 68% chance (CI) that the actual mean of the entire population is between 40+/-4 or between 36 and 44. There is a 95% chance (CI) that the mean is between 2SEMs in either direction of between 32 and 48. CI = confidence interval

AP Lab Skills Guide 72 +/- 2.65bpm = 8.4/SQRT(10) = 2.65 bpm Now lets determine the SEM(SE): 72 +/- 2.65bpm = 8.4/SQRT(10) = 2.65 bpm Now explain this data in words This says that the mean is 72 and that there is a 68% chance that the true mean of the population is between 69.35 and 74.65, and a 95% chance that it is between 66.7 and 77.3 bpm. Standard Error of the Mean (SEM or SE) Standard Deviation (S, SD, σ)

AP Lab Skills Guide MEAN SD SEM 72 7 2.65 Let’s sum up the data heart rate data… MEAN SD SEM 72 7 2.65 This tells us that the mean of the OBSERVED SAMPLE is 72 bpm. The descriptive stats tell us that if the entire population were to be measured then 68% is predicted to fall between 65 and 79, and 95% between 58 and 86. In addition, the true mean of the entire population has a 68% chance of being between 69.35 and 74.65, and a 95% chance that it is between 66.7 and 77.3 bpm.

AP Lab Skills Guide REVIEW Standard Deviation (SD, S, σ) describes the range of a particular variable that is predicted to include 68% of the population. Example – 70+/-7 bpm would imply that 68% of the total population would have heart rates between 63 and 77bpm. Standard Error of the Mean (SEM, SE) describes the range where the actual mean of the total population if predicted to be with 68% confidence. Example – 70+/-3 bpm would imply that there is a 68% chance that the actual mean of the total population is between 67 and 73bpm.

AP Lab Skills Guide Explain in words the data: The Shady leaf width has a mean of ~7.2 and there is a 68% chance that the true population mean is between 7.0 and 7.6 (looks like SEM is .3) Statistical significance Are these two groups (shady and sunny) significantly different statistically? Justify If the means are different and the error bars do not overlap then you would predict them to be significantly different.

AP Lab Skills Guide

AP Lab Skills Guide

AP Lab Skills Guide

AP Lab Skills Guide Scatterplots - Comparing two MEASURED VARIABLES - If a linear relationship is predicted, a linear regression can be performed (best fit line; Figure 3) - R2 (R-squared or coefficient of determination) Typically ranges from 0 to 1 Describes “goodness of fit” or how well the line drawn fits the points. R2 = 0 implies not relationship R2 = 1 implies prefect relationship (all points on line)

AP Lab Skills Guide Box-and-Whisker Plots (Boxplot) - Used with nonparametric data (data that is not assumed to follow a normal distribution). - Vertical lines indicate highest and lowest points in dataset - Top of box shows upper quartile and bottom shows lower quartile. 70 - Horizontal line represents the median Upper Quartile median Determining Lower (Q1) and Upper (Q3) Quartiles: Lower Quartile 32 (Q2) (Q1 - lower) (Q3 - upper) You are simply dividing the data into quarters by medians…the upper quartile is the median of the upper half of the data and vice versa…

AP Lab Skills Guide Box-and-Whisker Plots (Boxplot) Determine the upper and lower quartile of the sycamore and beech leaf data: Sycamore: 33 35 40 44 48 52 63 Lower Quartile(Q1) Equals 37.5 Median (Q2) Equals 42 Upper Quartile (Q3) Equals 50 Notice how the upper and lower quartile range give you a sense of the center of the data without the influence of outliers that might exist in nonparametric data!!

AP Lab Skills Guide Box-and-Whisker Plots (Boxplot) Determine the upper and lower quartile of the sycamore and beech leaf data: Beech: 11 15 19 21 26 32 34 Lower Quartile(Q1) Equals 37.5 Median (Q2) Equals 42 Upper Quartile (Q3) Equals 50 http://www.brainingcamp.com/resources/math/box-plots/questions.php

AP Lab Skills Guide Histograms Used to determine if a given set of measurements, like plant height from art. sel. lab, approximates a normal distribution (parametric) or if data is nonparametric. Histogram showing parametric data Histogram showing NONparametric data

AP Lab Skills Guide Warning

AP Lab Skills Guide

AP Lab Skills Guide

AP Lab Skills Guide