Understanding and Presenting Your Data OR What to Do with All Those Numbers You’re Recording.

Slides:



Advertisements
Similar presentations
Sampling Distributions
Advertisements

Confidence Intervals for Proportions
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Lect 10b1 Histogram – (Frequency distribution) Used for continuous measures Statistical Analysis of Data ______________ statistics – summarize data.
Sampling Distributions
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
AP Biology Intro to Statistic
Standard Error for AP Biology
Chapter 19: Confidence Intervals for Proportions
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
The Normal Distribution The “Bell Curve” The “Normal Curve”
A Sampling Distribution
An Introduction to Making Inferences. descriptive statistics – summarize important characteristics of known population data inferential statistics – we.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
Probability and Samples
Beak of the Finch Natural Selection Statistical Analysis.
By C. Kohn Waterford Agricultural Sciences.   A major concern in science is proving that what we have observed would occur again if we repeated the.
Scientific Inquiry & Skills
MATH IN THE FORM OF STATISTICS IS VERY COMMON IN AP BIOLOGY YOU WILL NEED TO BE ABLE TO CALCULATE USING THE FORMULA OR INTERPRET THE MEANING OF THE RESULTS.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
QUANTITATIVE RESEARCH AND BASIC STATISTICS. TODAYS AGENDA Progress, challenges and support needed Response to TAP Check-in, Warm-up responses and TAP.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Chapter 10: Introduction to Statistical Inference.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Copyright © 2009 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
PCB 3043L - General Ecology Data Analysis.
Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.
Outline of Today’s Discussion 1.The Distribution of Means (DOM) 2.Hypothesis Testing With The DOM 3.Estimation & Confidence Intervals 4.Confidence Intervals.
MAKING MEANING OUT OF DATA Statistics for IB-SL Biology.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Scientific Methodology: The Hypothetico-Deductive Approach, the Test of Hypothesis, and Null Hypotheses BIOL January 2016.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
The Data Collection and Statistical Analysis in IB Biology John Gasparini The Munich International School Part II – Basic Stats, Standard Deviation and.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
SPSS Homework Practice The Neuroticism Measure = S = 6.24 n = 54 How many people likely have a neuroticism score between 29 and 34?
Chapter 16: Sample Size “See what kind of love the Father has given to us, that we should be called children of God; and so we are. The reason why the.
AP Biology Intro to Statistics
Standard Error for AP Biology
PCB 3043L - General Ecology Data Analysis.
SAMPLING DISTRIBUTION. Probability and Samples Sampling Distributions Central Limit Theorem Standard Error Probability of Sample Means.
Standard Error for AP Biology
Inferential statistics,
AP Biology Intro to Statistics
Statistics for IB-SL Biology
AP Biology Intro to Statistic
AP Biology Intro to Statistic
Using Statistics in Biology
AP Biology Intro to Statistic
Using Statistics in Biology
Standard Error for AP Biology
Reasoning in Psychology Using Statistics
Sampling Distributions
Presentation transcript:

Understanding and Presenting Your Data OR What to Do with All Those Numbers You’re Recording

 The primary question underlying many biology experiments is whether, on average, one condition (treatment) has a greater effect on a certain variable than another condition  This type of question is answered by comparing the mean (average) response of a group of organisms under two or more treatments  Does your experiment involve this kind of question? Comparison of Means

 If we could measure the variable of interest in every animal from our chosen population (for example, a species exposed to a certain treatment) we could calculate the variable’s “true” mean (average of all individuals) in that population  If we did this for each population we wanted to compare, we could tell for sure whether these populations differed by seeing if their “true” means differed for the variable in question Comparison of Means

 As you might guess, it would be impossible to measure each individual of each population involved in our hypothesis!  We solve this problem by measuring a very small, but representative, subset of each relevant population  This specially selected subset is called a sample  We use data from a sample to make inferences (predictions) about the population Comparison of Means

 Because a sample is so much smaller than the population from which it is taken, the values we calculate from the sample (for example, the sample mean) should be “taken with a grain of salt” when using them to predict population values  Statistics is a set of calculations and rules that tell us how probable it is that our sample-based predictions will hold true for the population  Now we’ll discuss an example where we use statistics to test a hypothesis about sea slugs

 Say that you have observed sea slugs for some time and noticed that these slow-moving animals signal their readiness to mate by performing a simple “head bob”  Your team wants to determine what factors have the greatest effect on how often this simple courtship display occurs in sea slugs Example - Sea Slugs

 Based on a journal article you’ve read, and some preliminary observations, your team predicts that: Example - Sea Slugs  Sea slugs living on a rocky substrate will show more head bobs/month than sea slugs living on a silty substrate.

 To test this hypothesis, you randomly select 5 sea slugs for each of two treatment tanks - one with a rocky substrate and one with a silty substrate  For 1 month, you record the number of head bobs that you see during observation periods for each sea slug and then calculate the number of head bobs/month for each slug Example - Sea Slugs

 Here are your raw data:  Each set (rocky or silty) of 5 values represents a specific sample of sea slugs from the two populations of sea slugs you are interested in (all sea slugs living on rocky substrates or all sea slugs living on silty substrates)  Remember, we use the sample data to make inferences about the population

 Raw data:  What would be your first steps in summarizing these data?

 The first step would be to calculate the mean (average) number of head bobs/month for each substrate treatment  The AVERAGE function in Excel will calculate this for you  The sample mean is a measure of the central tendency of a population ; i.e., where the center of the population of interest tends to be located for the variable in question

 The next step would be to calculate the standard deviation (SD) for each substrate treatment  The STDEV function in Excel will calculate this for you  The SD is a measure of the dispersion (spread) of your data; that is, a value that summarizes how far individual values are from the mean value

 Another important step is to calculate the standard error of the mean (SE) for each substrate treatment  The SE is a measure of how far the “true” (population) mean is likely to be from the calculated sample mean (remember the “grain of salt”)

 For small samples (such as ours), the range of values 2 standard errors (2*SE) on either side of the sample mean has about a 90% chance of containing the “true” (population) mean  Thus, for the rocky substrate sea slugs, the population mean has about a 90% chance of being between 2.0 and 8.8 head bobs/month 2.0 = ( *1.7) 8.8 = ( *1.7)

 Question for you: For the silty substrate sea slugs, what is the range within which the population mean has about a 90% chance of being located?  1.7 and 10.9 head bobs/month 1.7 = ( *2.3) 10.9 = ( *2.3)

 Once all these statistics (mean, SD, SE) have been calculated for your sample, the next step is to visually describe your data  This is done using a figure of the proper sort

 This column graph shows the value for each sea slug from each substrate tank  Can you tell on which substrate sea slugs show more head bobs per month?  What is the meaning of the sea slug # on the X-axis?

 What kind of graph would be a better way to visually summarize on which of the two substrates sea slugs do more head bobbing?

 This column graph shows the sample mean for each substrate group  Now can you tell on which substrate sea slugs show more head bobs per month?  Is the answer completely clear or could two reasonable people disagree?

 If you measured 5 other sea slugs in each of the two substrate tanks would the sample means be the same as in the first experiment?  What could you add to this graph to give a sense of how well these sample means predict the mean of the population from which they come?

 Now we’ve added error bars representing 1 SE on either side of the sample mean  Even though the means of these two samples differ, because the SE bars for the two groups overlap (the upper bar for rocky overlaps the lower bar for silty), we have no good evidence that the “true” means for the rocky and silty substrates actually differ

 In this case, we would have fairly certain evidence of a difference between groups -- that is, that the “true” means for the rocky and silty substrates differ  If our data looked like this instead, the SE bars of the 2 groups would not overlap by a substantial amount

Rules of thumb for using SE bars to judge significant diffs :  Two means will never be significantly different if:  their SE intervals overlap -- at all  the gap between the two SE intervals is < 1/3 the length of the shorter SE interval  When the gap between the two SE intervals is > 1/3 the length of the shorter SE interval, the two means may be significantly different (you will need to use a statistical test to know with more certainty)

 Thus, SE bars give us an accepted standard for judging how certain we are that two treatments produce different effects on the variable of interest  In other words, two reasonable people should now agree that substrate type does not produce a significant difference in the number of head bobs per month in sea slugs

Once you have collected your raw data:  calculate the mean, standard deviation (SD), and standard error of the mean (SE) for each treatment group sample  graph the mean values for each treatment group in a column graph, adding error bars above and below the mean equal to 1 SE  use the rules of thumb about SE interval overlap to determine how probable it is that any means you are comparing are actually different Recap

How to Get the PP Presentation  Website where PowerPoint file “Understanding and Presenting Your Data” can be downloaded: biology.nsf/pages/myersgb  Tutorial Written By:  Dr. Marcie J. Myers  College of St. Catherine  St. Paul, MN