1 STATISTICS!!! The science of data. 2 What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis.

Slides:



Advertisements
Similar presentations
Statistical Analysis WHY ?.
Advertisements

Measures of Central Tendency. Central Tendency “Values that describe the middle, or central, characteristics of a set of data” Terms used to describe.
Calculating & Reporting Healthcare Statistics
AP Biology Intro to Statistic
MEASURES OF CENTRAL TENDENCY & DISPERSION Research Methods.
DATA ANALYSIS FOR RESEARCH PROJECTS
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Objective To understand measures of central tendency and use them to analyze data.
Stats. Pop Quiz 1)True or False? 2)Up or Down? 3)Left or Right? 4)Potatoes or French Fries? 5)Coke or Pepsi? 6)Summer or Winter? 7)Justin Bieber or Selena.
Assessment Statements – State that error bars are a graphical representation of the variability of data – Calculate the mean and standard deviation.
Statistical Analysis How do we make sense of the data we collect during a study or an experiment?
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
STAT02 - Descriptive statistics (cont.) 1 Descriptive statistics (cont.) Lecturer: Smilen Dimitrov Applied statistics for testing and evaluation – MED4.
CHAPTER 1 Basic Statistics Statistics in Engineering
STATISTICS!!! The science of data. What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for.
PTP 560 Research Methods Week 8 Thomas Ruediger, PT.
Statistics The POWER of Data. Statistics: Definition Statistics is the mathematics of the collection, organization, and interpretation of numerical data.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
STATISTICS!!! The science of data. What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for.
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures.
The Scientific Method Honors Biology Laboratory Skills.
Nature of Science Science Nature of Science Scientific methods Formulation of a hypothesis Formulation of a hypothesis Survey literature/Archives.
Statistical Analysis Topic 1. Statistics State that error bars are a graphical representation of the variability of data Calculate the mean.
Scientific Method, Lab Report Format and Graphing
Descriptive Statistics
MATH IN THE FORM OF STATISTICS IS VERY COMMON IN AP BIOLOGY YOU WILL NEED TO BE ABLE TO CALCULATE USING THE FORMULA OR INTERPRET THE MEANING OF THE RESULTS.
Make observations to state the problem *a statement that defines the topic of the experiments and identifies the relationship between the two variables.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
 Statistics The Baaaasics. “For most biologists, statistics is just a useful tool, like a microscope, and knowing the detailed mathematical basis of.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
STATISTICS!!! The science of data.
Statistical Analysis IB Topic 1. Why study statistics?  Scientists use the scientific method when designing experiments  Observations and experiments.
STATISTICS!!! The science of data. What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for.
Chapter Eight: Using Statistics to Answer Questions.
Statistical Analysis Topic 1. Statistics State that error bars are a graphical representation of the variability of data Calculate the mean.
RESEARCH & DATA ANALYSIS
Statistical analysis Why?? (besides making your life difficult …)  Scientists must collect data AND analyze it  Does your data support your hypothesis?
Introduction to statistics I Sophia King Rm. P24 HWB
STATISTICS STATISTICS Numerical data. How Do We Make Sense of the Data? descriptively Researchers use statistics for two major purposes: (1) descriptively.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
MAKING MEANING OUT OF DATA Statistics for IB-SL Biology.
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
USING GRAPHING SKILLS. Axis While drawing graphs, we have two axis. X-axis: for consistent variables Y-axis: for other variable.
Descriptive Statistics Used in Biology. It is rarely practical for scientists to measure every event or individual in a population. Instead, they typically.
STATISICAL ANALYSIS HLIB BIOLOGY TOPIC 1:. Why statistics? __________________ “Statistics refers to methods and rules for organizing and interpreting.
Statistical Analysis IB Topic 1. IB assessment statements:  By the end of this topic, I can …: 1. State that error bars are a graphical representation.
Data, Tables & Graphs October 24, 2016 BIOL 260
AP Biology Intro to Statistics
AP Biology Intro to Statistics
Chapter 2: Methods for Describing Data Sets
AP Biology Intro to Statistics
STATS DAY First a few review questions.
STATISTICS!!! The science of data.
STATISTICAL ANALYSIS.
AP Biology Intro to Statistics
Statistics in Science Data can be collected about a population (surveys) Data can be collected about a process (experimentation)
Statistics for IB-SL Biology
AP Biology Intro to Statistic
What is Data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for making calculations or drawing.
AP Biology Intro to Statistic
AP Biology Intro to Statistic
STATISTICAL ANALYSIS.
Chapter Nine: Using Statistics to Answer Questions
Data Literacy Graphing and Statisitics
Presentation transcript:

1 STATISTICS!!! The science of data

2 What is data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for making calculations or drawing conclusions Encarta dictionary Encarta dictionary

3 Statistics in Science Data can be collected about a population (surveys) Data can be collected about a population (surveys) Data can be collected about a process (experimentation) Data can be collected about a process (experimentation)

4 2 types of Data Qualitative Qualitative Quantitative Quantitative

5 Qualitative Data Information that relates to characteristics or description (observable qualities) Information that relates to characteristics or description (observable qualities) Information is often grouped by descriptive category Information is often grouped by descriptive category Examples Examples Species of plant Species of plant Type of insect Type of insect Shades of color Shades of color Rank of flavor in taste testing Rank of flavor in taste testing Remember: qualitative data can be “scored” and evaluated numerically

6 Qualitative data, manipulated numerically Survey results, teens and need for environmental action Survey results, teens and need for environmental action

7 Quantitative data Quantitative – measured using a naturally occurring numerical scale Quantitative – measured using a naturally occurring numerical scale Examples Examples Chemical concentration Chemical concentration Temperature Temperature Length Length Weight…etc. Weight…etc.

8 Quantitation Measurements are often displayed graphically Measurements are often displayed graphically

9 Quantitation = Measurement In data collection for Biology, data must be measured carefully, using laboratory equipment In data collection for Biology, data must be measured carefully, using laboratory equipment ( ex. Timers, metersticks, pH meters, balances, pipettes, etc) The limits of the equipment used add some uncertainty to the data collected. All equipment has a certain magnitude of uncertainty. For example, is a ruler that is mass-produced a good measure of 1 cm? 1mm? 0.1mm? The limits of the equipment used add some uncertainty to the data collected. All equipment has a certain magnitude of uncertainty. For example, is a ruler that is mass-produced a good measure of 1 cm? 1mm? 0.1mm? For quantitative testing, you must indicate the level of uncertainty of the tool that you are using for measurement!! For quantitative testing, you must indicate the level of uncertainty of the tool that you are using for measurement!!

10 How to determine uncertainty? Usually the instrument manufacturer will indicate this – read what is provided by the manufacturer. Usually the instrument manufacturer will indicate this – read what is provided by the manufacturer. Be sure that the number of significant digits in the data table/graph reflects the precision of the instrument used (for ex. If the manufacturer states that the accuracy of a balance is to 0.1g – and your average mass is 2.06g, be sure to round the average to 2.1g) Your data must be consistent with your measurement tool regarding significant figures. Be sure that the number of significant digits in the data table/graph reflects the precision of the instrument used (for ex. If the manufacturer states that the accuracy of a balance is to 0.1g – and your average mass is 2.06g, be sure to round the average to 2.1g) Your data must be consistent with your measurement tool regarding significant figures.

11 Finding the limits As a “rule-of-thumb”, if not specified, use +/- 1/2 of the smallest measurement unit (ex metric ruler is lined to 1mm,so the limit of uncertainty of the ruler is +/- 0.5 mm.) As a “rule-of-thumb”, if not specified, use +/- 1/2 of the smallest measurement unit (ex metric ruler is lined to 1mm,so the limit of uncertainty of the ruler is +/- 0.5 mm.) If the room temperature is read as 25 degrees C, with a thermometer that is scored at 1 degree intervals – what is the range of possible temperatures for the room? If the room temperature is read as 25 degrees C, with a thermometer that is scored at 1 degree intervals – what is the range of possible temperatures for the room? (ans.s +/- 0.5 degrees Celsius - if you read 15 o C, it may in fact be 14.5 or 15.5 degrees) (ans.s +/- 0.5 degrees Celsius - if you read 15 o C, it may in fact be 14.5 or 15.5 degrees)

12 Looking at Data How accurate is the data? (How close are the data to the “real” results?) This is also considered as BIAS How accurate is the data? (How close are the data to the “real” results?) This is also considered as BIAS How precise is the data? (All test systems have some uncertainty, due to limits of measurement) Estimation of the limits of the experimental uncertainty is essential. How precise is the data? (All test systems have some uncertainty, due to limits of measurement) Estimation of the limits of the experimental uncertainty is essential.

13

14

15 Comparing Averages Once the 2 averages are calculated for each set of data, the average values can be plotted together on a graph, to visualize the relationship between the 2 Once the 2 averages are calculated for each set of data, the average values can be plotted together on a graph, to visualize the relationship between the 2

16

17

18 Drawing error bars The simplest way to draw an error bar is to use the mean as the central point, and to use the distance of the measurement that is furthest from the average as the endpoints of the data bar The simplest way to draw an error bar is to use the mean as the central point, and to use the distance of the measurement that is furthest from the average as the endpoints of the data bar

19 Average value Value farthest from average Calculated distance

20 What do error bars suggest? If the bars show extensive overlap, it is likely that there is not a significant difference between those values If the bars show extensive overlap, it is likely that there is not a significant difference between those values

21

22 Quick Review – 3 measures of “Central Tendency” mode : value that appears most frequently mode : value that appears most frequently median : When all data are listed from least to greatest, the value at which half of the observations are greater, and half are lesser. median : When all data are listed from least to greatest, the value at which half of the observations are greater, and half are lesser. The most commonly used measure of central tendency is the mean, or arithmetic average (sum of data points divided by the number of points) The most commonly used measure of central tendency is the mean, or arithmetic average (sum of data points divided by the number of points)

23 How can leaf lengths be displayed graphically?

24 Simply measure the lengths of each and plot how many are of each length

25 If smoothed, the histogram data assumes this shape

26 This Shape? Is a classic bell-shaped curve, AKA Gaussian Distribution Curve, AKA a Normal Distribution curve. Is a classic bell-shaped curve, AKA Gaussian Distribution Curve, AKA a Normal Distribution curve. Essentially it means that in all studies with an adequate number of datapoints (>30) a significant number of results tend to be near the mean. Fewer results are found farther from the mean Essentially it means that in all studies with an adequate number of datapoints (>30) a significant number of results tend to be near the mean. Fewer results are found farther from the mean

27 Standard Deviation The standard deviation is a statistic that tells you how tightly all the various examples are clustered around the mean in a set of data The standard deviation is a statistic that tells you how tightly all the various examples are clustered around the mean in a set of data

28 Standard deviation The STANDARD DEVIATION is a more sophisticated indicator of the precision of a set of a given number of measurements The STANDARD DEVIATION is a more sophisticated indicator of the precision of a set of a given number of measurements The standard deviation is like an average deviation of measurement values from the mean. In large studies, the standard deviation is used to draw error bars, instead of the maximum deviation. The standard deviation is like an average deviation of measurement values from the mean. In large studies, the standard deviation is used to draw error bars, instead of the maximum deviation.

29 A typical standard distribution curve

30 According to this curve: One standard deviation away from the mean in either direction on the horizontal axis (the red area on the preceding graph) accounts for somewhere around 68 percent of the data in this group. One standard deviation away from the mean in either direction on the horizontal axis (the red area on the preceding graph) accounts for somewhere around 68 percent of the data in this group. Two standard deviations away from the mean (the red and green areas) account for roughly 95 percent of the data. Two standard deviations away from the mean (the red and green areas) account for roughly 95 percent of the data.

31 Three Standard Deviations? three standard deviations (the red, green and blue areas) account for about 99 percent of the data three standard deviations (the red, green and blue areas) account for about 99 percent of the data -3sd -2sd +/-1sd 2sd +3sd

32 How is Standard Deviation calculated? With this formula!

33 AGHHH! MRS R- DO I NEED TO KNOW THIS FOR THE TEST????? DO I NEED TO KNOW THIS FOR THE TEST?????

34 Not the formula! This can be calculated on a scientific calculator This can be calculated on a scientific calculator OR…. In Microsoft Excel, type the following code into the cell where you want the Standard Deviation result, using the "unbiased," or "n-1" method: =STDEV(A1:A30) (substitute the cell name of the first value in your dataset for A1, and the cell name of the last value for A30.) OR…. In Microsoft Excel, type the following code into the cell where you want the Standard Deviation result, using the "unbiased," or "n-1" method: =STDEV(A1:A30) (substitute the cell name of the first value in your dataset for A1, and the cell name of the last value for A30.)

35 You DO need to know the concept! standard deviation is a statistic that tells how tightly all the various datapoints are clustered around the mean in a set of data. standard deviation is a statistic that tells how tightly all the various datapoints are clustered around the mean in a set of data. When the datapoints are tightly bunched together and the bell-shaped curve is steep, the standard deviation is small.(precise results, smaller sd) When the datapoints are tightly bunched together and the bell-shaped curve is steep, the standard deviation is small.(precise results, smaller sd) When the datapoints are spread apart and the bell curve is relatively flat, a large standard deviation value suggests less precise results When the datapoints are spread apart and the bell curve is relatively flat, a large standard deviation value suggests less precise results

36 THE END For today………. For today……….