BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Bivariate Analysis Cross-tabulation and chi-square.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Chapter Goals After completing this chapter, you should be able to:
Independent t-Test CJ 526 Statistical Analysis in Criminal Justice.
BHS Methods in Behavioral Sciences I
Chi-square Test of Independence
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
T-Tests Lecture: Nov. 6, 2002.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
S519: Evaluation of Information Systems
 What is t test  Types of t test  TTEST function  T-test ToolPak 2.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Independent t-tests.  Use when:  You are examining differences between groups  Each participant is tested once  Comparing two groups only.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Chapter 8 BOT3015L Data analysis and interpretation Presentation created by Jean Burns All photos from Raven et al. Biology of Plants except when otherwise.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
Statistical Analysis I have all this data. Now what does it mean?
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Statistical Analysis Statistical Analysis
Single-Sample T-Test Quantitative Methods in HPELS 440:210.
Choosing and using statistics to test ecological hypotheses
Comparing Means: t-tests Wednesday 22 February 2012/ Thursday 23 February 2012.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chapter 9: Testing Hypotheses
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
Statistical Analysis I have all this data. Now what does it mean?
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Independent t-Test CJ 526 Statistical Analysis in Criminal Justice.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
Statistical Analysis Topic – Math skills requirements.
Statistics in IB Biology Error bars, standard deviation, t-test and more.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
Chapter Eight: Using Statistics to Answer Questions.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
State the ‘null hypothesis’ State the ‘alternative hypothesis’ State either one-tailed or two-tailed test State the chosen statistical test with reasons.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
© Copyright McGraw-Hill 2004
DateGroup Project TaskDetails Feb 8, 9Article Analysis Due at start of class. See pgs Feb 15, 16In-class work day Discuss scientific writing,
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 9: t test.
Guidelines for building a bar graph in Excel and using it in a laboratory report IB Biology (December 2012)
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
Introduction to Marketing Research
Section 6-4 – Confidence Intervals for the Population Variance and Standard Deviation Estimating Population Parameters.
Statistical Analysis Determining the Significance of Data
Chapter 11 Hypothesis Tests and Estimation for Population Variances
Chapter 10 Analyzing the Association Between Categorical Variables
Statistical Analysis Chi-Square.
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
Chapter Nine: Using Statistics to Answer Questions
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Presentation transcript:

BOT3015L Data analysis and interpretation Presentation created by Jean Burns and Sarah Tso All photos from Raven et al. Biology of Plants except when otherwise noted

Today Types of data Discrete, Continuous Independent, dependent Types of statistics Descriptive, Inferential Creating graphs in excel Doing a t-test or Chi Square Lab: create graphs and do statistics for the gas exchange experiment

Today Types of data Discrete, Continuous Independent, dependent Types of statistics Descriptive, Inferential Creating graphs in excel Doing a t-test Lab: create graphs and do statistics for the gas exchange experiment

Types of data 1. Discrete: Having categories (i.e. flowers present/flowers absent, large/medium/small)

Seed heteromorphism: a discrete character. Not hetermorphic Hetermorphic

Types of data 1. Discrete: Having categories (i.e. flowers present/flowers absent, large/medium/small) 2. Continuous: Having infinite possible values (i.e. age, growth rate)

Seed size: a continuous character Commelina benghalensis seed size variation

Types of data 1.Independent: Manipulated or selected with the hypothesis that it is causally linked to the dependent variable. Cause. 2.Dependent: Measured as a response to the independent variable. Effect.

Independent and dependent variables Independent: Treatment (CO2 concentration) Dependent: Number of open and closed stomata, or stomatal aperture Assumption: Changes in CO2 concentration will affect stomatal aperture.

Today Types of data Discrete, Continuous Independent, dependent Types of statistics Descriptive, Inferential Creating graphs in excel Doing a t-test Lab: create graphs and do statistics for the gas exchange experiment

Types of statistics 1. Descriptive: Summarize a set of data. 2. Inferential: Draw conclusions from a data set.

Types of statistics 1. Descriptive: Summarize a set of data. 2. Inferential: Draw conclusions from a data set.

Mean: a type of descriptive statistic Arithmetic mean

Mean: a type of descriptive statistic Measure of the central tendency of a data set. Frequency Value Mean = 2.9

Standard deviation: a type of descriptive statistic Standard deviation

Standard deviation: a type of descriptive statistic. Measure of spread of variability in a data set. Frequency Value Standard deviation = 0.25

Standard deviation: a type of descriptive statistic. Measure of spread of variability in a data set. Frequency Value Standard deviation = 0.58Standard deviation = 0.41 Value

Types of statistics 1. Descriptive: Summarize a set of data. 2. Inferential: Draw conclusions from a data set.

Pearson’s  2 : a type of inferential statistic Used on discrete response variable, when you have discrete treatments (independent variables). Example: The number of open and closed stomata in response to lower CO 2 concentration.

t-test: a type of inferential statistic Used on continuous response variable, when you have discrete treatments (independent variables). Example: Stomatal aperture response to lower CO 2 concentration.

Regression: a type of inferential statistic Used on continuous response variable, when you have continuous treatments (independent variables). Example: Stomatal aperture response to varied CO 2 concentration (when the CO 2 concentration is actually measured). *Talk to your TA if you want to know how to do this

Observation: both internal and external factors affect stomatal aperture Question: What is the effect of CO 2 concentration on stomatal aperture or the number of open and closed stomata?

Experimental Design Question: What is the effect of reducing CO 2 concentration on the number of open stomata? Treatment: Reduce CO 2 concentration using sodium hydroxide: CO 2 + NaOH => NaHCO 3 (sodium bicarbonate) Control: Ambient atmospheric CO 2 concentration Data: Count the number of open and closed stomata (are these data discrete or continuous?)

Hypothesis testing for discrete data Pearson’s Chi Square (  2 ): a test of association between to categorical variables. H o : Both treatments yield an equal number of open and closed stomata. H A1 : NaOH treatment results in fewer open stomata than the control. H A2 : NaOH treatment results in more open stomata than the control.

Step 1: Make a contingency table # open stomata # closed stomata NaOH515 Ambient CO This is a 2 x 2 contingency table, having two columns and two rows, but it can have other dimensions.

Step 2: Make a contingency table # open stomata # closed stomata Row Totals NaOH51520 Ambient CO Column Totals20 N = 40 Add the row and column totals and the grand total, N.

Step 3: Calculate expected values based on null hypothesis # open stomata # closed stomata Row Totals NaOH5 (10)15 (10)20 Ambient CO 2 15 (10)5 (10)20 Column Totals20 N = 40 H o : Both treatments yield an equal number of open and closed stomata. For each cell, the expected value is: Row total x column total/ N.

Step 4: Calculate the  2 and degrees of freedom  2 =  {(observed - expected) 2 / expected} d.f. = (# of columns - 1) x (# of rows - 1) # open stomata # closed stomata Row Totals NaOH5 (10)15 (10)20 Ambient CO 2 15 (10)5 (10)20 Column Totals20 N = 40  2 = (5 - 10) 2 / 10 + ( ) 2 /10 + ( ) 2 /10 + (5 - 10) 2 / 10 = 10 d.f. = (2 - 1) x (2 - 1) = 1

Step 4: Compare calculated  2 with the critical value from a Chi Square distribution table The critical value can be obtained from a table based on the degrees of freedom and the level of confidence, which is set at P =  2 calc = 10  2 crit = 3.84, d.f. = 1 If the calculated value exceeds the critical value, you can reject your H o

Hypothesis testing for continuous data H o : Both treatments yield the same stomatal aperture. H A1 : NaOH treatment results in narrower stomatal aperture. H A2 : NaOH treatment results in larger stomatal aperture.

Hypothesis testing for continuous data H o : Both treatments yield the same stomatal aperture. H A1 : Water treatment results in larger stomatal aperture. H A2 : NaOH treatment results in larger stomatal aperture. A t-test will distinguish between H o and H A, then you must look at the direction of the difference to interpret the results.

We will use a t-test for this example:

Question: is there a difference in the means between two treatments? Large overlap = not different.

Question: is there a difference in the means between two treatments? Large overlap = not different. small large t < ~2

Question: is there a difference in the means between two treatments? Large overlap = not different.

Question: is there a difference in the means between two treatments? Little overlap = different. larger large t > ~2

Question: is there a difference in the means between two treatments? Little overlap = different.

Question: is there a difference in the means between two treatments? Little overlap = different. large small t > ~2

What if the answer is not so obvious? This is why we need statistics.

Degrees of freedom DF = n 1 + n DF = number of independent categories in a statistical test. For example, in a t-test, we are estimating 2 parameters the mean and the variance. Thus we subtract 2 from the degrees of freedom, because 2 elements are no longer independent. DF is a measure of a test’s power. Larger sample sizes (and DF) result in more power to detect differences between the means.

t-value distribution t-value frequency 1. Get t crit from a table of t-values, for P = 0.05 and the correct DF. 2. If t observed > t crit, then the test is significant. 3. If P < 0.05, the means are different.

Factors influencing a difference between means Distance between means Variance in each sample (Standard Deviation, SD) T-value (means and SD) Number of samples (DF) Level of error we are willing to accept to consider two means different (P-value).

Today Types of data Discrete, Continuous Independent, dependent Types of statistics Descriptive, Inferential Creating graphs in excel Doing a t-test Lab: create graphs and do statistics for the gas exchange experiment

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format 3.In the cells directly under treatment data:

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format 3.Calculate the mean and standard deviation Mean: enter formula =average(cells to calculate the mean from) Example: =AVERAGE(A2:A11)

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format 3.Calculate the mean and standard deviation Standard deviation: enter formula =stdev(cells to calculate the mean from) Example: =STDEV(A2:A11)

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format 3.Calculate the mean and standard deviation 4.Select the data you wish to graph Select these cells

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format 3.Calculate the mean and standard deviation 4.Select the data you wish to graph 5.Click the chart button or “Insert” “Chart…” Chart Button

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format 3.Calculate the mean and standard deviation 4.Select the data you wish to graph 5.Click the chart button 6.Chose your chart options: Column (next) Series/Category x-axis labels/highlight treatment labels (next) Titles/label axes including Units (next) Finish

Now your chart should look like this:

Creating graphs in excel 1.Open excel (Start/Applications/Microsoft Excel) 2.Enter the data in table format 3.Calculate the mean and standard deviation 4.Select the data you wish to graph 5.Click the chart button 6.Chose your chart options 7.Add error bars to your chart: Double click on the bar Y-error bars (at the top) Go to Custom Select the cells with the standard deviation *Note: you should only have error bars if the data are continuous.

Now your chart should look like this:

Today Types of data Discrete, Continuous Independent, dependent Types of statistics Descriptive, Inferential Creating graphs in excel Doing a t-test Lab: create graphs and do statistics for the gas exchange experiment

Performing a t-test In this course, we will demonstrate the use of Excel for statistics; however, more advanced software, designed specifically for statistical analyses, offer more detailed analyses. Use the software of your choice, being sure to indicate the software that is used.

t-test with Excel In excel: 1.In an empty cell, “Insert” a “Function” 2.Find “T-TEST” 3.“Array 1” is one set of values. Include each value (e.g. each aperture size under one condition) 4.“Array 2 is the other set of values (e.g. each aperture size under the other condition. 5.We will be performing a two-tailed distribution t-test. Enter “2” in “tails.” 6. We are assuming there is equal variance for the two samples, so enter “2” in “type.” 7.“OK” will return the probability (p) value. This is the probability that the difference between the sets of values is random.

Reminders Report submissions (paper and turnitin) refer to “organization of a lab report” in the beginning of your lab manual. Titles must be descriptive Methods must be complete Results should include descriptions (in your own words) not just graphs and tables (although those are also necessary). Discussion must demonstrate thought Submit copies of your references with your reports