Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.

Slides:



Advertisements
Similar presentations
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Advertisements

Introduction to Summary Statistics
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
Chapter 7 Introduction to Sampling Distributions
Calculating & Reporting Healthcare Statistics
Chapter 6 Introduction to Sampling Distributions
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Descriptive Statistics
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Introduction to Educational Statistics
Measures of Dispersion
Central Tendency and Variability
QUIZ CHAPTER Seven Psy302 Quantitative Methods. 1. A distribution of all sample means or sample variances that could be obtained in samples of a given.
Standard error of estimate & Confidence interval.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Chapter 6: Sampling Distributions
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
6 - 1 Basic Univariate Statistics Chapter Basic Statistics A statistic is a number, computed from sample data, such as a mean or variance. The.
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Statistics Class 4 February 11th , 2012.
AP Statistics Chapter 9 Notes.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
1.1 - Populations, Samples and Processes Pictorial and Tabular Methods in Descriptive Statistics Measures of Location Measures of Variability.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Chapter 6: Random Errors in Chemical Analysis CHE 321: Quantitative Chemical Analysis Dr. Jerome Williams, Ph.D. Saint Leo University.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Distributions of the Sample Mean
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Measures of Central Tendency: The Mean, Median, and Mode
Introduction to Statistics Santosh Kumar Director (iCISA)
Chapter SixteenChapter Sixteen. Figure 16.1 Relationship of Frequency Distribution, Hypothesis Testing and Cross-Tabulation to the Previous Chapters and.
CHAPTER Basic Definitions and Properties  P opulation Characteristics = “Parameters”  S ample Characteristics = “Statistics”  R andom Variables.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Introduction to Inference Sampling Distributions.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
CHAPTER 2: Basic Summary Statistics
From the population to the sample The sampling distribution FETP India.
Averages and Variability
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Sampling Distributions Chapter 9 First, a word from our textbook A statistic is a numerical value computed from a sample. EX. Mean, median, mode, etc.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Chapter 6: Sampling Distributions
Chapter 7 Review.
Measures of Central Tendency
Chapter 6: Sampling Distributions
Central Tendency and Variability
Descriptive Statistics
Central Tendency.
Measures of Location Statistics of location Statistics of dispersion
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
CHAPTER 15 SUMMARY Chapter Specifics
Chapter Three Numerically Summarizing Data
CHAPTER 2: Basic Summary Statistics
Numerical Descriptive Measures
Presentation transcript:

Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics for Biology. CRC Press.

The basic output of any scientific investigation is a collection of observations or data. (Ex. If Y is a random variable, then we use Y i to denote the ith observation in our sample.) Often, we will use our sample data to estimate unknown population parameters (Ex. We can use the sample mean,, to estimate the population mean, μ) The construction of frequency distributions is usually the first step in summarizing data

Small, short-lived perennial herb Narrowly endemic and endangered Flowers are small and bisexual Hypericum cumulicola:

Histogram of plant height (1995)

Measures of location It is useful to identify a “typical value” to summarize our observations (i.e., an “average”) Examples include: 1.Mean 2.Median 3.Mode

The Arithmetic Mean The arithmetic mean (or simply the mean) of a list of numbers is the sum of all the observations (Y i ) in the list divided by the number of the observations (n):

The Arithmetic Mean Remember the formula for the expected value of a discrete random variable? Since we assume, for our sample, that the Y i are the values of a random variable and that p i = 1/n for all Y i, we get:

The Arithmetic Mean The arithmetic mean of the observations in our sample ( ) is an unbiased estimator of the population mean (μ) if 3 conditions are met: 1.Observation are made on randomly selected individuals 2.Observations in the sample are independent 3.Observations are drawn from a larger population that is distributed as a normal random variable

The Law of Large Numbers As the sample size n increases, the arithmetic mean of Y i approaches the expected value of Y

The Median The value of a set of ordered observations that has an equal number of observations above and below it.

The Median Estimation: –For an odd number of observations, the median is the middle observation of the set. –Ex. Median of {1, 2, 3, 4, 5} = 3 –For an even number of observations, the median is the average of the two middle observations of the set. –Ex. Median of {1, 2, 3, 4, 5, 6} = (3+4)/2 = 3.5

The sample mean and the median height of Hypericum cumulicola (ADULTS ONLY) The normal distribution with the observed sample mean and variance

The Mode The value of the observations that occurs most frequently in the sample. This will be the peak of the frequency distribution in a histogram

The distribution of height of Hypericum cumulicola is bimodal. Could you suggest why?

Plotting seedlings and adults separately

Final Comments on Measures of Location When the underlying distribution is symmetrical (or nearly so), the mean, median, and mode are all similar in value, BUT… …when there are extreme observations, the median or mode may better describe the location of the data

Measures of variability It is never sufficient to just state the mean or other measure of location of our data! Because there is variability in nature, variability due to our sampling, etc., we also need to estimate the spread of our observations around the average value Examples include: The range, the variance, and the standard deviation

The sample variance An individual value is called a deviation from the mean. The sum of the squared deviations is called the sum of squares (SS). We divide SS by one less than the sample size to get the sample variance (s 2 ), which is an unbiased estimator of the population variance (σ 2 ).

The sample standard deviation The units in which the variance is expressed are (original units) 2, which is conceptually awkward. To get around this, the sample variance is converted to the sample standard deviation (s), by simple taking the square root:

15.87 % % Mean + One standard deviation

The Standard Error of the Mean Remember the Central Limit Theorem: if the Y i are independent random observations and the sample size is “reasonably large”, the sample mean ( ) is approximately normally distributed with mean E[Y] and variance σ 2 (Y)/n Thus, we can calculate the standard error of the mean as follows: