From the population to the sample The sampling distribution FETP India.

Slides:



Advertisements
Similar presentations
Sampling Distributions Suppose I throw a dice times and count the number of times each face turns up: Each score has a similar frequency (uniform.
Advertisements

Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Sampling Distributions (§ )
Introduction to Statistics
Chapter 10: Sampling and Sampling Distributions
Chapter 6 Introduction to Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Sampling Distributions
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Chapter 7: Variation in repeated samples – Sampling distributions
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Central Tendency and Variability
Chapter 11: Random Sampling and Sampling Distributions
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Chapter 6: Sampling Distributions
Quiz 2 Measures of central tendency Measures of variability.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Sampling Distributions
AP Statistics Chapter 9 Notes.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Introduction to Summary Statistics
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Agresti/Franklin Statistics, 1e, 1 of 139  Section 6.4 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
Rule of sample proportions IF:1.There is a population proportion of interest 2.We have a random sample from the population 3.The sample is large enough.
KNR 445 Statistics t-tests Slide 1 Variability Measures of dispersion or spread 1.
Sampling Distributions Chapter 7. The Concept of a Sampling Distribution Repeated samples of the same size are selected from the same population. Repeated.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
The Central Limit Theorem and the Normal Distribution.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Distributions of the Sample Mean
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Biostatistics Unit 5 – Samples. Sampling distributions Sampling distributions are important in the understanding of statistical inference. Probability.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
6.3 THE CENTRAL LIMIT THEOREM. DISTRIBUTION OF SAMPLE MEANS  A sampling distribution of sample means is a distribution using the means computed from.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Chapter 10: Introduction to Statistical Inference.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 7: Sampling Distributions Section 7.1 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
Review of Statistical Terms Population Sample Parameter Statistic.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Chapter Eleven Sample Size Determination Chapter Eleven.
Chapter 6: Random Errors in Chemical Analysis. 6A The nature of random errors Random, or indeterminate, errors can never be totally eliminated and are.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
THE CENTRAL LIMIT THEOREM. Sampling Distribution of Sample Means Definition: A distribution obtained by using the means computed from random samples of.
Sampling and Sampling Distributions
Chapter 7 Review.
Introductory Statistics
Distribution of the Sample Means
Summary descriptive statistics: means and standard deviations:
Probability and the Sampling Distribution
Sampling Distribution
Sampling Distribution
How do we categorize and make sense of data?
Summary descriptive statistics: means and standard deviations:
Statistics PSY302 Review Quiz One Spring 2017
Sampling Distributions (§ )
IRCCS San Raffaele Pisana, Rome, Italy, 28 February - 2 March 2018
Presentation transcript:

From the population to the sample The sampling distribution FETP India

Competency to be gained from this lecture Use the properties of the sampling distribution to calculate standard error to the mean

Key issues Population parameters versus sample statistics Sampling distribution and its properties Mean and standard error of the sampling distribution

Things we already know Mean  Arithmetic sum of data divided by number of observations Standard deviation  Index of variability (spread) of data about the mean Z-score  Distance from mean in standard deviation units z = (x-mean)/sd Normal curve  Bell-shaped curve that relates probability to z-scores Parameters and statistics

Population parameters A population parameter is a numerical descriptive measure of a population Examples:  Population mean (µ)  Standard deviation (  ) Parameters and statistics

A statistic A statistic is a numerical descriptive measure of a sample Examples:  Sample mean x  Sample standard deviation s Parameters and statistics

Inference The parameter is fixed The sample statistics varies from sample to sample We try to infer what happens in the population from what we see in the sample Parameters and statistics

Sample mean: A typical situation A sample might be taken The mean and standard deviation are computed From this data, one will want to infer that the population values are identical or at least similar In other words, it is hoped that the sample data reflects the population data Sampling distribution

Sample mean: Another approach Change your thinking from a single sample Consider the situation where you:  Take many samples  Calculate a mean and standard deviation for each sample Sampling distribution

Taking many samples from a population Consider a population of 1,000 individuals with various heights If we take 10 samples of 100 persons from the population, each of the 10 samples will have a specific frequency distribution with:  A specific mean  A specific standard deviation In each sample, each data point is a height Sampling distribution

Looking at the means of the samples We can look at the frequency distribution of the means of each of the 10 samples In this case:  The data points are no longer the heights  The data points are the means Sampling distribution

Intuitive observation If we take iterative samples from a population, we are unlikely to sample extreme values every time:  Values close to the mean are common  Extreme values are less common Thus, when we compare the distribution of the heights and the distribution of the means, we observe:  More variation in the distribution of individual heights  Less variation in the distribution of the means Sampling distribution

Taking many samples from the population If we take many samples, we can plot a complete frequency distribution of the means of the samples Each sample produces a statistic (mean) The distribution of statistics (means) is called a sampling distribution Sampling distribution

Multiple sample means

Important properties of the sampling distribution 1.The sampling distribution is normally distributed 2.The mean of the sampling distribution is equal to the mean of the population Sampling distribution

Standard deviation of the sampling distribution If the standard deviation of the population is  The standard deviation of the sampling distribution will be  / (√ n) n is the sample size Sampling distribution

Terminology The mean of the sampling distribution continues to be called the mean The standard deviation of the sampling distribution is the standard error Standard error

Distribution of sample means One could obtain a standard deviation of sample means which would describe the variability and the spread of sample means about the true population mean In a practical situation:  There is only one sample mean  One hopes this sample mean is near the real population mean Wouldn't it be nice to have an estimate of the standard deviation of sample means which describe the spread of sample means? Standard error

Standard error of the mean Divide the standard deviation by the square root of the number of observations The resulting estimate of the standard deviation of sample means is called the standard error of means It can be interpreted in a manner similar to the standard deviation of raw scores  For example, the probability of obtaining a sample mean which is outside the to range is 5 out of 100 Standard error

Central limit theorem If x possesses any distribution with mean µ and standard deviation SD Then the sample mean x based on a random sample of size n will have a distribution that approaches the distribution of a normal random variable  Mean µ  Standard deviation SD/square root of n as n increases without limit. Special case:  If x is normally distributed, the result is true for any sample size Standard error

Simple example Let the population be 1,2,3,4,5  Mean = 15/5 = 3 = µ Let’s take a sample of two elements The 25 possible samples are: 1,11,21,31,41,5 2,12,22,32,42,5 3,13,23,33,43,5 4,14,24,34,44,5 5,15,25,35,45,5 Standard error

The frequency distribution of the population is not normal Values Frequency Standard error

Standard deviation of the population Standard error

Looking at the mean of the samples The 25 means of the 25 samples are: Mean of sample means = 75/25 = 3 Same as population mean Standard error

The sampling distribution tends to be normal Values Frequency Even if the population is not normally distributed, the sampling distribution will tend to be normal Standard error

Standard deviation of the sample Standard error

Standard deviation in the population and standard error Standard deviation in the population:  1.4 Sample size:  2 Square root of the sample size:  1.4 Standard deviation / square root of the sample size:  1.4 / 1.4 = 1  = Standard error Standard error

Applying the standard error: Male's serum uric acid levels (1/2) Population mean :  5.4 mg per 100 ml Standard deviation is:  1 Take 100 samples of 25 men in each sample Compute 100 sample means How many of those means would you expect to fall within the range 5.4-(1.96x1) to 5.4+(1.96x1)? The answer is 95! Standard error

Applying the standard error: Male's serum uric acid levels (2/2) One sample Mean serum uric acid level of 8.2 Would you assume this was "significantly" different from the population mean?  Yes, because a mean of that magnitude could occur less than 5 times in 100 Standard error

Key messages While population parameters are fixed, samples provide estimates (statistics) that fluctuate The distribution of a statistic for all possible samples of given size ‘n’ is called the sampling distribution.  For large ‘n’, the sampling distribution is ‘normal’, even if the original distribution is not.  If the original distribution is normal, the result is true even for small ‘n’. The mean of the sampling distribution is the population mean and the standard deviation (standard error) is the population SD/ sq.root n