Basic Statistics Probability and Sampling Distributions.

Slides:



Advertisements
Similar presentations
Chapter 9 Introduction to the t-statistic
Advertisements

Probability and Samples: The Distribution of Sample Means
Sampling Distributions
Statistics: Purpose, Approach, Method. The Basic Approach The basic principle behind the use of statistical tests of significance can be stated as: Compare.
Sampling Distributions (§ )
Chapter 18 Sampling Distribution Models
Chapter 10: Sampling and Sampling Distributions
Central Limit Theorem.
Topics: Inferential Statistics
Distribution of Sample Means 2011, 10, 20. Today ’ s Topics What is distribution of sample means?** Properties of distribution of sample means* How to.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Sampling Distributions
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Sampling Distributions
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
Probability (cont.). Assigning Probabilities A probability is a value between 0 and 1 and is written either as a fraction or as a proportion. For the.
Chapter 7 ~ Sample Variability
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Chapter 6: Sampling Distributions
From Last week.
1 Ch6. Sampling distribution Dr. Deshi Ye
Probability Quantitative Methods in HPELS HPELS 6210.
1 1 Slide © 2005 Thomson/South-Western Chapter 7, Part A Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Agresti/Franklin Statistics, 1e, 1 of 139  Section 6.4 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
Introduction to Behavioral Statistics Probability, The Binomial Distribution and the Normal Curve.
Chapter 7: Sampling and Sampling Distributions
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
Anthony J Greene1 Where We Left Off What is the probability of randomly selecting a sample of three individuals, all of whom have an I.Q. of 135 or more?
TobiasEcon 472 Law of Large Numbers (LLN) and Central Limit Theorem (CLT)
Distribution of the Sample Means
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Distributions of the Sample Mean
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
Introduction to Statistics Chapter 6 Feb 11-16, 2010 Classes #8-9
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Chapter 10: Introduction to Statistical Inference.
Confidence Interval Estimation For statistical inference in decision making:
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Review of Statistical Terms Population Sample Parameter Statistic.
Introduction to Inference Sampling Distributions.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
From the population to the sample The sampling distribution FETP India.
Distributions of Sample Means. z-scores for Samples  What do I mean by a “z-score” for a sample? This score would describe how a specific sample is.
m/sampling_dist/index.html.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
THE CENTRAL LIMIT THEOREM. Sampling Distribution of Sample Means Definition: A distribution obtained by using the means computed from random samples of.
Sampling Distributions
Chapter 6: Sampling Distributions
Distribution of the Sample Means

Sampling Distributions and The Central Limit Theorem
Probability and the Sampling Distribution
Sampling Distributions
Sampling Distributions (§ )
Sampling Distributions and The Central Limit Theorem
Chapter 4 (cont.) The Sampling Distribution
How Confident Are You?.
Presentation transcript:

Basic Statistics Probability and Sampling Distributions

STRUCTURE OF STATISTICS TABULAR DESCRIPTIVE GRAPHICAL NUMERICAL STATISTICS ESTIMATION INFERENTIAL TESTS OF HYPOTHESIS

? Inferential Statistics EXAMPLE: Population parameters statistics Inferential statistics describes a population of data using the information contained in a sample ? EXAMPLE: Population parameters Estimating Predicting (inferring) Sampling Sample statistics A sample is a portion, or part, of the population of interest

Everyone makes inferences, why Statistical Inference? I predict rain today!!! The difference between a fortune teller and a statistician is a “Statement of Goodness”

A statement of goodness is a statement indicating the chance or probability that an inference is wrong. Probability is a statement of one’s belief that an event will happen. * What are the chances of a head appearing when you toss a coin? * What are the chances of selecting an Ace of Diamonds from a deck of cards?

Probability Has a Basis in Mathematics Consider a coin-toss experiment: where k = number of heads and N = number of tosses 1/2

Relative Frequency Probability P(A) = Number of Events of Interest (A) divided by the Total Number of Events. Probability of a Head on the flip of an honest coin = 1/2 (as we saw on the last slide). Probability of drawing the Ace of Diamonds = 1/52 (since there are 52 cards in the deck and only 1 ace of diamonds). There are other elements of probability discussed in the text but we will not be using those portions for this class.

We can put probability in the context of a relative frequency distribution. Recall the example of 10 students who took a 5-point quiz with the following results. It turns out that the area under the curve also represents the probability of an event. For example, what is the probability that a student picked at random scored 4 on the quiz? f X r .4 -- .3 -- .2 -- .1 -- .0 -- 1 2 3 4 5 .1 The probability would be .2, the same as the area under the curve! .1 .1 .1 .1 .1 .1 .1 .1 .1

This idea transfers directly to a normal distribution This idea transfers directly to a normal distribution. What is the probability that a randomly selected person scores at least 650 on the Verbal portion of the GRE? First, we must compute the person’s z-score. From Table A, Column C we find that .0668 of the area is in the tail. Thus, the probability is about .07. z 1.5

Some Notation Characteristic Sample Population Mean Standard Deviation (Necessary to Distinguish Between Sample and Population) Characteristic Sample Population Mean Standard Deviation Sample Size In descriptive statistics, the differentiation is not important as the sample and population numerical measures are the same.

Sampling A key to statistical inference is the assumption that the sample is representative of the population from which it was drawn. Random Sampling ensures that each possible sample has an equal chance of being selected and all members of the population have an equal chance of being selected into the sample. We will assume that all samples are selected in a random fashion. Please note that bigger is not better, unless it is representative.

Introduction to Inferential Statistics How do we get from a sample to a prediction about a population? In statistics, we use a sampling distribution to infer the characteristics of the population.

Some Definitions Sampling Distributions: A distribution of statistics obtained by selecting all possible samples of a specific size from a population. Sampling Error: The discrepancy between the statistic obtained from the sample and the parameter for the population. Standard Error: Provides an estimate of exactly how much error, on average, should exist between the statistic and the parameter. It is a measure of chance and is the standard deviation of the sampling distribution.

What is the shape of the sampling distribution and can we describe it in terms of mean and standard deviation (standard error)? YES! The answer is the Central Limit Theorem (CLT).

Central Limit Theorem Applied to Means For any population with mean m and standard deviation s, the distribution of sample means for sample size n (n >30) will have a mean of m and a standard deviation (standard error) of , and will approach a normal distribution as n approaches infinity.

The Central Limit Theorem Recap Regardless of the mean of the population, the mean of the distribution of sample means (sampling distribution) will be the same. Regardless of the SD of the population, the SD of the sampling distribution will be the same divided by the square root of the sample size. Regardless of the shape of the population, the shape of the sampling distribution will be approximately normal.

An Example of a Sampling Distribution of the Means First, assume that we have a Population consisting of only four numbers N =4, (2, 4, 6, 8). Next, we will take all possible samples from this population of size n = 2. We will calculate the mean of each sample that we obtain. Finally, we will plot the means in a frequency histogram.

Our Population Parameters = 20 / 4 = 5 = = 2.236

All Possible Samples (n = 2) from our Population First Pick Second Pick Mean 2 6 4 3 5 8 7 These are all possible samples (16) of size n = 2 and the means of those samples that can be taken from our population of N = 4 objects.

Frequency Histogram of Means All 16 means plotted

Calculate the Mean of the Means and Standard Deviation of the Means

The Central Limit Theorem Recall that the Central Limit Theorem states that the mean of the sampling distribution of the means would be equal to m. From the previous slide we calculated the mean of the sampling distribution of our 16 means to be 5, which is the population mean we calculated earlier. Also, recall that the Central Limit Theorem states that the standard deviation (standard error) would be equal to If we divide the standard deviation we calculated on our population (s = 2.236) and divide it by the square root of our sample size ( n= 2) we would obtain 2.236 / 1.4142 = 1.58; which is exactly what we calculated our standard deviation to be using our sample data.

Summary of Central Limit Theorem = 20 / 4 = 5 = 2.236 = 2.236 / 2 = 1.58 Note the symbols used to denote the mean of the means and the standard error of the mean and

Inferential Statistics We will use our knowledge of the sampling distribution of the means {(m), ( )} given by the Central Limit Theorem as the basis for inferential statistics. We also will use our ability to locate a single score in a distribution using z scores in hypothesis testing.