Introduction to Formal Statistical Inference

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Chapter 6 Confidence Intervals.
Chapter 6 Sampling and Sampling Distributions
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Estimation in Sampling
Sampling: Final and Initial Sample Size Determination
CHAPTER 7.1 Confidence Intervals for the mean. ESTIMATION  Estimation is one aspect of inferential statistics. It is the process of estimating the value.
Chapter 7 Introduction to Sampling Distributions
Math 161 Spring 2008 What Is a Confidence Interval?
Chapter 7 Introduction to Sampling Distributions
Ch 6 Introduction to Formal Statistical Inference.
Chapter 6 Introduction to Sampling Distributions
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Business Statistics for Managerial Decision
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 6: Introduction to Formal Statistical Inference November 19, 2008.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 10 Introduction to Estimation.
BCOR 1020 Business Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Chapter 6: Sampling Distributions
Review of normal distribution. Exercise Solution.
1 CHAPTER 7 Homework:5,7,9,11,17,22,23,25,29,33,37,41,45,51, 59,65,77,79 : The U.S. Bureau of Census publishes annual price figures for new mobile homes.
Chapter 7 Estimation: Single Population
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
Chapter 6 Confidence Intervals.
Chapter 6 Confidence Intervals 1 Larson/Farber 4th ed.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.
6 Chapter Confidence Intervals © 2012 Pearson Education, Inc.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
Estimates and Sample Sizes Lecture – 7.4
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Estimating a Population Proportion
Chapter 10 Introduction to Estimation Sir Naseer Shahzada.
Estimation This is our introduction to the field of inferential statistics. We already know why we want to study samples instead of entire populations,
Confidence Intervals: The Basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Determination of Sample Size: A Review of Statistical Theory
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Estimation Chapter 8. Estimating µ When σ Is Known.
Unit 6 Confidence Intervals If you arrive late (or leave early) please do not announce it to everyone as we get side tracked, instead send me an .
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Section 6.1 Confidence Intervals for the Mean (  Known)
Confidence Intervals (Dr. Monticino). Assignment Sheet  Read Chapter 21  Assignment # 14 (Due Monday May 2 nd )  Chapter 21 Exercise Set A: 1,2,3,7.
Confidence Interval Estimation For statistical inference in decision making:
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
1 CHAPTER 6 Sampling Distributions Homework: 1abcd,3acd,9,15,19,25,29,33,43 Sec 6.0: Introduction Parameter The "unknown" numerical values that is used.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
CHAPTER 9 Inference: Estimation The essential nature of inferential statistics, as verses descriptive statistics is one of knowledge. In descriptive statistics,
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
Lecture 5 Introduction to Sampling Distributions.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Chapter 6 Sampling and Sampling Distributions
Confidence Intervals.
Chapter 6: Sampling Distributions
Confidence Intervals and Sample Size
Sampling Distributions
Sampling Distributions
Chapter 6: Sampling Distributions
Estimates and Sample Sizes Lecture – 7.4
How Confident Are You?.
Presentation transcript:

Introduction to Formal Statistical Inference Chapter 6 Introduction to Formal Statistical Inference

Inferential Statistics Two areas of statistics: Descriptive Statistics Inferential Statistics

Some Terminology Quantities of a population are called parameters and are typically denoted by Greek letters Quantities obtained from a sample are called statistics and are typically denoted by Roman letters µ is a parameter, is a statistic

Example As a means of trying to estimate the mean GPA of Bucknell students, a sample of 100 students yielded an average of 3.12. The parameter of interest is the population mean GPA (µ) The statistic is the sample mean GPA of 3.12. ( )

Parameters For every parameter of interest, there are typically a number of statistics that can be used for estimation purposes If one is interested in the population mean, the sample mean or sample median can be used If one is interested in the population variance, the sample variance, sample range, or sample IQR can be used

Sampling Distributions The sampling distribution for a sample statistic is the probability distribution of the statistic Sampling distributions are just like the probability distributions discussed earlier (i.e., sampling distributions have a mean and variance, usually dependent upon the sample size)

Central Limit Theorem If X1, X2, …Xn are iid random variables (with mean µ and variance σ2), then for large n, the variable is approximately normally distributed. That is, approximate probabilities can be calculated using the normal distribution with mean µ and variance σ2/n.

Z value for sample mean

Properties of Sampling Distributions A sample statistic used to estimate a population parameter is called a point estimate (or point estimator) There are 2 properties that are desired for point estimators: The mean of the sampling distribution of the point estimator is equal to the population parameter that it is intending to estimate (i.e., the point estimator is an unbiased estimator) The point estimator has minimum variance among all other point estimators

Sampling Distribution of Mean is always and unbiased point estimator of µ There are 2 things that are always true about the sampling distribution of :

Applications If the population is Normally distributed, then

Example - The weights of the jars of baby food are Normally distributed with a mean of 137.2 g and a standard deviation of 1.6 g. What is the probability that if one jar was selected at random, its weight would be more than 140 grams?

Example What is the probability that if nine jars were selected at random, their average weight would be more than 140 grams?

What if it’s not Normal? If we don’t know the shape or the distribution or if we know that it is not Normal, we can apply the Central Limit Theorem to find out something about the distribution. For sufficiently large samples, the sampling distribution of will be approximately Normal. Typically, a sample size of 25 or 30 is “sufficiently large” The necessary sample size depends on the skewness of the distribution of the population The larger the sample size, the better the normality

Example A soft-drink bottler purchases glass bottles from a vendor. The bottles are required to have an internal pressure strength of at least 150 pounds per square inch (psi). A prospective bottle vendor claims that its production process yields bottles with a mean internal strength of 157 psi and a standard deviation of 3 psi. The bottler strikes an agreement with the vendor that permits the bottler to sample from the vendor’s production process to verify the vendor’s claim. The bottler randomly selects 40 bottles from the last 10,000 produced, measures the internal pressure of each and fins that the mean pressure for the sample to be 1.3 psi below the process mean cited by the vendor. Assuming the vendor’s claim to be true, what is the probability of obtaining a sample mean this far or farther below the process mean? What does your answer suggest about the validity of the vendor’s claim?

Estimation Continued Goals of Confidence Interval Estimation Identify an interval of values likely to contain an unknown parameter Quantify how likely the interval is to contain the correct value

Confidence Interval A confidence interval for a parameter is a data-based interval of numbers thought likely to contain the parameter possessing a stated probability-based confidence or reliability

A Large-n Confidence Interval for µ Involving σ point estimate ± margin of error

Gallup Poll www.gallup.com Results are based on telephone interviews with 825 likely voters, aged 18 and older, conducted Oct. 10-12, 2008. For results based on the total sample of likely voters, one can say with 95% confidence that the maximum margin of sampling error is ±4 percentage points. Interviews are conducted with respondents on land-line telephones (for respondents with a land-line telephone) and cellular phones (for respondents who are cell-phone only). In addition to sampling error, question wording and practical difficulties in conducting surveys can introduce error or bias into the findings of public opinion polls.

Back to Baby Food Jars Suppose we want to estimate the actual mean weight of all baby food jars produced at the plant How can we do this?

What do we know? Given that σ = 1.6 grams Suppose we take a sample of 50 jars and finds that their average weight is 142.7 grams.

Point estimate ± margin of error Formula Point estimate ± margin of error

Z’s for Confidence Intervals Desired Confidence z 80% 1.28 90% 1.645 95% 1.96 98% 2.33 99% 2.58