Consolidation & Review

Slides:



Advertisements
Similar presentations
There are two statistical tests for mean: 1) z test – Used for large samples (n ≥ 30) 1) t test – Used for small samples (n < 30)
Advertisements

“Students” t-test.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Sampling: Final and Initial Sample Size Determination
Sampling Distributions (§ )
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
1 Econ 240A Power Four Last Time Probability.
Final Jeopardy $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 LosingConfidenceLosingConfidenceTesting.
1 Econ 240A Power 6. 2 Interval Estimation and Hypothesis Testing.
1 Econ 240A Power 6. 2 The Challenger Disaster l sjoly/RB-intro.html sjoly/RB-intro.html.
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Inferences About Process Quality
Statistical Inference Lab Three. Bernoulli to Normal Through Binomial One flip Fair coin Heads Tails Random Variable: k, # of heads p=0.5 1-p=0.5 For.
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
1 Econ 240A Power Four Last Time Probability.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Normal Distributions Z Transformations Central Limit Theorem Standard Normal Distribution Z Distribution Table Confidence Intervals Levels of Significance.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Determination of Sample Size: A Review of Statistical Theory
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1.State your research hypothesis in the form of a relation between two variables. 2. Find a statistic to summarize your sample data and convert the above.
Estimation Chapter 8. Estimating µ When σ Is Known.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Machine Learning Chapter 5. Evaluating Hypotheses
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Mystery 1Mystery 2Mystery 3.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
 Normal Curves  The family of normal curves  The rule of  The Central Limit Theorem  Confidence Intervals  Around a Mean  Around a Proportion.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
Chapter 6: Sampling Distributions
Sampling and Sampling Distributions
Statistical Inference
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
Chapter 9 Hypothesis Testing.
ESTIMATION.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
LECTURE 33: STATISTICAL SIGNIFICANCE AND CONFIDENCE (CONT.)
Chapter 4. Inference about Process Quality
Sampling distribution
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Chapter 6: Sampling Distributions
Data Mining 2016/2017 Fall MIS 331 Chapter 2 Sampliing Distribution
Sampling Distribution of Sample Means Open Minitab
Econ Roadmap Focus Midterm 1 Focus Midterm 2 Focus Final Intro
Hypothesis Tests for a Population Mean in Practice
Chapter 9 Hypothesis Testing.
CI for μ When σ is Unknown
Problems: Q&A chapter 6, problems Chapter 6:
CHAPTER 22: Inference about a Population Proportion
Hypothesis Tests for Proportions
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
CHAPTER 6 Statistical Inference & Hypothesis Testing
CHAPTER 15 SUMMARY Chapter Specifics
Intro to Confidence Intervals Introduction to Inference
Sampling Distributions (§ )
Statistical Inference for the Mean: t-test
How Confident Are You?.
Presentation transcript:

Consolidation & Review Inference

Sample Proportions Column of zeros and ones : estimated proportion or fraction equals # of successes/ # of observations = k/n Where k= Binomial[mean=np, variance=p*(1-p)n] E[ } = E[k/n] =E[(1/n)*k] = 1/n E[k] =( 1/n)np=p Var[ ] = Var[k/n] = (1/n)2 Var k =(1/n)2 np(1-p) Var[ ] = p(1-p)/n is a point estimate of p The estimate of the Var of is *(1 - )/n

Sample Proportions example from Lab 3 one of the ten columns of 50 observations of ones and zeros with the smallest proportion of 0.32 Var = 0.32*0.68/50 =0.004352 with square root, i.e standard deviation of 0.066 A 95 % confidence interval for an estimate of p from this sample is: Prob [-1.96≤(0.32-p)/0.066≤1.96]=0.95 Prob[-1.96*0.066≤(0.32-p)≤1.96*0.066]=0.95 Prob[-0.13≤(0.32-p)≤0.13]=0.95 Prob[0.13≥(p-0.32)≥-0.13]=0.95 Prob[0.45≥p≥ 0.19]=0.95

Note: This 95 % confidence interval does not include p=0.5, the population parameter chosen for the simulation, illustrating that 5 % of the time the 95% confidence interval will not include the true value! The Prob [-1.96≤(0.32-p)/0.066≤1.96]=0.95 Is the same as Prob[-1.96≤z≤1.96]=0.95, where, ( - p)/ = z = (0.32 – p)/0.066, in this example We can use the normal distribution approximation to the binomial in this example since n*p = 50*0.32 ≥ 5 and n*(1-p)≥5

a Z value of 1.96 leads to an area of 0.475, leaving 0.025 in the Upper tail

Interval Estimation The conventional approach is to choose a probability for the interval such as 95% or 99%

So z values of -1.96 and 1.96 leave 2.5% in each tail

1.96 -1.96 2.5% 2.5%

Application of Sample Proportions

Oct 13 Rasmussen Poll

Field Poll Margin of Error Variance of = *(1 - )/n = 0.47* 0.53/599 = 0.000416 = √0.000416 = 0.0204 So two standard deviations is about 0.041 0r 4.1%, i.e the margin of error is plus or minus 4.1 percentage points

Inferring the unknown population mean from a sample mean Example from Lab 3: simulate the population as uniform, with random variable x, 0≤x≤1, and density f(x) =1 Note: Note the expected value of x, E[x]= Var[x] = E[x-E(x)]2 = E[x – E(x)]2 =E{x2 -2xE[x]+E[x]2} Var[x] = {E(x2) – 2E[x]*E[x] + E[x]2 } = E[x2] –[Ex]2 E[x2] = = 1/3 Var[x] = E[x2] – E[x]2 = 1/3 –[1/2]2 = 1/3-1/4 = (4 -3)/12 =1/12 X~ Uniform(mean=1/2, Variance=1/12)

In lab 3 we drew a random sample of size 50 from this uniform distribution and calculated the sample mean: From the central limit theorem, we know, and we saw in lab 3, that the sample mean is distributed normally

Central tendency and dispersion of sample mean Where μ is the population mean. In the simulation from Lab 3 using the uniform distribution, we knew that μ = 0.5 Where σ2 is the variance of x. In the simulation from Lab 3 using the uniform distribution, we knew the σ2 =1/12.

Hypothesis testing Example from Lab 3 for sample proportions Step one: formulate the hypotheses Null hypothesis, H0: p = 0.5 Alternative hypothesis, HA : p<0.5 Step two: Identify a test statistic Where the value for p is from the null hypothesis, so z= (0.32 - 0.5)/0.066 = 0.18/0.066 = 2.73 If the null hypothesis were true, what is the probability of getting a test statistic of this size

Hypothesis Testing: 4 Steps Formulate all the hypotheses Identify a test statistic If the null hypothesis were true, what is the probability of getting a test statistic this large? Compare this probability to a chosen critical level of significance, e.g. 5% 19

20 a Z value Of 2.73 leads to an area of 0.4968, leaving 0.0032 in the Upper tail, and Hence 0.0032 In the lower tail. If you choose a risk level of .05, i.e. α = 0.05 for The probability A type I error, Then reject H0 20

0.0032 -2.73

0.050 -1.645