Chapter 6 INFERENTIAL STATISTICS I: Foundations and sampling distribution.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Estimation in Sampling
Sampling: Final and Initial Sample Size Determination
Sampling Distributions
QUANTITATIVE DATA ANALYSIS
Topics: Inferential Statistics
Sampling Distributions
Chapter Sampling Distributions and Hypothesis Testing.
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Standard error of estimate & Confidence interval.
Review of normal distribution. Exercise Solution.
Essentials of Marketing Research
Understanding Inferential Statistics—Estimation
STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.
Estimation of Statistical Parameters
Topic 5 Statistical inference: point and interval estimate
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
Estimates and Sample Sizes Lecture – 7.4
Random Sampling, Point Estimation and Maximum Likelihood.
Continuous Probability Distributions Continuous random variable –Values from interval of numbers –Absence of gaps Continuous probability distribution –Distribution.
Sampling W&W, Chapter 6. Rules for Expectation Examples Mean: E(X) =  xp(x) Variance: E(X-  ) 2 =  (x-  ) 2 p(x) Covariance: E(X-  x )(Y-  y ) =
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
The Normal Curve Theoretical Symmetrical Known Areas For Each Standard Deviation or Z-score FOR EACH SIDE:  34.13% of scores in distribution are b/t the.
Determination of Sample Size: A Review of Statistical Theory
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Confidence Intervals (Dr. Monticino). Assignment Sheet  Read Chapter 21  Assignment # 14 (Due Monday May 2 nd )  Chapter 21 Exercise Set A: 1,2,3,7.
What is a Confidence Interval?. Sampling Distribution of the Sample Mean The statistic estimates the population mean We want the sampling distribution.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
© 2002 Prentice-Hall, Inc.Chap 8-1 Basic Business Statistics (8 th Edition) Chapter 8 Confidence Interval Estimation.
Sampling Distributions Sampling Distributions. Sampling Distribution Introduction In real life calculating parameters of populations is prohibitive because.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Sampling: Distribution of the Sample Mean (Sigma Known) o If a population follows the normal distribution o Population is represented by X 1,X 2,…,X N.
Design and Data Analysis in Psychology I English group (A) Salvador Chacón Moscoso Susana Sanduvete Chaves Milagrosa Sánchez Martín School of Psychology.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Introduction to Marketing Research
Sampling and Sampling Distributions
Intro to Research Methods
Continuous Probability Distributions
Confidence Intervals and Sample Size
Sampling Distributions
Understanding Sampling Distributions: Statistics as Random Variables
ESTIMATION.
LECTURE 24 TUESDAY, 17 November
Sampling Distributions and Estimation
Chapter 6 Confidence Intervals.
Chapter 9: Inferences Involving One Population
Combining Random Variables
Elementary Statistics
Chapter 9 Hypothesis Testing.
Chapter 6 Confidence Intervals.
Arithmetic Mean This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Warmup To check the accuracy of a scale, a weight is weighed repeatedly. The scale readings are normally distributed with a standard deviation of
Calculating Probabilities for Any Normal Variable
Estimates and Sample Sizes Lecture – 7.4
BUSINESS MARKET RESEARCH
ESTIMATION.
Objectives 6.1 Estimating with confidence Statistical confidence
Objectives 6.1 Estimating with confidence Statistical confidence
Presentation transcript:

Chapter 6 INFERENTIAL STATISTICS I: Foundations and sampling distribution

INFERENTIAL STATISTICS STATISTICAL PARAMETERS Introduction

 Statistical estimation theory: Find a value for the index in the sample level, the goal is to infer the value of the index in the population. Punctual estimation: if it provides a single value. Estimation by intervals: if it facilitates a range of values ​ whose limits we expected to be the population mean.  Statistical decision theory : Procedure to make decisions in the field of statistical inference. We’ll see in chapter 8.

Phases of the inferential process 1  A sample is obtained, randomly, and calculated the corresponding statistics:

2  We wonder: What would have happened if we had worked with the entire population?

3  Know the probabilistic model of possible outcomes: * normal law, binomial, * Ji-square, * Student-Fisher’ t, * Snedecor’ F Etc... It is known because we have information about similar situations or because it is deductible.

4  Construction of the STATISTIC SAMPLING DISTRIBUTION (means or proportions). That is, construct the distribution of all possible outcomes.

5  Knowing the sampling distribution and the underlying probability model, we just can make probability judgments about the statistics.

9 Sampling error theory

 As larger the sample size and better sampling procedure performed, easier would be that the statistical value is close to the parameter value satisfactorily.  But, it is expected that there is some discrepancy between them (sampling error).  Solution: We renounce to know the precise and specific sampling error. We can have some confidence that this error does not exceed a limit amount. Thanks to the inference, this amount is known with a certain confidence.

1º) We selected a sample by a random system (MAS = simple random sampling) and obtain the main statistics. Sample Population Statistics Parameters (latin letters) (greek letters)

 2º) Calculate the SAMPLING ERROR, "the difference between a statistic and its corresponding parameter“.  E m

 Ex. The mean in the sample is 13.2 and there is a 0.05 probability of being wrong in asserting that the population mean differs from 13.2 in 2 units, plus or minus.

It is not known, but we can have some confidence that this error does not exceed a certain limit amount. Thanks to the inference, this amount is known with a certain confidence (translated to a specific value of probability) ERROR SAMPLE ACCURACY RELIABILITY How can we measure the sampling error?

 "The precision with which a statistic represents the parameter. " a) Accuracy  = 50

 "The measure of the constancy of a statistic when you get several samples of the same type and size. " b) Reliability

 We select large samples (n> 30) and obtain the means:  If they vary little among themselves, as in the example, we could say they are very reliable, if not they will be unreliable and we will not trust them.  This is an indirect indication of the accuracy. Example

 Knowledge about the context in which the inference is being made ​ allows us to conclude at the population level, with a degree of certain security or certainty.  To obtain this probability measure, it’s necessary to know how rare or expected is to find what we have found. We need to know a sampling distribution and a probability model associated with it.

 To arrive to the knowledge of sampling distributions should follow a process of construction in 3 phases: SAMPLING DISTRIBUTION

1ª FHASE  Obtaining sample areas of the population.  I.e.: Collection of all samples of the same size "n", extracted randomly from the population under study.

M 1 M 2 M 3 M... M n Population If in each of the samples we calculate the mean we can see that does not always take the same value but varies its value from sample to sample.

2ª PHASE  Get all the means of each of these samples. M 1 M 2 M 3 M... M n

3ª PHASE  Grouping these measures in a new distribution called: Sample distribution of means

Mathematical expectation or expected value Standard error Parameters of the Sample Distribution of means sigma mŷ

Population distribution and sample distribution of means

CARACTERISTICS of sample distributions of means  1ª) The statistics obtained in the samples are grouped around the population parameter.  2ª) As you increase n, the statistics will be more grouped around the parameter.  3ª) If the samples are large, the graphic representation of the sampling distribution, we can observe that:

 a) The graphic representation is SYMMETRICAL about the central vertical axis that is the parameter ().  b) Bell-shaped more narrow when higher is "n".

 c) Takes the form of the normal curve.  d) The mean of the sampling distribution of means matches with the real mean in the population.  e) The distribution is more or less variable. If the sampling distribution changes little, i.e., has a very small sigma, means differ little among themselves, and it’s be very reliable.

The standard error of the mean =  Depends on the value taken by the standard deviation of the sampling distribution of means.  This value is known as typical error of the mean.  Symbolically: typical error of the mean =

STANDARDIZATION

 In D.M. we do not work directly with theoretical scores, but typical scores.  Typify the D.M. allows us to calculate probabilities (if you also know the probability model that has the distribution). We can consider Normal Distribution if: n≥30 in Distrib. of means Πn ≥5 y (1- Π)n ≥5 in Distrib. of probability

Means Based on σ Based on S N = ∞ N ≠ ∞ Characteristics of the sampling distribution of means in terms of population size and population and sample variances

EXAMPLE 1 (suppose we know variance) Sampling Distribution of Means

We have applied a test of a population and we obtained a mean ( ) of 18 points, with a standard deviation ( ) of 3 points. Assuming that the variable is normally distributed in the population, calculate: A) Between what values will the central 95% of the subjects of that population be? B) Between what values ​ will the central 99% of the average scores in samples of size n = 225, drawn at random from this population be?

A)Between what values will the central 95% of the subjects of that population be?

Calculations

The central 95% of the subjects will be obtained between and points

% B) Between what values ​ will the central 99% of the average scores in samples of size n = 225, drawn at random from this population be?

Calculations

17,48418,516 The central 99% of the mean scores ranging between and %

EXAMPLE 2 Calculate the probability of extracting from a population whose mean () is 40 and standard deviation () is 9 -, a sample of size n = 81, whose average is equal to or less than 42 points.

Calculations

EXAMPLE 3 In one sampling distribution of means with samples of 49 subjects, central 90% of samples means are between 47 and 53 points: Which scores delimit the central 95% of means? Which is the σ, related to the origin samples population? Which scores delimit the central 95% of means, if n is 81 subjects?

A)¿Which scores delimit the central 95% of means? % Because n is 49, > than 30 = DN Mean= /2= 50 SD 1.64 = 53 – 50 / = 1.83

¿? 95% = X – 50 / = X – 50 / 1.83 X = X = 53.59

B) ¿Which is the σ, related to the origin samples population? = 1.83 = σ / √49; σ = 12.81

c) Which scores delimit the central 95% of means, if n is 81 subjects? ¿? 95% = X – 50 / = / √81 = = X – 50 / = X – 50 / 1.42 X = X= 52.78

Sampling Distribution of Proportions EXAMPLE 1 In a given population, the proportion of smokers was If we choose for this population a sample of n = 200 subjects; What is the probability that in that sample we find 130 or fewer smokers?

CONDITIONS OF APPLICATION

Parameters of the sample distribution proportions

Standardization process

CALCULATIONS

Proportions Based on σ Based on S N = ∞ N ≠ ∞ Characteristics of the sampling distribution of means in terms of population size and population and sample variances

EXAMPLE 2  In the elections in a particular university, to elect president, a candidate obtained 45% of the vote.  If you will choose randomly and independently a sample of 100 voters, what is the probability that the candidate receives more than 50% of the vote?

EXAMPLE 2 We know that 30% of seville students pass one concrete test. Extracting samples of 100 students from this population: Which values delimit the central 99% from proportions of these samples? Which samples % will have a proportion equal or higher than 0,35 respect to students that pass the test?

A) Which values delimit the central 99% from proportions of these samples? 99%

A) Which values delimit the central 99% from proportions of these samples? 99%

A)Which samples % will have a proportion equal or higher than 0,35 respect to students that pass the test? ≥0.35