Estimators and estimates: An estimator is a mathematical formula. An estimate is a number obtained by applying this formula to a set of sample data. 1.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
General Linear Model Introduction to ANOVA.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: introduction to maximum likelihood estimation Original citation: Dougherty,
Statistical Estimation and Sampling Distributions
POINT ESTIMATION AND INTERVAL ESTIMATION
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Lecture 3 Today: Statistical Review cont’d:
1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: asymptotic properties of estimators: plims and consistency Original.
Econ 140 Lecture 61 Inference about a Mean Lecture 6.
Lecture 2 Today: Statistical Review cont’d:
ASYMPTOTIC PROPERTIES OF ESTIMATORS: PLIMS AND CONSISTENCY
Point estimation, interval estimation
Topic 7 Sampling And Sampling Distributions. The term Population represents everything we want to study, bearing in mind that the population is ever changing.
SUMS OF RANDOM VARIABLES Changfei Chen. Sums of Random Variables Let be a sequence of random variables, and let be their sum:
Evaluating Hypotheses
Visual Recognition Tutorial
Statistical Background
Chapter 8 Estimation: Single Population
Continuous Random Variables and Probability Distributions
Topic4 Ordinary Least Squares. Suppose that X is a non-random variable Y is a random variable that is affected by X in a linear fashion and by the random.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
QUIZ CHAPTER Seven Psy302 Quantitative Methods. 1. A distribution of all sample means or sample variances that could be obtained in samples of a given.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
EC220 - Introduction to econometrics (review chapter)
1 UNBIASEDNESS AND EFFICIENCY Much of the analysis in this course will be concerned with three properties of estimators: unbiasedness, efficiency, and.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
MEASURES OF CENTRAL TENDENCY & DISPERSION Research Methods.
Econ 482 Lecture 1 I. Administration: Introduction Syllabus Thursday, Jan 16 th, “Lab” class is from 5-6pm in Savery 117 II. Material: Start of Statistical.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: conflicts between unbiasedness and minimum variance Original citation:
Chapter 6: Sampling Distributions
CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE
Chapter 7 Estimation: Single Population
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
1 G Lect 3b G Lecture 3b Why are means and variances so useful? Recap of random variables and expectations with examples Further consideration.
Statistical Methods Introduction to Estimation noha hussein elkhidir16/04/35.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Chapter 3 Numerically Summarizing Data 3.2 Measures of Dispersion.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 9 Samples.
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
© Christopher Dougherty 1999–2006 The denominator has been rewritten a little more carefully, making it explicit that the summation of the squared deviations.
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Distributions of the Sample Mean
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
1 We will now look at the properties of the OLS regression estimators with the assumptions of Model B. We will do this within the context of the simple.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
1 ASYMPTOTIC PROPERTIES OF ESTIMATORS: THE USE OF SIMULATION In practice we deal with finite samples, not infinite ones. So why should we be interested.
Continuous Random Variables and Probability Distributions
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: exercise 2.11 Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 INSTRUMENTAL VARIABLE ESTIMATION OF SIMULTANEOUS EQUATIONS In the previous sequence it was asserted that the reduced form equations have two important.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Chapter 7, part D. VII. Sampling Distribution of The sampling distribution of is the probability distribution of all possible values of the sample proportion.
Chapter 4. The Normality Assumption: CLassical Normal Linear Regression Model (CNLRM)
Nvmrao Three useful results: GOODNESS OF FIT This sequence explains measures of goodness of fit in regression analysis. It is convenient to start by demonstrating.
Chapter 6: Sampling Distributions
Lecture 3 Today: Statistical Review cont’d:
Measures of Dispersion
Some General Concepts of Point Estimation
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Chapter 4. Inference about Process Quality
Chapter 6: Sampling Distributions
Sampling Distributions
Introduction to Econometrics, 5th edition
Presentation transcript:

Estimators and estimates: An estimator is a mathematical formula. An estimate is a number obtained by applying this formula to a set of sample data. 1 ESTIMATORS It is important to distinguish between estimators and estimates. Definitions are given above. EMU, ECON 503, M. Balcılar

Population characteristicEstimator Mean:  X 4 ESTIMATORS A common example of an estimator is the sample mean, which is the usual estimator of the population mean. EMU, ECON 503, M. Balcılar

Population characteristicEstimator Mean:  X 4 ESTIMATORS Here it is defined for a random variable X and a sample of n observations. EMU, ECON 503, M. Balcılar

Population characteristicEstimator Mean:  X Population variance: 4 ESTIMATORS Another common estimator is s 2, defined above. It is used to estimate the population variance,  X 2. EMU, ECON 503, M. Balcılar

Estimators are random variables 8 ESTIMATORS An estimator is a special kind of random variable. We will demonstrate this in the case of the sample mean. EMU, ECON 503, M. Balcılar

Estimators are random variables 8 ESTIMATORS We saw in the previous sequence that each observation on X can be decomposed into a fixed component and a random component. EMU, ECON 503, M. Balcılar

Estimators are random variables 8 ESTIMATORS So the sample mean is the average of n fixed components and n random components. EMU, ECON 503, M. Balcılar

Estimators are random variables 8 ESTIMATORS It thus has a fixed component  X and a random component u, the average of the random components in the observations in the sample. EMU, ECON 503, M. Balcılar

10 ESTIMATORS probability density function of X XX X XX X probability density function of X The graph compares the probability density functions of X and X. As we have seen, they have the same fixed component. However the distribution of the sample mean is more concentrated. EMU, ECON 503, M. Balcılar

10 ESTIMATORS Its random component tends to be smaller than that of X because it is the average of the random components in all the observations, and these tend to cancel each other out. probability density function of X XX X XX X probability density function of X EMU, ECON 503, M. Balcılar

Unbiasedness of X: 1 UNBIASEDNESS AND EFFICIENCY Suppose that you wish to estimate the population mean  X of a random variable X given a sample of observations. We will demonstrate that the sample mean is an unbiased estimator, but not the only one. EMU, ECON 503, M. Balcılar

Unbiasedness of X: 2 UNBIASEDNESS AND EFFICIENCY We use the second expected value rule to take the (1/n) factor out of the expectation expression. EMU, ECON 503, M. Balcılar

Unbiasedness of X: 3 UNBIASEDNESS AND EFFICIENCY Next we use the first expected value rule to break up the expression into the sum of the expectations of the observations. EMU, ECON 503, M. Balcılar

Unbiasedness of X: 4 UNBIASEDNESS AND EFFICIENCY Each expectation is equal to  X, and hence the expected value of the sample mean is  X. EMU, ECON 503, M. Balcılar

probability density function XX estimator B How do we choose among them? The answer is to use the most efficient estimator, the one with the smallest population variance, because it will tend to be the most accurate. UNBIASEDNESS AND EFFICIENCY estimator A 12 EMU, ECON 503, M. Balcılar

probability density function estimator B In the diagram, A and B are both unbiased estimators but B is superior because it is more efficient. UNBIASEDNESS AND EFFICIENCY estimator A 13 XX EMU, ECON 503, M. Balcılar

1 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE Suppose that you have alternative estimators of a population characteristic , one unbiased, the other biased but with a smaller population variance. How do you choose between them? probability density function  estimator B estimator A EMU, ECON 503, M. Balcılar

CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE One way is to define a loss function which reflects the cost to you of making errors, positive or negative, of different sizes. 2 error (positive)error (negative) loss EMU, ECON 503, M. Balcılar

3 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE A widely-used loss function is the mean square error of the estimator, defined as the expected value of the square of the deviation of the estimator about the true value of the population characteristic. probability density function  estimator B EMU, ECON 503, M. Balcılar

4 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE The mean square error involves a trade-off between the population variance of the estimator and its bias. Suppose you have a biased estimator like estimator B above, with expected value  Z. probability density function ZZ bias estimator B EMU, ECON 503, M. Balcılar

5 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE The mean square error can be shown to be equal to the sum of the population variance of the estimator and the square of the bias. probability density function ZZ bias estimator B EMU, ECON 503, M. Balcılar

6 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE To demonstrate this, we start by subtracting and adding  Z. EMU, ECON 503, M. Balcılar

7 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE We expand the quadratic using the rule (a + b) 2 = a 2 + b 2 + 2ab, where a = Z -  Z and b =  Z - . EMU, ECON 503, M. Balcılar

8 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE We use the first expected value rule to break up the expectation into its three components. EMU, ECON 503, M. Balcılar

9 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE The first term in the expression is by definition the population variance of Z. EMU, ECON 503, M. Balcılar

10 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE (  Z -  ) is a constant, so the second term is a constant. EMU, ECON 503, M. Balcılar

11 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE In the third term, (  Z -  ) may be brought out of the expectation, again because it is a constant, using the second expected value rule. EMU, ECON 503, M. Balcılar

12 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE Now E(Z) is  Z, and E(-  Z ) is -  Z. EMU, ECON 503, M. Balcılar

13 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE Hence the third term is zero and the mean square error of Z is shown be the sum of the population variance of Z and the bias squared. EMU, ECON 503, M. Balcılar

14 CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE In the case of the estimators shown, estimator B is probably a little better than estimator A according to the MSE criterion. probability density function  estimator B estimator A EMU, ECON 503, M. Balcılar

n The sample mean is the usual estimator of a population mean, for reasons discussed in the previous sequence. In this sequence we will see how its properties are affected by the sample size. probability density function of X n = 1 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x EMU, ECON 503, M. Balcılar

n Suppose that a random variable X has population mean 100 and standard deviation 50, as in the diagram. Suppose that we do not know the population mean and we are using the sample mean to estimate it n = 1 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n The sample mean will have the same population mean as X, but its standard deviation will be 50/, where n is the number of observations in the sample n = 1 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n The larger is the sample, the smaller will be the standard deviation of the sample mean n = 1 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n If n is equal to 1, the sample consists of a single observation. X is the same as X and its standard deviation is n = 1 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n We will see how the shape of the distribution changes as the sample size is increased n = 4 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n The distribution becomes more concentrated about the population mean n = 25 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n To see what happens for n greater than 100, we will have to change the vertical scale n = 100 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n We have increased the vertical scale by a factor of n = 100 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n The distribution continues to contract about the population mean n = 1000 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar

n In the limit, the variance of the distribution tends to zero. The distribution collapses to a spike at the true value. The sample mean is therefore a consistent estimator of the population mean n = 5000 EFFECT OF INCREASING THE SAMPLE SIZE ON THE DISTRIBUTION OF x probability density function of X EMU, ECON 503, M. Balcılar