1 Introduction to Biostatistics (PUBHLTH 540) Estimating Parameters Which estimator is best? Study possible samples, determine Expected values, bias, variance,

Slides:



Advertisements
Similar presentations
Chapter 23: Inferences About Means
Advertisements

Chapter 6 Sampling and Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Statistics for Business and Economics
Chapter 7 Introduction to Sampling Distributions
1 Point and Interval Estimates Examples with z and t distributions Single sample; two samples Result: Sums (and differences) of normally distributed RV.
1 Introduction to Biostatistics (PUBHLTH 540) Hypothesis Testing General Idea How unusual is the result? Test statistics Type I error (alpha level) p-value.
Class notes for ISE 201 San Jose State University
Chapter 7 Sampling and Sampling Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 22 = More of Chapter “Confidence Interval Estimation”
Chapter 8 Estimation: Single Population
Chapter 7 Estimation: Single Population
Modular 13 Ch 8.1 to 8.2.
Let sample from N(μ, σ), μ unknown, σ known.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Standard error of estimate & Confidence interval.
Confidence Intervals for the Mean (σ Unknown) (Small Samples)
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
Review of normal distribution. Exercise Solution.
1 Terminating Statistical Analysis By Dr. Jason Merrick.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Sampling distributions, Point Estimation Week 3: Lectures 3 Sampling Distributions Central limit theorem-sample mean Point estimators-bias,efficiency Random.
Dan Piett STAT West Virginia University
Topic 5 Statistical inference: point and interval estimate
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Confidence Intervals for Means. point estimate – using a single value (or point) to approximate a population parameter. –the sample mean is the best point.
Continuous Probability Distributions Continuous random variable –Values from interval of numbers –Absence of gaps Continuous probability distribution –Distribution.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Normal Distributions Z Transformations Central Limit Theorem Standard Normal Distribution Z Distribution Table Confidence Intervals Levels of Significance.
Estimation This is our introduction to the field of inferential statistics. We already know why we want to study samples instead of entire populations,
Determination of Sample Size: A Review of Statistical Theory
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc.. Chap 7-1 Developing a Sampling Distribution Assume there is a population … Population size N=4.
Sample Variability Consider the small population of integers {0, 2, 4, 6, 8} It is clear that the mean, μ = 4. Suppose we did not know the population mean.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Confidence Intervals for the Mean (Small Samples) 1 Larson/Farber 4th ed.
Chapter 7 Statistical Inference: Estimating a Population Mean.
Mystery 1Mystery 2Mystery 3.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Point Estimates point estimate A point estimate is a single number determined from a sample that is used to estimate the corresponding population parameter.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Sampling Theory and Some Important Sampling Distributions.
Chapter 9: One- and Two-Sample Estimation Problems: 9.1 Introduction: · Suppose we have a population with some unknown parameter(s). Example: Normal( ,
Lecture 5 Introduction to Sampling Distributions.
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Section 6.2 Confidence Intervals for the Mean (  Unknown)
Section 6.2 Confidence Intervals for the Mean (Small Samples) Larson/Farber 4th ed.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
1 Day 3 QMIM Confidence Interval for a Population mean of Large and Small Samples by Binam Ghimire.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Section 6.2 Confidence Intervals for the Mean (Small Samples) © 2012 Pearson Education, Inc. All rights reserved. 1 of 83.
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
ESTIMATION.
Chapter 6 Confidence Intervals.
Chapter 6 Confidence Intervals.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Interval Estimation Download this presentation.
Presentation transcript:

1 Introduction to Biostatistics (PUBHLTH 540) Estimating Parameters Which estimator is best? Study possible samples, determine Expected values, bias, variance, MSE –with replacement example –without replacement example (Exam 1) Estimate population mean –point estimator (sample mean) –interval estimator (95% central width) Central Limit theorem Interval estimators based on a sample –estimating the standard error –determining the multiplier (normal and t-distributions)

2 Sampling with replacement Program ejs09b540p19.sas –uses Arrays, Outputs, and Transpose –Select SRS w rep from N=5 with n=3 –Uniform random number generator Program ejs09b540p20.sas –Replaces sample size, pop size, and trials with macro variables (gives flexibility) –Uses functions of arrays to get mean, var, min, max –Select SRS w rep from N=5 with n=3

3 SRS without Replacement Program ejs09b540p21.sas –Process of selecting subjects without replacement –Do loops, shifting indices etc. Program ejs09b540p22.sas –Implementable version with macro variables Program ejs09b540p23.sas –Check that all sample sets have equal probability –n=3 from N=3 with functions to get sets

4 Which Estimator of Population Median is Best? Program ejs09b540p24.sas –Add data from population, and link response for sample subject sets –Evaluate sample median, mean, (min+max)/2 Program ejs09b540p25.sas –Summarize results of samples- using expected value, variance, MSE of estimators –Use PROC MEANS options for VARDEF=N, and MAXDEC=2 –Sample mean has smallest MSE –Is this always true?

5 Estimate Pop Median Age in Seasons Study Data Program ejs09b540p26.sas –use basev2.sas7bdat with “Age” –include histograms of distribution of estimator over possible samples –best estimator is not the mean!- BEST depends on the population… Program ejs09b540p27.sas –estimate Pop Mean using sample mean from SRS w/o rep. of n=25 –How does var of sample means relate to the population variance?

6 Relating Population Variance to the Variance of the Sample Means Population Variance Variance of Sample Mean (without replacement: with T=10,000 trials…

7 Interval Estimate idea is to place an interval around an estimate to approximate the width of the estimators sampling distribution usually, the width is the central 95% of the estimators sampling distribution How wide is this? –measure width in terms of stderr of mean

8 How good is Approximation? Program ejs09b540p28.sas –SRS w/o rep of n=5 to estimate Mean LDL cholesterol from the Seasons study using the sample mean, 10 samples. –determine the 2.5 th percentile and 97.5 th percentile of the distribution of sample means. –Determine how many multiples of stderror of mean the percentiles are from the population mean

9 Example of 95% Width Program ejs09b540p28.sas Change number of samples to Determine multiples for standard error –Lower 2.5% multiplier is –Upper 97.5 multiplier is 2.02 –Standard Deviation of sample means = se(Mean)=15.94 Program ejs09b540p30.sas –select srs w/o rep of n=5, estimate mean sample mean=166.7 Low= (15.94) High= (15.94)

10 Example of Triglycerides- Seasons Study

11 Example of Triglycerides- Seasons Study Take 10,000 SRS w/o replacement of size n=5 (program ejs09b540p31.sas) Population: SourceSim Source ejs09b540p31.sas Multiplier of for 2.5 %ile Multiplier of for 97.5 %ile

12 Example of Triglycerides- Seasons Study

13 Example of Triglycerides- Seasons Study

14 Example of Triglycerides- Seasons Study

15 Example of Triglycerides- Seasons Study

16 Example of Triglycerides- Seasons Study

17 Example of Triglycerides- Seasons Study

18 Conclusions With larger sample size, distribution of sample means is more bell shaped (i.e. ‘normal’) (Central Limit Theorem) Central 95% of distribution is around + or - 2 standard errors from true population mean In practice we don’t know the SE In practice we don’t know the multiplier Solution: Estimate SE from sample Solution: Approximate multipler assuming a distribution (Normal if known or t-distribution if not known)

19 Normal Distribution With larger sample sizes, the distribution of SRS means is normal: Standard Normal Distribution

20 Transforming a Random Variable Standardization is an example of transforming a random variable. Suppose we have a random variable: What is the expected value and variance of X=a+bY?

21 Transforming a Random Variable Variance of X=a+bY?

22 Transforming a Random Variable Application for Standardizing

23 Conclusions- Practical Assume Central Limit Theorem holds (usually if n>30) Use multiplier based on centered distribution of standard normal (if is known) see Table A3 in Text –central 60%-0.84 to –central 80% to 1.28 –central 90% to 1.64 –central 95%-1.96 to 1.96 –central 99%-2.56 to 2.56

24 Conclusions- Practical In practice we don’t know Estimate using Use a t-distribution with (n-1) degrees of freedom for multiplies (see table A4 in text). –assumes underlying normal distribution and srs

25 Conclusions- Practical t-distribution examples for 95% interval estimator (Confidence interval): –n=2df=1-4.3 to 4.3 –n=5df= to –n=10df= to –n=20df= to –n=30df= to –n=50df= to –n=120df= to 1.98 –n=500df= to 1.96