Presentation on theme: "12 - 1 Module 12: Populations, Samples and Sampling Distributions This module provides basic information about the statistical concepts of populations."— Presentation transcript:
Module 12: Populations, Samples and Sampling Distributions This module provides basic information about the statistical concepts of populations and samples, selecting samples from population and the critical issue of sampling distributions. Reviewed 05 May 05 / MODULE 12
Population: The entire group about which information is desired. Sample: A proportion or part of the population - usually the proportion from which information is gathered. Populations and Samples
Target Population The participants to whom the answer to the question pertains. The target population definition has two aspects: Conceptual Operational
Population Definition Adults and children years of age residing in four census tracts in Richfield, a suburb of Minneapolis Adults years of age residing in Cedar County, Iowa and certain rural townships in neighboring counties on July 1, 1973 Employees of Pacific Northwest Bell Telephone Company working in King County A population definition gives a clear statement of those included. The following are some examples:
Population Person Population of Cholesterol values (mg/dl)
Population of Cholesterol values
Sampling In its broadest sense, sampling is a procedure by which one or more members of a population are picked from the population. The objective is to make certain observations upon the members of the sample and then, on the basis of these results, to draw conclusions about the characteristics of the entire population.
Selecting a Sample Haphazard Sample: Haphazard samples are constructed by arbitrarily selecting individual sample members. Random Sample: There are several methods for constructing random samples—we consider only simple random samples. This process operates so that each member of the population has an equal chance of being selected into the sample.
Selecting a Sample The selection process: Assign to each member of the population the equivalent of sequential ID number; Use a random number table or computer generated numbers; For computer generated numbers, generate one for each ID number, sort the ID numbers in order according to the random number and take the first on the list up to the point when you have the sample size you need For a table, haphazardly select a starting point and then Ignore numbers that are too large Ignore a number after it appears the first time
Fundamental and Important Concept We now begin the discussion of perhaps the most important concept in biostatistics. It is fundamental to understanding and thus interpreting correctly the use of the many statistical tools we will cover in this course. The concept is not complex, in fact, it is rather simple. It does require, however, thinking about issues in a manner that may initially appear somewhat different and unusual.
Looking at the Process When we randomly select a sample from a population, we can use the mean for the sample as an estimate or guess as to the value for the mean of the population. This should bring up the question as to how good is this sample mean or sample statistic as a guess for the value of the population mean or population parameter. The essence of this question has to do with how well this process works—the process of using a sample to make guesses about the population.
Understanding the Process Two important aspects of this fundamental process: FIRST: It is critical to recognize that it is a process SECOND:It is important to understand how and how well the process works
How Good is a Sample Mean The essential question is “How good is a sample mean as an estimate of the population mean?” One way to examine this question is to understand that we used a process that involved randomly selecting a sample from the population and then calculating the mean for the values of the observations in the sample. We can repeat this process as many times as we wish and examine what it produces.
Sample with n = … Sample of 5 weights Population of weights
Ten Different Samples, n = 5 SamplenMeans2s2 s Average
Standard Error of the Mean The population that includes all possible samples of size n is a long list of numbers and the variance for these numbers can, in theory, be calculated. The square root of this variance is called the standard error of the mean. It is simply the standard deviation for this population of means.
Sample with n = Sample of 20 weights
Ten Different Samples, n = 20 SamplenMeans2s2 s Average
Sampling Distributions Individual observations Means for n = 5 Means for n = µ = 150 lbs 2 = 100lbs = 10 lbs