3. Populations and samples Yun, Hyuk Jin. Populations Objects, or events, or procedures, or observations A population is thus an aggregate of creatures,

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
© 2011 Pearson Education, Inc
Sampling: Final and Initial Sample Size Determination
Inference for a population mean BPS chapter 18 © 2006 W. H. Freeman and Company.
Sampling Distributions (§ )
Introduction to Statistics
Chapter 10: Sampling and Sampling Distributions
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Chapter 10 Sampling and Sampling Distributions
BPS - 5th Ed. Chapter 111 Sampling Distributions.
Evaluating Hypotheses
Chapter 7: Variation in repeated samples – Sampling distributions
11 Populations and Samples.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Review of normal distribution. Exercise Solution.
Introduction Parameters are numerical descriptive measures for populations. For the normal distribution, the location and shape are described by  and.
Sampling for Research. Types of Research Quantitative – the collection & analysis of data to describe, explain, predict, or control phenomena of interest.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
Sampling Techniques LEARNING OBJECTIVES : After studying this module, participants will be able to : 1. Identify and define the population to be studied.
 The situation in a statistical problem is that there is a population of interest, and a quantity or aspect of that population that is of interest. This.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
AP Statistics Chapter 9 Notes.
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
Part III Gathering Data.
Basic Sampling & Review of Statistics. Basic Sampling What is a sample?  Selection of a subset of elements from a larger group of objects Why use a sample?
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
MATB344 Applied Statistics Chapter 7 Sampling Distributions I.Sampling Plans and Experimental Designs II.Statistics and Sampling Distributions III.“The.
9.3: Sample Means.
Chapter 7: Sampling and Sampling Distributions
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
1 Chapter Two: Sampling Methods §know the reasons of sampling §use the table of random numbers §perform Simple Random, Systematic, Stratified, Cluster,
Relative Values. Statistical Terms n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to.
Chapter 10: Introduction to Statistical Inference.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Confidence Interval Estimation For statistical inference in decision making:
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Ka-fu Wong © 2003 Chap 6- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. PPSS The situation in a statistical problem is that there is a population of interest, and a quantity or.
1 CHAPTER 6 Sampling Distributions Homework: 1abcd,3acd,9,15,19,25,29,33,43 Sec 6.0: Introduction Parameter The "unknown" numerical values that is used.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Introduction to Inference Sampling Distributions.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Design of Clinical Research Studies ASAP Session by: Robert McCarter, ScD Dir. Biostatistics and Informatics, CNMC
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Sampling Concepts Nursing Research. Population  Population the group you are ultimately interested in knowing more about “entire aggregation of cases.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Sampling Sampling Distributions. Sample is subset of population used to infer something about the population. Probability – know the likelihood of selection.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Sampling Distributions
ESTIMATION.
Lecture 15 Sections 7.3 – 7.5 Objectives:
Sampling Distributions (§ )
Chapter 7 Sampling and Sampling Distributions
Chapter 5: Sampling Distributions
Presentation transcript:

3. Populations and samples Yun, Hyuk Jin

Populations Objects, or events, or procedures, or observations A population is thus an aggregate of creatures, things, cases and so on. Not be able to enumerate it exactly. Needs precise information on such matters to draw valid inferences from the sample that was studied to the population being considered. μ : population mean, σ : standard deviation

Samples A population commonly contains too many individuals to study conveniently, so an investigation is often restricted to one or more samples drawn from it. A well chosen sample will contain most of the information about a particular population parameter, but the relation between the sample and the population must be such as to allow true inferences to be made about a population from that sample.

Samples Drawn must have a known non-zero chance Natural suggestion is that these chances should be equal Independent Use a table of random numbers The word "random" does not describe the sample as such but the way in which it is selected.

Samples Can occasionally coincide by chance with some unforeseen regularity in the presentation of the material for study. As susceptibility to disease generally varies in relation to age, sex, occupation, family history, and many other genetic or environmental factors. It is advisable to examine samples when drawn to see whether they are, on average, comparable in these respects.

Samples The random process of selection is intended to make them so, but sometimes it can by chance lead to disparities. To guard against this possibility the sampling may be stratified. - Stratified random sampling This means that a framework is laid down initially, and the patients or objects of the study in a random sample are then allotted to the compartments of the framework.

Samples But they will not reflect the distribution of these categories in the population from which the sample is drawn unless the compartments in the framework have been designed with that in mind. For taking a sample from a long list a compromise between strict theory and practicalities is known as a systematic random sample. In this case we choose subjects a fixed interval apart on the list, say every tenth subject, but we choose the starting point within the first interval at random.

Unbiasedness and precision When we say that a measurement is unbiased we mean that the average of a large set of unbiased measurements will be close to the true value. When we say it is precise we mean that it is repeatable. Repeated measurements will be close to one another, but not necessarily close to the true value. An estimate of a parameter taken from a random sample is known to be unbiased. As the sample size increases, it gets more precise.

Randomisation Another use of random number tables is to randomise the allocation of treatments to patients in a clinical trial. This ensures that there is no bias in treatment allocation and, in the long run, the subjects in each treatment group are comparable in both known and unknown prognostic factors. A common method is to use blocked randomisation. This is to ensure that at regular intervals there are equal numbers in the two groups. Usual sizes for blocks are two, four, six, eight, and ten.

Variation between samples The variation between samples depends partly on the amount of variation in the population from which they are drawn. It is a matter of common observation that a small sample is a much less certain guide to the population from which it was drawn than a large sample. Convenience sample - we are not in a position to take a random sample; our sample is simply those subjects available for study.

Standard error of the mean Normal distribution - Central Limit Theorem The variation depends on the variation of the population and the size of the sample. We do not know the variation in the population so we use the variation in the sample as an estimate of it. This is expressed in the standard deviation. If we now divide the standard deviation by the square root of the number of observations in the sample we have an estimate of the standard error of the mean,

Standard error of the mean Diastolic blood pressure of men aged differs between the printers and the farm workers. For this purpose she has obtained a random sample of 72 printers and 48 farm workers and calculated the mean and standard deviations.

Standard error of a proportion or a percentage The size of the sample will affect the size of the standard error but the amount of variation is determined by the value of the percentage or proportion in the population itself, and so we do not need an estimate of the standard deviation. Acute appendicitis in people aged 65 and over. As a preliminary study he examines the hospital case notes over the previous 10 years and finds that of 120 patients in this age group with a diagnosis confirmed at operation 73 (60.8%) were women and 47 (39.2%) were men.

Problems with non-random samples In general we do not have the luxury of a random sample; we have to make do with what is available, a "convenience sample". the patients available are not typical. Common biases are: –hospital patients are not the same as ones seen in the community; –volunteers are not typical of non-volunteers; –patients who return questionnaires are different from those who do not.

Problems with non-random samples In order to persuade the reader that the patients included are typical it is important to give as much detail as possible at the beginning of a report of the selection process and some demographic data such as age, sex, social class and response rate.

Exercises Exercise 3.1 –The mean urinary lead concentration in 140 children was 2.18 mol/24 h, with standard deviation What is the standard error of the mean? Exercise 3.2 –In Table F (Appendix), what is the distribution of the digits, and what are the mean and standard deviation? Exercise 3.3 –For the first column of five digits in Table F take the mean value of the five digits and do this for all rows of five digits in the column. –What would you expect a histogram of the means to look like? –What would you expect the mean and standard deviation to be?