Introduction In medicine, business, sports, science, and other fields, important decisions are based on statistical information drawn from samples. A sample.

Slides:



Advertisements
Similar presentations
CHAPTER 11: Sampling Distributions
Advertisements

QBM117 Business Statistics Introduction to Statistics.
Chapter 7: Data for Decisions Lesson Plan
Confidence Intervals for Proportions
Beginning the Research Design
Sampling Distributions
Part III: Inference Topic 6 Sampling and Sampling Distributions
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Copyright ©2009 Cengage Learning 1.1 Day 3 What is Statistics?
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.
Chapter 9: Introduction to the t statistic
CHAPTER 11: Sampling Distributions
A P STATISTICS LESSON 9 – 1 ( DAY 1 ) SAMPLING DISTRIBUTIONS.
UNIT 1 • INFERENCES AND CONCLUSIONS FROM DATA
Introduction to Statistical Inferences
Chapter 5 Sampling Distributions
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
1.1 Chapter One What is Statistics?. 1.2 What is Statistics? “Statistics is a way to get information from data.”
Statistics and Quantitative Analysis Chemistry 321, Summer 2014.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Two THE DESIGN OF RESEARCH.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Stat 1510: Sampling Distributions
Introduction Previous lessons demonstrated the use of the standard normal distribution. While distributions with a mean of 0 and a standard deviation of.
Introduction Studying the normal curve in previous lessons has revealed that normal data sets hover around the average, and that most data fits within.
1 Statistics Statistics can be found in all aspects of life:
Introduction Suppose that some students from the junior class will be chosen to receive new laptop computers for free as part of a pilot program. You hear.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Sampling Theory and Some Important Sampling Distributions.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 13 Collecting Statistical Data 13.1The Population 13.2Sampling.
Introduction Previous lessons demonstrated the use of the standard normal distribution. While distributions with a mean of 0 and a standard deviation of.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Sample Size Mahmoud Alhussami, DSc., PhD. Sample Size Determination Is the act of choosing the number of observations or replicates to include in a statistical.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Statistics 19 Confidence Intervals for Proportions.
Lecture 13 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
Chapter 6 Sampling and Sampling Distributions
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chapter 6 INFERENTIAL STATISTICS I: Foundations and sampling distribution.
Chapter 8: Estimating with Confidence
Statistical analysis.
Confidence Intervals for Proportions
Chapter 8: Estimating with Confidence
Statistical analysis.
Confidence Intervals for Proportions
Chapter 8: Estimating with Confidence
Chapter 5 Sampling Distributions
Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Confidence Intervals for Proportions
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
CHAPTER 11: Sampling Distributions
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
Chapter 5 Sampling Distributions
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
2/5/ Estimating a Population Mean.
Confidence Intervals for Proportions
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 5: Sampling Distributions
Presentation transcript:

Introduction In medicine, business, sports, science, and other fields, important decisions are based on statistical information drawn from samples. A sample is a subset of the population. The wise selection of samples often determines the success of those who use the information. It is possible that one sample is more reliable to predict an election or justify a new medical procedure, while other samples are simply not reliable. Some conclusions based on statistical samples are little more than guesses, and some are reckless conclusions in life-or-death matters; in many cases, it all comes down to whether the sample selected is genuinely random : Differences Between Populations and Samples

2 Key Concepts The word statistics has two different but related meanings. On a basic level, a statistic is a measure of a sample that is used to estimate a corresponding measure of a population (all of the people, objects, or phenomena of interest in an investigation). A statistic is a number used to summarize, describe, or represent something about a sample drawn from a larger population; the statistic allows us to make predictions about that population. A measure of the population that we are interested in is a parameter, a numerical value that represents the data in a set.

Key Concepts, continued We use different notation for sample statistics and population parameters. For example, the symbol for the mean of a population is μ, the Greek letter mu, whereas the symbol for the mean of a sample population is, pronounced “x bar.” The symbol for the standard deviation of a population is σ, the lowercase version of the Greek letter sigma; the symbol for the standard deviation of a sample population is s : Differences Between Populations and Samples

Key Concepts, continued Though the formulas for the mean of a population and the mean of a sample population are essentially the same, the formula for the standard deviation of a sample population is slightly different from the formula for the standard deviation of a population : Differences Between Populations and Samples

Key Concepts, continued For a population, the formula is, with σ representing the standard deviation of the population and μ representing the mean of the population. For a sample, the formula is, with s representing the standard deviation of the sample and representing the mean of the sample : Differences Between Populations and Samples

Key Concepts, continued When using a graphing calculator to find standard deviations of data sets, it is important to recognize whether the data set is a population or a sample so that the proper measure of standard deviation is selected. On a higher level, the field of statistics concerns the science and mathematics of describing and making inferences about a population from a sample. An inference is a conclusion reached upon the basis of evidence and reasoning : Differences Between Populations and Samples

Key Concepts, continued How well a statistic computed from a sample describes a population depends greatly upon the quality of the sampling method(s) used. First, the sample must be representative of the population. A representative sample is a sample in which the characteristics of the people, objects, or items in the sample are similar to the characteristics of the population : Differences Between Populations and Samples

Key Concepts, continued Samples that represent a population well can provide valuable information about that population. In research, it may be impractical to gather information about an entire population because of time, money, availability, privacy, and many other issues. In these cases, representative sampling may provide researchers with an efficient way to gather information and make decisions : Differences Between Populations and Samples

Key Concepts, continued In addition to the need for sampling to be representative, it must also produce reliable measures. Reliability refers to the degree to which a study or experiment performed many times would have similar results. When small samples are used, there is often great variability and little consistency among the statistics that are found : Differences Between Populations and Samples

Key Concepts, continued By increasing sample size, the variability in many sample statistics (such as means, standard deviations, and proportions) can be reduced, resulting in improved reliability and greater consistency of results. Statistical reasoning often involves making decisions based on limited information. In particular, when a population of interest is too large or expensive to study, a carefully chosen sample is used : Differences Between Populations and Samples

Key Concepts, continued One of the most important things that a researcher should understand about a population is the amount of chance variation, or sampling error, that is present in the measures of interest in that population. Chance variation is a measure showing how precisely a sample reflects the population, with smaller sampling errors resulting from large samples and/or when the data clusters closely around the mean : Differences Between Populations and Samples

Key Concepts, continued If a population is small enough, then parameters (such as measures of average, variation, or proportions) can be measured directly. There is no need for sampling in these cases. For example, if a teacher wants to know the mean grade for a recent test, he can calculate the mean of the entire class. If a population is large, or if it is impractical to measure all members of a population, then estimates are made from samples. The accuracy and reliability of the estimates depends on the quality of the sampling procedures used : Differences Between Populations and Samples

Key Concepts, continued In general, estimates of a population based on data from large samples are more reliable than estimates from small samples. In estimating the mean of a population, a sample size greater than 30 is recommended. In some cases, the sample size is much larger. In estimating proportions, a larger sample is desirable : Differences Between Populations and Samples

Key Concepts, continued Validity is the degree to which the results obtained from a sample measure what they are intended to measure. The validity of inferences made about a population depends greatly on the amount of bias, or lack of neutrality, in sampling procedures. A biased sample is a sample in which some members of the population have a better chance of inclusion in the sample than others : Differences Between Populations and Samples

Key Concepts, continued An estimate made using a sample that is biased is likely to be inaccurate even if a large sample is used. For example, if a publisher wants to determine the percent of readers who prefer printed books to e-books, interviewing 100 people shopping at a bookstore may yield biased results, since those people are more likely to be deliberately seeking out printed books instead of e-books for a variety of reasons (they prefer printed books, they don’t own e-readers, they lack Internet access, etc.) : Differences Between Populations and Samples

Common Errors/Misconceptions not recognizing that results from an experiment or an observational study with a small sample size are unreliable not recognizing that samples that are biased can lead to misleading results even if numerical calculations are accurate not understanding that some of the variation in samples can be attributed to chance variation/sampling error : Differences Between Populations and Samples

Guided Practice Example 2 High levels of blood glucose are a strong predictor for developing diabetes. Blood glucose is typically tested after fasting overnight, and the test result is called a fasting glucose level. A doctor wants to determine the percentage of his patients who have high glucose levels. He reviewed the glucose test results for 25 patients to determine how many of them had a fasting glucose level greater than 100 mg/dL (milligrams per deciliter). He recorded each patient’s fasting glucose level in a table, shown on the next slide : Differences Between Populations and Samples

Guided Practice: Example 2, continued Identify the population, parameter, sample, and statistic of interest in this situation, and then calculate the percent of patients in the sample with a fasting glucose level above 100 mg/dL : Differences Between Populations and Samples Patient glucose levels in mg/dL

Guided Practice: Example 2, continued 1.Identify the population in this situation. The population is all patients of this doctor : Differences Between Populations and Samples

Guided Practice: Example 2, continued 2.Identify the parameter in this situation. The parameter is the percent of patients with a fasting glucose level greater than 100 mg/dL : Differences Between Populations and Samples

Guided Practice: Example 2, continued 3.Identify the sample in this situation. The sample is the 25 patients whose blood tests the doctor reviewed : Differences Between Populations and Samples

Guided Practice: Example 2, continued 4.Identify the statistic of interest in this situation. The statistic of interest is the percent of patients in the sample with a fasting glucose level greater than 100 mg/dL : Differences Between Populations and Samples

: Differences Between Populations and Samples Guided Practice: Example 2, continued 5.Calculate the statistic of interest. To calculate the percent of patients in the sample with a fasting glucose level greater than 100 mg/dL, use the fraction, where x represents the number of patients with a fasting glucose level greater than 100 mg/dL and n represents the number of patients in the sample.

Guided Practice: Example 2, continued From the table, it can be seen that 7 of the values are greater than 100, so x = 7. The total number of patients in the sample is 25, so n = 25. Fraction Substitute 7 for x and 25 for n, and then solve : Differences Between Populations and Samples

Guided Practice: Example 2, continued Of the patients in the sample, 0.28 or 28% had a fasting glucose level greater than 100 mg/dL. Note: It is important to recognize that this may be an inaccurate estimate because the patients in the sample may not be representative of the entire population of the doctor’s patients : Differences Between Populations and Samples ✔

: Differences Between Populations and Samples Guided Practice Example 3 Data collected by the National Climatic Data Center from 1971 to 2000 was used to determine the average total yearly precipitation for each state. The table on the next slide shows the mean yearly precipitation for a random sample of 10 states and each state’s ranking in relation to the rest of the states, where a ranking that’s closer to 1 indicates a higher mean yearly precipitation. Use the sample data to estimate the total rainfall in all 50 states for the 30-year period from 1971 to Identify the population, parameter, sample, and statistic of interest in this situation.

Guided Practice: Example 3, continued : Differences Between Populations and Samples RankingState Mean yearly precipitation (in inches) 5Florida54.5 8Arkansas Kentucky Ohio Kansas Nebraska Alaska South Dakota North Dakota New Mexico14.6

Guided Practice: Example 3, continued 1.Identify the population in this situation. The population is all 50 states : Differences Between Populations and Samples

Guided Practice: Example 3, continued 2.Identify the parameter in this situation. The parameter is the total rainfall from 1971 to : Differences Between Populations and Samples

Guided Practice: Example 3, continued 3.Identify the sample in this situation. The sample is the 10 randomly selected states : Differences Between Populations and Samples

Guided Practice: Example 3, continued 4.Identify the statistic of interest in this situation. The statistic of interest is the mean yearly precipitation for the sample : Differences Between Populations and Samples

Guided Practice: Example 3, continued 5.Calculate the statistic of interest. To calculate the mean yearly precipitation for the sample, first find the total mean yearly precipitation in the sample states. To do this, find the sum of the mean yearly precipitation of each state. mean yearly precipitation = = The total mean yearly precipitation for the 10 sample states is inches : Differences Between Populations and Samples

Guided Practice: Example 3, continued Next, use this value to estimate the total precipitation in all 50 states for 1 year. Create a proportion, as shown on the next slide; then, solve it for the unknown value : Differences Between Populations and Samples

Guided Practice: Example 3, continued Substitute known values. 10x = 50(320.6) Cross-multiply to solve for x. 10x = 16,030 Simplify. x = : Differences Between Populations and Samples

Guided Practice: Example 3, continued Based on the data from 1971 to 2000, the estimated total precipitation in all 50 states for 1 year during this time frame is 1,603 inches per year. Use this value to estimate the total precipitation in all 50 states for this period of 30 years : Differences Between Populations and Samples

Guided Practice: Example 3, continued Multiply the precipitation in all 50 states for 1 year by (30) = 48,090 The estimated total rainfall in all 50 states for the 30-year period from 1971 to 2000 is 48,090 inches : Differences Between Populations and Samples ✔