Inferential Statistics A Closer Look. Analyze Phase2 Nature of Inference in·fer·ence (n.) “The act or process of deriving logical conclusions from premises.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

Chapter 6 Sampling and Sampling Distributions
A Sampling Distribution
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chapter 10: Estimating with Confidence
Sampling Distributions (§ )
Chapter 7 Sampling and Sampling Distributions
Sampling Distributions
Topic 2: Statistical Concepts and Market Returns
Inference.ppt - © Aki Taanila1 Sampling Probability sample Non probability sample Statistical inference Sampling error.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 10: Estimating with Confidence
Probability Population:
Inferential Statistics
Standard error of estimate & Confidence interval.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
1 Ch6. Sampling distribution Dr. Deshi Ye
 The situation in a statistical problem is that there is a population of interest, and a quantity or aspect of that population that is of interest. This.
Essentials of Marketing Research
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Chapter 11: Estimation Estimation Defined Confidence Levels
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination CHAPTER Eleven.
AP Statistics Chapter 9 Notes.
Topic 5 Statistical inference: point and interval estimate
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Review: Two Main Uses of Statistics 1)Descriptive : To describe or summarize a collection of data points The data set in hand = all the data points of.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
PARAMETRIC STATISTICAL INFERENCE
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Sample Size Determination CHAPTER thirteen.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
9.3: Sample Means.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Confidence Interval Estimation For statistical inference in decision making:
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
1 Sampling Distribution of Arithmetic Mean Dr. T. T. Kachwala.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. PPSS The situation in a statistical problem is that there is a population of interest, and a quantity or.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Review of Statistical Terms Population Sample Parameter Statistic.
Sampling Theory and Some Important Sampling Distributions.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Chapter 6 Sampling and Sampling Distributions
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Analyze Phase Inferential Statistics
Making inferences from collected data involve two possible tasks:
Six Sigma Green Belt Training
Sampling Distributions (§ )
Presentation transcript:

Inferential Statistics A Closer Look

Analyze Phase2 Nature of Inference in·fer·ence (n.) “The act or process of deriving logical conclusions from premises known or assumed to be true. The act of reasoning from factual knowledge or evidence.” 1 1. Dictionary.com Inferential Statistics – To draw inferences about the process or population being studied by modeling patterns of data in a way that account for randomness and uncertainty in the observations Wikipedia.com

Analyze Phase3 5 Step Approach to Inferential Statistics So many questions….? 1. What do you want to know? 2. What tool will give you that information? 5. How confident are you with your data summaries? 4. How will you collect the data? 3. What kind of data does that tool require?

Analyze Phase4 Types of Error 1.Error in sampling Error due to differences among samples drawn at random from the population (luck of the draw). This is the only source of error that statistics can accommodate. 2.Bias in sampling Error due to lack of independence among random samples or due to systematic sampling procedures (height of horse jockeys only). 3.Error in measurement Error in the measurement of the samples (MSA/GR&R) 4.Lack of measurement validity Error in the measurement does not actually measure what it intends to measure (placing a probe in the wrong slot measuring temperature with a thermometer that is just next to a furnace).

Analyze Phase5 Population, Sample, Observation Population –EVERY data point that has ever been or ever will be generated from a given characteristic. Sample –A portion (or subset) of the population, either at one time or over time. Observation –An individual measurement. X X X X X X

Analyze Phase6 Significance Significance is all about differences. In general, larger differences (or deltas) are considered to be “more significant.” Practical difference and significance is: The amount of difference, change, or improvement that will be of practical, economic, or technical value to you. The amount of improvement required to pay for the cost of making the improvement. Statistical difference and significance is: The magnitude of difference or change required to distinguish between a true difference, change, or improvement and one that could have occurred by chance. Six Sigma decisions will ultimately have a return on resource investment (RORI)* element associated with them. The key question of interest for our decisions “is the benefit of making a change worth the cost and risk of making it?” * RORI includes not only dollars and assets but the time and participation of your teams.

Analyze Phase7 The Mission Your mission, which you have chosen to accept, is to reduce cycle time, reduce the error rate, reduce costs, reduce investment, improve service level, improve throughput, reduce lead time, increase productivity… change the output metric of some process, etc… In statistical terms, this translates to the need to move the process mean and/or reduce the process standard deviation You’ll be making decisions about how to adjust key process input variables based on sample data, not population data - that means you are taking some risks. How will you know your key process output variable really changed, and is not just an unlikely sample? The Central Limit Theorem helps us understand the risk we are taking and is the basis for using sampling to estimate population parameters. Mean Shift Variation Reduction Both

Analyze Phase8 A Distribution of Sample Means Imagine you have some population, the individual values of this population form some distribution. Take a sample of some of the individual values and calculate the sample mean. Keep taking samples and calculating sample means. Plot a new distribution of these sample means. The central limit theorem says that as the sample size becomes large, this new distribution (the sample mean distribution) will form a normal distribution, no matter what the shape of the population distribution of individuals.

Analyze Phase9 Sampling Distributions—The Foundation of Statistics Population Sample 1Sample 2Sample Samples from the population, each with five observations: In this example, we have taken three samples out of the population, each with five observations in it. We computed a mean for each sample. Note that the means are not the same! Why not? What would happen if we kept taking more samples?

Analyze Phase10 Central Limit Theorem If all possible random samples, each of size n, are taken from any population with a mean μ and standard deviation σ, the distribution of sample means will: have a mean have a stddev and be normally distributed when the parent population is normally distributed, or will be approximately normal for samples of size 30 or more when the parent population is not normally distributed. This improves with samples of larger size. Bigger is Better!

Analyze Phase11 So What? So how does this theorem help me understand the risk I am taking when I use sample data, instead of population data? Recall that 95% of normally distributed data is within ± 2 standard deviations from the mean. Therefore, the probability is 95% that my sample mean is within 2 standard errors of the true population mean.

Analyze Phase12 A Practical Example Let’s say your project is to reduce the setup time for a large casting Based on a sample of 20 setups, you learn that your baseline average is 45 minutes, with a standard deviation of 10 minutes. Because this is just a sample, the 45 minute average is just an estimate of the true average. Using the central limit theorem, there is 95% probability that the true average is somewhere between 40.5 and 49.5 minutes. Therefore, don’t get too excited if you made a process change that resulted in a reduction of only 2 minutes.

Analyze Phase13 Sample Size and the Mean When taking a sample we have only estimated the true mean All we know is that the true mean lies somewhere within the theoretical distribution of sample means or the t-distribution which are analyzed using t-tests. T-tests measure the significance of differences between means. Distribution of individuals in the population Theoretical distribution of sample means for n = 10 Theoretical distribution of sample means for n = 2

Analyze Phase14 Standard Error of the Mean The standard deviation for the distribution of means is called the standard error of the mean and is defined as: This formula shows that the mean is more stable than a single observation by a factor of the square root of the sample size.

Analyze Phase15 Standard Error The rate of change in the standard error approaches zero at about 30 samples. This is why 30 samples is often recommended when generating summary statistics such as the mean and standard deviation. This is also the point at which the t and Z distributions become nearly equivalent Sample Size Standard Error 5

End of Presentation SSD Global University