Manijeh Keshtgary Chapter 13.  How to report the performance as a single number? Is specifying the mean the correct way?  How to report the variability.

Slides:



Advertisements
Similar presentations
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Advertisements

McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Confidence Intervals Chapter 8.
Sampling: Final and Initial Sample Size Determination
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chapter 8 Estimation: Additional Topics
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Sample size computations Petter Mostad
Chapter 7 Sampling and Sampling Distributions
Comparing Systems Using Sample Data
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
k r Factorial Designs with Replications r replications of 2 k Experiments –2 k r observations. –Allows estimation of experimental errors Model:
Experimental Evaluation
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Hypothesis Testing Using The One-Sample t-Test
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
AM Recitation 2/10/11.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Hypothesis Testing.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Section 8.2 Estimating  When  is Unknown
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Topic 5 Statistical inference: point and interval estimate
Chapter 8: Confidence Intervals
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
HYPOTHESIS TESTING. Statistical Methods Estimation Hypothesis Testing Inferential Statistics Descriptive Statistics Statistical Methods.
CPSC 531: Data Analysis1 CPSC 531: Output Data Analysis Instructor: Anirban Mahanti Office: ICT Class Location: TRB.
Determination of Sample Size: A Review of Statistical Theory
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
CPE 619 Comparing Systems Using Sample Data Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of.
Inferential Statistics 4 Maarten Buis 18/01/2006.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 8. Parameter Estimation Using Confidence Intervals.
Mystery 1Mystery 2Mystery 3.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
© Copyright McGraw-Hill 2004
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
MATB344 Applied Statistics I. Experimental Designs for Small Samples II. Statistical Tests of Significance III. Small Sample Test Statistics Chapter 10.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Chapter 7: The Distribution of Sample Means
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
CHAPTER 8 (4 TH EDITION) ESTIMATING WITH CONFIDENCE CORRESPONDS TO 10.1, 11.1 AND 12.1 IN YOUR BOOK.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chapter 9 Introduction to the t Statistic
Confidence Intervals Cont.
Comparing Systems Using Sample Data
ESTIMATION.
Chapter 4. Inference about Process Quality
Statistics in Applied Science and Technology
Chapter 9 Hypothesis Testing.
Econ 3790: Business and Economics Statistics
Warmup To check the accuracy of a scale, a weight is weighed repeatedly. The scale readings are normally distributed with a standard deviation of
LESSON 18: CONFIDENCE INTERVAL ESTIMATION
Determination of Sample Size
Statistical Inference for the Mean: t-test
Presentation transcript:

Manijeh Keshtgary Chapter 13

 How to report the performance as a single number? Is specifying the mean the correct way?  How to report the variability of measured quantities? What are the alternatives to variance and when are they appropriate?  How to interpret the variability?  How much confidence can you put on data with a large variability?  How many measurements are required to get a desired level of statistical confidence?  How to summarize the results of several different workloads on a single computer system?  How to compare two or more computer systems using several different workloads? Is comparing the mean sufficient? 2

 Sample Versus Population  Confidence Interval for The Mean  Approximate Visual Test  One Sided Confidence Intervals  Sample Size for Determining Mean 3

 Old French word `essample'  `sample' and `example'  One example  theory  One sample  Definite statement 4

5  Generate several million random numbers with mean  and standard deviation   Draw a sample of n observations: {x 1, x 2, …, x n } ◦ Sample mean (x)  population mean (  )  Parameters: population characteristics ◦ Unknown, Use Greek letters (   Statistics: Sample estimates ◦ Random, Use English letters (x, s)

 it is not possible to get a perfect estimate of the population mean from any finite number of finite size samples.  The best we can do is to get probabilistic bounds. Thus, we may be able to get two bounds, for instance, c1 and c2,

 k samples  k Sample means ◦ Can't get a single estimate of  ◦ Use bounds c 1 and c 2 :  Probability{c 1    c 2 } = 1-  (  is very small)  Confidence interval: [(c 1, c 2 )]  Significance level:   Confidence level: 100(1-  )  Confidence coefficient: 1-  7  c1c1 c2c2

 Use 5-percentile and 95-percentile of the sample means to get 90% Confidence interval  Need many samples (n > 30)  Central limit theorem: Sample mean of independent and identically distributed observations: Where  = population mean,  = population standard deviation  Standard Error: Standard deviation of the sample mean  100(1-  )% confidence interval for  : z 1-  /2 = (1-  /2)-quantile of N(0,1) 8  -z 1-  /2 z 1-  /2

 For example, for a two–sided confidence interval at 95%, α= 0.05 and p = 1 – α/2 = The entry in the row labeled 0.97 and column labeled gives z p =

 x = 3.90, s = 0.95 and n = 32  A 90% confidence interval for the mean =  We can state with 90% confidence that the population mean is between 3.62 and The chance of error in this statement is 10%. 10

 If we take 100 samples and construct confidence interval for each sample, the interval would include the population mean in 90 cases. 11  c1c1 c2c2 Total yes > 100(1-  )

 The preceding confidence interval applies only for large samples, that is, for samples of size greater than 30.  For smaller samples, confidence intervals can be constructed only if the observations come from a normally distributed population  100(1-  ) % confidence interval for n < 30

◦ Note: can be constructed only if observations come from a normally distributed population  t [1-  /2; n-1] = (1-  /2)-quantile of a t-variate with n-1 degrees of freedom ◦ Listed in Table A.4 in the Appendix 13

 Table A.4 lists t [p;n]. For example, the t [0.95;13] required for a two–sided 90% confidence interval of the mean of a sample of 14 observation is 1.771

 Sample ◦ -0.04, -0.19, 0.14, -0.09, -0.14, 0.19, 0.04, and  Mean = 0, Sample standard deviation =  For 90% interval: t [0.95;7] =  Confidence interval for the mean 15

16

 Difference in processor times: {1.5, 2.6, -1.8, 1.3, -0.5, 1.7, 2.4}  Question: Can we say with 99% confidence that one is superior to the other? ◦ Sample size = n = 7 ◦ Mean = 7.20/7 = 1.03 ◦ Sample variance = ( *7.20/7)/6 = 2.57 ◦ Sample standard deviation = = 1.60 t [0.995; 6] =  99% confidence interval = (-1.21, 3.27) 17

 Opposite signs  we cannot say with 99% confidence that the mean difference is significantly different from zero  Answer: They are same  Answer: The difference is zero 18

 Difference in processor times ◦ {1.5, 2.6, -1.8, 1.3, -0.5, 1.7, 2.4}.  Question: Is the difference 1?  99% Confidence interval = (-1.21, 3.27) ◦ The confidence interval includes 1 =>  Yes: The difference is 1 with 99% of confidence 19

 Paired: one-to-one correspondence between the i th test of system A and the i th test on system B ◦ Example: Performance on i th workload ◦ Straightforward analysis: the two samples are treated as one sample of n pairs ◦ Use confidence interval of the difference  Unpaired: No correspondence ◦ Example: n people on System A, n on System B  Need more sophisticated method ◦ t-test procedure 20

 Performance: {(5.4, 19.1), (16.6, 3.5), (0.6, 3.4), (1.4, 2.5), (0.6, 3.6), (7.3, 1.7)}. Is one system better?  Differences: {-13.7, 13.1, -2.8, -1.1, -3.0, 5.6}.  Answer: No. They are not different (the confidence interval includes zero) 21

 1. Compute the sample means  2. Compute the sample standard deviations 22

 3. Compute the mean difference  4. Compute the standard deviation of the mean difference  5. Compute the effective number of degrees of freedom  6. Compute the confidence interval for the mean difference  7. If the confidence interval includes zero, the difference is not significant 23

 Times on System A: {5.36, 16.57, 0.62, 1.41, 0.64, 7.26} Times on system B: {19.12, 3.52, 3.38, 2.50, 3.60, 1.74}  Question: Are the two systems significantly different?  For system A:  For System B: 24

 The confidence interval includes zero  the two systems are not different 25

26

 Times on System A: {5.36, 16.57, 0.62, 1.41, 0.64, 7.26} Times on system B: {19.12, 3.52, 3.38, 2.50, 3.60, 1.74} t [0.95, 5] =  The 90% confidence interval for the mean of A = 5.31  (2.015) = (0.24, 10.38)  The 90% confidence interval for the mean of B = 5.64  (2.015) = (0.18, 11.10)  Confidence intervals overlap and the mean of one falls in the confidence interval for the other ◦ Two systems are not different at this level of confidence 27

 Need not always be 90% or 95% or 99%  Based on the loss that you would sustain if the parameter is outside the range and the gain you would have if the parameter is inside the range  Low loss  Low confidence level is fine ◦ E.g., lottery of 5 Million, one dollar ticket cost, with probability of winning (one in 10 million) ◦ 90% confidence  buy 9 million tickets (and pay $9M) ◦ 0.01% confidence level is fine  50% confidence level may or may not be too low  99% confidence level may or may not be too high 28

 Time between crashes was measured for two systems A and B. The mean and standard deviations of the time are listed in Table To check if System A is more susceptible to failures than System B,

 Larger sample  Narrower confidence interval resulting in higher confidence  Question: How many observations n to get an accuracy of § r% and a confidence level of 100(1-  )%?  We know that for a sample of size n, the 100(1 - α)% confidence interval of the population mean is  r% accuracy implies that confidence interval should be 33

 Sample mean of the response time = 20 seconds Sample standard deviation = 5 Question: How many repetitions are needed to get the response time accurate within 1 second at 95% confidence?  Required accuracy = 1 in 20 = 5% Here, = 20, s= 5, z= 1.960, and r=5, n = A total of 97 observations are needed. 34

 All statistics based on a sample are random and should be specified with a confidence interval  If the confidence interval includes zero, the hypothesis that the population mean is zero cannot be rejected  Paired observations  Test the difference for zero mean  Unpaired observations  More sophisticated t-test 35