POOLED DATA DISTRIBUTIONS GRAPHICAL AND STATISTICAL TOOLS FOR EXAMINING COMPARISON REFERENCE VALUES Alan Steele, Ken Hill, and Rob Douglas National Research.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Forecasting Using the Simple Linear Regression Model and Correlation
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chapter 14 Comparing two groups Dr Richard Bußmann.
Confidence Interval and Hypothesis Testing for:
Chapter 8 Estimation: Additional Topics
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
BA 555 Practical Business Analysis
Chapter Topics Types of Regression Models
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
Chapter 2 Simple Comparative Experiments
Linear Regression Example Data
Inferences About Process Quality
Copyright © 2010 Pearson Education, Inc. Chapter 24 Comparing Means.
8-5 Testing a Claim About a Standard Deviation or Variance This section introduces methods for testing a claim made about a population standard deviation.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Statistical Comparison of Two Learning Algorithms Presented by: Payam Refaeilzadeh.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Hypothesis Testing Using The One-Sample t-Test
CHAPTER 19: Two-Sample Problems
Introduction to Regression Analysis, Chapter 13,
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Statistical Inference for Two Samples
Chapter 24: Comparing Means.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 23 Comparing Means.
User’s Guide to the ‘QDE Toolkit Pro’ National ResearchConseil national Council Canadade recherches Excel Tools for Presenting Metrological Comparisons.
More About Significance Tests
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Student’s t-distributions. Student’s t-Model: Family of distributions similar to the Normal model but changes based on degrees-of- freedom. Degrees-of-freedom.
CHAPTER 18: Inference about a Population Mean
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 24 Comparing Means.
User’s Guide to the ‘QDE Toolkit Pro’ National ResearchConseil national Council Canadade recherches Excel Tools for Presenting Metrological Comparisons.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #7 Jose M. Cruz Assistant Professor.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
AP Statistics Chapter 24 Comparing Means.
Chapter 6: Analyzing and Interpreting Quantitative Data
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Comparing Means Chapter 24. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side.
Course Review. Distributions What are the important aspects needed to describe a distribution of one variable? List three types of graphs that could be.
AP Statistics Chapter 24 Notes “Comparing Two Sample Means”
User’s Guide to the ‘QDE Toolkit Pro’ National ResearchConseil national Council Canadade recherches Excel Tools for Presenting Metrological Comparisons.
Statistics 24 Comparing Means. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side.
AP Statistics Chapter 25 Paired Samples and Blocks.
Simulation-based inference beyond the introductory course Beth Chance Department of Statistics Cal Poly – San Luis Obispo
Putting Confidence Into Your Lab’s Results Alan Steele, Barry Wood & Rob Douglas National Research Council Ottawa, CANADA National.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Tests of hypothesis Contents: Tests of significance for small samples
Slides by JOHN LOUCKS St. Edward’s University.
Computer aided teaching of statistics: advantages and disadvantages
Paired Samples and Blocks
Chapter 24 Comparing Means.
Chapter 23 Comparing Means.
Chapter 23 Comparing Means.
Chapter 9: Inferences Involving One Population
User’s Guide to the ‘QDE Toolkit Pro’
Psychology 202a Advanced Psychological Statistics
Chapter 2 Simple Comparative Experiments
Statistical Methods For Engineers
Chapter 23 Comparing Means.
BA 275 Quantitative Business Methods
Chapter 24 Comparing Means Copyright © 2009 Pearson Education, Inc.
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
DESIGN OF EXPERIMENT (DOE)
Presentation transcript:

POOLED DATA DISTRIBUTIONS GRAPHICAL AND STATISTICAL TOOLS FOR EXAMINING COMPARISON REFERENCE VALUES Alan Steele, Ken Hill, and Rob Douglas National Research Council of Canada National ResearchConseil national Council Canadade recherches Measurement comparison data sets are generally summarized using a simple statistical reference value calculated from the pool of the participants’ results. Consideration of the comparison data sets, particularly with regard to the consequences and implications of such data pooling, can allow informed decisions regarding the appropriateness of choosing a simple statistical reference value. Graphs of the relevant distributions provide insight to this problem.

Steele, Hill, and Douglas: Pooled Data Distributions 2 Introduction Comparison data collection and analysis continues to grow in importance among the tasks of international metrology Sample distributions and populations are routinely considered when preparing the summary of the comparison Reference values (KCRVs) are often calculated from the measurement data supplied by the participants We believe that graphical techniques are an aid to understanding and communication in this field

Steele, Hill, and Douglas: Pooled Data Distributions 3 The Normal Approach Generally, initial implicit assumption is to consider that all of the participants’ data, as x i /u i, represent individual samples from a single (normal) population A coherent picture of the population mean and standard deviation can be built from the comparison data set that is fully consistent with the reported values and uncertainties Most outlier-test protocols rely on this assumption to identify when and if a given laboratory result should be excluded, since its inclusion would violate this internal consistency

Steele, Hill, and Douglas: Pooled Data Distributions 4 Pooled Data Distributions Creating pooled data distributions tackles this problem from the opposite direction The independent distributions reported by each participant (through their value and uncertainty) are summed directly Result is taken as representative of the underlying population as revealed in the comparison measurements Monte Carlo methods are useful when calculations involve Student distributions or medians rather than means

Steele, Hill, and Douglas: Pooled Data Distributions 5 Monte Carlo Calculations High quality linear congruent uniform random number generators are easy to find Transformation from uniform to any distribution done via cumulative distribution Example shows Student distribution transform Our Excel Toolkit includes an external DLL for doing fast Monte Carlo simulations with multiple large arrays

Steele, Hill, and Douglas: Pooled Data Distributions 6 Dealing with Student Distributions Student Cumulative Distribution Functions for different Degrees of Freedom (  = 2…10) Note that the line at 97.5% cumulative probability crosses each curve at the coverage factor, k, appropriate for a 95% confidence interval

Steele, Hill, and Douglas: Pooled Data Distributions 7 Example Data From KCDB Recent results for CCAUV.U-K1 Low power, 1.9 MHz: 5 Labs Finite degrees of freedom specified for all participants Data failed consistency check using weighted mean Median chosen as KCRV

Steele, Hill, and Douglas: Pooled Data Distributions 8 Statistical Distributions Results of Monte Carlo simulation: –lab distributions used to resample comparison –pooled data histogram incremented once for each lab per event –mean, weighted mean, and median calculated for each event Population revealed by measurement is multi-modal and evidently not normal

Steele, Hill, and Douglas: Pooled Data Distributions 9 Statistical Distributions Results of Monte Carlo simulation: –lab distributions used to resample comparison –pooled data histogram incremented once for each lab per event –mean, weighted mean, and median calculated for each event Population revealed by measurement is multi-modal and evidently not normal

Steele, Hill, and Douglas: Pooled Data Distributions 10 Advantages of Monte Carlo Technique is simple to implement Allows calculation of confidence intervals for statistics Covariances can be accommodated in straightforward manner Possible to include outlier rejection schemes Easy to track quantities of interest, such as probability of a given participant being median laboratory Can consider other candidate reference values

Steele, Hill, and Douglas: Pooled Data Distributions 11 Example: CCT-K3 Argon Point Another example from KCDB CCT-K3 Argon Triple Point Large variation in reported values Large variation in stated uncertainties No KCRV was assigned, based on data pooling analysis

Steele, Hill, and Douglas: Pooled Data Distributions 12 Algorithmic Reference Values Linear combinations of simple estimators can be used as robust estimators of location For CCT-K3, proposal to use simple average of mean, weighted mean, and median Evaluation of any such algorithmic estimator is easy to do with Monte Carlo

Steele, Hill, and Douglas: Pooled Data Distributions 13 Quantifying the Comparison Calculating a reference value – typically the variance-weighted mean or the median - is a routine part of reporting comparisons The suitability of these statistics for representing the data set can be checked using chi-squared testing It is also possible to perform such tests without invoking a reference value by considering the data in pair wise fashion Advantages of pair-statistics –Always works, even before choosing a reference value –More rigorous, since can handle correlations exactly –Explicit, following metrological chains of inference

Steele, Hill, and Douglas: Pooled Data Distributions 14 Pair-Difference Distributions Similar to exclusive statistics Consider difference between one lab and “rest of world” Sum of per-lab differences is the all-pairs-difference (APD) distribution; this is symmetric Width of APD is a measure of “global” quality assurance for independent calibration of an artifact by two different labs chosen at random

Steele, Hill, and Douglas: Pooled Data Distributions 15 Reduced Chi-Squared Testing Normalizing the pair differences by the pair uncertainties allows us to build tests of the measurement capability claims This is still independent of any chosen reference value This All Pairs Difference reduced  2 has N-1 degrees of freedom If a data set fails the APD  2 test, it will fail for every possible KCRV APD

Steele, Hill, and Douglas: Pooled Data Distributions 16 Conclusions Monte Carlo technique is fast and simple to implement Graphs provide a powerful tool for visual consideration of: –Pooled data (sum distribution) –Simple Estimators (mean, weighted mean, median) –Other Estimators (any algorithm can be used) All-pairs reduced chi-squared statistic is egalitarian over participants, and independent of choice of KCRV No single choice of KCRV can adequately represent a comparison that fails the all-pairs-difference chi-squared test