Statistics Presentation Ch En 475 Unit Operations.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Design of Experiments Lecture I
Welcome to PHYS 225a Lab Introduction, class rules, error analysis Julia Velkovska.
General Statistics Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Materials for Lecture 11 Chapters 3 and 6 Chapter 16 Section 4.0 and 5.0 Lecture 11 Pseudo Random LHC.xls Lecture 11 Validation Tests.xls Next 4 slides.
Sampling: Final and Initial Sample Size Determination
Sampling Distributions (§ )
Design of Experiments and Data Analysis. Let’s Work an Example Data obtained from MS Thesis Studied the “bioavailability” of metals in sediment cores.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Introduction to Inference Estimating with Confidence Chapter 6.1.
Statistics: Data Analysis and Presentation Fr Clinic II.
Quality Control Procedures put into place to monitor the performance of a laboratory test with regard to accuracy and precision.
Ch. 6 The Normal Distribution
Types of Errors Difference between measured result and true value. u Illegitimate errors u Blunders resulting from mistakes in procedure. You must be careful.
Inferences About Process Quality
BCOR 1020 Business Statistics
Relationships Among Variables
Objectives (BPS chapter 14)
Chapter 7 Confidence Intervals and Sample Sizes
Regression and Correlation Methods Judy Zhong Ph.D.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Inference for regression - Simple linear regression
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
Development of An ERROR ESTIMATE P M V Subbarao Professor Mechanical Engineering Department A Tolerance to Error Generates New Information….
Statistical Methods For UO Lab — Part 1 Calvin H. Bartholomew Chemical Engineering Brigham Young University.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
Topic 5 Statistical inference: point and interval estimate
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
PARAMETRIC STATISTICAL INFERENCE
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Measurement Uncertainties Physics 161 University Physics Lab I Fall 2007.
General Statistics Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) Each has some error or uncertainty.
Basic Probability (Chapter 2, W.J.Decoursey, 2003) Objectives: -Define probability and its relationship to relative frequency of an event. -Learn the basic.
LECTURER PROF.Dr. DEMIR BAYKA AUTOMOTIVE ENGINEERING LABORATORY I.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 16 1 MER301: Engineering Reliability LECTURE 16: Measurement System Analysis and.
Uncertainty & Error “Science is what we have learned about how to keep from fooling ourselves.” ― Richard P. FeynmanRichard P. Feynman.
General Statistics Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Ch 6-1 © 2004 Pearson Education, Inc. Pearson Prentice Hall, Pearson Education, Upper Saddle River, NJ Ostwald and McLaren / Cost Analysis and Estimating.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
A Process Control Screen for Multiple Stream Processes An Operator Friendly Approach Richard E. Clark Process & Product Analysis.
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Statistics Presentation Ch En 475 Unit Operations.
CHEMISTRY ANALYTICAL CHEMISTRY Fall Lecture 6.
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
Experimental Data Analysis Prof. Terry A. Ring, Ph. D. Dept. Chemical & Fuels Engineering University of Utah
Statistical Data Analysis
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 10: Confidence Intervals
NON-LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 6 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Analysis of Experimental Data; Introduction
BME 353 – BIOMEDICAL MEASUREMENTS AND INSTRUMENTATION MEASUREMENT PRINCIPLES.
Class 5 Estimating  Confidence Intervals. Estimation of  Imagine that we do not know what  is, so we would like to estimate it. In order to get a point.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Introduction to inference Estimating with confidence IPS chapter 6.1 © 2006 W.H. Freeman and Company.
Introduction to Engineering Calculations Chapter 2.
Chapter 7: The Distribution of Sample Means
Basic Estimation Techniques
Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis
Introduction, class rules, error analysis Julia Velkovska
Materials for Lecture 18 Chapters 3 and 6
Statistics Presentation
Basic Estimation Techniques
Statistical Methods For Engineers
Statistics Review ChE 477 Winter 2018 Dr. Harding.
Introduction to Instrumentation Engineering
Statistical Data Analysis
Objectives 6.1 Estimating with confidence Statistical confidence
Presentation transcript:

Statistics Presentation Ch En 475 Unit Operations

Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred to as “measured” variable ex. Temperature measured with thermocouple 2. Calculate variable from “measured” or “tabulated” variables - referred to as “calculated” variable ex. Flow rate m =  A v (measured or tabulated) Each has some error or uncertainty

Some definitions: x = sample mean s = sample standard deviation  = exact mean  = exact standard deviation As the sampling becomes larger: x   s    t chart z chart  not valid if bias exists (i.e. calibration is off) A. Error of Measured Variable Several measurements are obtained for a single variable (i.e. T). What is the true value? How confident are you? Is the value different on different days? Questions

Let’s assume “normal” Gaussian distribution For small sampling: s is known For large sampling:  is assumed How do you determine the error? small large (n>30) we’ll pursue this approach Use z tables for this approach Use t tables for this approach Don’t often have this much data

Example nTemp

Standard Deviation Summary (normal distribution) 40.9 ± (3.27) 1s: 68.3% of data are within this range 40.9 ± (3.27x2) 2s: 95.4% of data are within this range 40.9 ± (3.27x3) 3s: 99.7% of data are within this range If normal distribution is questionable, use Chebyshev's inequality:Chebyshev's inequality At least 50% of the data are within 1.4 s from the mean. At least 75% of the data are within 2 s from the mean. At least 89% of the data are within 3 s from the mean. The above ranges don’t state how accurate the mean is - only the % of data within the given range

Student t-test (gives confidence of where  (not data) is located)  =1- probability r = n-1 = 6Prob.t+-90% % % % t true mean measured mean 2-tail

T-test Summary 41 ± 2 90% confident  is somewhere in this range 41 ± 3 95% confident  is somewhere in this range 41 ± 5 99% confident  is somewhere in this range  = exact mean 40.9 is sample mean

Comparing averages of measured variables Day 1: Day 2: What is your confidence that  x ≠  y (i.e. they are different)  n x +n y  confident different  confident same Larger t: More likely different 1-tail

Example 1. Calculate average and s for both sets of data 2. Find range in which 95.4% of the data falls (for each set). 3. Determine range for  for each set at 95% probability 4. At what confidence are pressures different each day? Data points Pressure Day 1 Pressure Day

Example: You take measurements of , A, v to determine m =  Av. What is the range of m and its associated uncertainty? Calculate variable from multiple input (measured, tabulated, …) variables (i.e. m =  Av) What is the uncertainty of your “calculated” value? Each input variable has its own error B. Uncertainty of Calculated Variable Details provided in Applied Engineering Statistics, Chapters 8 and 14, R.M. Bethea and R.R. Rhinehart, 1991).

To obtain uncertainty of “calculated” variable DO NOT just calculate variable for each set of data and then average and take standard deviation DO calculate uncertainty using error from input variables: use uncertainty for “calculated” variables and error for input variables Plan: obtain max error (  ) for each input variable then obtain uncertainty of calculated variable Method 1: Propagation of max error - brute force Method 2: Propagation of max error - analytical Method 3: Propagation of variance - analytical Method 4: Propagation of variance - brute force - Monte Carlo simulation

Value and Uncertainty Value used to make decisions - need to know uncertainty of value Potential ethical and societal impact How do you determine the uncertainty of the value? Sources of uncertainty ( from Rhinehart, Applied Engineering Statistics, 1991 ): 1. Estimation - we guess! 2. Discrimination - device accuracy (single data point) 3. Calibration - may not be exact (error of curve fit) 4. Technique - i.e. measure ID rather than OD 5. Constants and data - not always exact! 6. Noise - which reading do we take? 7. Model and equations - i.e. ideal gas law vs. real gas 8. Humans - transposing, …

Estimates of Error (  ) for input variables (  ’s are propagated to  find uncertainty) 1. Measured: measure multiple times; obtain s;  ≈ 2.5s Reason: 99% of data is within ± 2.5s Example: s = 2.3 ºC for thermocouple,  = 5.8 ºC 2. Tabulated :  ≈ 2.5 times last reported significant digit (with 1) Reason: Assumes last digit is ± 2.5 (± 0 assumes perfect, ± 5 assumes next left digit is fuzzy) Example:  = 1.3 g/ml at 0º C,  = 0.25 g/ml Example: People = 127,000  = 2500 people

Estimates of Error (  ) for input variables 3. Manufacturer spec or calibration accuracy: use given spec or accuracy data Example: Pump spec is ± 1 ml/min,  = 1 ml/min 4.Variable from regression (i.e. calibration curve):  ≈ 2.5*standard error (std error is stdev of residual) Example: Velocity is slope with std error = 2 m/s 5. Judgment for a variable: use judgment for  Example: Read pressure to ± 1 psi,  = 1 psi

Estimates of Error (  ) for input variables If none of the above rules apply, give your best guess Example: Data from a computer show that the flow rate is 562 ml/min ± 3 ml/min (stdev of computer noise). Your calibration shows 510 ml/min ± 8 ml/min (stdev). What flow rate do you use and what is  ? In the following propagation methods, it’s assumed that there is no bias in the values used - let’s assume this for all lab projects.

 Brute force method: obtain upper and lower limits of all input variables (from maximum errors); plug into equation to get uncertainty of calculated variable (y).  Uncertainty of y is between y min and y max.  This method works for both symmetry and asymmetry in errors ( i.e. 10 psi + 3 psi or - 2 psi ) Method 1: Propagation of max error- brute force

Example: Propagation of max error- brute force m =  A v  = 2.0 g/cm 3 (table) A = 3.4 cm 2 (measured avg) v = 2 cm/s (slope of graph) s A = 0.1 cm 2 std. error (v) = 0.1 cm/s maxmin  A v Brute force method: m min < m < m max All combinations Additional information: What is  for each input variable?

Method 2: Propagation of max error- analytical Propagation of error: Utilizes maximum error of input variable (  ) to estimate uncertainty range of calculated variable (y) Uncertainty of y: y = y avg ±  y Assumptions: input errors are symmetric input errors are independent of each other equation is linear (works o.k. for non-linear equations if input errors are relatively small) * Remember to take the absolute value!!

Example: Propagation of max error- analytical m =  A v y x 1 x 2 x 3 = (3.4)(2)(0.25) = 0.39 (4.4) m = m avg ±  m =  Av ±  m = 13.6 ± 4.4 g/s Av  v  A  = 2.0 g/cm 3 (table) A = 3.4 cm 2 (measured avg) v = 2 cm/s (slope of graph) s A = 0.1 cm 2 std. error (v) = 0.1 cm/s Additional information: f error,  

Propagation of max error If linear equation, symmetric errors, and input errors are independent  brute force and analytical are same If non-linear equation, symmetric errors, and input errors are independent  brute force and analytical are close if errors are small. If large errors (i.e. >10% or more than order of magnitude), brute force is more accurate. Must use brute force if errors are dependant on each other and/or asymmetric. Analytical method is easier to assess if lots of inputs. Also gives info on % contribution from each error.

Method 3: Propagation of variance- analytical 1. Maximum error can be calculated from max errors of input variables as shown previously: a) Brute force b) Analytical 2. Probable error is more realistic Errors are independent (some may be “+” and some “-”). Not all will be in same direction. Errors are not always at their largest value. Thus, propagate variance rather than max error You need variance (   ) of each input to propagate variance. If  (stdev) is unknown, estimate  =  /2.5

Method 3: Propagation of variance- analytical y = y avg ± 1.96 SQRT(   y ) 95% y = y avg ± 2.57 SQRT(   y ) 99% gives propagated variance of y or (stdev) 2 gives probable error of y and associated confidence error should be <10% (linear approximation) use propagation of max error if not much data, use propagation of variance if lots of data

Method 4: Monte Carlo Simulation (propagation of variance – brute force) Choose N (N is very large, e.g. 100,000) random ±δ i from a normal distribution of standard deviation σ i for each variable and add to the mean to obtain N values with errors: Choose N (N is very large, e.g. 100,000) random ±δ i from a normal distribution of standard deviation σ i for each variable and add to the mean to obtain N values with errors: rnorm(N,μ,σ) in Mathcad generates N random numbers from a normal distribution with mean μ and std dev σrnorm(N,μ,σ) in Mathcad generates N random numbers from a normal distribution with mean μ and std dev σ Find N values of the calculated variable using the generated x’ i values. Find N values of the calculated variable using the generated x’ i values. Determine mean and standard deviation of the N calculated variables. Determine mean and standard deviation of the N calculated variables.

Monte Carlo Simulation Example Estimate the uncertainty in the critical compressibility factor of a fluid if Tc = 514 ± 2 K, Pc = ± 0.6 bar, and Vc = ± m 3 /kmol? Estimate the uncertainty in the critical compressibility factor of a fluid if Tc = 514 ± 2 K, Pc = ± 0.6 bar, and Vc = ± m 3 /kmol?

Example: Propagation of variance Calculate  and its 95% probable error All independent variables were measured multiple times (Rule 1); averages and s are given M = 5.0 kg s = 0.05 kg L = 0.75 m s = 0.01 m D = 0.14 m s = m

Propagation of Errors

Monte Carlo

Overall Summary measured variables: use average, std dev (data range), and student t-test (mean range and mean comparison) calculated variable: determine uncertainty -- Max error: propagating error with brute force -- Max error: propagating error analytically -- Probable error: propagating variance analytically -- Probable error: propagating variance with brute force (Monte Carlo)

Data and Statistical Expectations 1. Summary of raw data (table format) 2. Sample calculations– including statistical calculations 3. Summary of all calculations- table format is helpful 4. If measured variable: average and standard deviation for all, confidence of mean for at least one variable 5. if calculated variable: 1 of the 4 methods. Please state in report. If messy equation, you may show 1 of 4 methods for small part and then just average (with std dev.) the value (although not the best method).