/ department of mathematics and computer science 1212 2DS01 Statistics 2 for Chemical Engineering

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Analysis of Variance Outlines: Designing Engineering Experiments
Model Adequacy Checking in the ANOVA Text reference, Section 3-4, pg
/k 2DS00 Statistics 1 for Chemical Engineering lecture 4.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
ANOVA: Analysis of Variation
Chapter 13 Multiple Regression
Probability & Statistical Inference Lecture 8 MSc in Computing (Data Analytics)
ANOVA notes NR 245 Austin Troy
Design of Engineering Experiments - Experiments with Random Factors
Chapter 5 Introduction to Factorial Designs
2DS00 Statistics 1 for Chemical Engineering Lecture 3.
BA 555 Practical Business Analysis
Chapter 12 Multiple Regression
/k 2DS00 Statistics 1 for Chemical Engineering /k Lecturers Dr. A. Di Bucchianico – Department of Mathematics, – Statistics group –HG 9.24 – phone (040)
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
Chapter 3 Experiments with a Single Factor: The Analysis of Variance
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
1 Chapter 5 Introduction to Factorial Designs Basic Definitions and Principles Study the effects of two or more factors. Factorial designs Crossed:
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Correlation and Regression Analysis
13 Design and Analysis of Single-Factor Experiments:
1 14 Design of Experiments with Several Factors 14-1 Introduction 14-2 Factorial Experiments 14-3 Two-Factor Factorial Experiments Statistical analysis.
Lecture 15 Basics of Regression Analysis
Inference for regression - Simple linear regression
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
ITK-226 Statistika & Rancangan Percobaan Dicky Dermawan
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
M23- Residuals & Minitab 1  Department of ISM, University of Alabama, ResidualsResiduals A continuation of regression analysis.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Diploma in Statistics Design and Analysis of Experiments Lecture 2.11 Design and Analysis of Experiments Lecture Review of Lecture Randomised.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
DOX 6E Montgomery1 Design of Engineering Experiments Part 9 – Experiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 Analysis of Variance & One Factor Designs Y= DEPENDENT VARIABLE (“yield”) (“response variable”) (“quality indicator”) X = INDEPENDENT VARIABLE (A possibly.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
ETM U 1 Analysis of Variance (ANOVA) Suppose we want to compare more than two means? For example, suppose a manufacturer of paper used for grocery.
Chapter 13 Design of Experiments. Introduction “Listening” or passive statistical tools: control charts. “Conversational” or active tools: Experimental.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
ChE 452 Lecture 07 Statistical Tests Of Rate Equations 1.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
ENGR 610 Applied Statistics Fall Week 11 Marshall University CITE Jack Smith.
1 Chapter 5.8 What if We Have More Than Two Samples?
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Comparing Models.
CHAPTER 13 Design and Analysis of Single-Factor Experiments:
Statistics Analysis of Variance.
Chapter 5 Introduction to Factorial Designs
CHAPTER 29: Multiple Regression*
The Analysis of Variance
ENM 310 Design of Experiments and Regression Analysis Chapter 3
14 Design of Experiments with Several Factors CHAPTER OUTLINE
Presentation transcript:

/ department of mathematics and computer science DS01 Statistics 2 for Chemical Engineering

/ department of mathematics and computer science 1212 Lecturers Marko Boon Dr. A. Di Bucchianico Ir. G.D. Mooiweer Drs. C.M.J. Rusch – Groot (

/ department of mathematics and computer science 1212 Important to remember Web site for this course: No textbook, but handouts + Powerpoint sheets through web site Bring notebook to fourth lecture (12 th of April) and self-study Software: –Statgraphics (version 5.1). If not installed, install through –Java (at least version 1.4). Install through Java is needed to run Statlab ( Important: In order to run Statlab during the exams, security settings have to be adjusted!

/ department of mathematics and computer science 1212 Goals of this course teach students need for statistical basis of experimentation teach students statistical tools for experimentation –design of experiments (factorial designs, optimal designs) –analysis of experiments (ANOVA) –use of statistical software give students short introduction to recent developments

/ department of mathematics and computer science 1212 Week schedule Week 1: Introduction to Analysis of Variance (ANOVA) Week 2: Factorial designs: screening Week 3: Factorial designs: optimisation Week 4: Optimal experimental design and mixture designs (by A. Di Bucchianico – Bring your laptop!)

/ department of mathematics and computer science 1212 Detailed contents of week 1 statistics and experimentation short recapitulation of regression analysis one-way ANOVA one-way ANOVA with blocks multiple comparisons

/ department of mathematics and computer science 1212 Statistics and experimentation Chemical experiments often depend on several factors (pressure, catalyst, temperature, reaction time,...) Two important questions: which factors are really important? what are optimal settings for important factors?

/ department of mathematics and computer science 1212 Use of statistical experimentation in chemical engineering Chemical synthesis (synthetic steps; work up and separation; reagents, solvents, catalysts; structure, reactivity and properties,...) Biotech industry (drug design, analytical biochemistry, process optimization – fermentation, purification,...) Process industry (process optimization and control -yield, purity, through put time, pollution, energy consumption; product quality and performance - material strength, warp, color, taste, odour;...)...

/ department of mathematics and computer science 1212 Short history of statistics and experimentation 1920’s -... introduction of statistical methods in agriculture by Fisher and co-workers 1950’s -... introduction in chemical engineering (Box,...) 1980’s -... introduction in Western industry of Japanese approach (Taguchi, robust design) 1990’s -... combinatorial chemistry, high througput processing

/ department of mathematics and computer science 1212 Link to Statistics 1 for Chemical Engineering introduction to measurements –data analysis –error propagation regression analysis use of statistical software (Statgraphics)

/ department of mathematics and computer science 1212 Types of regression analysis Linear means linear in coefficients, not linear functions! Simple linear regression Multiple linear regression Non-linear regression

/ department of mathematics and computer science 1212 Model:  ssumptions: the model is linear (+ enough terms) the  i 's are normally distributed with  =0 and variance  2 the  i 's are independent. Linear regression

/ department of mathematics and computer science 1212 Specific warmth specific warmth of vapour at constant pressure as function of temperature data set from Perry’s Chemical Engineers’ Handbook thermodynamic theories say that quadratic relation between temperature and specific warmth usually suffices:

/ department of mathematics and computer science 1212 Scatter plot of specific warmth data

/ department of mathematics and computer science 1212 Regression output specific warmth data Polynomial Regression Analysis Dependent variable: Cp Standard T Parameter Estimate Error Statistic P-Value CONSTANT T T^ Analysis of Variance Source Sum of Squares Df Mean Square F-Ratio P-Value Model Residual Total (Corr.) R-squared = percent R-squared (adjusted for d.f.) = percent Standard Error of Est. = Mean absolute error = Durbin-Watson statistic = (P=0.0000) Lag 1 residual autocorrelation =

/ department of mathematics and computer science 1212 Issues in regression output significance of model significance of individual regression parameters residual plots: –normality (density trace, normal probability plot) –constant variance (against predicted values + each independent variable) –model adequacy (against predicted values) –outliers –independence influential points

/ department of mathematics and computer science 1212 Residual plot specific warmth data This behaviour is visible in plot of fitted line only after rescaling!

/ department of mathematics and computer science 1212 Plot of fitted quadratic model for specific warmth data

/ department of mathematics and computer science 1212 Conclusion regression models for specific warmth data we need third order model (polynomial of degree 3) careful with extrapolation original data set contains influential points original data set contains potential outliers

/ department of mathematics and computer science 1212 Analysis of variance name refers to mathematical technique, not to goal comparison of means (!!) using variances (extension of t- test to more than 2 samples) samples usually are groups of measurements with constant factor settings

/ department of mathematics and computer science 1212 Example: ANOVA production of yarns: influence of fibre composition on breaking tension simplification: one factor: % cotton fixed factor levels: 15%, 20%, 25%, 30%, 35% experimental design: produce on the same machine 5 threads of each type of fibre composition in random order

/ department of mathematics and computer science 1212 Statistical setting Basis model: Y ij =  +  i +  ij influence factor levels i=1,2,…k error term: normal  =0,  2 independent replications j=1,2,…,n Basis hypotheses: H 0 :  i = 0 for all i H 1 :  i  0 for at least one i overall mean

/ department of mathematics and computer science 1212 Expectation under H 0 (= no effect of factor level) spread observations with respect to group means spread group means with respect to overall mean chance

/ department of mathematics and computer science 1212 Expectation under H 1 spread observations with respect to group means chance systematic spread group means with respect to overall mean

/ department of mathematics and computer science 1212 Illustration of group means

/ department of mathematics and computer science 1212 Group means versus overall mean

/ department of mathematics and computer science 1212 Conclusion Comparison of both spreads yields indication for H 0 vs H 1. total treatment: between groups rest: within groups =+

/ department of mathematics and computer science 1212 Conclusion Comparison of both spreads yields indication for H 0 vs H 1. total treatment: between groups rest: within groups =+ Spreads are converted into sums of squares:

/ department of mathematics and computer science 1212 Mean Sums of Squares sums of squares differ with respect to number of contributions. for fair comparison: divide by degrees of freedom. we expect under H 0 : MS between  MS within we expect under H 1 : MS between >> MS within summary in ANOVA table

/ department of mathematics and computer science 1212 Completely Randomized One-factor Design Experiment, in which one factor varies on k levels. At each level n measurements are taken. The order of all measurements is random.

/ department of mathematics and computer science 1212 Multiple comparisons ANOVA only indicates whether there are significantly different group means ANOVA does not indicate which groups have different means (although we may construct confidence intervals for differences) various methods exist for correctly performing pairwise comparisons: –LSD (Least Significant Difference) method –HSD (Honestly Significant Difference) method –Duncan –Newman – Keuls –...

/ department of mathematics and computer science 1212 Randomized one-factor block design In each block all treatments occur equally often; randomization within blocks Experiment with one factor and observations in blocks Blocks are levels of noise factor.

/ department of mathematics and computer science 1212 Example testing method for material hardness : force pressure pin/tip strip testing material practical problem: 4 types of pressure pins  do these yield the same results?

/ department of mathematics and computer science 1212 Experimental design pin 1 pin 2 pin 4pin 3 testing strips Problem: if the measurements of strips 5 through 8 differ, is this caused by the strips or by pin 2?

/ department of mathematics and computer science 1212 Experimental design 2 Take 4 strips on which you measure (in random order) each pressure pin once : strip 1strip 2strip 4strip 3 pressure pins

/ department of mathematics and computer science 1212 Blocking Advantage of blocked experimental design 2: differences between strips are filtered out Model: Y ij =  +  i +  j +  ij Primary goal: reduction error term factor pressure pin block effect strip error term

/ department of mathematics and computer science 1212 Summary completely randomized design randomized block design multiple comparisons Reading material: Statgraphics lecture notes section 4.1 through 4.3