Replicated Binary Designs

Slides:



Advertisements
Similar presentations
Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements.
Advertisements

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
ANOVA: Analysis of Variation
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chapter 10 Simple Regression.
BA 555 Practical Business Analysis
Statistics CSE 807.
SIMPLE LINEAR REGRESSION
Lecture 4 Page 1 CS 239, Spring 2007 Models and Linear Regression CS 239 Experimental Methodologies for System Software Peter Reiher April 12, 2007.
Chapter 11 Multiple Regression.
k r Factorial Designs with Replications r replications of 2 k Experiments –2 k r observations. –Allows estimation of experimental errors Model:
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Correlation and Regression Analysis
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
QNT 531 Advanced Problems in Statistics and Research Methods
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Simple Linear Regression Models
Ratio Games and Designing Experiments Andy Wang CIS Computer Systems Performance Analysis.
© 1998, Geoff Kuenning Linear Regression Models What is a (good) model? Estimating model parameters Allocating variation Confidence intervals for regressions.
© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
A Really Bad Graph. For Discussion Today Project Proposal 1.Statement of hypothesis 2.Workload decisions 3.Metrics to be used 4.Method.
Lecture 8 Page 1 CS 239, Spring 2007 Experiment Design CS 239 Experimental Methodologies for System Software Peter Reiher May 1, 2007.
Other Regression Models Andy Wang CIS Computer Systems Performance Analysis.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Lecture 10 Page 1 CS 239, Spring 2007 Experiment Designs for Categorical Parameters CS 239 Experimental Methodologies for System Software Peter Reiher.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
CPE 619 Two-Factor Full Factorial Design With Replications Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The.
CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.
Experiment Design Overview Number of factors 1 2 k levels 2:min/max n - cat num regression models2k2k repl interactions & errors 2 k-p weak interactions.
Linear Regression Models Andy Wang CIS Computer Systems Performance Analysis.
Multiple Linear Regression
F73DA2 INTRODUCTORY DATA ANALYSIS ANALYSIS OF VARIANCE.
Stats Methods at IC Lecture 3: Regression.
ANOVA: Analysis of Variation
ANOVA: Analysis of Variation
The simple linear regression model and parameter estimation
Chapter 4: Basic Estimation Techniques
ANOVA: Analysis of Variation
EXCEL: Multiple Regression
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Regression Analysis AGEC 784.
ANOVA: Analysis of Variation
Two-Way Analysis of Variance Chapter 11.
Ratio Games and Designing Experiments
Other Regression Models
Chapter 10: Analysis of Variance: Comparing More Than Two Means
Two-Factor Full Factorial Designs
Linear Regression Models
Statistical Methods For Engineers
BA 275 Quantitative Business Methods
CHAPTER 29: Multiple Regression*
Chapter 11 Analysis of Variance
Undergraduated Econometrics
SIMPLE LINEAR REGRESSION
Regression Assumptions
SIMPLE LINEAR REGRESSION
ENM 310 Design of Experiments and Regression Analysis Chapter 3
One-Factor Experiments
For Discussion Today Survey your proceedings for just one paper in which factorial design has been used or, if none, one in which it could have been used.
CS533 Modeling and Performance Evalua ion of Network and Computer Systems Experimental Design (Chapters 16-17)
Regression Assumptions
Presentation transcript:

Replicated Binary Designs Andy Wang CIS 5930-03 Computer Systems Performance Analysis

2k Factorial Designs With Replications 2k factorial designs do not allow for estimation of experimental error No experiment is ever repeated Error is usually present And usually important Handle issue by replicating experiments But which to replicate, and how often?

2kr Factorial Designs Replicate each experiment r times Allows quantifying experimental error Again, easiest to first look at case of only 2 factors

22r Factorial Designs 2 factors, 2 levels each, with r replications at each of the four combinations y = q0 + qAxA + qBxB + qABxAxB + e Now we need to compute effects, estimate errors, and allocate variation Can also produce confidence intervals for effects and predicted responses

Computing Effects for 22r Factorial Experiments We can use sign table, as before But instead of single observations, regress off mean of the r observations Compute errors for each replication using similar tabular method Similar methods used for allocation of variance and calculating confidence intervals

Example of 22r Factorial Design With Replications Same parallel system as before, but with 4 replications at each point (r =4) No DLM, 8 nodes: 820, 822, 813, 809 DLM, 8 nodes: 776, 798, 750, 755 No DLM, 64 nodes: 217, 228, 215, 221 DLM, 64 nodes: 197, 180, 220, 185

22r Factorial Example Analysis Matrix I A B AB y Mean 1 -1 -1 1 (820,822,813,809) 816 1 1 -1 -1 (217,228,215,221) 220.25 1 -1 1 -1 (776,798,750,755) 769.75 1 1 1 1 (197,180,220,185) 195.5 2001.5 -1170 -71 21.5 Total 500.4 -292.5 -17.75 5.4 Total/4 q0= 500.4 qA= -292.5 qB= -17.75 qAB= 5.4

Estimation of Errors for 22r Factorial Example Figure differences between predicted and observed values for each replication: Now calculate SSE

Allocating Variation We can determine percentage of variation due to each factor’s impact Just like 2k designs without replication But we can also isolate variation due to experimental errors Methods are similar to other regression techniques for allocating variation

Variation Allocation in Example We’ve already figured SSE We also need SST, SSA, SSB, and SSAB Also, SST = SSA + SSB + SSAB + SSE Use same formulae as before for SSA, SSB, and SSAB

Sums of Squares for Example SST = SSY - SS0 = 1,377,009.75 SSA = 1,368,900 SSB = 5041 SSAB = 462.25 Percentage of variation for A is 99.4% Percentage of variation for B is 0.4% Percentage of variation for A/B interaction is 0.03% And 0.2% (approx.) is due to experimental errors

Confidence Intervals for Effects Computed effects are random variables Thus would like to specify how confident we are that they are correct Usual confidence-interval methods First, must figure Mean Square of Errors r - 1 is because errors add up to zero

Calculating Variances of Effects Standard deviation (due to errors) of all effects is the same: In calculations, use t- or z-value for 22(r-1) degrees of freedom

Calculating Confidence Intervals for Example At 90% level, using t-value for 12 degrees of freedom, 1.782 Standard deviation of effects is 3.68 Confidence intervals are qi(1.782)(3.68) q0 is (493.8,506.9) qA is (-299.1,-285.9) qB is (-24.3,-11.2) qAB is (-1.2,11.9)

Predicted Responses We already have predicted all the means we can predict from this kind of model We measured four, we can “predict” four However, we can predict how close we would get to true sample mean if we ran m more experiments

Formula for Predicted Means For m future experiments, predicted mean is Where Also, neff = n/(1 + DoF in y_hat)

Example of Predicted Means What would we predict as confidence interval of response for no dynamic load management at 8 nodes for 7 more tests? 90% confidence interval is (798.3,833.7) We’re 90% confident that mean would be in this range

Visual Tests for Verifying Assumptions What assumptions have we been making? Model errors are statistically independent Model errors are additive Errors are normally distributed Errors have constant standard deviation Effects of errors are additive All boils down to independent, normally distributed observations with constant variance

Testing for Independent Errors Compute residuals and make scatter plot Trends indicate dependence of errors on factor levels But if residuals order of magnitude below predicted response, trends can be ignored Usually good idea to plot residuals vs. experiment number

Example Plot of Residuals vs. Predicted Response

Example Plot of Residuals vs. Experiment Number

Testing for Normally Distributed Errors As usual, do quantile-quantile chart against normal distribution If close to linear, normality assumption is good

Quantile-Quantile Plot for Example

Assumption of Constant Variance Checking homoscedasticity Go back to scatter plot and check for even spread

The Scatter Plot, Again

Multiplicative Models for 22r Experiments Assumptions of additive models Example of a multiplicative situation Handling a multiplicative model When to choose multiplicative model Multiplicative example

Assumptions of Additive Models Last time’s analysis used additive model: yij = q0+ qAxA+ qBxB+ qABxAxB+ eij Assumes all effects are additive: Factors Interactions Errors This assumption must be validated!

Example of a Multiplicative Situation Testing processors with different workloads Most common multiplicative case Consider 2 processors, 2 workloads Use 22r design Response is time to execute wj instructions on processor that does vi instructions/second Without interactions, time is yij = viwj

Handling a Multiplicative Model Take logarithm of both sides: yij = viwj so log(yij) = log(vi) + log(wj) Now easy to solve using previous methods Resulting model is:

Meaning of a Multiplicative Model Model is Here, A = is ratio of MIPS ratings of processors, B = is ratio of workload size Antilog of q0 is geometric mean of responses: where n = 22r

When to Choose a Multiplicative Model? Physical considerations (see previous slides) Range of y is large Making arithmetic mean unreasonable Calling for log transformation Plot of residuals shows large values and increasing spread Quantile-quantile plot doesn’t look like normal distribution

Multiplicative Example Consider additive model of processors A1 & A2 running benchmarks B1 and B2: Note large range of y values

Error Scatter of Additive Model

Quantile-Quantile Plot of Additive Model

Multiplicative Model Taking logs of everything, the model is:

Error Residuals of Multiplicative Model

Quantile-Quantile Plot for Multiplicative Model

Summary of the Two Models

General 2kr Factorial Design Simple extension of 22r See Box 18.1 for summary Always do visual tests Remember to consider multiplicative model as alternative

Example of 2kr Factorial Design Consider a 233 design:

ANOVA for 233 Design Percent variation explained: 90% confidence intervals

Error Residuals for 233 Design

Quantile-Quantile Plot for 233 Design

Alternative Quantile-Quantile Plot for 233

White Slide