Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements.

Copyright 2004 David J. Lilja2 Comparing more than two alternatives ANOVA Analysis of Variance Partitions total variation in a set of measurements into Variation due to real differences in alternatives Variation due to errors

Copyright 2004 David J. Lilja3 Comparing Two Alternatives 1. Before-and-after Did a change to the system have a statistically significant impact on performance? 2. Non-corresponding measurements Is there a statistically significant difference between two different systems?

Copyright 2004 David J. Lilja4 Before-and-After Comparison Assumptions Before-and-after measurements are not independent Variances in two sets of measurements may not be equal → Measurements are related Form obvious corresponding pairs Find mean of differences

Copyright 2004 David J. Lilja10 95% Confidence Interval for Mean of Differences c 1,2 = [-5.36, 3.36] Interval includes 0 → With 95% confidence, there is no statistically significant difference between the two systems.

Copyright 2004 David J. Lilja12 Confidence Interval for Difference of Means 1. Compute means 2. Compute difference of means 3. Compute standard deviation of difference of means 4. Find confidence interval for this difference 5. No statistically significant difference between systems if interval includes 0

Copyright 2004 David J. Lilja19 A Special Case If n 1 < ≈ 30 or n 2 < ≈ 30 And errors are normally distributed And s 1 = s 2 (standard deviations are equal) OR If n 1 = n 2 And errors are normally distributed Even if s 1 is not equal s 2 Then special case applies…

Copyright 2004 David J. Lilja24 OS Example (cont) Initial operating system n1 = 1,300,203 interrupts (3.5 hours) m1 = 142,892 interrupts occurred in OS code p1 = 0.1099, or 11% of time executing in OS Upgrade OS n2 = 999,382 m2 = 84,876 p2 = 0.0849, or 8.5% of time executing in OS Statistically significant improvement?

Copyright 2004 David J. Lilja26 Important Points Use confidence intervals to determine if there are statistically significant differences Before-and-after comparisons Find interval for mean of differences Noncorresponding measurements Find interval for difference of means Proportions If interval includes zero → No statistically significant difference

Copyright 2004 David J. Lilja28 One-Factor Analysis of Variance (ANOVA) Very general technique Look at total variation in a set of measurements Divide into meaningful components Also called One-way classification One-factor experimental design Introduce basic concept with one-factor ANOVA Generalize later with design of experiments

Copyright 2004 David J. Lilja29 One-Factor Analysis of Variance (ANOVA) Separates total variation observed in a set of measurements into: 1. Variation within one system Due to random measurement errors 2. Variation between systems Due to real differences + random error Is variation(2) statistically > variation(1)?

Copyright 2004 David J. Lilja31 Measurements for All Alternatives Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja33 Column Means Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja35 Error = Deviation From Column Mean Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja37 Overall Mean Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja39 Effect = Deviation From Overall Mean Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja40 Effects and Errors Effect is distance from overall mean Horizontally across alternatives Error is distance from column mean Vertically within one alternative Error across alternatives, too Individual measurements are then:

Copyright 2004 David J. Lilja45 Sum of Squares of Differences SST = differences between each measurement and overall mean SSA = variation due to effects of alternatives SSE = variation due to errors in measurments

Copyright 2004 David J. Lilja46 ANOVA – Fundamental Idea Separates variation in measured values into: 1. Variation due to effects of alternatives SSA – variation across columns 2. Variation due to errors SSE – variation within a single column If differences among alternatives are due to real differences, SSA should be statistically > SSE

Copyright 2004 David J. Lilja47 Comparing SSE and SSA Simple approach SSA / SST = fraction of total variation explained by differences among alternatives SSE / SST = fraction of total variation due to experimental error But is it statistically significant?

Copyright 2004 David J. Lilja50 Degrees of Freedom for Effects Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja51 Degrees of Freedom for Errors Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja52 Degrees of Freedom for Errors Alternatives Measure ments 12…j…k 1y 11 y 12 …y 1j …y k1 2y 21 y 22 …y 2j …y 2k ………………… iy i1 y i2 …y ij …y ik ………………… ny n1 y n2 …y nj …y nk Col meany.1 y.2 …y.j …y.k Effectα1α1 α2α2 …αjαj …αkαk

Copyright 2004 David J. Lilja55 F-test If F computed > F table → We have (1 – α) * 100% confidence that variation due to actual differences in alternatives, SSA, is statistically greater than variation due to errors, SSE.

Copyright 2004 David J. Lilja57 ANOVA Example Alternatives Measurements123Overall mean 10.09720.13820.7966 20.09710.14320.5300 30.09690.13820.5152 40.19540.17300.6675 50.09740.13830.5298 Column mean0.11680.14620.60780.2903 Effects-0.1735-0.14410.3175

Copyright 2004 David J. Lilja59 Conclusions from example SSA/SST = 0.7585/0.8270 = 0.917 → 91.7% of total variation in measurements is due to differences among alternatives SSE/SST = 0.0685/0.8270 = 0.083 → 8.3% of total variation in measurements is due to noise in measurements Computed F statistic > tabulated F statistic → 95% confidence that differences among alternatives are statistically significant.

Copyright 2004 David J. Lilja60 Contrasts ANOVA tells us that there is a statistically significant difference among alternatives But it does not tell us where difference is Use method of contrasts to compare subsets of alternatives A vs B {A, B} vs {C} Etc.

Copyright 2004 David J. Lilja63 Construct confidence interval for contrasts Need Estimate of variance Appropriate value from t table Compute confidence interval as before If interval includes 0 Then no statistically significant difference exists between the alternatives included in the contrast

Copyright 2004 David J. Lilja68 Important Points Use one-factor ANOVA to separate total variation into: – Variation within one system Due to random errors – Variation between systems Due to real differences (+ random error) Is the variation due to real differences statistically greater than the variation due to errors?

Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements.

Similar presentations

Presentation on theme: "Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements.

Similar presentations

Presentation on theme: "Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements."— Presentation transcript:

Similar presentations

About project

Feedback