1111. 2222  Will help you gain knowledge in: ◦ Improving performance characteristics ◦ Reducing costs ◦ Understand regression analysis ◦ Understand relationships.

Slides:



Advertisements
Similar presentations
Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Advertisements

The Normal Distribution
Planning for a Successful Design of Experiment Jim Akers SIU ASQ Spring Conference 2013.
AP Statistics.  Least Squares regression is a way of finding a line that summarizes the relationship between two variables.
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
Ch. 5 The Normal Curve Comparing histograms to the normal curve Interpretation of SDs and areas.
Statistical Relationship Between Quantitative Variables
Simple Linear Regression Analysis
FACTOR THE FOLLOWING: Opener. 2-5 Scatter Plots and Lines of Regression 1. Bivariate Data – data with two variables 2. Scatter Plot – graph of bivariate.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
CORPORATE FINANCE V ESCP-EAP - European Executive MBA Dec. 2005, London Risk, Return, Diversification and CAPM I. Ertürk Senior Fellow in Banking.
A P STATISTICS LESSON 3 – 2 CORRELATION.
CHAPTER 7: Exploring Data: Part I Review
Scatter plots and Regression Algebra II. Linear Regression  Linear regression is the relationship between two variables when the equation is linear.
Sec 1.5 Scatter Plots and Least Squares Lines Come in & plot your height (x-axis) and shoe size (y-axis) on the graph. Add your coordinate point to the.
Part IV Significantly Different: Using Inferential Statistics
Warm-up Ch.11 Inference for Linear Regression Day 2 1. Which of the following are true statements? I. By the Law of Large Numbers, the mean of a random.
Statistics with TI-Nspire™ Technology Module E. Lesson 2: Properties Statistics with TI-Nspire™ Technology Module E.
3.2 - Least- Squares Regression. Where else have we seen “residuals?” Sx = data point - mean (observed - predicted) z-scores = observed - expected * note.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Exam Review Day 6 Chapters 2 and 3 Statistics of One Variable and Statistics of Two Variable.
Regression and Least Squares The need for a mathematical construct… Insert fig 3.8.
2.6 Scatter Diagrams. Scatter Diagrams A relation is a correspondence between two sets of data X is the independent variable Y is the dependent variable.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Essential Statistics Chapter 51 Least Squares Regression Line u Regression line equation: y = a + bx ^ –x is the value of the explanatory variable –“y-hat”
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
SWBAT: Calculate and interpret the residual plot for a line of regression Do Now: Do heavier cars really use more gasoline? In the following data set,
Basic Statistics Linear Regression. X Y Simple Linear Regression.
A. Write an equation in slope-intercept form that passes through (2,3) and is parallel to.
STATISTICS 1040 TERM PROJECT SPRING THE QUESTION Is a student’s Grade Point Average (GPA) correlated with their age?
Section 1.6 Fitting Linear Functions to Data. Consider the set of points {(3,1), (4,3), (6,6), (8,12)} Plot these points on a graph –This is called a.
The coefficient of determination, r 2, is The fraction of the variation in the value of y that is explained by the regression line and the explanatory.
Method 3: Least squares regression. Another method for finding the equation of a straight line which is fitted to data is known as the method of least-squares.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
 Will help you gain knowledge in: ◦ Improving performance characteristics ◦ Reducing costs ◦ Understand regression analysis ◦ Understand relationships.
Chapter 12: Correlation and Linear Regression 1.
Pearson’s Correlation The Pearson correlation coefficient is the most widely used for summarizing the relation ship between two variables that have a straight.
Analysis and Interpretation: Multiple Variables Simultaneously
Statistics PSY302 Regression Quiz
Questions from lectures
CHAPTER 3 Describing Relationships
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
LEAST – SQUARES REGRESSION
Random Variables and Probability Distribution (2)
Quantitative Methods PSY302 Quiz Chapter Five-Correlation
Political Science 30: Political Inquiry
Cautions about Correlation and Regression
Chapter 12: Regression Diagnostics
An Example of {AND, OR, Given that} Using a Normal Distribution
عمل الطالبة : هايدى محمد عبد المنعم حسين
2-7 Curve Fitting with Linear Models Holt Algebra 2.
Correlation and Regression
A P STATISTICS LESSON 3 – 2 CORRELATION.
BA 275 Quantitative Business Methods
STEM Fair Graphs.
Unit 4 Mathematics Created by Educational Technology Network
Суури мэдлэг Basic Knowledge
A P STATISTICS LESSON 3 – 2 CORRELATION.
Unit 3 Review (Calculator)
Quantitative Methods PSY302 Quiz Chapter Five
Figure 6.1 Risk as Function of Number of Stocks in Portfolio
Correlation and Covariance
Adequacy of Linear Regression Models
Residuals (resids).
Inference for Regression
Calculate 9 x 81 = x 3 3 x 3 x 3 x 3 3 x 3 x 3 x 3 x 3 x 3 x =
Regression and Correlation of Data
Presentation transcript:

1111

2222  Will help you gain knowledge in: ◦ Improving performance characteristics ◦ Reducing costs ◦ Understand regression analysis ◦ Understand relationships between variables ◦ Understand correlation ◦ Understand how to optimize processes  So you can: ◦ Recognize opportunities ◦ Understand terminology ◦ Know when to get help X-2

3333 Plot a histogram and calculate the average and standard deviation Fuel Economy to <66 to <1212 to <1818 to <2424 to <3030 to <3636 to <4242 to <4848 to <5454 to <=60 mgp Number of Cars X-2

4444  Experimental design (a.k.a. DOE) is about discovering and quantifying the magnitude of cause and effect relationships.  We need DOE because intuition can be misleading.... but we’ll get to that in a minute.  Regression can be used to explain how we can model data experimentally. METHOD MOTHER NATURE MEASUREMENT MANPOWERMACHINE MATERIAL X-2

5555  Let’s take a look at the mileage data and see if there’s a factor that might explain some of the variation.  Draw a scatter diagram for the following data: Y=f(X) XY X-2

6666  If you draw a best fit line and figure out an equation for that line, you would have a ‘model’ that represents the data. Y=f(X) X-2

7777 There are basically three ways to understand a process you are working on.  Classical 1FAT experiments ◦ One factor at a time (1FAT) focuses on one variable at two or three levels and attempts to hold everything else constant (which is impossible to do in a complicated process).  Mathematical model ◦ Express the system with a mathematical equation.  DOE ◦ When properly constructed, it can focus on a wide range of key input factors and will determine the optimum levels of each of the factors. Each have their advantages and disadvantages. We’ll talk about each. X-2

8888  Let’s consider how two known (based on years of experience) factors affect gas mileage, tire size (T) and fuel type (F). Fuel TypeTire size F1F1 T1T1 F2F2 T2T2 Y=f(X) T( 1,2 ) Y F( 1,2 ) X-2

9999 Step 1: Select two levels of tire size and two kinds of fuels. Step 2: Holding fuel type constant (and everything else), test the car at both tire sizes. Fuel TypeTire sizeMpg F1F1 T1T1 20 F1F1 T2T2 30 X-2

10 Since we want to maximize mpg the more desirable response happened with T2 Step 3: Holding tire size at T2, test the car at both fuel types. Fuel TypeTire sizeMpg F1F1 T2T2 30 F2F2 T2T2 40 X-2

11  Looks like the ideal setting is F2 and T2 at 40mpg.  This is a common experimental method. Fuel TypeTire sizeMpg F1F1 T2T2 30 F2F2 T2T2 40 What about the possible interaction effect of tire size and fuel type. F 2 T 1 X-2

12  Suppose that the untested combination F 2 T 1 would produce the results below.  There is a different slope so there appears to be an interaction. A more appropriate design would be to test all four combinations. X-2

13  We need a way to ◦ investigate the relationship(s) between variables ◦ distinguish the effects of variables from each other (and maybe tell if they interact with each other) ◦ quantify the effects... ...So we can predict, control, and optimize processes. X-2

14 We can see some problems with 1FAT. Now let’s go back and talk about the statapult. We can do a mathematical model or we could do a DOE. DOE will build a ‘model’ - a mathematical representation of the behavior of measurements. or… You could build a “mathematical model” without DOE and it might look something like... X-2

15 X-2

16 DOE uses purposeful changes of the inputs (factors) in order to observe corresponding changes to the outputs (response). Remember the IPO’s we did – they are real important here. X-3

17  Set objectives (Charter) ◦ Comparative  Determine what factor is significant ◦ Screening  Determine what factors will be studied ◦ Model – response surface method  Determine interactions and optimize  Select process variables (C&E) and levels you will test at  Select an experimental design  Execute the design  CONFIRM the model!! Check that the data are consistent with the experimental assumptions  Analyze and interpret the results  Use/present the results X-4

18 X-8

19 X-8

20  To ‘design’ an experiment, means to pick the points that you’ll use for a scatter diagram.  See DOE terms X-9 through X-15 RunAB In tabular form, it would look like: High (+) Low (-) Factor B Settings Factor A Settings High (+)Low (-) (-,+) (+,-) (+,+) (-,-) Y A B X1X1 X2X2 X-9

21  A full factorial is an experimental design which contains all levels of all factors. No possible treatments are omitted. ◦ The preferred (ultimate) design ◦ Best for modeling  A fractional factorial is a balanced experimental design which contains fewer than all combinations of all levels of all factors. ◦ The preferred design when a full factorial cannot be performed due to lack of resources ◦ Okay for some modeling ◦ Good for screening X-16

22  Full factorial ◦ 2 level ◦ 3 factors ◦ 8 runs ◦ Balanced (orthogonal)  Fractional factorial ◦ 2 level ◦ 3 factors ◦ 4 runs - Half fraction ◦ Balanced (orthogonal) X-16

23 Average Y when A was set ‘high’ Average Y when A was set ‘low’  The difference in the average Y when A was ‘high’ from the average Y when A was ‘low’ is the ‘factor effect’  The differences are calculated for every factor in the experiment X-16

24 When the effect of one factor changes due to the effect of another factor, the two factors are said to ‘interact.’ more than two factors can interact at the same time, but it is thought to be rare outside of chemical reactions. Response - Y Factor A LowHigh B = High B = Low Slight Response - Y Factor A LowHigh B = HighB = Low Strong Response - Y Factor A LowHigh B = High B = Low None X-16

25  Using the statapult, we will experiment with some factors to “model” the process.  We will perform a confirmation run to determine if the model will help us predict the proper settings required to achieve a desired output. What design should we use? Y A B X1X1 X2X2 C D X3X3 X4X4 Y=f(X 1, X 2, X 3, X 4 ) X-17

26  Too much variation in the response  Measurement error  Poor experimental discipline  Aliases (confounded) effects  Inadequate model  Something changed - And: - There may not be a true cause-and-effect relationship. There may not be a true cause-and-effect relationship. X-17

27 X-17

28 FactorABCDResponse #1 Row #A -B -C -D -Y1Y2Y X-17 Confirmation runs

29 X-17

30 X-17

31 X-17

32 X-17

33  Full factorial ◦ 3 level ◦ 3 factors ◦ 27 runs ◦ Balanced (orthogonal) ◦ Used when it is expected the response in non-linear X-18

34  Useful to see how factors effect the response and to determine what other settings provide the same response X-30

35  Helpful in reaching the optimal result X-30