Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Scatterplots and Correlation
Correlation and Regression
Chapter 6: Exploring Data: Relationships Lesson Plan
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
CHAPTER 3 Describing Relationships
Ch 2 and 9.1 Relationships Between 2 Variables
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
Chapter 6: Exploring Data: Relationships Chi-Kwong Li Displaying Relationships: Scatterplots Regression Lines Correlation Least-Squares Regression Interpreting.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Stat 1510: Statistical Thinking and Concepts Scatterplots and Correlation.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Ch 3 – Examining Relationships YMS – 3.1
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
CHAPTER 7: Exploring Data: Part I Review
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Lecture Presentation Slides SEVENTH EDITION STATISTICS Moore / McCabe / Craig Introduction to the Practice of Chapter 2 Looking at Data: Relationships.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Regression and Least Squares The need for a mathematical construct… Insert fig 3.8.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Business Statistics for Managerial Decision Making
^ y = a + bx Stats Chapter 5 - Least Squares Regression
CHAPTER 3 Describing Relationships
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Midterm Review IN CLASS. Chapter 1: The Art and Science of Data 1.Recognize individuals and variables in a statistical study. 2.Distinguish between categorical.
CHAPTER 3 Describing Relationships
Chapter 2: Modeling Distributions of Data
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 2: Modeling Distributions of Data
Chapter 2 Looking at Data— Relationships
Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 2: Modeling Distributions of Data
CHAPTER 3 Describing Relationships
Chapter 2 Looking at Data— Relationships
Chapter 2: Modeling Distributions of Data
CHAPTER 3 Describing Relationships
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 2: Modeling Distributions of Data
CHAPTER 3 Describing Relationships
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapters Important Concepts and Terms
Chapter 2: Modeling Distributions of Data
9/27/ A Least-Squares Regression.
Mean and Median.
Correlation Coefficient
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Presentation transcript:

Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions Describe and apply the Rule Describe the standard Normal distribution Perform Normal calculations 1

2 The Rule In the Normal distribution with mean µ and standard deviation σ: Approximately 68% of the observations fall within σ of µ. Approximately 95% of the observations fall within 2σ of µ. Approximately 99.7% of the observations fall within 3σ of µ. The Rule In the Normal distribution with mean µ and standard deviation σ: Approximately 68% of the observations fall within σ of µ. Approximately 95% of the observations fall within 2σ of µ. Approximately 99.7% of the observations fall within 3σ of µ. σ

3 The Standard Normal Distribution  All Normal distributions are the same if we measure in units of size σ from the mean µ as center. The standard Normal distribution is the Normal distribution with mean 0 and standard deviation 1. If a variable x has any Normal distribution N(µ,σ) with mean µ and standard deviation σ, then the standardized variable has the standard Normal distribution, N(0,1). The standard Normal distribution is the Normal distribution with mean 0 and standard deviation 1. If a variable x has any Normal distribution N(µ,σ) with mean µ and standard deviation σ, then the standardized variable has the standard Normal distribution, N(0,1).

4 Normal Calculations Find the proportion of observations from the standard Normal distribution that are between and Can you find the same proportion using a different approach? 1 – ( ) = 1 – =

5 State: Express the problem in terms of the observed variable x. Plan: Draw a picture of the distribution and shade the area of interest under the curve. Solve: Perform calculations. Standardize x to restate the problem in terms of a standard Normal variable z. Use Table A and the fact that the total area under the curve is 1 to find the required area under the standard Normal curve. Conclude: State the conclusion in the context. State: Express the problem in terms of the observed variable x. Plan: Draw a picture of the distribution and shade the area of interest under the curve. Solve: Perform calculations. Standardize x to restate the problem in terms of a standard Normal variable z. Use Table A and the fact that the total area under the curve is 1 to find the required area under the standard Normal curve. Conclude: State the conclusion in the context. How to Solve Problems Involving Normal Distributions Normal Calculations

Chapter 4 concepts/objectives Explanatory and Response Variables Displaying Relationships: Scatterplots Interpreting Scatterplots Measuring Linear Association: Correlation Facts About Correlation 6

The Standard Normal Distribution All Normal distributions are the same if we measure in units of size σ from the mean µ as center. The standard Normal distribution is the Normal distribution with mean 0 and standard deviation 1. If a variable x has any Normal distribution N(µ,σ) with mean µ and standard deviation σ, then the standardized variable. has the standard Normal distribution, N(0,1).

Measuring Linear Association A scatterplot displays the strength, direction, and form of the relationship between two quantitative variables The correlation r measures the strength of the linear relationship between two quantitative variables. r is always a number between -1 and 1. r > 0 indicates a positive association. r < 0 indicates a negative association. Values of r near 0 indicate a very weak linear relationship. The strength of the linear relationship increases as r moves away from 0 toward -1 or 1. The extreme values r = -1 and r = 1 occur only in the case of a perfect linear relationship.

Facts About Correlation 1.Correlation makes no distinction between explanatory and response variables. 2. r has no units and does not change when we change the units of measurement of x, y, or both. 3. Positive r indicates positive association between the variables, and negative r indicates negative association. 4. The correlation r is always a number between -1 and 1. Cautions: Correlation requires that both variables be quantitative. Correlation does not describe curved relationships between variables, no matter how strong the relationship is. Correlation is not resistant. r is strongly affected by a few outlying observations. Correlation is not a complete summary of two-variable data.

10 Regression equation: y = a + bx ^  x is the value of the explanatory variable.  “y-hat” is the predicted value of the response variable for a given value of x ( based on the line of best fit ).  b is the slope, the amount by which y changes for each one-unit increase in x.  a is the intercept, the value of y when x = 0. Chapter 5 -- Regression Line

11 Least Squares Regression Line To predict y, we want the regression line to be as close as possible to the data points in the y (vertical) direction. Least Squares Regression Line (LSRL): The line that minimizes the sum of the squares of the vertical distances of the data points from the line. For LSRL, the constants a (intercept) and b (slope) are calculated and inserted in the regression line. Regression equation: y = a + bx Calculate b from: Calculate a from: where s x and s y are the standard deviations of the two variables x and y, and r is their correlation. Least Squares Regression Line (LSRL): The line that minimizes the sum of the squares of the vertical distances of the data points from the line. For LSRL, the constants a (intercept) and b (slope) are calculated and inserted in the regression line. Regression equation: y = a + bx Calculate b from: Calculate a from: where s x and s y are the standard deviations of the two variables x and y, and r is their correlation. ^

12 An outlier is an observation that lies far away from the other observations. –Outliers in the y direction have large residuals. –Outliers in the x direction are often influential for the least-squares regression line, meaning that the removal of such points would markedly change the equation of the line. Outliers and Influential Points

13 Chapter 6 --Two-Way Table, Example What are the variables described by this two-way table? (Hint: Number of columns?) How many young adults were surveyed? (Hint: It is one of the totals in bottom row.)

14 Chap 6, Marginal Distribution The Marginal Distribution of one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all individuals described by the table. Note: Percents are often more informative than counts, especially when comparing groups of different sizes. To examine a marginal distribution: 1.Use the data in the table to calculate the marginal distribution (in percents) of the row or column totals. 2. Make a graph to display the marginal distribution.

15 Chap Conditional Distribution Marginal distributions tell us nothing about the relationship between two variables. A Conditional Distribution of a variable describes the values of that variable among individuals who have a specific value of another variable. To examine or compare conditional distributions: 1.Select the row(s) or column(s) of interest. 2.Use the data in the table to calculate the conditional distribution (in percents) of the row(s) or column(s). 3.Make a graph to display the conditional distribution. Use a side-by-side bar graph or segmented bar graph to compare distributions. 15