Z-scores & Shifting Data

Slides:



Advertisements
Similar presentations
Chapter 6: The Standard Deviation as a Ruler and the Normal Model
Advertisements

The Standard Deviation as a Ruler + The Normal Model
Copyright © 2010 Pearson Education, Inc. Slide The boxplots shown above summarize two data sets, I and II. Based on the boxplots, which of the following.
Copyright © 2010 Pearson Education, Inc. Slide The number of sweatshirts a vendor sells daily has the following probability distribution. Num of.
The Standard Deviation as a Ruler and the Normal Model.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 6- 1.
D O YOU THINK YOU ARE NORMAL ? Slide Yes 2. No 3. I’m not average, but I’m probably within 2 standard deviations.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
The Standard Deviation as a Ruler and the Normal Model
Chapter 6: The Standard Deviation as a Ruler and the Normal Model
Chapter 6: The Standard Deviation as a Ruler and the Normal Model
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
Describing Location in a Distribution. Measuring Position: Percentiles Here are the scores of 25 students in Mr. Pryor’s statistics class on their first.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Slide 6-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation As A Ruler And The Normal Model.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Chapter 7 Lesson 7.6 Random Variables and Probability Distributions 7.6: Normal Distributions.
The Standard Deviation as a Ruler and the Normal Model
Chapter 6 The Standard Deviation as a Ruler and the Normal Model Math2200.
Find out where you can find rand and randInt in your calculator. Write down the keystrokes.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 5, Slide 1 Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Slide Chapter 2d Describing Quantitative Data – The Normal Distribution Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Chapter 2 Modeling Distributions of Data Objectives SWBAT: 1)Find and interpret the percentile of an individual value within a distribution of data. 2)Find.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
Standard Deviation and the Normal Model PART 1 RESHIFTING DATA, RESCALING DATA, Z-SCORE.
Warm-up O Make sure to use a ruler and proper scaling for the box-plots. O This will be taken up for a grade! O Today we start the last chapter before.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Chapter 6 (part 2) WHEN IS A Z-SCORE BIG? NORMAL MODELS A Very Useful Model for Data.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
The Normal Model Statistics 6. Update Box plots Our book has the whiskers extend to 1.5 times the IQR when there is an outlier. Most other resources have.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 6- 1.
Descriptive Statistics ( )
The Standard Deviation as a Ruler and the Normal Model
Relative Cumulative Frequency Graph
CHAPTER 2 Modeling Distributions of Data
The Standard Deviation as a Ruler and the Normal Model
CHAPTER 2 Modeling Distributions of Data
Describing Distributions Numerically
The Standard Deviation as a Ruler and the Normal Model
Good Afternoon! Agenda: Knight’s Charge-please get started Good things
Objective: Given a data set, compute measures of center and spread.
The Standard Deviation as a Ruler and the Normal Model
CHAPTER 2 Modeling Distributions of Data
Box and Whisker Plots Algebra 2.
Warm-up We are going to collect some data and determine if it is “normal” Roll each pair of dice 10 times and record the SUM of the two digits in your.
CHAPTER 2 Modeling Distributions of Data
Displaying and Summarizing Quantitative Data
The Standard Deviation as a Ruler and the Normal Model
CHAPTER 2 Modeling Distributions of Data
Displaying and Summarizing Quantitative Data
Normal Distribution Z-distribution.
Chapter 1 Warm Up .
Honors Statistics The Standard Deviation as a Ruler and the Normal Model Chapter 6 Part 3.
Summary (Week 1) Categorical vs. Quantitative Variables
The Standard Deviation as a Ruler and the Normal Model
Summary (Week 1) Categorical vs. Quantitative Variables
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Standard Deviation and the Normal Model
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

Z-scores & Shifting Data Chapter 6 Z-scores & Shifting Data Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

The Standard Deviation as a Ruler The trick in comparing very different-looking values is to use ____________ _____________ as our rulers. The standard deviation tells us how the whole collection of values _________, so it’s a natural ruler for comparing an ___________ to a ______. As the most common measure of ___________, the standard deviation plays a crucial role in how we look at data.

Standardizing with z-scores We compare individual data values to their mean, relative to their standard deviation using the following formula: We call the resulting values _______________ values, denoted as z. They can also be called _____________.

Standardizing with z-scores (cont.) Standardized values have ___ _______. z-scores measure the ___________ of each data value from the ________ in ____________ ______________. A negative z-score tells us that the data value is ________ the mean, while a positive z-score tells us that the data value is _________ the mean.

Benefits of Standardizing Standardized values have been converted from their original units to the standard statistical unit of _____________________________________. Thus, we can compare values that are measured on _________________, with ______________, or from _______________________.

Ex. 1 Consider the three athletes’ performances shown below in a three event competition. Note that each placed first, second, and third in an event. Who gets the gold medal? Who turned in the most remarkable performance of the competition?

Shifting Data Shifting data: Adding (or subtracting) a __________ amount to each value just adds (or subtracts) the same _________ to (from) the mean. This is true for the median and other measures of position too. In general, adding a constant to every data value adds the same constant to measures of __________ and percentiles, but leaves measures of __________ unchanged.

Shifting Data (cont.) The following histograms show a shift from men’s actual weights to kilograms above recommended weight:

Rescaling Data Rescaling data: When we divide or multiply all the data values by any constant value, both measures of _____________ (e.g., mean and median) and measures of ___________ (e.g., range, IQR, standard deviation) are divided and multiplied by the same value.

Rescaling Data (cont.) The men’s weight data set measured weights in kilograms. If we want to think about these weights in pounds, we would rescale the data:

Ex. 2 Suppose the class took a 40-point quiz Ex. 2 Suppose the class took a 40-point quiz. Results show a mean score of 30, median 32, IQR 8, SD 6, min 12, and Q1 27. (Suppose YOU got a 35.) What happens to each of the statistics if… • I decide to weight the quiz as 50 points, and will add 10 points to every score. Your score is now 45.

Results show a mean score of 30, median 32, IQR 8, SD 6, min 12, and Q1 27. (Suppose YOU got a 35.) What happens to each of the statistics if… • I decide to weight the quiz as 80 points, and double each score. Your score is now 70.

Results show a mean score of 30, median 32, IQR 8, SD 6, min 12, and Q1 27. (Suppose YOU got a 35.) What happens to each of the statistics if… • I decide to count the quiz as 100 points; I’ll double each score and add 20 points. Your score is now 90.

Back to z-scores Standardizing data into z-scores shifts the data by subtracting the mean and rescales the values by dividing by their standard deviation. Standardizing into z-scores does not change the _________ of the distribution. Standardizing into z-scores changes the _________ by making the mean __. Standardizing into z-scores changes the _______ by making the standard deviation __.

Assignment Pg. 123 #1-8 all

68-95-99.7 Rule and Normal Probability Rule Chapter 6 68-95-99.7 Rule and Normal Probability Rule Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

When Is a z-score Big? A z-score gives us an indication of how unusual a value is because it tells us how far it is from the _______. Remember that a negative z-score tells us that the data value is _________ the mean, while a positive z-score tells us that the data value is ____________ the mean. The larger a z-score is (negative or positive), the more unusual it is.

When Is a z-score Big? (cont.) There is no universal standard for z-scores, but there is a model that shows up over and over in Statistics. This model is called the ___________________ (You may have heard of “bell-shaped curves.”). Normal models are appropriate for distributions whose shapes are _____________ and roughly _________________. These distributions provide a measure of how _________ a z-score is.

When Is a z-score Big? (cont.) There is a Normal model for every possible combination of mean and standard deviation. We write N(μ,σ) to represent a Normal model with a _______ of μ and a ______________ _____________ of σ. We use Greek letters because this mean and standard deviation do not come from data—they are numbers (called __________________) that specify the model.

When Is a z-score Big? (cont.) Summaries of data, like the sample mean and standard deviation, are written with Latin letters. Such summaries of data are called _____________. When we standardize Normal data, we still call the standardized value a ________, and we write

When Is a z-score Big? (cont.) Once we have standardized, we need only one model: The N(0,1) model is called the standard Normal model (or the standard Normal distribution). Be careful—don’t use a Normal model for just any data set, since standardizing does not change the __________ of the distribution.

When Is a z-score Big? (cont.) When we use the Normal model, we are assuming the distribution is _____________. We cannot check this assumption in practice, so we check the following condition: Nearly Normal Condition: The shape of the data’s distribution is ___________ and ______________. This condition can be checked with a _________________ or a Normal probability plot (to be explained later).

The 68-95-99.7 Rule Normal models give us an idea of how ________ a value is by telling us how likely it is to find one that _____ from the mean. We can find these numbers precisely, but until then we will use a simple rule that tells us a lot about the Normal model…

The 68-95-99.7 Rule (cont.) It turns out that in a Normal model: about 68% of the values fall within ____ standard deviation of the mean; about 95% of the values fall within ____ standard deviations of the mean; and, about 99.7% (almost all!) of the values fall within _________ standard deviations of the mean.

The 68-95-99.7 Rule (cont.) The following shows what the 68-95-99.7 Rule tells us:

The First Three Rules for Working with Normal Models Make a ________. And, when we have data, make a ____________ to check the Nearly Normal Condition to make sure we can use the Normal model to model the distribution.

Are You Normal? How Can You Tell? When you actually have your own data, you must check to see whether a Normal model is __________________. Looking at a histogram of the data is a good way to check that the underlying distribution is roughly ___________ and ____________.

Are You Normal? How Can You Tell? (cont.) A more specialized graphical display that can help you decide whether a Normal model is appropriate is the ________________________. If the distribution of the data is roughly Normal, the Normal probability plot approximates a _____________________________________. Deviations from a straight line indicate that the distribution is not Normal.

Are You Normal? How Can You Tell? (cont.) Nearly Normal data have a histogram and a Normal probability plot that look somewhat like this example:

Are You Normal? How Can You Tell? (cont.) A skewed distribution might have a histogram and Normal probability plot like this:

TI help Enter data into List (L1) Turn STAT Plot on Choose Normal probability plot by choosing last icon Specify data list (L1) and which axis you want the plot on (we are using y in the examples) Select the type of mark you want Under ZOMM choose 9:Zoom Stat

Ex. 1 sketch Normal models using the 68-95-99.7 Rule. Birth weights of babies, N(7.6 lbs., 1.3 lbs.) ACT scores at a certain college, N(21.2, 4.4)

Assignment Pg. 124 #10, 12, 14, 15, 18, 22, 23, 24

Percent of Normal Curve and Inverse Normal Chapter 6 Percent of Normal Curve and Inverse Normal Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

Finding Normal Percentiles Using Technology Many calculators and statistics programs have the ability to find normal percentiles for us. Normalcdf (found under DIST) finds the area between to z-score points by specifying normalcdf(______, ______) To find a tail region use _______________ and the fact that the mean divides the normal distribution in ______.

Ex. 1 Shade the corresponding area under the standard normal curve, and find the indicated probability. Round answers to 0.0001 a) -1 < z < 1.15 b) z > 1.5 c) z < 2.25 d) |z| > 0.5

From Percentiles to Scores: z in Reverse Sometimes we start with _________ and need to find the corresponding __________ or even the original data value. Example: What z-score represents the first quartile in a Normal model?

From Percentiles to Scores: z in Reverse (cont.) _____________ (found under DIST) finds the z-value such that the portion to the ________ of the z-value is equal to the percent entered. Ie. InvNorm(.25) = -.6744897495 This means that 25% of the normal curve lies to the left of this z-value

Ex. 2 In a standard Normal model, what value(s) of z cut(s) off the region described (0.01)? Draw a picture first. the lowest 12% b) the highest 30% c) the middle 80%

Assignment w/s areas under normal standard curve

The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

What to do when dealing with non-standardized data 2 options Convert raw score to standardized score using Or Use normcdf(lb, ub, mean, sd)

Practice Problems P. 127 #33, 36, 41e

What Can Go Wrong? Don’t use a Normal model when the distribution is not unimodal and symmetric.

What Can Go Wrong? (cont.) Don’t use the mean and standard deviation when outliers are present—the mean and standard deviation can both be distorted by outliers. Don’t round off too soon. Don’t round your results in the middle of a calculation.

What have we learned? The story data can tell may be easier to understand after shifting or rescaling the data. Shifting data by adding or subtracting the same amount from each value affects measures of center and position but not measures of spread. Rescaling data by multiplying or dividing every value by a constant changes all the summary statistics—center, position, and spread.

What have we learned? (cont.) We’ve learned the power of standardizing data. Standardizing uses the SD as a ruler to measure distance from the mean (z-scores). With z-scores, we can compare values from different distributions or values based on different units. z-scores can identify unusual or surprising values among data.

What have we learned? (cont.) A Normal model sometimes provides a useful way to understand data. We need to Think about whether a method will work—check the Nearly Normal Condition with a histogram or Normal probability plot. Normal models follow the 68-95-99.7 Rule, and we can use technology or tables for a more detailed analysis.

Assignment Pg. 126 #30, 31, 34, 35, 42