Transforms What does the word transform mean?. Transforms What does the word transform mean? –Changing something into another thing.

Slides:



Advertisements
Similar presentations
Chapter 5 Some Key Ingredients for Inferential Statistics: The Normal Curve, Probability, and Population Versus Sample.
Advertisements

The Normal Curve. Introduction The normal curve Will need to understand it to understand inferential statistics It is a theoretical model Most actual.
Estimation in Sampling
A.k.a. “bell curve”.  If a characteristic is normally distributed in a population, the distribution of scores measuring that characteristic will form.
Measures of Dispersion and Standard Scores
For Explaining Psychological Statistics, 4th ed. by B. Cohen
Chapter 6: Standard Scores and the Normal Curve
Looking at data: distributions - Density curves and normal distributions IPS section 1.3 © 2006 W.H. Freeman and Company (authored by Brigitte Baldi, University.
Statistics for the Social Sciences
The Normal Distribution
Probability & Using Frequency Distributions Chapters 1 & 6 Homework: Ch 1: 9-12 Ch 6: 1, 2, 3, 8, 9, 14.
NORMAL CURVE Needed for inferential statistics. Find percentile ranks without knowing all the scores in the distribution. Determine probabilities.
PSY 307 – Statistics for the Behavioral Sciences
Chapter 7: Normal Curves & Probability
1.  Why understanding probability is important?  What is normal curve  How to compute and interpret z scores. 2.
S519: Evaluation of Information Systems Social Statistics Chapter 7: Are your curves normal?
14.4 The Normal Distribution
S519: Evaluation of Information Systems
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 5: Variability and Standard (z) Scores How do we quantify the variability of the scores in a sample?
Central Tendency and Variability
The Normal Distribution Unimodal Symmetrical Abscissa – the different values of x Ordinate – density, probability, or frequency of each value of x Thus,
Statistical Analysis – Chapter 4 Normal Distribution
Probability Distributions W&W Chapter 4. Continuous Distributions Many variables we wish to study in Political Science are continuous, rather than discrete.
Measures of Central Tendency
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
1.3 Psychology Statistics AP Psychology Mr. Loomis.
The Normal Distribution The “Bell Curve” The “Normal Curve”
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Describing Location in a Distribution. Measuring Position: Percentiles Here are the scores of 25 students in Mr. Pryor’s statistics class on their first.
GrowingKnowing.com © GrowingKnowing.com © 2011.
Chapter 5 The Normal Curve. In This Presentation  This presentation will introduce The Normal Curve Z scores The use of the Normal Curve table (Appendix.
Copyright © 2012 by Nelson Education Limited. Chapter 4 The Normal Curve 4-1.
Measures of Dispersion & The Standard Normal Distribution 2/5/07.
Some probability distribution The Normal Distribution
Part III Taking Chances for Fun and Profit
Part III Taking Chances for Fun and Profit Chapter 8 Are Your Curves Normal? Probability and Why it Counts.
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
1 Lecture 3 Outline 1. Chebyshev’s Theorem 2. The Empirical Rule 3. Measures of Relative Standing 4. Examples.
Thursday August 29, 2013 The Z Transformation. Today: Z-Scores First--Upper and lower real limits: Boundaries of intervals for scores that are represented.
Chapter 6 The Normal Distribution. 2 Chapter 6 The Normal Distribution Major Points Distributions and area Distributions and area The normal distribution.
Essential Statistics Chapter 31 The Normal Distributions.
NORMAL DISTRIBUTION Chapter 3. DENSITY CURVES Example: here is a histogram of vocabulary scores of 947 seventh graders. BPS - 5TH ED. CHAPTER 3 2 The.
Answering Descriptive Questions in Multivariate Research When we are studying more than one variable, we are typically asking one (or more) of the following.
Today: Standard Deviations & Z-Scores Any questions from last time?
Normal Distributions (aka Bell Curves, Gaussians) Spring 2010.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 4-6 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Chapter 4 Exploring Chemical Analysis, Harris
SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.
 A standardized value  A number of standard deviations a given value, x, is above or below the mean  z = (score (x) – mean)/s (standard deviation)
Chapter 7: The Distribution of Sample Means
Hypothesis Testing and Statistical Significance
THE NORMAL DISTRIBUTION
Construction Engineering 221 Probability and statistics Normal Distribution.
Transforming Data.
Advanced Math Topics 7.2/7.3 The Normal Curve.
Linear Transformations and Standardized Scores
These probabilities are the probabilities that individual values in a sample will fall in a 50 gram range, and thus represent the integral of individual.
Normal Distribution and z-scores
Presentation transcript:

Transforms What does the word transform mean?

Transforms What does the word transform mean? –Changing something into another thing

Transforms What does the word transform mean? –Changing something into another thing In statistics it refers to changing a distribution into a different distribution

Transforms What does the word transform mean? –Changing something into another thing In statistics it refers to changing a distribution into a different distribution –How can you change a distribution?

Linear Transforms A linear transform happens when you add or multiply a constant with EACH number in a distribution –Usually has the form z i = a(x i ) + b Where y is the new “transformed” number, x is the old “untransformed number” and a and b are any constants (including zero!)

Linear Transforms What would happen to the mean, the variance, the standard deviation if you applied a linear transform?

Linear Transforms adding or multiplying by a constant affects the mean multiplying by a constant affects the variance and standard deviation but adding a constant does not!

Linear Transforms More formally, when adding a constant: the mean of a new distribution z is the mean of the old distribution x plus the constant c

Linear Transforms More formally, when adding a constant: the variance of the distribution z is the same as the variance of the distribution x

Linear Transforms More formally, when multiplying by a constant: the mean of the distribution z is the mean of x multiplied by the constant c

Linear Transforms More formally, when multiplying by a constant: the variance of the distribution z is the variance of x multiplied by the square of the constant c

Linear Transforms More formally, when multiplying by a constant: And the standard deviation

Linear Transforms If these features of transforms aren’t intuitive for you, work through pages 34 and 35!

Linear Transforms Notice that you can work backward from what you want the mean or standard deviation to be because: –Add or subtract a constant to change the mean –Multiply or divide a constant to change the standard deviation

The Z Transform

What if you wanted a very specific distribution - one with a mean of zero and a standard deviation of one Why on earth would you want THAT? The Z Transform

Often we can assume that a set of numbers are normally distributed Normally distributed numbers have interesting characteristics Normal Distributions

Importantly, the probability of any number being of a particular value can be computed using the gaussian function: Which you will almost certainly never need to compute yourself! Normal Distributions

Normal Distribution Probability of a score is the height on the curve mean standard deviation

The Normal Distribution 34% of scores fall between the mean and 1 standard deviation above the mean Standard Deviations 34%

The Normal Distribution 34% of scores fall between the mean and 1 standard deviation below the mean Standard Deviations 34%

The Normal Distribution 68% of scores fall between the 1 standard deviation below and 1 standard deviation above the mean Standard Deviations 34%

The Normal Distribution 96% of scores fall between the 2 standard deviations below and 2 standard deviations above the mean Standard Deviations 48%

The Normal Distribution 95% of scores fall between 1.96 standard deviations below and 1.96 standard deviations above the mean Standard Deviations 48%

The Normal Distribution The Normal distribution reveals the proportions (i.e. probabilities) of scores that fall within certain ranges when the ranges are expressed in terms of standard deviations

The Normal Distribution The Normal distribution reveals the proportions (i.e. probabilities) of scores that fall within certain ranges when the ranges are expressed in terms of standard deviations If only there was some way to transform scores into units of standard deviation…

The Z Transform

Break that down: –Remember that –And that The Z Transform

Break that down: –If we used this to make the new mean zero by plugging in the negative of the old mean for c The Z Transform

Break that down: –If we used this to make the new mean zero by plugging in the negative of the old mean for c –And we use this to make the standard deviation equal 1 by plugging in 1 / the old standard deviation The Z Transform

-subtract the mean from each score -Divide each score by the standard deviation

–Then any score that was exactly the mean would be zero standard deviations from the mean (Z = 0.0) –A score that was 1 standard deviation from the mean would now be Z = 1.0 –2 standard deviations from the mean would be Z = 2.0 –Half way between 1 and 2 std. dev. from the mean would be 1.5, etc. The Z Transform

Z scores are standardized Z scores are in units of standard deviation One can think of Z scores as the ratio of a score’s difference from the mean to the average difference from the mean Or one can think of Z score as “what percentage of one standard deviation from the mean is this score’s distance from the mean?” The Z Transform

Uses of Z scores: –allows comparison across different samples (e.g. 25 degrees in Vancouver vs. 25 degrees in Lethbridge) –If one assumes that scores are normally distributed, the Z score reveals the probability of that particular score occurring by chance The Z Transform

The Standard Normal Distribution or Z Distribution For any distribution: –probability for each number (on x axis) is given by height of curve –probability for getting one out of a range of numbers is given by the area under the curve

Standard Normal Distribution For the Standard Normal (a.k.a. Z distribution), the area under the curve for a given range is found in a Z table e.g. pg 111

Standard Normal Distribution Table shows areas between  and any z score you wish What’s  !?  is the mean of the population of possible scores (more on that later)

Standard Normal Distribution Using the Z table –note that negative z scores yield the same probabilities because the curve is symmetric –total area under the curve = 1.0 (probability that something will happen is 1 ! ) –Examples probability of getting a z score between 0 and 1 is.3413 probability of getting a z score within 1 std. dev. of the mean is =.6826 or ~ 68%

Standard Normal Distribution What range above and below the mean contains 95% of all the z scores?

Standard Normal Distribution What range above and below the mean contains 95% of all the z scores? –Z table tells you the positive half of the curve

Standard Normal Distribution What range above and below the mean contains 95% of all the z scores? –Z table tells you the positive half of the curve –1/2 of 95% = 47.5% or.475 is on each side of the mean

Standard Normal Distribution What range above and below the mean contains 95% of all the z scores? –Z table tells you the positive half of the curve –1/2 of 95% = 47.5% or.475 is on each side of the mean –.475 corresponds to a z score of 1.96 (or negative 1.96! )

Standard Normal Distribution What range above and below the mean contains 95% of all the z scores? –Z table tells you the positive half of the curve –1/2 of 95% = 47.5% or.475 is on each side of the mean –.475 corresponds to a z score of 1.96 (or negative 1.96! ) –Thus or 95% of z scores fall between + / standard deviations of the mean

Standard Normal Distribution What do we do with this knowledge?

Standard Normal Distribution What do we do with this knowledge? Knowing the probability of getting particular z scores helps us to know what population a given sample came from

Standard Normal Distribution For example: Blood Doping in Cross-Country Skiing

Standard Normal Distribution For example: Blood Doping in Cross-Country Skiing At the World Championships in Lahti Finland, 13% of the athletes were found to have a red blood cell count between 3.5 and 5.5 standard deviations from the presumed population mean for all athletes (that was measured in previous IOC study)

Standard Normal Distribution What percentage of athletes would you expect to be greater than 3.5 standard deviations from the mean?

Standard Normal Distribution What percentage of athletes would you expect to be greater than 3.5 standard deviations from the mean? –look up z = 3.5 –the associated probability is.4998 so 2x.4998 =.9996 or 99.96% should fall within +/- 3.5 ! –less than =.0004 or.04% should have red blood cell z scores greater than +/-3.5! –only half (.02%) of that should be above +3.5 !

Standard Normal Distribution.02% is what you’d expect - 13% is what they observed! What this tells us is that the sample of athletes at the Lahti World Championships was almost certainly not taken from the same population as the “normal” athletes in the IOC study At least some of the athletes sampled in Lahti had done something to artificially elevate their red blood cell count

Problem: What good is it to know about normal distributions if there’s no guarantee that your scores will be normally distributed?