Review Measures of Central Tendency –Mean, median, mode Measures of Variation –Variance, standard deviation.

Slides:



Advertisements
Similar presentations
Chapter 5 Some Key Ingredients for Inferential Statistics: The Normal Curve, Probability, and Population Versus Sample.
Advertisements

1. Exams 2. Sampling Distributions 3. Estimation + Confidence Intervals.
Hypothesis Testing It is frequently expected that you have clear hypotheses when you have a study using quantitative data. Older citizens are more likely.
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Chapter 7 Introduction to Sampling Distributions
Topics: Inferential Statistics
The Normal Distribution
Chapter 7 Sampling and Sampling Distributions
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
The Normal Curve and Sampling A.A sample will always be different from the true population B.This is called “sampling error” C.The difference between a.
Introduction to Educational Statistics
Chapter 7 Probability and Samples: The Distribution of Sample Means
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
12.3 – Measures of Dispersion
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
QUIZ CHAPTER Seven Psy302 Quantitative Methods. 1. A distribution of all sample means or sample variances that could be obtained in samples of a given.
1. Homework #2 2. Inferential Statistics 3. Review for Exam.
Quiz 2 Measures of central tendency Measures of variability.
Variance Formula. Probability A. The importance of probability Hypothesis testing and statistical significance Probabilistic causation - because error.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
The Normal Distribution The “Bell Curve” The “Normal Curve”
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
Chapter 11: Estimation Estimation Defined Confidence Levels
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
PTP 560 Research Methods Week 8 Thomas Ruediger, PT.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Smith/Davis (c) 2005 Prentice Hall Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
Chapter 6 Foundations of Educational Measurement Part 1 Jeffrey Oescher.
Chapter 7 Estimation Procedures. Basic Logic  In estimation procedures, statistics calculated from random samples are used to estimate the value of population.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
Determination of Sample Size: A Review of Statistical Theory
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Chapter 10: Introduction to Statistical Inference.
Confidence Interval Estimation For statistical inference in decision making:
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
CONFIDENCE INTERVALS.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 8. Parameter Estimation Using Confidence Intervals.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Practice Page 128 –#6.7 –#6.8 Practice Page 128 –#6.7 =.0668 = test scores are normally distributed –#6.8 a =.0832 b =.2912 c =.4778.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Confidence Intervals INTRO. Confidence Intervals Brief review of sampling. Brief review of the Central Limit Theorem. How do CIs work? Why do we use CIs?
The Normal Probability Distribution. What is a distribution? A collection of scores, values, arranged to indicate how common various values, or scores.
Chapter 2 Describing and Presenting a Distribution of Scores.
Chapter Eleven Sample Size Determination Chapter Eleven.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Review Day 2 May 4 th Probability Events are independent if the outcome of one event does not influence the outcome of any other event Events are.
Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the Mean, and z Scores PowerPoint Presentation.
Chapter 6: Sampling Distributions
Sampling Distributions
Chapter 7 Sampling Distributions.
Week 10 Chapter 16. Confidence Intervals for Proportions
Chapter 3.
Chapter 7 Sampling Distributions.
Samples and Populations
Calculating Probabilities for Any Normal Variable
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
BUSINESS MARKET RESEARCH
Chapter 7 Sampling Distributions.
Chapter 4 (cont.) The Sampling Distribution
Presentation transcript:

Review Measures of Central Tendency –Mean, median, mode Measures of Variation –Variance, standard deviation

Variance is defined as

The Normal Curve The mean and standard deviation, in conjunction with the normal curve allow for more sophisticated description of the data and (as we see later) statistical analysis For example, a school is not that interested in the raw GRE score, it is interested in how you score relative to others.

Even if the school knows the average (mean) GRE score, your raw score still doesn’t tell them much, since in a perfectly normal distribution, 50% of people will score higher than the mean. This is where the standard deviation is so helpful. It helps interpret raw scores and understand the likelihood of a score. So if I told you if I scored 710 on the quantitative section and the mean score is 591. Is that good?

It’s above average, but who cares. What if I tell you the standard deviation is 148? What does that mean? What if I said the standard deviation is 5? Calculating z-scores

Converting raw scores to z scores What is a z score? What does it represent Z = (x-µ) / σ Converting z scores into raw scores X = z σ + µ Z = ( )/140 = 147/140 = 1.05

Finding Probabilities under the Normal Curve So what % of GRE takers scored above and below 710? The importance of Table A Why is this important? Inferential Statistics (to be cont.)

Stuff you don’t need to know: pi = ≈ e = ≈2.71

The Normal Curve and Sampling A.A sample will (almost) always be different from the true population B.This is called “sampling error” C.The difference between a sample and the true population, regardless of how well the survey was designed or implemented D.Different from measurement error or sample bias

Sampling distribution of Means The existence of sampling error means that if you take a 1000 random samples from a population and calculate a 1000 means and plot the distribution of those means you will get a consistent distribution that has the following characteristics:

Characteristics of a Sampling Distribution 1. the distribution approximates a normal curve 2. the mean of a sampling distribution of means is equal to the true population 3. the standard deviation of a sampling distribution is smaller than the standard deviation of the population. Less variation in the distribution because we are not dealing with raw scores but rather central tendencies.

Why is the normal curve so important? If we define probability in terms of the likelihood of occurrence, then the normal curve can be regarded as a probability distribution (the probability of occurrence decreases as we move away from the center). With this notion, we can find the probability of obtaining a raw score in a distribution, given a certain mean and SD.

Probability and the Normal Curve In chapter 6 – we are not interested in the distribution of raw scores but rather the distribution of sample means and making probability statements about those sample means.

Probability and the Sampling Distribution Why is making probabilistic statements about a central tendency important? 1. it will allow us to engage in inferential statistics (later in ch. 7) 2. it allows us to produce confidence intervals

Example of number 1: President of UNLV states that the average salary of a new UNLV graduate is $60,000. We are skeptical and test this by taking a random sample of a 100 UNLV students. We find that the average is only $55,000. Do we declare the President a liar?

Not Yet!!!! We need to make a probabilistic statement regarding the likelihood of Harter’s statement. How do we do that? With the aid of the standard error of the mean we can calculate confidence intervals - the range of mean values within with our true population mean is likely to fall.

How do we do that? First, we need the sample mean Second, we need the standard deviation of the sampling distribution of means (what’s another name for this?) a.k.a standard error of the mean

What’s the Problem? The problem is… We don’t have the standard deviation of the sampling distribution of means? What do we do?

First – let’s pretend Let’s pretend that I know the Standard Deviation of the Sampling Distribution of Means (a.k.a. the standard error of the mean). It’s 3000 For a 95% confidence interval we multiply the standard error of the mean by 1.96 and add & subtract that product to our sample mean Why 1.96? What’s the range?

So is President Ashley Lying? CI = Mean + or – 1.96 (SE) = 55,000 +/ (3000) = 55,000 +/ = $49,120 to 60,880

Let’s stop pretending We Can Estimate the Standard Error of the Mean. –Divide the standard deviation of the sample by √N-1 Multiply this estimate by t rather than 1.96 and then add this product to our sample mean. Why t?

The t Distribution Empirical testing and models shows that a standard deviation from a sample underestimates the standard deviation of the true population This is why we use N-1 not N when calculating the standard deviation and the standard error So in reality, we are calculating t-scores, not z-scores since we are not using the true sd.

So when we are using a sample and calculating a 95% confidence interval (CI) we need to multiply the standard error by t, not 1.96 How do we know what t is? Table in back of book Df = N - 1

Confidence Intervals for Proportions Calculate the standard error of the proportion: Sp = 95% conf. Interval = P +/- (1.96)S p

Example National sample of 531 Democrats and Democratic-leaning independents, aged 18 and older, conducted Sept , 2007 Clinton 47%; Obama 25%; Edwards 11% P(1-P) =.47(1-.47) =.47(.53) =.2491 Divide by N =.2491/531 = Take square root = % CI =.47 +/ (.0217).47 +/ or to.511

Midterm Key terms from Schutt chapters 1-5 Statistical Calculations by hand –Mean, Median, Mode –Variance/Standard Deviation –Z-scores –Standard errors and confidence intervals using z or t