Stat 13, Tue 6/5/12. 1. Hand in hw7. 2. Practice problems. Final exam is Thur, 6/7, in class. It has 20 multiple choice questions. Only about 50% are on.

Slides:



Advertisements
Similar presentations
Chapter 5 Some Key Ingredients for Inferential Statistics: The Normal Curve, Probability, and Population Versus Sample.
Advertisements

Objectives 10.1 Simple linear regression
The Normal Curve and Z-scores Using the Normal Curve to Find Probabilities.
6-1 Stats Unit 6 Sampling Distributions and Statistical Inference - 1 FPP Chapters 16-18, 20-21, 23 The Law of Averages (Ch 16) Box Models (Ch 16) Sampling.
Standard Normal Table Area Under the Curve
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Monday, 4/29/02, Slide #1 MA 102 Statistical Controversies Monday, 4/29/02 Today: CLOSING CEREMONIES!  Discuss HW #3  Review for final exam  Evaluations.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
First we need to understand the variables. A random variable is a value of an outcome such as counting the number of heads when flipping a coin, which.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Confidence Intervals: Estimating Population Mean
Quiz 6 Confidence intervals z Distribution t Distribution.
Chapter 11: Random Sampling and Sampling Distributions
STA Lecture 201 STA 291 Lecture 20 Exam II Today 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm – 9:15pm Location CB 234 Bring a calculator,
Copyright © 2005 by Evan Schofer
Review of normal distribution. Exercise Solution.
Stat 13, Thu 5/3/ Correction on normal notation. 2. Normal percentiles. 3. Normal probability plots. 4. Bernoulli and Binomial random variables.
Prediction concerning Y variable. Three different research questions What is the mean response, E(Y h ), for a given level, X h, of the predictor variable?
ESTIMATING with confidence. Confidence INterval A confidence interval gives an estimated range of values which is likely to include an unknown population.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Stat 13, Thu 5/10/ CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When.
Statistics 101 Chapter 10. Section 10-1 We want to infer from the sample data some conclusion about a wider population that the sample represents. Inferential.
AP STATISTICS LESSON 10 – 1 (DAY 2)
Regression. Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used.
When σ is Unknown The One – Sample Interval For a Population Mean Target Goal: I can construct and interpret a CI for a population mean when σ is unknown.
Stat 13, Tue 5/8/ Collect HW Central limit theorem. 3. CLT for 0-1 events. 4. Examples. 5.  versus  /√n. 6. Assumptions. Read ch. 5 and 6.
Statistical Interval for a Single Sample
Statistics 101 Class 6. Overview Sample and populations Sample and populations Why a Sample Why a Sample Types of samples Types of samples Revisiting.
FINAL EXAM REVIEW Last Class – Week 15. Measures of Central Tendency (Ex 1) – Using the following set of data, Find each of the following: 101, 98, 76,
TIMES 3 Technological Integrations in Mathematical Environments and Studies Jacksonville State University 2010.
Stat 13, Tue 5/15/ Hand in HW5 2. Review list. 3. Assumptions and CLT again. 4. Examples. Hand in Hw5. Midterm 2 is Thur, 5/17. Hw6 is due Thu, 5/24.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory variables.
Correlation and Prediction Error The amount of prediction error is associated with the strength of the correlation between X and Y.
Stat 35b: Introduction to Probability with Applications to Poker Outline for the day, Tue 3/13/12: 1.Collect Hw WSOP main event. 3.Review list.
STA Lecture 181 STA 291 Lecture 18 Exam II Next Tuesday 5-7pm Memorial Hall (Same place) Makeup Exam 7:15pm – 9:15pm Location TBA.
6.2 Homework Questions.
Chapter 7 Lesson 7.6 Random Variables and Probability Distributions 7.6: Normal Distributions.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
Stat 13, Tue 4/24/ Review list. 2. Examples of studies. 3. Summary of probability rules. 4. General probability strategy. 5. Probability examples.
STA Lecture 191 STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm – 9:15pm Location CB 234.
Introduction to Statistics Chapter 6 Feb 11-16, 2010 Classes #8-9
Stat 13, Tue 5/1/ Midterm Expected value, lotto and roulette. 3. Normal calculations. Read ch4. Hw4 is due Tue, 5/8, and Midterm 2 is Thur,
Discrete Random Variables
§ 5.3 Normal Distributions: Finding Values. Probability and Normal Distributions If a random variable, x, is normally distributed, you can find the probability.
 The point estimators of population parameters ( and in our case) are random variables and they follow a normal distribution. Their expected values are.
Stat 35b: Introduction to Probability with Applications to Poker Outline for the day: 1. Review list 2.Bayes’ Rule example 3.CLT example 4.Other examples.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Practice Page 128 –#6.7 –#6.8 Practice Page 128 –#6.7 =.0668 = test scores are normally distributed –#6.8 a =.0832 b =.2912 c =.4778.
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
Math 3680 Lecture #15 Confidence Intervals. Review: Suppose that E(X) =  and SD(X) = . Recall the following two facts about the average of n observations.
Stat 13, Thu 4/19/ Hand in HW2! 1. Resistance. 2. n-1 in sample sd formula, and parameters and statistics. 3. Probability basic terminology. 4. Probability.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
1 Probability and Statistics Confidence Intervals.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
The Normal Approximation for Data. History The normal curve was discovered by Abraham de Moivre around Around 1870, the Belgian mathematician Adolph.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
Regression Inference. Height Weight How much would an adult male weigh if he were 5 feet tall? He could weigh varying amounts (in other words, there is.
MIDTERM REVIEW October 7 th multiple choice questions (Scantron) ** Make sure you bring a #2 Pencil ** And your OWN GRAPHING CALCULATOR!
Basic Estimation Techniques
Stat 35b: Introduction to Probability with Applications to Poker
Basic Estimation Techniques
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
BUSINESS MATHEMATICS & STATISTICS.
Statistical inference for the slope and intercept in SLR
Presentation transcript:

Stat 13, Tue 6/5/ Hand in hw7. 2. Practice problems. Final exam is Thur, 6/7, in class. It has 20 multiple choice questions. Only about 50% are on regression and correlation. The other 50% are on previous topics like study design and confounding factors (10%), probability (10%), normal calculations (10%), standard errors and confidence intervals (10%), and testing (10%). All these percentages here are approximate. You will get no credit for saying “none of the above” unless your answer is exactly right. You can use 2 pages, each double sided, of notes. Bring a calculator and a pen or pencil. Do not hand in your notes when you hand in the exam. 1

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. a. What percentage of wines cost more than $15? Standardize the question: ($15 - $10)/$4 = 1.25, so we want the area under the standard normal curve greater than Using the z table, this is = = 10.56%. 2

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. b. 95% of wine prices are in what range? $10 +/ ($4) = $10 +/- $

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. c. What price would put a wine in the 80 th percentile of price? Using the z table, the 80 th percentile of the standard normal is about We need to convert this 0.84 from standard units to dollars ($4) + $10 = $

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. d. Suppose the slope of the regression line is.1, and the correlation is 0.5. What is the sd of the taste ratings in your sample? b = r s y / s x, so s x = r s y / b = (0.5)(4)/.1 ~ 20. 5

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. Suppose the slope of the regression line is.1, and the correlation is 0.5. e. Suppose the intercept of the regression line is $6. What is the sample mean wine taste rating? b =.1, and we know a = - b = = $6. So,.1 = = 4, so = 4/.1 = 40. 6

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. Suppose again that the slope of the regression line is.1, the correlation is 0.5, and the intercept of the regression line is $6. f. Use the regression line to predict the cost of a wine that has a taste rating of 55. b =.1, and a = $6, so = $6 +.1 x, and here x = 55, so = $6 +.1 (55) = $

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. Suppose again that the slope of the regression line is.1, the correlation is 0.5, and the intercept of the regression line is $6. g. +/- what? That is, roughly how much do you expect your prediction to be off by? √(1-r 2 ) s y = ($4) ~ $

1. Suppose you are trying to use regression to predict wine prices by taste ratings. You take a SRS of 300 wines, have people rate their taste from 0 to 100, and record the prices of the wines. Suppose both taste ratings and wine prices are normally distributed. Suppose you find that your mean wine price is $10 and the sd is $4. Suppose again that the slope of the regression line is.1 and the intercept of the regression line is $6. h. Which of the following is your interpretation of the regression estimates? (i) If you make a wine taste 1 unit better, then it will cost ten cents more. (ii) Wines that taste 1 unit better cost, on average, 10 cents more. (iii) A wine that gets a taste score of 200 would cost around $6 +.1(200) = $26. 9

2. Suppose we deal two cards without replacement from an ordinary deck, and consider a face card a J, Q or K. What is the expected number of face cards you are dealt? Let X = the number of face cards you are dealt. X = 0, 1, or 2. E(X) = 0 P(X=0) + 1 P(X=1) + 2 P(X=2) = 0 + 1P(X=1) + 2P(X=2). P(X=1) = P(you get 1 face card and 1 nonface card) = P(your 1st card is face and 2 nd isn’t) + P(your 1 st card isn’t face and your 2 nd is) = P(1 st is face) P(2 nd isn’t face | 1 st is face) + P(1 st isn’t face)P(2 nd is face | 1 st isn’t) = 12/52 x 40/ /52 x 12/51 = 36.2%. P(X=2) = P(1 st card is face and 2 nd is face) = P(1 st card is face) P(2 nd is face | 1 st is face) = 12/52 x 11/51 = 4.98%. So E(X) = x 36.2% + 2 x 4.98% =