You are about to take the AP Stats test and…. MAKE SURE YOU HAVE… MULTIPLE PENCILS… 1 OR 2 CALCULATORS… EXTRA BATTERIES… EAT A FULL MEAL BEFOREHAND!

Slides:



Advertisements
Similar presentations
Data: Quantitative (Histogram, Stem & Leaf, Boxplots) versus Categorical (Bar or Pie Chart) Boxplots: 5 Number Summary, IQR, Outliers???, Comparisons.
Advertisements

AP Statistics Course Review.
Chapter 7: Data for Decisions Lesson Plan
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
General Set up of the exam. Two Sections Multiple Choice Multiple Choice 90 minutes 90 minutes 40 questions 40 questions Free Response Free Response 90.
Tips for Writing Free Response Questions on the AP Statistics Exam Laura Trojan Cannon School.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
The Simple Regression Model
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Chapter 11 Multiple Regression.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Inference about a Mean Part II
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Inferences About Process Quality
Today Concepts underlying inferential statistics
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation and Regression Analysis
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 12 Section 1 Inference for Linear Regression.
Relationships Among Variables
AP Stats Test Review  What are the four parts of the course? Inference, Experimental Design, Probability, and Data Analysis  How many multiple choice.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Chapter 5 Sampling Distributions
Inferences for Regression
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Wednesday, May 13, 2015 Report at 11:30 to Prairieview.
Final Exam Review!!!!!. Experimental Design Random Sampling Stratified Sampling Convenience Sampling Cluster Sampling Systematic Sampling Multistage Sampling.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Fall Final Topics by “Notecard”.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
Section 10.1 Estimating with Confidence AP Statistics February 11 th, 2011.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Inference: Probabilities and Distributions Feb , 2012.
Review ~ Data & Displays Chapters 1-6 Key Terms: –Data : Categorical versus Quantitative –Frequency Table/Contingency Table/Marginal Distribution/Conditional.
© Copyright McGraw-Hill 2004
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Course Review. Distributions What are the important aspects needed to describe a distribution of one variable? List three types of graphs that could be.
AP Stats Review day 1 April 2, Basics Two Parts (90 Minutes each part) – 40 Multiple Choice Content Questions (10-15) Calculation Questions(25-30)
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
A.P. STATISTICS EXAM REVIEW TOPIC #2 Tests of Significance and Confidence Intervals for Means and Proportions Chapters
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Howard Community College
Thursday, May 12, 2016 Report at 11:30 to Prairieview
CHAPTER 12 More About Regression
AP Stat Review Know Your Book!
CHAPTER 12 More About Regression
Correlation and Regression
When You See (This), You Think (That)
CHAPTER 12 More About Regression
Major Topics first semester by chapter
Major Topics first semester by chapter
CHAPTER 12 More About Regression
Fall Final Topics by “Notecard”.
Presentation transcript:

You are about to take the AP Stats test and…

MAKE SURE YOU HAVE… MULTIPLE PENCILS… 1 OR 2 CALCULATORS… EXTRA BATTERIES… EAT A FULL MEAL BEFOREHAND!

Statistics is about… Models A model is an attempt to represent reality… …but we know it’s not perfect.

Models “All models are wrong, but some are useful.” (George Box)

Models Common models:  Regression lines  Simulations AND MORE…

Avoid confusion … known vs unknowable samples vs populations statistics vs parameters THINK about Models

Probability Models: Normal Model Geometric & Binomial t - Models Chi-Square Models THINK about Models

The calculator… … can’t TELL what it all means. YOU have to do that, too!

Remember … Answers are not numbers, answers are sentences in context.

The 68, 95, 99.7 Rule Only works for Normal Distributions.

When describing Univariate Data (1 variable) Shape! Center! Spread!

Range is… A way to measure the “spread” of data It is a single number, such as 40. (not 30-70) Variance, standard deviation, IQR are other ways to measure the “spread” of data.

Mean or Median is… A way to measure the location of data (center) They are a single number

Adding a constant to every value in a data set… Changes the central location of the data (such as mean or median) Does NOT change the spread of the data (such as standard deviation, variance, IQR)

Multiplying by a constant to every value in a data set… Changes the central location of the data (such as the mean or the median) DOES CHANGE the spread of the data (such as standard deviation, variance, IQR)

When describing bivariate quantitative data… Form! Strength! Direction! This is when describing a scatterplot, linear regression, or the like...

A residual is… The vertical distance from point to LSRL It is calculated as Observed Y – Predicted Y All points ABOVE the LSRL have positive residuals! All points BELOW the LSRL have negative residuals!

An Exponential Model is best fit when… Log Y vs X is linear Variable is in the exponent

A Power Model is best fit when… Log Y vs. Log X is linear Variable is in the base

Interpreting Slope… “For every [unit] increase in [x], we expect an [? unit] increase in [y]

Interpreting Y Intercept When [x] is 0 [units], we expect [y] to be [? units]

You will see the word “COMPARE” at least once… So COMPARE!!! Don’t list attributes… use comparative words!

Lurking variable (Common Response) x y z

Confounding Variable x y z

“Correlation does not imply causation!” Be careful with the word “cause”. The only way to prove causation is a properly designed experiment.

R is called the… “Correlation Coefficient” Or just…. “Correlation” It measure the strength and direction of a linear relationship (no context for nonlinear relationships)

R-squared is called… “Coefficient of Determination” And is interpreted as “the % of the variation in [y] that is explained by [x]” It can also be thought of as sum of explained error/sum of total error

If r squared is.64 Then r =.8 OR r= -.8 Figure it out by looking at the direction of the scatterplot!

Simpson’s Paradox is… When combining the data from 2 groups results in a reversal of direction of the conclusion.

Placebo Effect Giving a person a sugar pill and telling them it will make them feel better

Double Blind Neither the subjects nor the experimenter know which treatment the subject is receiving

Disjoint Both cant occur simultaneously

Mutually Exclusive One or the other must occur

Checks for Independence P(A and B)=p(A)p(B) Or P(B)=p(B|A)

Rules for Means and Variances of Discrete Random Variables P420

Central Limit Theorem As sample size increases, the shape of the sampling distribution gets more and more normal, regardless of the shape of the parent distribution. The center (mean) of the sampling distributions stays exactly the same. The variability in the sampling distribution (standard deviation) decreases.

Law of Large Numbers As sample size increases, the mean, Xbar, tends to get closer and closer to u.

DON’T FORGET TO… Use the proper NOTATION. P(x>2)=….

Notation is communication a, b, n, p, q, r, s, t, x, y, z, E, H, P, π, ,  all have special meanings… …and “hats” or “bars” change those meanings. You are not free to substitute another letter even though it looks like algebra.

4 Requirements of a Binomial Setting are… Independence Success/Failure for each trial Equal probability of success for each trial Fixed number of trials

The only difference between binomial setting and geometric setting is… Geometric does not have a fixed number of trials, it is waiting for the first success…

The mean of a geometric distribution is… 1/p (this is not on your formula sheet, but you should know it) To calculate a geometric probability, use a tree diagram

Undercoverage Bias When some groups are systematically left out of the sampling process (like people without phones in a phone survey)

Voluntary Response Bias When a sample consists of volunteers (like calling in to a radio survey)

Nonresponse Bias When an individual cant be contacted or refuses to cooperate

P of your “Phantoms” is “Define the parameter”. So Define your parameter (either p or u) as specifically as possible.

The grader of your test… Doesn’t know what “PANIC” and “PHANTOMS” are… they are simply for your own organization.

Your hypotheses must… Be about the PARAMETERS. Why make a hypothesis about the sample? (X bar and p hat shouldn’t ever be in the Hypotheses).

When you do inference… You hypothesize about what the value of a single number is (that number is usually u, or p).

What IS an Assumption? an underlying hypothesis about the situation required by the mathematical justification for the statistical method. WE WILL PROBABLY NEVER KNOW IF AN ASSUMPTION IS TRUE.

Draw the graph if you are given the data set! An outlier/skewness can dramatically affect your Test Statistic.

Your sample should be an SRS of the POPULATION OF INTEREST. Be specific. If the sample is randomly selected and unbiased, then we can generalize the findings to the population

We rarely know the population standard deviation, so the only time you do z- tests will probably be when using proportions.

Don’t Forget… Degrees of Freedom! -for all t-statistics and Chi-Square statistics. ***(r-1)(c-1) for Chi-Square Tests for Homogeneity and Chi-Square Tests for Independence ***n-2 for LinReg T Test *** n-1 for all others.

Can you find and interpret critical values? 1-sided or 2-sided? Draw a picture! Degrees of freedom? Critical values interpret to be the number of std. devs. away from the mean, usually in order to reject a hypothesis

Your Calculations in PHANTOMS should… Include equation and values you are plugging in… Normal curves will earn you a PLUS (but don’t draw a normal curve for Chi- Square Distributions, they are ALWAYS skewed.

Interpretation of Confidence Interval: “We are 95% confident that _____ falls within the interval _________”

Interpretation of Confidence Level “If we repeated this sampling and calculation process many times, 95% of all calculated intervals will correctly contain ______.”

Interpretation of the P-Value “A p-value of ______ indicates that if Ho is true, then we would obtain a sample statistics as extreme as ours less than (more than) ____% of the time due to random chance alone.”

The communication rubric: Your conclusion should be … Clear, Concise, Complete, and in Context!

You will be given an essay question that asks you to describe an experiment… Be thorough! Your answer must include Repetition, Randomization, Control, and Comparison. Don’t abbreviate “R.A.” Spell it out! – They will assume you don’t know what you are doing, but you do! SAY WHAT YOU ARE COMPARING!

When describing an experiment or simulation… Simply stating one of these things is not enough… “use a random number generator…” “use a table or random digit table…” “use a coin to randomly select…” YOU MUST DESCRIBE YOUR PROCEDURE IN DETAIL

Describing random assignment using a RDT… Label subjects to digits Peel digits Assign subjects to treatments

If you are asked to simulate something… Include a stopping rule as well

Type 1 error is… The probability of incorrectly rejecting the Null Hypothesis.

Type 2 error is… The probability of incorrectly failing to reject the null hypothesis. 1- Power

How will each of these affect the power of a test?… Using a larger alpha? Using a larger sample size? Using a smaller sigma? Choosing an alternative that is further from the population mean? Increasing Type 1 error? Decreasing Type 2 error? ……They all will Increase the Power!!!

Medians… Are resistant! Means are not!

IQR… Is a measure of spread!

The 1.5 IQR Rule… Adds or subtracts from Q1 and Q3!!! Not the median…

A Five Number Summary is Min, Q1, Med, Q3, Max 0%, 25%, 50%, 75%, 100%

Variance is Std. dev squared!

Show all your work, even for little calculations! AND WRITE NEATLY! Even show your work if the answer is 2+2 = 4

If you erase on your scantron sheet, erase thoroughly!

Please use… All 90 minutes of each section… It cant hurt you and you cant go anywhere… Once you think you are done, go back and try to find at least 1 problem to change/correct/improve.

The Investigative Task… Will be difficult- but it will be difficult for everybody. Even if you can get a 2 on it, you are ahead of the game.

If you don’t know an answer to part A of a question… Say “Suppose Part A is 3.6”… just so you can go on to part B… You can still get full credit for Part B if Part A is incorrect.

Every answer… Should be in context

If you want to review one more thing…. Review mean and variance of a discrete random variable (last pages in your review packet) The answers and explanations are included in the packet.

And the most important thing I can tell you…

READ CAREFULLY and ANSWER THE QUESTION! THEY LIKE TO ASK MULTIPLE QUESTIONS AT ONCE… SO ANSWER COMPLETELY!