Using Lock5 Statistics: Unlocking the Power of Data

Slides:



Advertisements
Similar presentations
Panel at 2013 Joint Mathematics Meetings
Advertisements

Using Randomization Methods to Build Conceptual Understanding in Statistical Inference: Day 1 Lock, Lock, Lock, Lock, and Lock MAA Minicourse – Joint Mathematics.
StatKey Online Tools for Teaching a Modern Introductory Statistics Course Robin Lock Burry Professor of Statistics St. Lawrence University
Early Inference: Using Bootstraps to Introduce Confidence Intervals Robin H. Lock, Burry Professor of Statistics Patti Frazer Lock, Cummings Professor.
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
A Fiddler on the Roof: Tradition vs. Modern Methods in Teaching Inference Patti Frazer Lock Robin H. Lock St. Lawrence University Joint Mathematics Meetings.
STAT 101 Dr. Kari Lock Morgan Exam 2 Review.
Connecting Simulation- Based Inference with Traditional Methods Kari Lock Morgan, Penn State Robin Lock, St. Lawrence University Patti Frazer Lock, St.
Dr. Kari Lock Morgan Department of Statistics Penn State University Teaching the Common Core: Making Inferences and Justifying Conclusions ASA Webinar.
Starting Inference with Bootstraps and Randomizations Robin H. Lock, Burry Professor of Statistics St. Lawrence University Stat Chat Macalester College,
Using Simulation Methods to Introduce Statistical Inference Patti Frazer Lock Kari Lock Morgan Cummings Professor of Mathematics Assistant Professor of.
Building Conceptual Understanding of Statistical Inference with Lock 5 Dr. Kari Lock Morgan Department of Statistical Science Duke University Wake Forest.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
StatKey Online Tools for Teaching a Modern Introductory Statistics Course Robin Lock Burry Professor of Statistics St. Lawrence University
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Building Conceptual Understanding of Statistical Inference Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University
Using Simulation Methods to Introduce Inference Kari Lock Morgan Duke University In collaboration with Robin Lock, Patti Frazer Lock, Eric Lock, Dennis.
More About Significance Tests
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Statistics: Unlocking the Power of Data Lock 5 Afternoon Session Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock University of Kentucky.
Building Conceptual Understanding of Statistical Inference Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University
Statistics: Unlocking the Power of Data Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University University of Kentucky.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
+ Chapter 12: Inference for Regression Inference for Linear Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Robin Lock St. Lawrence University USCOTS Opening Session.
CHAPTER 17: Tests of Significance: The Basics
Lesson Inference for Regression. Knowledge Objectives Identify the conditions necessary to do inference for regression. Explain what is meant by.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
StatKey Online Tools for Teaching a Modern Introductory Statistics Course Robin Lock Burry Professor of Statistics St. Lawrence University
Introducing Inference with Bootstrapping and Randomization Kari Lock Morgan Department of Statistical Science, Duke University with.
Implementing a Randomization-Based Curriculum for Introductory Statistics Robin H. Lock, Burry Professor of Statistics St. Lawrence University Breakout.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
Building Conceptual Understanding of Statistical Inference Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University Canton, New York.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Give your data the boot: What is bootstrapping? and Why does it matter? Patti Frazer Lock and Robin H. Lock St. Lawrence University MAA Seaway Section.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/6/12 Synthesis Big Picture Essential Synthesis Bayesian Inference (continued)
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 250 Dr. Kari Lock Morgan Chapter 5 Normal distribution (5.1) Central limit theorem.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
StatKey Online Tools for Teaching a Modern Introductory Statistics Course Robin Lock Burry Professor of Statistics St. Lawrence University
Bootstraps and Scrambles: Letting a Dataset Speak for Itself Robin H. Lock Patti Frazer Lock ‘75 Burry Professor of Statistics Cummings Professor of MathematicsSt.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Synthesis and Review for Exam 1.
Using Randomization Methods to Build Conceptual Understanding in Statistical Inference: Day 1 Lock, Lock, Lock, Lock, and Lock Minicourse – Joint Mathematics.
CHAPTER 10 Comparing Two Populations or Groups
Inference: Conclusion with Confidence
Using Simulation Methods to Introduce Inference
Using Simulation Methods to Introduce Inference
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 12 More About Regression
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Comparing Two Proportions
Comparing Two Proportions
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Teaching with Simulation-Based Inference, for Beginners
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

Using Lock5 Statistics: Unlocking the Power of Data Classroom Suggestions Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock

Chapter 1: Collecting Data Why is this first? Comes first in actual analysis More interesting than histograms and mean/median! Data! Categorical vs Quantitative Variables Concept of a dataset with cases as rows and variables as columns Data Collection “Random” in random sampling does not mean haphazard! And you can NOT do random! Randomized experiment necessary to make conclusions about causality ALWAYS think about how the data were collected before making conclusions

Chapter 1: Collecting Data Focus is not on memorizing methods, but on thinking critically about how data are collected Should be fun and interesting! (See Instructor Resources) Relatively hard to assess Can give only minimal coverage to some of the details if desired

Chapter 2: Describing Data Pretty straightforward Outline: Single variables Categorical Quantitative Relationships between variables Two categorical One categorical and one quantitative Two quantitative Discuss relevant graphs and summary statistics in each case

Chapter 2: Describing Data All graphs and most statistics found using technology Use interesting datasets! Reinforce ideas from Chapter 1 Possibly introduce StatKey or other relevant software at this point www.lock5stat.com/statkey

StatKey www.lock5stat.com/statkey

Unit A Essential Synthesis One day Flipped classroom Integrate ideas from Chapters 1 and 2

Chapter 3: Confidence Intervals Sampling variability/Sampling distributions Concepts of “margin of error” and “standard error” Concept of a confidence interval or interval estimate StatKey might be helpful

StatKey www.lock5stat.com/statkey

Chapter 3: Confidence Intervals Sampling Distribution: Have access to entire population Take many samples of the same size and record some statistic Not feasible in practice! Bootstrap Distribution Only have one sample Take many samples of the same size (with replacement) from that one sample and record some statistic Feasible!! And gives same approximate shape and standard error!!

Chapter 3: Confidence Intervals Using Bootstrap Distributions to reinforce the ideas of: Sampling variability/Sampling distributions Margin of error Standard error Interval estimate that is likely to contain the true value of the parameter

Chapter 3: Confidence Intervals Using Bootstrap Distributions to construct confidence intervals: Using: Statistic ± 2· SE (helps get them used to the formulas that will come later) Using middle 95% (helps them understand confidence level)

StatKey www.lock5stat.com/statkey

StatKey Sample mean Standard Error

𝑂𝑟𝑖𝑔𝑖𝑛𝑎𝑙 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 ±2∙𝑆𝐸 Using the Bootstrap Distribution to Get a Confidence Interval – Method #1 The standard deviation of the bootstrap statistics estimates the standard error of the sample statistic. Quick interval estimate : 𝑂𝑟𝑖𝑔𝑖𝑛𝑎𝑙 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 ±2∙𝑆𝐸 For the mean Mustang prices: 15.98±2∙2.178=15.98±4.36=(11.62, 20.34)

Using the Bootstrap Distribution to Get a Confidence Interval – Method #2 Keep 95% in middle Chop 2.5% in each tail Chop 2.5% in each tail We are 95% sure that the mean price for Mustangs is between $11,930 and $20,238

Chapter 3: Confidence Intervals At the end of this chapter, students should be able to understand and interpret confidence intervals (for a variety of different parameters) (And be able to construct them using the bootstrap method) (which is the same method for all parameters)

Chapter 4: Hypothesis Tests State null and alternative hypotheses (for many different parameters) Understand the idea behind a hypothesis test (stick with the null unless evidence is strong for the alternative) Understand a p-value (!) State the conclusion in context (Conduct a randomization hypothesis test)

P-value: The probability of seeing results as extreme as, or more extreme than, the sample results, if the null hypothesis is true. Say what????

Example 1: Beer and Mosquitoes Does consuming beer attract mosquitoes? Experiment: 25 volunteers drank a liter of beer, 18 volunteers drank a liter of water Randomly assigned! Mosquitoes were caught in traps as they approached the volunteers.1 1 Lefvre, T., et. al., “Beer Consumption Increases Human Attractiveness to Malaria Mosquitoes, ” PLoS ONE, 2010; 5(3): e9546.

Beer and Mosquitoes Number of Mosquitoes Beer 27 20 21 26 31 24 19 23 28 29 17 25 18 Water 21 22 15 12 16 19 24 23 13 20 18 Does drinking beer actually attract mosquitoes, or is the difference just due to random chance? H0: μB = μW Ha: μB > μW 𝑥 𝐵 =23.60 𝑥 𝑊 =19.22 𝑥 𝐵 − 𝑥 𝑊 =4.38

Traditional Inference 1. Check conditions 5. Which theoretical distribution? 2. Which formula? 𝑡= 𝑥 𝐵 − 𝑥 𝑊 𝑠 𝐵 2 𝑛 𝐵 + 𝑠 𝑊 2 𝑛 𝑊 6. df? 7. find p-value 8. Interpret a decision 3. Calculate numbers and plug into formula 𝑡= 23.6−19.22 4.1 2 25 + 3.7 2 18 4. Chug with calculator 𝑡=3.68 0.0005 < p-value < 0.001

Simulation Approach Number of Mosquitoes Beer 27 20 21 26 31 24 19 23 28 29 17 25 18 Water 21 22 15 12 16 19 24 23 13 20 18 Does drinking beer actually attract mosquitoes, or is the difference just due to random chance? H0: μB = μW Ha: μB > μW 𝑥 𝐵 =23.60 𝑥 𝑊 =19.22 𝑥 𝐵 − 𝑥 𝑊 =4.38

Simulation Approach Number of Mosquitoes Find out how extreme these results would be, if there were no difference between beer and water. What kinds of results would we see, just by random chance (i.e. beverage doesn’t matter)? Beer 27 20 21 26 31 24 19 23 28 29 17 25 18 Water 21 22 15 12 16 19 24 23 13 20 18 27 21 20 22 21 15 26 12 31 16 24 19 19 15 23 24 28 23 19 13 24 22 29 20 20 24 17 18 31 20 25 28 21 27 18 20 Re-randomize results into Beer and Water groups

Simulation Approach StatKey Number of Mosquitoes Find out how extreme these results would be, if there were no difference between beer and water. What kinds of results would we see, just by random chance (i.e. beverage doesn’t matter)? Beer Water 20 22 21 15 26 12 27 21 31 16 24 19 19 15 23 24 28 23 19 13 24 22 29 20 20 24 17 18 31 20 25 28 21 27 18 20 27 21 21 20 24 19 31 13 18 25 15 16 28 22 27 23 20 26 31 19 23 15 22 12 24 29 27 21 17 28 𝑥 𝐵 =21.76 𝑥 𝑊 =22.50 Re-randomize results into Beer and Water groups StatKey 𝑥 𝐵 − 𝑥 𝑊 =−0.84 Repeat MANY times

StatKey www.lock5stat.com/statkey

StatKey! www.lock5stat.com P-value

Traditional Inference 1. Which formula? 4. Which theoretical distribution? 5. df? 6. find p-value 2. Calculate numbers and plug into formula 3. Plug into calculator 0.0005 < p-value < 0.001

Beer and Mosquitoes The Conclusion! The results seen in the experiment are very unlikely to happen just by random chance (just 1 out of 1000!) We have strong evidence that drinking beer does attract mosquitoes!

P-value: The probability of seeing results as extreme as, or more extreme than, the sample results, if the null hypothesis is true. How extreme are the sample results in the randomization distribution? Randomization distribution must assume null hypothesis is true

Chapter 4: Hypothesis Tests State null and alternative hypotheses Understand the idea behind a hypothesis test Understand a p-value (!) State the conclusion in context Can minimize the details of how the randomization is carried out -- Important idea is that the process must assume the null hypothesis is true!

By this point in the course, students have all the key ideas of inference!!!! Take your time through Chapters 3 and 4 You can make up the time later – Chapters 5 and 6 go quickly!

Unit B Essential Synthesis One day Flipped classroom Integrate ideas from Chapters 1 through 4

Chapter 5: Normal Distribution Finding probabilities and cutoff values on a normal distribution Using a distribution for confidence intervals: And hypothesis tests: 𝑆𝑎𝑚𝑝𝑙𝑒 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐± 𝑧 ∗ ∙𝑆𝐸 𝑡.𝑠.= 𝑆𝑎𝑚𝑝𝑙𝑒 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 −𝑁𝑢𝑙𝑙 𝑃𝑎𝑟𝑎𝑚𝑒𝑡𝑒𝑟 𝑆𝐸

StatKey www.lock5stat.com/statkey

Chapter 6: Short-cut Formulas Short sections can be covered in any order you want!! Proportions or means first One sample or two first Confidence intervals or hypothesis tests first Can be covered quickly! Mostly just lots of new SE formulas! Do more than one section a day!!!

StatKey Sample stats 𝑆𝐸= 𝑠 𝑛 = 11.114 25 =2.22

Additional Topics Chi-square Tests (Chapter 7) ANOVA for difference in means (Chapter 8) Inference for simple regression (Chapter 9) and multiple regression (Chapter 10) These can be done in any order (Also, probability – chapter 11 – can be omitted or covered at any point in the course)

StatKey www.lock5stat.com/statkey

Instructor Resources PowerPoint slides for every section Clicker questions for every section Notes and suggestions for every section Instructor video for every section Class worksheet(s) for every section Class activity for every section Videos for every example and every learning goal WileyPLUS (with most content designed by us) Software manuals for R, Minitab, Fathom, Excel, SAS, TI calculators Datasets ready to import in these formats Test bank

Feel free to contact me or any of the authors at any time if you have any questions or suggestions for improvement. Thanks! lock5stat.com