Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Using Real vs Simulated Data in an Introductory.

Slides:



Advertisements
Similar presentations
INTRODUCTION TO STATISTICAL CONCEPTS. Objectives Definition of “statistics” Descriptive vs. Inferential Statistics Types of Descriptive Statistics Elements.
Advertisements

Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
Research Methodology For reader assistance, have an introductory paragraph in which attention is given to the organization of the section in relation to.
Statistics Using StatCrunch in a Large Enrollment Course Roger Woodard Department of Statistics NC State University.
1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
QBM117 Business Statistics Statistical Inference Sampling 1.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
AGEC 608 Lecture 14, p. 1 AGEC 608: Lecture 14 Objective: Provide overview of contingent valuation method (CVM) and review strengths and weaknesses of.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Cross-sectional:Observations on individuals, households, enterprises, countries, etc at one moment in time (Chapters 1–10, Models A and B). 1 During this.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
Experiments and Observational Studies.  A study at a high school in California compared academic performance of music students with that of non-music.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 14.
Chapter 33 Conducting Marketing Research. The Marketing Research Process 1. Define the Problem 2. Obtaining Data 3. Analyze Data 4. Rec. Solutions 5.
Introductory Statistical Concepts. Disclaimer – I am not an expert SAS programmer. – Nothing that I say is confirmed or denied by Texas A&M University.
Advanced Business Communication Spring Advanced Business Communication Spring 2012 Introduction Our last project for the class is a recommendation.
Copyright © 2010 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Experiments and Observational Studies. Observational Studies In an observational study, researchers don’t assign choices; they simply observe them. look.
Copyright © 2010 Pearson Education, Inc. Slide
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 13 Experiments and Observational Studies.
Using Lock5 Statistics: Unlocking the Power of Data
 Collecting Quantitative  Data  By: Zainab Aidroos.
Copyright © Cengage Learning. All rights reserved. Section 1.3 Introduction to Experimental Design.
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Sampling is the other method of getting data, along with experimentation. It involves looking at a sample from a population with the hope of making inferences.
1.3 – Introduction to Experimental Design Vocabulary Census Sample Simulation.
Part III Gathering Data.
Chapter 41 Sample Surveys in the Real World. Chapter 42 Thought Question 1 (from Seeing Through Statistics, 2nd Edition, by Jessica M. Utts, p. 14) Nicotine.
Day 3: Sampling Distributions. CCSS.Math.Content.HSS-IC.A.1 Understand statistics as a process for making inferences about population parameters based.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
Chapter Five Data Collection and Sampling Sir Naseer Shahzada.
Section 10.1 Confidence Intervals
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Stat 1510: Sampling Distributions
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Marketing Research Approaches. Research Approaches Observational Research Ethnographic Research Survey Research Experimental Research.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
+ Using StatCrunch to Teach Statistics Using Resampling Techniques Webster West Texas A&M University.
1 Data Collection and Sampling Chapter Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results.
Online students’ perceived self-efficacy: Does it change? Presenter: Jenny Tseng Professor: Ming-Puu Chen Date: July 11, 2007 C. Y. Lee & E. L. Witta (2001).
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Probability Sampling. Simple Random Sample (SRS) Stratified Random Sampling Cluster Sampling The only way to ensure a representative sample is to obtain.
Learning Objectives Determine when to use sampling. Determine the pros and cons of various sampling techniques. Be aware of the different types of errors.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Sample Size Mahmoud Alhussami, DSc., PhD. Sample Size Determination Is the act of choosing the number of observations or replicates to include in a statistical.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
We’ve been limited to date being given to us. But we can collect it ourselves using specific sampling techniques. Chapter 12: Sample Surveys.
Chapter 3 Generating Data. Introduction to Data Collection/Analysis Exploratory Data Analysis: Plots and Measures that describe a set of measurements.
Margin of Error S-IC.4 Use data from a sample survey to estimate a population mean or proportion; develop a margin of error through the use of simulation.
Statistics 25 Paired Samples. Paired Data Data are paired when the observations are collected in pairs or the observations in one group are naturally.
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
Intro to Probability and Statistics 1-1: How Can You Investigate Using Data? 1-2: We Learn about Populations Using Samples 1-3: What Role Do Computers.
Statistics Critical Thinking in Intro Stats Roger Woodard.
Teaching Introductory Statistics
Chapter Six Normal Curves and Sampling Probability Distributions
CHAPTER 4 Designing Studies
Introduction to Econometrics, 5th edition
STAT 515 Statistical Methods I Chapter 1 Data Types and Data Collection Brian Habing Department of Statistics University of South Carolina Redistribution.
Presentation transcript:

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Using Real vs Simulated Data in an Introductory Statistics Course Christopher J. Malone Kansas State University

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Caldwell (1983), “Combining Real and Generated Data in Lab Exercises to Demonstrate Problems in Inference”, Proceedings of the Section on Statistics Education –“Limiting lab exercises to the analysis of real data is a analogous to practicing dart-throwing by concentrating on one’s form without being able to see how close each dart comes to the bullseye.” –Gives several examples of real/simulated data exercises Lit Review

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Halley (1991), “Teaching Social Statistics with Simulated Data”. Teaching Sociology –Real data contains missing codes…leads to unnecessary confusion –…interesting and significant relationships often disappear when providing unique data sets with real data Lit Review

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings What does Real data bring to the classroom? –Self-motivating –Students can use preconceived judgments to “complete” an analysis (subjective analysis + analytical analysis) –Students get a feel for real problems inherent in real data –?Obviously?, more realistic Real/Simulated

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings What does simulated (realistic) data bring to the classroom? –More easily provide individualized data sets –Easily investigate the purpose, concept, and behavior of a statistical procedure –Avoid many of the pre-analysis issues –Verify a statistical procedure –Time management issues Real/Simulated

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Best Solution ??? Real Simulated + Real/Simulated

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Previous Work… –Gitlow & Oppenheim (1982), Stat City –Halley (1991), “Teaching Social Statistics with Simulated Data”. GENSTAT system –Chang et all (1992), “Teaching Survey Sampling Using Simulation”, SURVEY –Schwarz (1997), “StatVillage: An On-Line Hypothetical City Based on Real Data for Use in an Introductory Class in Survey Sampling” Existing Systems

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Gitlow & Oppenheim (1982), Stat City –Objectives: 1. …complete statistical problems…totality of statistical studies, from inception through memorandum 2.unified statistical problems… –Used at the undergraduate and graduate levels –Wide variety of problems (ex. telephone bills, Tax Assessor’s Office, territorial shopping behavior) –Students’ response: “extremely enthusiastic “ Existing Systems

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Halley (1991), “Teaching Social Statistics with Simulated Data”. GENSTAT system –Used to assist instructors in the creation of sample data for demonstration, homework, lab work, and testing –Very flexible (specify variables name, parameters, etc) –Emphasis placed on individualized data sets –Creates a file of data and provides a complete solution Existing Systems

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Chang et all (1992), “Teaching Survey Sampling Using Simulation”, SURVEY –Used in introductory and advanced survey courses –Simulates samples drawn from a hypothetical county –Specific purpose, Cablevision Company –Costs, non-response issues are incorporated –Students’ response: “gave a feeling of realism to the class” Existing Systems

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Schwarz (1997), “StatVillage: An On-Line Hypothetical City Based on Real Data for Use in an Introductory Class in Survey Sampling” –Two main selling points: 1. Accessibility (World Wide Web) 2. Based on actual census records –Multiple variables, single location (Vancouver BC) –Mention “easily modify”, not sure to what extent??? Existing Systems

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Grades: Stat CityGENSTATSURVEYStatVillage Individualized Data Sets AAAA Overall Flexibility DADC Overall Accessibility DCDA Change Population? FADC Uses Real Data? FDDA Solutions Provided FAFF Existing Systems “Best” Solution ?? –Combine GENSTAT and StatVillage

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Personal Example #1 (GENSTAT) –Multiple linear regression ( indicators/ interaction/non- constant variance/outliers) –Modeling used car prices based on mileage, age, and domestic/foreign –Each group gets data from a variety of models –Parameter estimates are specified (by the instructor) so that students may start in the same spot, but may end in a very different spot –Might have to “sufficientize” the data for grading purposes Examples Must communicate “important” issues that arise within groups across groups!!

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Personal Example #2 (StatVillage) –Selling prices of homes in local area over the past 3 years –Each group gets a particular “area” or a random sample from the entire database –Students visit the database once for simple linear regression and return for multiple linear regression (same observations used the second time for comparison purposes) Examples Must communicate “important” issues that arise within groups across groups!!

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Personal Example #3 (StatVillage +, -Real) –Planet X Students are asked to visit the planet to obtain data for all “missions” (projects) Data are “different” than here on earth (wanted relationships to be unknown) Students pose research questions, gather relevant variables, write briefings, missing values included,… –Side-effects (Good/Bad, you decide…) Students never see real data Prevents subjective analysis Students results are not verified Examples Must communicate “important” issues that arise within groups across groups!!

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Very Simple Survey –Students (Spring 2001 semester): 87 respondents second semester introductory business statistics –Faculty & GTAs (May 2001): 9 respondents Teach a variety of classes (undergraduate & graduate) Survey

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Is there a difference between real data and realistic data? (Students) –20% said yes –“Realistic means it was generated, but probably reflects the ‘norm’” –“I like the real data because everything doesn’t come out all clean and nice feeling” Survey

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Is there a difference between real data and realistic data? (Faculty & GTAs) –“Yes” by all –“Real data often obscures the purpose…“ –“Context is what matters…” –“I like real data much more” Survey

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Additional Questions (1=Low, 5=High) –A. How important is it for you to create your own question of interest? –B. How important is it to use real data? –C. How important is it to use realistic data? –D. How important is it that all students have the same data set? –E. How important is it that all students do the same analyses? –F. How important is individualism/ownership? Survey

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Results -- Students Survey

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Results -- Students/Faculty Students Faculty Survey

Christopher J. Malone Lit Review Real/Simulated Existing Systems Examples Survey 8/09/01Joint Statistical Meetings Future work… –Create a web-based interface so that students can easily get samples of real data or simulated data –Instructors provide the file (Excel, say) and samples are obtained through the web –Automate a procedure for verification of results –Problems with groups need to be communicated across groups – very important for learning!