What is a Random Sample (and what if its not) ©Dr. B. C. Paul 2005.

Slides:



Advertisements
Similar presentations
The T Distribution ©Dr. B. C. Paul Wasn’t the Herby Assembly Line Problem Fun But there is one little problem But there is one little problem We.
Advertisements

Psych 5500/6500 The Sampling Distribution of the Mean Fall, 2008.
Sampling Distributions
Chapter 18 Sampling Distribution Models
What is MPC? Hypothesis testing.
Sampling Distributions. Review Random phenomenon Individual outcomes unpredictable Sample space all possible outcomes Probability of an outcome long-run.
4.4 Mean and Variance. Mean How do we compute the mean of a probability distribution? Actually, what does that even mean? Let’s look at an example on.
The standard error of the sample mean and confidence intervals
Copyright © 2009 Pearson Education, Inc. Chapter 16 Random Variables.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 15, Slide 1 Chapter 15 Random Variables.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Central Tendency and Variability
Statistical Analysis – Chapter 4 Normal Distribution
Applications of the Normal Distribution Model (The Confidence Interval) ©Dr. B. C. Paul 2003 revision 2009 Note – The concepts found in these slides are.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 23 Comparing Means.
Quantitative Skills: Data Analysis
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 16 Random Variables.
AP Statistics Chapter 9 Notes.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
Measures of Variability James H. Steiger. Overview Discuss Common Measures of Variability Range Semi-Interquartile Range Variance Standard Deviation Derive.
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
Modeling Spatial Correlation (The Semivariogram) ©2007 Dr. B. C. Paul.
1 Chapter 16 Random Variables. 2 Expected Value: Center A random variable assumes a value based on the outcome of a random event.  We use a capital letter,
Engineering Statistics Mnge 417 Introduction ©Dr. B. C. Paul 2003.
Introduction to Behavioral Statistics Probability, The Binomial Distribution and the Normal Curve.
Psyc 235: Introduction to Statistics DON’T FORGET TO SIGN IN FOR CREDIT!
Distributions of the Sample Mean
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Section 10.1 Confidence Intervals
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 16 Random Variables.
AP Statistics Chapter 24 Comparing Means.
Estimation of a Population Mean
Sampling and estimation Petter Mostad
AP Statistics, Section 7.2, Part 1 2  The Michigan Daily Game you pick a 3 digit number and win $500 if your number matches the number drawn. AP Statistics,
Section 7.2 P1 Means and Variances of Random Variables AP Statistics.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
Chapter 18 Sampling distribution models math2200.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 16, 2009.
The Normal Distribution AS Mathematics Statistics 1 Module.
Sampling Distributions Chapter 18. Sampling Distributions If we could take every possible sample of the same size (n) from a population, we would create.
Copyright © 2010 Pearson Education, Inc. Chapter 16 Random Variables.
Statistics 16 Random Variables. Expected Value: Center A random variable assumes a value based on the outcome of a random event. –We use a capital letter,
Probability Distributions. Constructing a Probability Distribution Definition: Consists of the values a random variable can assume and the corresponding.
The Law of Averages. What does the law of average say? We know that, from the definition of probability, in the long run the frequency of some event will.
Copyright © 2009 Pearson Education, Inc. Chapter 16 Random Variables.
Sampling Distributions
Chapter 15 Random Variables.
Chapter 23 Comparing Means.
Chapter 15 Random Variables
Chapter 16 Random Variables.
Chapter 23 Comparing Means.
Chapter 16 Random Variables.
Chapter 15 Random Variables.
Chapter 23 Comparing Means.
Summary descriptive statistics: means and standard deviations:
Chapter 16 Random Variables Copyright © 2009 Pearson Education, Inc.
Sampling Distributions
Section Means and Variances of Random Variables
Chapter 24 Comparing Means Copyright © 2009 Pearson Education, Inc.
Section Means and Variances of Random Variables
Chapter 16 Random Variables Copyright © 2010 Pearson Education, Inc.
Presentation transcript:

What is a Random Sample (and what if its not) ©Dr. B. C. Paul 2005

Some Commentary on Random Samples We are using mathematical models as surrogates for a reality we either don’t have data for or can’t afford what it means to get the data We are using mathematical models as surrogates for a reality we either don’t have data for or can’t afford what it means to get the data We’ve already discussed that we assumed normal distribution (the t distribution is just an adaptation with uncertainty in the stdev.) We’ve already discussed that we assumed normal distribution (the t distribution is just an adaptation with uncertainty in the stdev.) What does it mean to say our sample was random What does it mean to say our sample was random 1- No one cherry picked the data set (can be a problem when visual appearance is different – humans are born cherry pickers) 1- No one cherry picked the data set (can be a problem when visual appearance is different – humans are born cherry pickers) 2- Value of one sample has no bearing on what the next sample value will be 2- Value of one sample has no bearing on what the next sample value will be

When is That Not True When taking the sample alters the nature of the remaining population When taking the sample alters the nature of the remaining population Example – Playing Black Jack Example – Playing Black Jack When a card is drawn and played the number of that particular card in the deck is changed When a card is drawn and played the number of that particular card in the deck is changed Casinos may play with several decks to more closely approximate a random chance draw (because the house has an advantage in a random game) Casinos may play with several decks to more closely approximate a random chance draw (because the house has an advantage in a random game) Casinos also tend to get upset if they find that someone is trying to recalculate the odds based on what has played Casinos also tend to get upset if they find that someone is trying to recalculate the odds based on what has played

Another Time it is not True In the presence of spatial correlation In the presence of spatial correlation Spatial correlation is commonly seen in Mining Ore Grade problems and Environmental Engineering Spatial correlation is commonly seen in Mining Ore Grade problems and Environmental Engineering If I take a soil sample and find it loaded with dioxin what are the chances that a soil sample taken two inches away will show no dioxin? If I take a soil sample and find it loaded with dioxin what are the chances that a soil sample taken two inches away will show no dioxin? With the random formula for variance of the mean there are so many little samples in a truck load of ore that every truck load of ore should have the average grade of the deposit – IF THINGS WERE RANDOM With the random formula for variance of the mean there are so many little samples in a truck load of ore that every truck load of ore should have the average grade of the deposit – IF THINGS WERE RANDOM

Variance of Means with Spatial Correlation First Thing one must define how correlation is influenced by distance and direction. First Thing one must define how correlation is influenced by distance and direction. Take the samples and create a “Semivariogram” Take the samples and create a “Semivariogram” Plot the average half squared difference for all samples a distance X apart Plot the average half squared difference for all samples a distance X apart

The Semivariogram ½ squared difference Distance Model Line fit to data points from samples Measures correlation Using ½ the squared Difference between Samples a distance X apart ½ squared difference is named Gama (symbol – γ) ½ squared difference has same units as Sample variance and levels out at Sample variance

Variance of Means Big Block of Ore Loaded in a Truck Itsy Bitsy Sample used To plot semivariogram We know the big block of ore has A lower stdev than the samples – But how much lower? Its not σ/sqrt(n)

Using Numerical Methods and Computers Computer creates a grid of Points – about 25 is usually Enough. Computer then exhaustively Measures all combinations of Distances between points (all 525 of them) For each distance it uses the Semivariogram model to Calculate the expected Variability of the points It keeps a running total and Then calculates the average Value of gamma.

The Variance of Big Blocks is Remember variance is just standard deviation squared We know the variance of samples cause we have the sample set And have calculated it That gamma bar thing up there is the number our computer just Chugged out for us Hey I can subtract even on a bad day!! We’ll look more at Spatial Statistics Later

Randomness and Using Normal Distribution Statistics Use ordinary normal distribution (or T) statistics if you are using random samples Use ordinary normal distribution (or T) statistics if you are using random samples Don’t cherry pick your samples Don’t cherry pick your samples Don’t determine what the test is after you collect your test statistics Don’t determine what the test is after you collect your test statistics Watch Out for Conditions that make a random sample impossible to take Watch Out for Conditions that make a random sample impossible to take Cases were your sample actually changed in a noticeable way the remaining population (the Black Jack example) Cases were your sample actually changed in a noticeable way the remaining population (the Black Jack example) Cases were your samples are in fact related to each other by virtue of how close and in what direction they came from (ie- Spatial Correlation) Cases were your samples are in fact related to each other by virtue of how close and in what direction they came from (ie- Spatial Correlation) We can handle these non-random sampling events but it does take a different mathematical model (don’t use the wrong model) We can handle these non-random sampling events but it does take a different mathematical model (don’t use the wrong model)