Presentation is loading. Please wait.

Presentation is loading. Please wait.

Basic Statistics Probability and Sampling Distributions.

Similar presentations

Presentation on theme: "Basic Statistics Probability and Sampling Distributions."— Presentation transcript:


2 Basic Statistics Probability and Sampling Distributions


4 Sampling Sample sample A sample is a portion, or part, of the population of interest Inferential statistics describes a population of data using the information contained in a sample Inferential Statistics Population ? Estimating Predicting (inferring) statistics parameters EXAMPLE:

5 Everyone makes inferences, why Statistical Inference? I predict rain today!!! The difference between a fortune teller and a statistician is a Statement of Goodness

6 A statement of goodness is a statement indicating the chance or probability that an inference is wrong. Probability is a statement of ones belief that an event will happen. * What are the chances of a head appearing when you toss a coin? * What are the chances of selecting an Ace of Diamonds from a deck of cards?

7 Probability Has a Basis in Mathematics Consider a coin-toss experiment: where k = number of heads and N = number of tosses 1/2

8 Relative Frequency Probability P(A) = Number of Events of Interest (A) divided by the Total Number of Events. –Probability of a Head on the flip of an honest coin = 1/2 (as we saw on the last slide). –Probability of drawing the Ace of Diamonds = 1/52 (since there are 52 cards in the deck and only 1 ace of diamonds). There are other elements of probability discussed in the text but we will not be using those portions for this class.

9 We can put probability in the context of a relative frequency distribution. Recall the example of 10 students who took a 5- point quiz with the following results. It turns out that the area under the curve also represents the probability of an event. For example, what is the probability that a student picked at random scored 4 on the quiz? f X r.4 --.3 --.2 --.1 --.0 -- 1 2 3 4 5.1 The probability would be.2, the same as the area under the curve!

10 z This idea transfers directly to a normal distribution. What is the probability that a randomly selected person scores at least 650 on the Verbal portion of the GRE? First, we must compute the persons z-score. 01.5 From Table A, Column C we find that.0668 of the area is in the tail. Thus, the probability is about.07.

11 Some Notation CharacteristicSamplePopulation (Necessary to Distinguish Between Sample and Population) Mean Standard Deviation Sample Size In descriptive statistics, the differentiation is not important as the sample and population numerical measures are the same.

12 Sampling A key to statistical inference is the assumption that the sample is representative of the population from which it was drawn. Random Sampling ensures that each possible sample has an equal chance of being selected and all members of the population have an equal chance of being selected into the sample. We will assume that all samples are selected in a random fashion. Please note that bigger is not better, unless it is representative.

13 Introduction to Inferential Statistics How do we get from a sample to a prediction about a population? In statistics, we use a sampling distribution to infer the characteristics of the population.

14 Some Definitions Sampling Distributions: A distribution of statistics obtained by selecting all possible samples of a specific size from a population. Sampling Error: The discrepancy between the statistic obtained from the sample and the parameter for the population. Standard Error: Provides an estimate of exactly how much error, on average, should exist between the statistic and the parameter. It is a measure of chance and is the standard deviation of the sampling distribution.

15 What is the shape of the sampling distribution and can we describe it in terms of mean and standard deviation (standard error)? YES! The answer is the Central Limit Theorem (CLT).

16 For any population with mean and standard deviation, the distribution of sample means for sample size n (n >30) will have a mean of and a standard deviation (standard error) of, and will approach a normal distribution as n approaches infinity. Central Limit Theorem Applied to Means

17 The Central Limit Theorem Recap Regardless of the mean of the population, the mean of the distribution of sample means (sampling distribution) will be the same. Regardless of the SD of the population, the SD of the sampling distribution will be the same divided by the square root of the sample size. Regardless of the shape of the population, the shape of the sampling distribution will be approximately normal.

18 An Example of a Sampling Distribution of the Means First, assume that we have a Population consisting of only four numbers N =4, (2, 4, 6, 8). Next, we will take all possible samples from this population of size n = 2. We will calculate the mean of each sample that we obtain. Finally, we will plot the means in a frequency histogram.

19 Our Population Parameters = 20 / 4 = 5 = = 2.236

20 All Possible Samples (n = 2) from our Population First PickSecond PickMeanFirst PickSecond PickMean 222624 243645 264666 285687 423825 444846 465867 486888 These are all possible samples (16) of size n = 2 and the means of those samples that can be taken from our population of N = 4 objects.

21 Frequency Histogram of Means All 16 means plotted

22 Calculate the Mean of the Means and Standard Deviation of the Means

23 The Central Limit Theorem Recall that the Central Limit Theorem states that the mean of the sampling distribution of the means would be equal to. From the previous slide we calculated the mean of the sampling distribution of our 16 means to be 5, which is the population mean we calculated earlier. Also, recall that the Central Limit Theorem states that the standard deviation (standard error) would be equal to If we divide the standard deviation we calculated on our population (s = 2.236) and divide it by the square root of our sample size ( n= 2) we would obtain 2.236 / 1.4142 = 1.58; which is exactly what we calculated our standard deviation to be using our sample data.

24 Summary of Central Limit Theorem = 20 / 4 = 5 = 2.236 = 2.236 / 2 = 1.58 Note the symbols used to denote the mean of the means and the standard error of the mean and

25 Inferential Statistics We will use our knowledge of the sampling distribution of the means {( ), ( )} given by the Central Limit Theorem as the basis for inferential statistics. We also will use our ability to locate a single score in a distribution using z scores in hypothesis testing.

Download ppt "Basic Statistics Probability and Sampling Distributions."

Similar presentations

Ads by Google