Probability Distributions, the Law of Large Numbers and the Central Limit Theorem Compare theoretical probability with a single sample and with many samples.

Slides:



Advertisements
Similar presentations
The following 5 questions are about VOLTAGE DIVIDERS. You have 20 seconds for each question What is the voltage at the point X ? A9v B5v C0v D10v Question.
Advertisements

Lecture Slides Elementary Statistics Eleventh Edition
Click GraphClick Plot Double click at C2 under Y and C1 under X in Graph Variable Window. Click OK Click on the pointer beside Display and select Project.
Store in column(s): C1-C60
BASIC SKILLS AND TOOLS USING ACCESS
Tutorial 3 – Creating a Multiple-Page Report
M & M’s Counting Activity
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
1 Contact details Colin Gray Room S16 (occasionally) address: Telephone: (27) 2233 Dont hesitate to get in touch.
Mr Watsons Introduction to Spreadsheets The Quick and Easy guide to using Microsoft Excel.
Chapter 7 Sampling and Sampling Distributions
Excel Functions. Part 1. Introduction 2 An Excel function is a formula or a procedure that is performed in the Visual Basic environment, outside the.
CHAPTER 5 REVIEW.
Box and Whiskers with Outliers. Outlier…… An extremely high or an extremely low value in the data set when compared with the rest of the values. The IQR.
Biostatistics Unit 5 Samples Needs to be completed. 12/24/13.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edwards University.
T-Test Label and list one set of scores in one column List the second set in the second column.
1. 2 File 1-Excel Training Part 1 Where number1, number2 are 1 to 30 numeric arguments. Arguments can either be numbers, ranged names or ranges of cell.
The Frequency Table or Frequency Distribution Table
Pivot Tables. What are Pivot Tables? A pivot table gives you a way to group, summarize and compare data from a spreadsheet You can do some of the same.
Vanderbilt Business Objects Users Group 1 Reporting Techniques & Formatting Beginning & Advanced.
Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals.
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
Benchmark Series Microsoft Excel 2013 Level 2
Hypothesis Tests: Two Independent Samples
Happy Wednesday! April 18, What is a probability distribution? -A table or an equation that links each outcome of a statistical experiment with.
April 22 th, 2014 Get out the Please Be Discrete Task and have questions ready!
Collin College Credit Exam
The Normal Curve and z-Scores
MS- Excel Spread sheet Program Component of MS-Office suite Performs Calculations, Maintaining Databases and Summary Reports Graphs.
Problem #1 E Mathboat.com.
CST-094 Basic SpreadSheet Click your left mouse button to proceed... © Delta College CST Faculty.
WorkKeys Internet Version Training
Benchmark Series Microsoft Excel 2010 Level 1
Microsoft Office XP Microsoft Excel
ELECTRONIC SPREADSHEATS ELECTRONIC SPREADSHEATS Chapter 14 Dr. Bahaa Al-Sheikh & Eng. Mohammed AlSumady Intoduction to Engineering BME152.
Start up Excel. Notice that each row has a number, and each column has a letter. Click in A1 (column A, row 1), and type in a title for your data.
Introduction to Excel 2007 Part 2: Bar Graphs and Histograms February 5, 2008.
HOW TO CREATE A HISTOGRAM IN EXCEL. STEP 1: INSTALL ANALYSIS TOOLPAK 1.Click on the Microsoft Office Button (circle button) 2.Click on Excel Options.
Experiment #2 Resistor Statistics
Excel Charts – Basic Skills Creating Charts in Excel.
Creating a Histogram using the Histogram Function.
Statistical Analysis with Excel
CS1100: Computer Science and Its Applications Creating Graphs and Charts in Excel.
Using Excel To help with data. Excel is a spreadsheet program that can interface with Word, or PowerPoint A spreadsheet program has cells (little blocks)
BIOSTAT - 2 The final averages for the last 200 students who took this course are Are you worried?
Statistical Analysis with Excel (PREVIEW). Spreadsheet Programs First developed in 70s –VisiCalc Dan Bricklin and Bob Frankston –Operated on Apple II.
The AIE Monte Carlo Tool The AIE Monte Carlo tool is an Excel spreadsheet and a set of supporting macros. It is the main tool used in AIE analysis of a.
Creating Histograms on the TI-84 and in Excel Mr. Ricks Madison High School.
June 21, Objectives  Enable the Data Analysis Add-In  Quickly calculate descriptive statistics using the Data Analysis Add-In  Create a histogram.
Week 1 Day 3 (2.1, 2.2, 2.3) Summarizing Data I. Step one You will need to determine the type of the variable to summarize, whether it is Qualitative.
Risk Analysis Simulate a scenario of possible input values that could occur and observe key impacts Pick many input scenarios according to their likelihood.
Risk Analysis Simulate a scenario of possible input values that could occur and observe key financial impacts Pick many different input scenarios according.
Using Sheets To help with data. Sheets is a spreadsheet program that can interface with forms, docs, or presentations. A spreadsheet program has cells.
Using Microsoft® Excel This presentation is designed for Chapter 1, Section 1.2.
Statistical Analysis with Excel. Learning Objectives Be able to use the Dial Caliper to measure Be able to use Microsoft Excel to –Calculate mean, median,
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Histograms PowerPoint Prepared by Alfred P. Rovai.
Probability and Statistics 12/11/2015. Statistics Review/ Excel: Objectives Be able to find the mean, median, mode and standard deviation for a set of.
Introduction to Excel EC 151 Principles of Microeconomics Block 3,
Working with Equations Mini Activity I. Objectives: The Learner will be able to: 1. Apply Currency formatting to cells in Excel 2. Use the AutoSum tool.
Statistical Analysis with Excel © 2012 Project Lead The Way, Inc.Introduction to Engineering Design.
We know about inserting numbers in Excel and how to sum and average numbers. Insert these numbers and in Cell A9, find the average of the numbers. In.
Statistical Analysis with Excel
Statistical Analysis with Excel
Statistical Analysis with Excel
Step 1: Arrange all data from least to greatest to make it easier to calculate central tendencies. When arranged from least to greatest, you find the.
Introduction to Excel 2007 Part 3: Bar Graphs and Histograms
Budget expenses, Loans, and Stats
Experiment #2 Resistor Statistics
Presentation transcript:

Probability Distributions, the Law of Large Numbers and the Central Limit Theorem Compare theoretical probability with a single sample and with many samples. Dale Nelson Salt Lake Community College November 2013

Part I. Theoretical Probability Use the theoretical probability distribution for the results of the spinner to calculate the expected value, the standard deviation, and to make a histogram. X = NumberP(x) = Probability

The expected value, or mean, of a probability distribution is EV = = The standard deviation: =

Using an Excel spreadsheet to find the expected value, enter the title x in cell A1, and the values in cells A2 through A6 below. Enter the title P(x) in cell B1 and the values in cells B2 through B6 below. Enter the title x. P(x) in cell C1 and the function =A2*B2 in cell C2. Hook the cell in the lower right corner and drag the function through to cell C6. In cell C7 enter the function =sum(C2:C6).

To use the spreadsheet to find the standard deviation, enter the title x^2*P(x) in cell D1, and the function =A2^2*B2 in cells D2. Hook the cell and drag the function through to cell D6. In cell D7 enter the function =sum(D2:D6). Enter the title st. dev. = in cell A8 and in cell B9 enter the function=sqrt(D7-C7^2) Give titles to the work done as shown below.

xP(x)x * P(x)x^2 * P(x) total = mean = st. dev. = ABCD

To make the probability histogram, highlight cells B2:B6, click the Insert tab, then select Column in the Charts section. Finally click the most basic choice in the upper left corner. Titles should be added to this using the Layout tab in the Chart Tools.

Part II : Single sample of size n = 25 Use the Data Analysis program in the Analysis section of the Data tab to create a random sample. If its not there, it should be Added-In using the Analysis ToolPak. If using a Mac, Ive heard theres a free download available by Googling StatPlus:mac.

In the Data Analysis program, click Random Number Generation and enter the following Number of variables: 1 Number of random numbers: 25 Distribution: Discrete Value and Probability Range: A2:B6 Random seed: 0 < n < 32,767 Output Range: G1 The column of numbers generated represents 25 random spins.

Using the Excel spreadsheet, find the mean and standard deviation of the sample. Enter the title mean = in cell F26, and the function =AVERAGE(G1:G25)in cell G26. Enter the title st. dev. = in cell F27 and the function =STDEV(G1:G25)in cell G27. The distribution table is made using the Histogram program in the Data Analysis tool of the Analysis section of the Data tab.

In the Data Analysis program, click Histogram and enter Input Range: G1:G25 Bin Range: A2:A5 Output Range: C22 The last few values and the statistics for the random sample here looks like: BinFrequency mean =3.64 More10st. dev. = CDEFG

The Bins in the distribution table need to be changed to a general format in order to make histogram. Change 1 to 1, 2 to 2, 3 to 3, 4 to 4, and more to 5. The default format changes the cell placement from the left side for numbers to the right side for non-numbers. Select cells C23:D27 by highlighting them, click the Insert tab, then select Column in the Charts section. Finally click the most basic icon choice in the upper left corner.

Titles should be added to this using the Layout tab in the Chart Tools. Now the last few values in the sample and the statistics, along with the histogram, should look something like: BinFrequency mean = st. dev. = total =25

The shape of the histograms can be compared subjectively. The frequencies are scaled differently, but students should be able to decide if the sample is similar enough for a random sample.

Part III : 201 samples of size n = 25 Use the Data Analysis program in the Analysis section of the Data tab to create another 200 random samples. In the Data Analysis program, click Random Number Generation and enter: Number of variables: 200 Number of random numbers: 25 Distribution: Discrete Value and Probability Range: A2:B6 Random seed: remains the same Output Range: H1

This generates an array 201 random samples of size n = 25 from column G to column GY. To find the mean and standard deviation of each sample, select cells G26:G27 and hook the small square in the bottom right corner and drag the functions through to column GY. Dont compare all of these samples to the theoretical probability distribution, but compare the mean of the sample means and the mean of sample standard deviations.

Enter the title mean of sample means = in cell F29 and the title mean of sample standard deviations = in cell F30, and format the alignment of these title to be on the right. In cells G29 and G30 enter the functions =AVERAGE(G26:GY26) and =AVERAGE(G27:GY27) respectfully mean = st. dev. = mean of sample means = mean of sample st. dev. = st. dev. of sample means =0.2824

To understand a distribution of sample means, the mean, the standard deviation, and the shape of the distribution all need to be considered. The Central Limit Theorem* states:

The standard deviation of the simulated sample means has already been calculated. Using the formula, the standard deviation of the sample means is This should be compared to the standard deviation of the 201 simulated sample means = –

Finally, the shape of the distribution of sample means must be determined. The mean should be approximately 3.35 Between three standard deviations less than the mean and three standard deviations greater than the mean should contain about 100% of the scores. Three standard deviations less than the mean is approximately 3.35 – 3 × and three standard deviations greater is approximately ×

Use this range and a bin size of 0.2 to make a list of values for the Histogram program to sort the sample means into a frequency table. Enter this list somewhere out of the way like cell B

To make the frequency distribution, go back to the Histogram program in Data Analysis of the Analysis section in the Data tab and enter: Input Range: G26:GY26 Bin Range: B32:B41 Output Range: D:34 Again the Bins in the distribution table need to be changed to general format in order to make histogram. Change 2.4 to 2.4, 2.6 to 2.6, and so on.

Select cells D35:E45, click the Insert tab, click Column, and choose the simplest icon in the upper left corner. Put in titles using the Layout tab and

The third part of the Central Limit Theorem is suggested: For all samples of size n, the sampling distribution of the sample means can be approximated by a normal distribution. Thank you, Dale Nelson, Salt Lake Community College Session: S172