Last lecture summary The nature of the normal distribution Non-Gaussian distributions.

Slides:



Advertisements
Similar presentations
Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
Advertisements

Sampling Distributions (§ )
Introduction to Statistics
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Central Limit Theorem.
Jan. 29 “Statistics” for one quantitative variable… Mean and standard deviation (last week!) “Robust” measures of location (median and its friends) Quartiles,
Chapter 3 Describing Data Using Numerical Measures
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Standard Normal Distribution
1 Introduction to Biostatistics (PUBHLTH 540) Estimating Parameters Which estimator is best? Study possible samples, determine Expected values, bias, variance,
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 3 Describing Data Using Numerical Measures.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
The normal distribution
Numerical Descriptive Techniques
MBA7020_04.ppt/June 120, 2005/Page 1 Georgia State University - Confidential MBA 7020 Business Analysis Foundations Descriptive Statistics June 20, 2005.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Sec 6.2 Bluman, Chapter Applications of the Normal Distributions The standard normal distribution curve can be used to solve a wide variety.
The Normal Curve, Standardization and z Scores Chapter 6.
Analysis of Distribution If the sample is truly random and there is no bias in the sampling then the expected distribution would be a smooth bell-shaped.
Chapter 21 Basic Statistics.
Measures of Relative Standing Percentiles Percentiles z-scores z-scores T-scores T-scores.
Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.
Lecture 3 Describing Data Using Numerical Measures.
Summary Five numbers summary, percentiles, mean Box plot, modified box plot Robust statistic – mean, median, trimmed mean outlier Measures of variability.
Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
Statistical Methods II: Confidence Intervals ChE 477 (UO Lab) Lecture 4 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
Sampling Distributions. Essential Question: How is the mean of a sampling distribution related to the population mean or proportion?
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
Last lecture summary Bias, Bessel's correction MAD Normal distribution Empirical rule.
1 Results from Lab 0 Guessed values are biased towards the high side. Judgment sample means are biased toward the high side and are more variable.
1 Topic 5 - Joint distributions and the CLT Joint distributions –Calculation of probabilities, mean and variance –Expectations of functions based on joint.
Chapter 18: Sampling Distribution Models
MBA7025_04.ppt/Jan 27, 2015/Page 1 Georgia State University - Confidential MBA 7025 Statistical Business Analysis Descriptive Statistics Jan 27, 2015.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Describing Data: One Quantitative Variable SECTIONS 2.2, 2.3 One quantitative.
SAMPLING DISTRIBUTIONS
Last lecture summary Standard normal distribution, Z-distribution
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Reminder: What is a sampling distribution? The sampling distribution of a statistic is the distribution of all possible values of the statistic when all.
Chapter 18 - Part 2 Sampling Distribution Models for.
Introduction to Inference Sampling Distributions.
Chapter 18 Sampling Distribution Models *For Means.
Lecture 5 Introduction to Sampling Distributions.
Chapter 4 – Measurements of Location and Position Math 22 Introductory Statistics.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Distributions of Sample Means. z-scores for Samples  What do I mean by a “z-score” for a sample? This score would describe how a specific sample is.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Green Belt – SIX SIGMA OPERATIONAL Central Limit Theorem.
7.3 Sample Means Objectives SWBAT: FIND the mean and standard deviation of the sampling distribution of a sample mean. CHECK the 10% condition before calculating.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
STATISTICS Chapter 2 and and 2.2: Review of Basic Statistics Topics covered today:  Mean, Median, Mode  5 number summary and box plot  Interquartile.
Created by: Tonya Jagoe. Measures of Central Tendency & Spread Input the data for these test scores into your calculator to find.
The normal distribution
Statistical Methods Michael J. Watts
Chapter 7 Review.
Statistical Methods Michael J. Watts
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Chapter 18: Sampling Distribution Models
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
The normal distribution
What would be the typical temperature in Atlanta?
Sampling Distribution of the Mean
CHAPTER- 3.1 ERROR ANALYSIS.
Sampling Distributions (§ )
Essential Statistics Sampling Distributions
Presentation transcript:

Last lecture summary The nature of the normal distribution Non-Gaussian distributions

New stuff

Lognormal distribution Frazier et al. measured the ability of a drug isoprenaline to relax the bladder muscle. The results are expressed as the EC50, which is the concentration required to relax the bladder halfway between its minimum and maximum possible relaxation.

Lognormal distribution

Geometric mean Geometric mean – transform all values to their logarithms, calculate the mean of the logarithms, transform this mean back to the units of original data (antilog)

The nature of the lognormal distribution Lognormal distributions arise when multiple random factors are multiplied together to determine the value. A typical example: cancer (cell division is multiplicative) Lognormal distributions are very common in many scientific fields. Drug potency is lognormal To analyse lognormal data, do not use methods that assume the Gaussian distribution. You will get misleding results (e.g.,non-existing outliers). Better way is to convert data to logarithm and analyse the converted values.

How normal is normal? Checking normality 1.Eyball histograms 2.Eyball QQ plots 3.There are tests

QQ plot Q stands for ‘quantile’. Quantiles are values taken at regular intervals from the data. The 2-quantile is called the median, the 3-quantiles are called terciles, the 4-quantiles are called quartiles (deciles, percentiles).

Typical normal QQ plot

QQ plot of left-skewed distribution

QQ plot of right-skewed distribution

SAMPLING DISTRIBUTIONS výběrová rozdělení

Histogram

Sampling distribution of sample mean výběrové rozdělení výběrového průměru

Sweet demonstration of the sampling distribution of the mean

průměr = 3.3 průměr = 1.7

Data 2015 Population: 4,3,3,5,0,4,4,4,3,4,2,6,8,2,4,3,5,7,3,3 25 samples (n=3) and their averages 3,5,3,4,2,3,3,3,5,5,3,4,3,4,5,4,4,4,6,3,4,3,4,3,4

Histogram of 2015 data

2015, n = 3, number of samples = 25

Going further So far, we have generated 25 samples with n = 3. To improve our histogram, we need more samples. However, we don’t want to spend ages in the classroom. Thus, I have prepared a simulation for you. In this simulation, I use data from 2014 and I generate all possible samples, n = 3.

Sampling distribution, n = samples

Sampling distribution, n = samples

Sampling distribution, n = samples

Central limit theorem (CLT)

Non-Gaussian distribution 1,1,1,1,1,1,2,2,2,2,2,3,3,3,3,4,4,4,5,5,6,7,7,8,8,8,9,9,9,9,10,10,10,10,10,11,11,11,11,11,11

Sampling distribution n = 2

Sampling distribution n = 4

Sampling distribution n = 6

Sampling distribution n = 8

Back to CLT Once we know that the sampling distribution of the sample mean is normal, we want to characterize this distribution. By which numbers you characterize a distribution? mean standard deviation

Back to CLT

M and SE Let’s have a look at our demonstration data: 1. Calculate population mean, population standard deviation and standard error for n=3. 2. Take all our sample means and calculate their mean. It should be close to the population mean. 3. Take all our sample means and calculate their standard deviation. It should be close to the standard error.

M and SE pop_mean <- mean(data.set2015) pop_sd <- sd(data.set2015)*sqrt(19/20) se <- pop_sd/sqrt(3) sampl_mean <- mean(prumery2015) sampl_sd <- sd(prumery2015)

Quiz As the sample size increases, the standard error increases decreases As the sample size increases, the shape of the sampling distribution gets skinnier wider

Sampling distribution applet parent distribution sample data sampling distributions of selected statistics