Chapter 5: Sampling Distributions, Other Distributions, Continuous Random Variables http://www.socialresearchmethods.net/kb/sampstat.php.

Slides:



Advertisements
Similar presentations
Chapter 6 Continuous Random Variables and Probability Distributions
Advertisements

Continuous Distributions BIC Prepaid By: Rajyagor Bhargav.
Chapter 5 Some Important Discrete Probability Distributions
ฟังก์ชั่นการแจกแจงความน่าจะเป็น แบบไม่ต่อเนื่อง Discrete Probability Distributions.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
CHAPTER 13: Binomial Distributions
Part VI: Named Continuous Random Variables
Copyright © Cengage Learning. All rights reserved. 4 Continuous Random Variables and Probability Distributions stat414/node/307.
Chapter 4 Discrete Random Variables and Probability Distributions
Discrete Random Variables and Probability Distributions
Probability Densities
Chapter 6 Continuous Random Variables and Probability Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
CHAPTER 6 Statistical Analysis of Experimental Data
Discrete Probability Distributions
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Chapter 5 Continuous Random Variables and Probability Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
TOPIC 5 Normal Distributions.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 4 Continuous Random Variables and Probability Distributions
Random Variables A random variable A variable (usually x ) that has a single numerical value (determined by chance) for each outcome of an experiment A.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
Chapter 7: Random Variables
6- 1 Chapter Six McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 5 Sampling Distributions
QA in Finance/ Ch 3 Probability in Finance Probability.
Standard Statistical Distributions Most elementary statistical books provide a survey of commonly used statistical distributions. The reason we study these.
Chapter 3 Basic Concepts in Statistics and Probability
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
Copyright ©2011 Nelson Education Limited The Normal Probability Distribution CHAPTER 6.
Theory of Probability Statistics for Business and Economics.
 A probability function is a function which assigns probabilities to the values of a random variable.  Individual probability values may be denoted by.
 A probability function is a function which assigns probabilities to the values of a random variable.  Individual probability values may be denoted by.
PROBABILITY DISTRIBUTIONS
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
 A probability function is a function which assigns probabilities to the values of a random variable.  Individual probability values may be denoted by.
4 - 1 © 1998 Prentice-Hall, Inc. Statistics for Business & Economics Discrete Random Variables Chapter 4.
4 - 1 © 2001 prentice-Hall, Inc. Behavioral Statistics Discrete Random Variables Chapter 4.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Since everything is a reflection of our minds, everything can be changed by our minds.
Probability Review-1 Probability Review. Probability Review-2 Probability Theory Mathematical description of relationships or occurrences that cannot.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
Chapter 5. Continuous Random Variables. Uniform Random Variable X is a uniform random variable on the interval (α, β) if its probability function is given.
Review of Chapter
Chapter 6: Continuous Probability Distributions A visual comparison.
Probability Theory and Specific Distributions (Moore Ch5 and Guan Ch6)
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
Chapter 6: Continuous Probability Distributions A visual comparison.
Chap 5-1 Discrete and Continuous Probability Distributions.
Theoretical distributions: the other distributions.
MECH 373 Instrumentation and Measurements
Chapter Six McGraw-Hill/Irwin
Chapter 5 Created by Bethany Stubbe and Stephan Kogitz.
Random Variable 2013.
PROBABILITY DISTRIBUTIONS
Chapter 5 Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Probability Theory and Specific Distributions (Moore Ch5 and Guan Ch6)
Continuous Probability Distributions Part 2
Continuous Probability Distributions Part 2
Continuous Probability Distributions Part 2
Chapter 6 Continuous Probability Distributions
Continuous Probability Distributions Part 2
Chapter 5: Sampling Distributions
Presentation transcript:

Chapter 5: Sampling Distributions, Other Distributions, Continuous Random Variables http://www.socialresearchmethods.net/kb/sampstat.php

5.2: Binomial and Poisson Distributions - Goals Determine when the random variable (count) X can be modeled using the binomial or Poisson Distributions. Calculate the probability, mean and standard deviation when X has a binomial or Poisson distribution. Determine when you can use the normal approximation to the binomial and perform calculations using this approximation.

Binomial Setting - BINS Binary: There are only two possible outcomes for each trial. Independent: Trials must be independent of each other. Number: The number of trials n of the chance process must be fixed. Success: On each trial, the probability p of success must be the same.

Binomial Setting: Example Do the following use the Binomial Setting? Rolling a fair 4-sided die five times and observing whether the number showing is a 1 or not In a drug trial, 20 patients with the same condition are given a drug and some are given a placebo to see if the drug is effective or not. In quality control we want to see if a particular product is ‘not acceptable’. We take 20 random samples from an assembly line that uses different machines to produce the product.

Binomial Distribution The count X of successes in a binomial setting has the binomial distribution with parameters n and p, where n is the number of trials of the chance process and p is the probability of a success on any one trial. The possible values of X are the whole numbers from 0 to n. X ~ B(n,p)

Examples of Binomial Distribution In a clinical trial, a patient’s condition may improve or not. We study the number of patients who improved. Was a sales transaction considered pleasant? The binomial distribution describes the number of pleasant transactions. In quality control we assess the number of defective items in a lot of goods.

Binomial Probabilities If X has the binomial distribution with n trials and probability p of success on each trial, the possible values of X are 0, 1, 2, …, n. If k is any one of these values, 𝑃 𝑋=𝑘 = 𝑛 𝑘 𝑝 𝑘 (1−𝑝) 𝑛−𝑘 𝑛 𝑘 = 𝑛! 𝑘! 𝑛−𝑘 !

Example: Binomial Distribution Suppose 20% of all copies of a particular textbook fail a certain binding strength test. Let's check a batch of 15 such textbooks. Is this a binomial distribution? What is the chance that we get no defective textbooks? What is the chance that we get less than 3 defective textbooks? What is the chance that we get more than 2 defective textbooks?

Example: Binomial Distribution (cont)

Histograms of Binomial Distributions p = 0.5 n = 10 p = 0.25 n = 10 p = 0.75

Binomial Distribution: Mean and Standard Deviation If X ~ B(n,p) then E(X) = X = np 𝜎 𝑋 = 𝑛𝑝(1−𝑝)

Example: Binomial Distribution (cont) Suppose 20% of all copies of a particular textbook fail a certain binding strength test. Let's check a batch of 15 such textbooks. What are the mean and standard deviation of the number of textbooks that will fail the binding test?

Difficulties with the Normal Approximation to the Binomial Skewedness of the Binomial Distribution. The Binomial Distribution is discrete.

Continuity Correction http://wiki.axlesoft.com/index.php?title=Continuity_correction

Continuity Correction – Extra Actual Value Approximate Value P(X = a) P(a – 0.5 < X < a +0.5) P(a < X) P(a + 0.5 < X) P(a ≤ X) P(a – 0.5 < X) P(X < b) P(X < b – 0.5) P(X ≤ b) P(X < b + 0.5)

Example: Normal Approximation to the Binomial The ideal size of a first-year class at a particular college is 150 students. The college, knowing from past experience that on the average only 30 percent of those accepted for admission will actually attend, uses a policy of approving the applications of 450 students. Compute the probability that more than 150 students attend this college.

Poisson Distribution The number of times that an event occurs during a particular time period or in a particular area Example: The number of people who enter the Union from noon to 1 pm. The number of α-particles emitted from Uranium-238 in 1 minute. The number of DNA fragments found from a sequencing experiment. The number of dead trees in a square mile of forest.

Poisson Setting The number of successes that occur in two nonoverlapping units of measure are independent. The probability that a success will occur in a unit of measure is the same for all units of equal size and is proportional to the size of the unit. The probability that more than one event occurs in a unit of measure is negligible for very small-sized units.

Poisson Distribution X ~ Poisson() 𝑃 𝑋=𝑘 = 𝑒 −𝜆 𝜆 𝑘 𝑘! , 𝑘=0, 1, 2, … 𝜆>0 X =  𝜎 𝑋 = 𝜆

Example: Poisson Distribution An IT consultant receives an average of 3 calls per hour. Let X be the number of calls the consultant receives. X follows a Poisson distribution. a) What is the chance that the consultant receives exactly one call during the next hour? b) What is the chance that the consultant receives more than one call during the next hour? c) What is the chance that the consultant receives exactly 5 calls during the next two hours?

Example: Poisson Approximation to Binomial 0.2% of feral cats are infected with feline aids (FIV) in a region. What is the chance that there are exactly 10 cats infected with FIV among 1000 cats?

5.3: Continuous Random Variables Uniform and Exponential Distributions - Goals Describe the probability distribution of a continuous random variable. Use the distribution of a continuous random variable to calculate probabilities and percentiles (median) of events. Be able to use a probability distribution to find the mean of a continuous random variable. Be able to use a probability distribution to find the variance of a continuous random variable. Calculate the probability, mean and standard deviation when X has a Uniform or Exponential distribution.

Continuous Random Variable A continuous random variable X takes all values in an interval of numbers or collection of such intervals. y = f(x)

Continuous Random Variable A continuous probability model assigns probabilities as areas under a density curve.

Density Curves – Percentiles 𝑝= −∞ 𝑦 𝑓 𝑥 𝑑𝑥 The median of a density curve is the equal – areas point. 𝑝=0.5= −∞ 𝜇 𝑓 𝑥 𝑑𝑥

Example: Continuous Random Variable The distribution of the grade of a particular road in a particular 2 mile region is a continuous r.v. X with density 𝑓 𝑥 = 1 2 𝑥 0≤𝑥≤2 0 𝑒𝑙𝑠𝑒 Is this a valid density curve? What is the probability that the grade is in the last quarter mile of the region? What is the median of this distribution?

Example: Continuous Random Variable We know that the distribution of the grade of a particular road in a particular 2 mile region is a continuous r.v. X with a functional form which is proportional to x2. What is f(x)?

Formulas for the Mean of a Random Variable Discrete – Mean Discrete – Rule 3 𝐸 𝑋 = 𝜇 𝑋 = 𝑖 𝑥 𝑖 𝑝 𝑖 𝐸 𝑔 𝑋 = 𝑔( 𝑥 𝑖 ) 𝑝 𝑖 Continuous Continuous – Rule 3 𝐸 𝑋 = 𝜇 𝑋 = −∞ ∞ 𝑥𝑓 𝑥 𝑑𝑥 𝐸(𝑔(𝑋))= −∞ ∞ 𝑔(𝑥)𝑓(𝑥)𝑑𝑥

Variance of a Random Variable Var(X)=E X− 𝜇 𝑋 2 = ( 𝑥 𝑖 −  X ) 2 ∙ 𝑝 𝑖 = −∞ ∞ (𝑥−  X ) 2 𝑓(𝑥)𝑑𝑥 = E(X2) – (E(X))2 𝜎 𝑋 = 𝑉𝑎𝑟(𝑋)

Example: Continuous Random Variables For the following density function: What is the expected value? Calculate E(X2). Calculate the standard deviation.

Uniform: density function http://www.six-sigma-material.com/Uniform-Distribution.html

Uniform Distribution The density function of the uniform distribution over the interval [a,b] is 𝑓 𝑥 = 1 𝑏−𝑎 𝑎<𝑥<𝑏 0 𝑒𝑙𝑠𝑒 𝐸 𝑋 = 𝑎+𝑏 2 𝜎 𝑋 = 𝑏−𝑎 12

Example: Uniform A packaging line constantly packages 200 cartons per hour. After weighing every package variation the distribution of the weights was found to be uniform with weights ranging from 18.2 lbs. – 20.4 lbs., measured to the nearest tenths. The customer requires less than 20.0 lbs. for ergonomic reasons. What is the probability that the package weights less than 20 lbs.? What are the mean and the standard deviation of the package weights?

Exponential Distribution Uses: amount of time until some specific event occurs (the amount of time between successive events) 𝑓 𝑥 = 𝜆 𝑒 −𝜆𝑥 𝑥≥0 0 𝑒𝑙𝑠𝑒 𝐸 𝑋 = 1 𝜆 𝜎 𝑋 = 1 𝜆

Example: Exponential The life span of some bacteria (in hours) has an exponential distribution with an average life span of 0.5 hours. What is the proportion of bacteria that live at most 1 hour? What is the proportion of bacterial that live more than 1.5 hours? What is the standard deviation of the distribution of these bacteria?

Gamma Distribution Generalization of the exponential function Uses probability theory theoretical statistics actuarial science operations research engineering

Beta Distribution This distribution is only defined on an interval standard beta is on the interval [0,1] uses modeling proportions percentages Probabilities Uniform distribution is a member of this family.

Other Continuous Random Variables Weibull exponential is a member of family uses: lifetimes lognormal log of the normal distribution uses: products of distributions Cauchy symmetrical, long straggly tails

5.1: Sampling Distribution of a Sample Mean - Goals Explain the difference between the sampling distribution of x̄ and the population distribution of . Determine the mean and standard deviation of x̄ for an SRS of size n from a population with mean  and standard deviation . Use the central limit theorem (CLT) to approximate the shape of the sampling distribution of x̄ and use it to perform probability calculations.

Statistical Inference Parameter: number describing a characteristic of the population. Statistics: number describing a characteristic of the sample. Population Sample ?

Sampling Distributions The law of large numbers assures us that if we measure enough subjects, the statistic x̄ will eventually get very close to the unknown parameter µ. The sampling distribution of a statistic is the distribution of values taken by the statistic in all possible samples of the same size from the same population. The population distribution of a variable is the distribution of values of the variable among all individuals in the population.

Spread as a function of n Therefore, sample means are less variable than individual observations

Example: mean and SD of Sampling Distribution The time that it takes a randomly selected rat of a certain subspecies to find its way through a maze has a normal distribution with μ = 1.5 min and σ = 0.35 min. Suppose five rats are randomly selected. What is the mean of the average time? What is the standard deviation of the average time?

Shape of Sampling Distributions If a population X ~ N(, σ) then the sample distribution of X̄ ~ N 𝜇, 𝜎 𝑛 . Draw a SRS of size n from any population with mean  and finite standard deviation σ. When n is large, the sample distribution of the sample mean X̄ is approximately normal with N 𝜇, 𝜎 𝑛 .

Example – Sampling Distribution: Normal The time that it takes a randomly selected rat of a certain subspecies to find its way through a maze has a normal distribution with μ = 1.5 min and σ = 0.35 min. Suppose five rats are randomly selected. What is the probability that the average time is at most 2.0 minutes? What is the probability that the average time will be within 0.3 minutes of the mean?

Shape of Sampling Distributions If a population X ~ N(, σ) then the sample distribution of X̄ ~ N 𝜇, 𝜎 𝑛 . Draw a SRS of size n from any population with mean  and finite standard deviation σ. When n is large, the sample distribution of the sample mean X̄ is approximately normal with N 𝜇, 𝜎 𝑛 .

A Few More Facts Any linear combination of independent Normal random variables is also Normal. More generally, the distribution of a sum or average of many small random quantities is close to Normal whether independent or not. CLT also applies to discrete random variables.

9 Binomial distributions

CLT: Example 1 (in class) An electronics company manufactures resistors that have a mean resistance of 100 ohms and a standard deviation of 10 ohms. Assume that the distribution of resistance is normal. a) Find the probability that one resistor will have a resistance less than 95 ohms. (0.3085) b) Find the probability that a random sample of 25 resistors will have an average resistance less than 95 ohms. (0.0062) X

CLT: Example 2 (in class) Without checking the city bus web site, a student walks at random times to the Beering Hall bus stop to wait for the Ross Ade bus which is supposed to arrive every 10 minutes. This will be a Uniform distribution with 0 ≤ x ≤ 10. For a Uniform distribution on the interval (a,b), a) If one student walks to the bus stop to catch this bus, what is the probability that the wait time will be more than 6 minutes? (0.4) b) If 40 students walk to the bus stop to catch this bus, what is the probability that the average wait time will be more than 6 minutes? (0.0143) X