Probability Definition: randomness, chance, likelihood, proportion, percentage, odds. Probability is the mathematical ideal. Not sure what will happen.

Slides:



Advertisements
Similar presentations
1.3 Density Curves and Normal Distributions. What is a density curve?
Advertisements

The Standard Normal Curve Revisited. Can you place where you are on a normal distribution at certain percentiles? 50 th percentile? Z = 0 84 th percentile?
Cal State Northridge 320 Andrew Ainsworth PhD
Chapter 8: Probability: The Mathematics of Chance Lesson Plan
The Normal Distribution “the bell curve” Some slides downloaded from
2-5 : Normal Distribution
CHAPTER 6 Statistical Analysis of Experimental Data
First we need to understand the variables. A random variable is a value of an outcome such as counting the number of heads when flipping a coin, which.
Statistics and Quantitative Analysis U4320 Segment 4: Statistics and Quantitative Analysis Prof. Sharyn O’Halloran.
5-Minute Check on Section 2-1a Click the mouse button or press the Space Bar to display the answers. 1.What does a z-score represent? 2.According to Chebyshev’s.
Stat 1510: Introducing Probability. Agenda 2  The Idea of Probability  Probability Models  Probability Rules  Finite and Discrete Probability Models.
Sets, Combinatorics, Probability, and Number Theory Mathematical Structures for Computer Science Chapter 3 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesProbability.
 A number expressing the likelihood that a specific event will occur, expressed as the ratio of the number of actual occurrences.
Standardized Score, probability & Normal Distribution
Ch 2 – The Normal Distribution YMS – 2.1
Chapter 7: The Normal Probability Distribution
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
Probability Distributions What proportion of a group of kittens lie in any selected part of a pile of kittens?
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Section 7.1 The STANDARD NORMAL CURVE
Statistics 303 Chapter 4 and 1.3 Probability. The probability of an outcome is the proportion of times the outcome would occur if we repeated the procedure.
Ex St 801 Statistical Methods Probability and Distributions.
Probability The definition – probability of an Event Applies only to the special case when 1.The sample space has a finite no.of outcomes, and 2.Each.
Chapter 3 Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements Can be described.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Applied Business Forecasting and Regression Analysis Review lecture 2 Randomness and Probability.
Chapter 2: The Normal Distribution Section 1: Density Curves and the Normal Distribution.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chapter 6. Continuous Random Variables Reminder: Continuous random variable.
Chapter 8: Probability: The Mathematics of Chance Lesson Plan Probability Models and Rules Discrete Probability Models Equally Likely Outcomes Continuous.
Warm-Up 1. What is Benford’s Law?
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 Normal Probability Distributions 6-1 Review and Preview 6-2 The Standard Normal.
Describing Location in a Distribution 2.1 Measures of Relative Standing and Density Curves 2.1 Measures of Relative Standing and Density Curves Text.
Think about this…. If Jenny gets an 86% on her first statistics test, should she be satisfied or disappointed? Could the scores of the other students in.
5.3 Random Variables  Random Variable  Discrete Random Variables  Continuous Random Variables  Normal Distributions as Probability Distributions 1.
AP Statistics Chapter 2 Notes. Measures of Relative Standing Percentiles The percent of data that lies at or below a particular value. e.g. standardized.
Density Curves Section 2.1. Strategy to explore data on a single variable Plot the data (histogram or stemplot) CUSS Calculate numerical summary to describe.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Warm-Up On the driving range, Tiger Woods practices his golf swing with a particular club by hitting many, many balls. When Tiger hits his driver, the.
Measures of Relative Standing and Density Curves
Two Main Uses of Statistics: 1)Descriptive : To describe or summarize a collection of data points The data set in hand = the population of interest 2)Inferential.
Essential Statistics Chapter 31 The Normal Distributions.
1 Chapter 5. Section 5-1 and 5-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
POSC 202A: Lecture 4 Probability. We begin with the basics of probability and then move on to expected value. Understanding probability is important because.
Chapter 2 Modeling Distributions of Data Objectives SWBAT: 1)Find and interpret the percentile of an individual value within a distribution of data. 2)Find.
Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements Can be described.
AP Statistics: Section 2.2 B. Recall finding a z-score in section 2.1: z =
Normal Distributions.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 5: Probability: What are the Chances? Section 5.2 Probability Rules.
Chapter 8: Probability: The Mathematics of Chance Probability Models and Rules 1 Probability Theory  The mathematical description of randomness.  Companies.
Chapter 8: Probability: The Mathematics of Chance Lesson Plan Probability Models and Rules Discrete Probability Models Equally Likely Outcomes Continuous.
Density Curves & Normal Distributions Textbook Section 2.2.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Chapter 7 Random Variables and Continuous Distributions.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Conditional Probability 423/what-is-your-favorite-data-analysis-cartoon 1.
Chapter 7 Continuous Probability Distributions and the Normal Distribution.
Simulations and Normal Distribution Week 4. Simulations Probability Exploration Tool.
Continuous Distributions
THE STANDARD NORMAL DISTRIBUTION
Probability Distribution
Chapter 4 – Part 3.
Chapter 4 – Part 2.
Unit 1A - Modelling with Statistics
Measuring location: percentiles
Chapter 8: Probability: The Mathematics of Chance Lesson Plan
Normal Distribution with Graphing Calculator
Chapter 8: Probability: The Mathematics of Chance Lesson Plan
Presentation transcript:

Probability Definition: randomness, chance, likelihood, proportion, percentage, odds. Probability is the mathematical ideal. Not sure what will happen in a single event but, in the long run, certain patterns emerge. We use letters like X and Y to represent quantities. These will be called random variables.

Probability Model List the outcomes for a given event (experiment or question) and associated probabilities. Example: pick a card out of a standard deck S: sample space (contains all possible outcomes) Event: single outcome or collection of outcomes S: sample space contains Event: pick a pick a

Basic Rules 1.Event A has probability P(A), which is between 0 and 1 (inclusive). 2.Probability of entire sample space, P(S), is. 3.Addition: If two events are disjoint (nothing in common), then P(A or B) =. 4.Complement: P(not A) =

Probability Model for standard deck of cards 52 cards, 4 suits (Diamonds, Hearts, Clubs, Spades) Each suit as 13 cards: 2, 3, 4, 5, …, 9, 10, J, Q, K, A P(picking any single card)= A = event that a red 5 is picked B = event that a club is picked C = event that a face card (J, Q, K) is picked P(A or B)= P(not C) =

Discrete Model If sample space is finite, the probability model is called discrete. Roll 2 six-sided dice and record the sum. List all outcomes and associated probabilities in a table. Sum Prob

Continuous Model If sample space contains a range of values, the probability model is called continuous. Density curves record probability as the area under the curve for a given range of outcomes. So, total area under the curve will always equal 1.

Continuous Model – Example 1 The uniform distribution for any real number, X, from 3 to 7 looks like:

Continuous Model – Example 2 The symmetric triangular distribution for any real number, X, from 0 to 8 looks like:

More Probability Use Venn diagrams to visualize probability rules. If events are disjoint, don’t overlap circles. Sample space, S: rectangle Events (A, B, C, …): circles inside Keep track of # of outcomes in each region of the rectangle.

Venn diagram - example Example: pick a card out of a standard deck S: sample space contains 52 outcomes (52 cards) A = event that a red 5 is picked B = event that a club is picked C = event that a face card (J, Q, K) is picked S A B C

P(A or B) S A B C S A B C P(B or C)

General Addition Rule A = event that a red card is picked B = event that a number card is picked P(A or B) S A B General Addition: P(A or B) =

Conditional Probability Rule Given a condition (you know something happened), how does that change the chances of something else happening? P(B|A)= probability of B given A S A B

Venn Diagram of 70 students C: owns a cat D: owns a dog S C D

General Multiplication Rule Rewriteto get: Experiment: pick two cards out of a standard deck

Independent Events Two (or more) events are independent if knowledge of one event does not change the chances of the other. Multiplication Rule for Independent Events:

For a cholesterol-lowering drug, there is a 5% chance that a loss-of-sleep side effect will occur. What are the chances that two people picked at random take the drug and experience sleep loss? What are the chances that at least 1out of 3 loses sleep?

The Normal Distribution Curve will capture 100% of all observations. Hence, there will be a total area of 1 below it. Then the area under the curve for a given range of values will represent the proportion (percent, fraction) of observations that fall in that range. Use curves to describe overall pattern seen in a histogram.

The proportion of scores above 80 is roughly 26.8%. The area under the density curve for scores above 80 is roughly =26.1%. Curves and proportions % v % v

Mean and Medians Location of the median on a density curve is where area under is cut in half. Location of the mean on a density curve is where the length of the curve is cut in half. On symmetric curves: On skewed curves:

Normal curves are special kinds of density curves Symmetric, single peaked, bell-shaped Use  mu  and  sigma  to talk about mean and std. dev.  –  –  -    + 

Rule About 68% of data fall within About 95% of data fall within About 99.7% of data fall within  -    +   + 2  + 3   - 2  - 3 

Example 1 Grasshopper jumps can be described by a Normal distribution with  = 12 inches and  = 2 inches. About 68% of all jumps are between inches    About where would you find the top 2.5%?

Example 1 – continued What % falls below 14 inches? What % of jumps are more than 14 inches? What % of jumps are between 14 and 16 inches?

Finding values without We use tables or calculators to find harder values, like where is the top 10% or what percent falls below a given observation. N( ,  ) means observations come from a Normal distribution with a mean of  and a standard deviation of . Standardize observation x from N( ,  ) by: The standardized value is called a

z-scores from example 1, N( ,  )

Two functions on the calculator (found under 2 nd VARS => DISTR) normalcdf( : will give area between two bounds for a given , . invNorm( : will give the observation that has a particular area to its left for a given , . normalcdf(lower bound, upper bound, ,  ) invNorm(area, ,  )  -    + 

Using the calculator with grasshopper N( ,  ) What % of jumps fall below 17 inches? normalcdf(lower bound, upper bound, ,  ) = normalcdf( ) = area below 17 = No lower bound, so: What % of jumps fall above 11.5 inches? Since total area is 1 and we have : First, find area we want normalcdf(lower bound, upper bound, ,  ) = normalcdf( ) = area =

Using the table with grasshopper N( ,  ) What % of jumps fall between 10 and inches? Calculator does this all at once with the normalcdf( function. Area between = area below – area below = - normalcdf(lower bound, upper bound, ,  ) = normalcdf( ) = area between =

Using the table with grasshopper N( ,  ) What jumps fell in the top 10%? Use invNorm function to find that observation % What observation has an area of.10 above it? What observation has an area of.90 below it? invNorm(area, ,  ) = invNorm( ) = value with.9 area below=

Using the table with grasshopper N( ,  ) Where do the middle 50% fall? Use invNorm function to find those observations % What observation has an area of below it? invNorm(area, ,  ) = invNorm( ) = value with area below = invNorm(area, ,  ) = invNorm( ) = value with area below =

Rule 1,2,3 standard deviations away accurate to two decimal places

Sampling Distributions Know the entire population: (parameter) Know only a sample (SRS): (statistic)

Law of Large Numbers - As you increase the sample size, sample mean gets closer to population mean Population = 3, 3, 8, 15, 20, 21, 22, 31, 39 Sample of size 1= 8 Sample of size 2= 8, 22 Sample of size 3= 8, 22, 31 Sample of size 4= 8, 22, 31, 3 Sample of size 5= 8, 22, 31, 3, 20

Population of 7 people and their weights (in pounds) 120 Samples of size 1: {122}, {140}, {150}, {155}, {160}, {170}, {195} Mark off the sample mean for each sample with an “x” 122, 140, 150, 155, 160, 170, x xxxxxx 120 Mark off the sample mean for each sample with an “x” x Samples of size 2: {122, 140}, {122, 150}, {122, 155}, {122, 160}, {122, 170}, {122, 195}, {140, 150}, …, (170, 195}. There are 21 possible samples. xxxxxxxxxxxx x xxxxxxx

Population of 7 people (continued) 120 Samples of size 1: 140, 122, 160, 195, 150, 155, x xxxxxx Samples of size 2: xxxxxxxxxxxxx x xxxxxxx Samples of size 6: 7 possible sample of this size. {122, 140, 160, 150, 155, 170}, … xxxxxxx

Sampling distribution of Sampling from a large population with mean  and standard deviation  : samples of size n will have their sample means distributed with a mean  and standard deviation  over root n. If population is N( ,  ), then If population is not Normal but n is large, then

Ex. 1 - Weight of eggs is N(65, 3) Your egg carton holds 9 eggs, so consider each carton as a random sample of 9 eggs. Let X be the weight of a single egg in grams and X be average weight of your carton. What is the sampling distribution for your carton’s average weight?

Weight of eggs is N(65, 3) – continued Mean weight of carton is N(65,1) Convert 67 to a z-score for a single egg: Convert 67 to a z-score for the carton:

Ex. 2 - Length of trout is N(17.5, 2.5) Your local waters contain a multitude of trout. Let X be the length of a single fish in inches and X be average length of your daily catch of five fish. What is the sampling distribution for your daily catch?

Trout length is N(17.5, 2.5) – continued Mean length of daily catch is N(17.5,1.118) Convert 16 to a z-score for a single fish: Convert 16 to a z-score for the daily catch:

Trout length is N(17.5, 2.5) – continued Mean length of daily catch is N(17.5,1.118)

Ex 3 - Length of trout is N(10, 2) Your fishing pond has another type of trout. Let X be the length of a single fish in inches taken at random and X be average length of a sample of 16 fish. What is the sampling distribution for a sample of 16 fish?

Trout length is N(10, 2) – continued Mean length of 16 fish is N(10,0.5)