CS 3332 Probability & Statistics ( 機率與統計 ) Hung-Min Sun ( 孫宏民 ) Department of Computer Science National Tsing Hua University

Slides:



Advertisements
Similar presentations
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Advertisements

Discrete Random Variables
Sections 4.1 and 4.2 Overview Random Variables. PROBABILITY DISTRIBUTIONS This chapter will deal with the construction of probability distributions by.
QUANTITATIVE DATA ANALYSIS
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
BCOR 1020 Business Statistics Lecture 9 – February 14, 2008.
Probability Distributions
Evaluating Hypotheses
Summarizing Measured Data Part I Visualization (Chap 10) Part II Data Summary (Chap 12)
BHS Methods in Behavioral Sciences I
Probability and Probability Distributions
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 5-2.
CHAPTER 6 Statistical Analysis of Experimental Data
Slide 1 Statistics Workshop Tutorial 4 Probability Probability Distributions.
Lecture Slides Elementary Statistics Twelfth Edition
Probability and Statistics in Engineering Philip Bedient, Ph.D.
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
5-2 Probability Distributions This section introduces the important concept of a probability distribution, which gives the probability for each value of.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Chapter 1 Probability and Distributions Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
CHAPTER 1 Basic Statistics Statistics in Engineering
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Theory of Probability Statistics for Business and Economics.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Applied Business Forecasting and Regression Analysis Review lecture 2 Randomness and Probability.
5.3 Random Variables  Random Variable  Discrete Random Variables  Continuous Random Variables  Normal Distributions as Probability Distributions 1.
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
Biostatistics, statistical software III. Population, statistical sample. Probability, probability variables. Important distributions. Properties of the.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 5-2 Random Variables.
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
The two way frequency table The  2 statistic Techniques for examining dependence amongst two categorical variables.
Lecture V Probability theory. Lecture questions Classical definition of probability Frequency probability Discrete variable and probability distribution.
Essential Statistics Chapter 91 Introducing Probability.
Chapter 10 Introducing Probability BPS - 5th Ed. Chapter 101.
Sections 5.1 and 5.2 Review and Preview and Random Variables.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
Statistics What is statistics? Where are statistics used?
Inference: Probabilities and Distributions Feb , 2012.
Random Variables Ch. 6. Flip a fair coin 4 times. List all the possible outcomes. Let X be the number of heads. A probability model describes the possible.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 6: Random Variables Section 6.1 Discrete and Continuous Random Variables.
Lesson Discrete Random Variables. Objectives Distinguish between discrete and continuous random variables Identify discrete probability distributions.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 5-1 Review and Preview.
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Chapter 8: Probability: The Mathematics of Chance Probability Models and Rules 1 Probability Theory  The mathematical description of randomness.  Companies.
Course Description Probability theory is a powerful tool that helps Computer Science and Electrical Engineering students explain, model, analyze, and design.
Computing Fundamentals 2 Lecture 7 Statistics, Random Variables, Expected Value. Lecturer: Patrick Browne
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
Random Variables Lecture Lecturer : FATEN AL-HUSSAIN.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Data Analysis.
Chapter 5 - Discrete Probability Distributions
Fundamentals of Probability and Statistics
Lecture 13 Sections 5.4 – 5.6 Objectives:
Lecture Slides Elementary Statistics Twelfth Edition
CS3332(01) Course Description
Lecture Slides Elementary Statistics Twelfth Edition
EECS3030(02) Course Description
Essential Statistics Introducing Probability
Lecture Slides Essentials of Statistics 5th Edition
Discrete Random Variables: Basics
Discrete Random Variables: Basics
Experiments, Outcomes, Events and Random Variables: A Revisit
Discrete Random Variables: Basics
Lecture Slides Essentials of Statistics 5th Edition
Presentation transcript:

CS 3332 Probability & Statistics ( 機率與統計 ) Hung-Min Sun ( 孫宏民 ) Department of Computer Science National Tsing Hua University Office: 資電館 Phone: 校內分機 2968,

Empirical and probability distributions Chapter 1

1.1 Basic concepts What are Statistics? Dealing with numbers? Consider the following. 1. There is some problem or situation that needs to be considered. Ex. The effectiveness of a new vaccine for mumps; whether an increase in yield can be attributed to a new strain of wheat; predicting the probability of rain; whether increasing speed limits will result in more accidents; estimate the unemployment rate; whether new controls have resulted in a reduction in pollution.

2. Some measures are needed to help us understand the situation better. How to create good measures? 3. After the measuring instrument has been developed, we must collect data through observation. 4. Using these data, statisticians summarize the results using descriptive statistics. 5. These summaries are then used to analyze the situation using statistical inferences. 6. A report is presented, along with some recommendations that are based upon the data and the analysis of them.

 The discipline of statistics deals with the collection & analysis data. --- Find a pattern: among uncertainties. Filter out the noise, bound the errors, derive the confidence Think carefully: about the investigations & problems. Make sense out of the observations, pick the proper math models.

 Random experiments--  Random experiments--Any act that may be repeated under similar conditions resulting in a trial which yields an outcome.  Sample--  Sample--a collection of actual outcomes from a repeated experiment.  Sample Space (Outcome Space)--  Sample Space (Outcome Space)--a set of all possible outcomes.  Event--  Event--a subset of sample space.

  Two dice are cast and the total number of spots on the sides that are ”up” are counted. The sample space is S = {2, 3, 4,..., 12}   Toss a fair coin. The sample space is S = {H, T}.   A fair coin is flipped successively at random until heads is observed on two successive flips. If we let y denote the number of flips of the coin that are required, then S = {y : y = 2, 3,....}.

 random variable  Given a random experiment with sample space S, a function X mapping each element of S to a unique real number is called a random variable.   For each element s from the sample space S, denote this function by X(s) = x and call the range of X or the space of X : R = {x : X(s) = x, for some s in S}

  When dealing with only two outcomes, one might use S = {success, failure}. Choose X(success) = 1, X(failure) =0. Then, R = {0, 1}   When gambling with a pair of dice, one might use S = ordered pairs of all possible rolls = {(a, b) : a = die 1 outcome, b = die 2 outcome}.Choose X((a, b)) = a + b. Then, R ={2, 3, 4, 5,..., 12}.   When rolling dice in a board game, one might use S = {(a, b) : a = die 1 outcome, b = die 2 outcome } Choose X((a, b)) = max{a, b}. Then, R = {1, 2, 3, 4, 5, 6}

  The members of sample space can be finite, countable infinite, uncountable.  frequency  The frequency f of some outcome is the number of times it occurs during a random experiment with n trials. (relative frequency: f/n)

Density (Relative Frequency) Histogram   The density histogram, say h(x), graphically reports the relative freq. of each possible outcome x 0.   For small n, f/n is very unstable.   As n increases,h(x 0 ) = f 0 /n →p 0 = f(x 0 ).   h(x) will approach the probability mass function(p.m.f.) f(x).   Density histogram ⇒ Probability histogram.

 Table 1.1-1: No. of children per family … No. of children per family …  Frequency …  Relative frequency …

1.2 The mean, variance, and standard deviation   ” measures of ”center”   mean   ” measures of ”spread”   variance

  Mean:    (1) Statistical measure of location   (2) Mathematical expectation of a corresponding random variable   (3) The first moment about the region of a mass function f(x)

  Variance:  2   (1) Statistical measure of variation   (2) Indication of the spread or dispersion of a probability distribution   (3) The second moment about the center of a mass function f(x)

  Standard deviation:    (1) Square root of variance

  x  {1, 2, 3} and the p.m.f. is given by f(1) = 3/6, f(2) =2/6, f(3) = 1/6. Weighted mean (weighted average) is 1 · 3/6 + 2 · 2/6+ 3 · 1/6= 10/6    =10/6    2 =(1-10/6) 2 ×3/6+(2-10/6) 2 ×2/6+(3- 10/6) 2 ×1/6=120/216    = (  2 ) 1/2 =(120/216) 1/2 =0.745

  Moments   (1) kth moment about the origin ( 第 k 級動差 )   (2) kth moment about the mean ( 第 k 級中央動差 )

1.3 Continuous-type data   Group the data into classes 1.Maximum , Minimum , Range 2. 2.Select the number of classes , k=5 to Each interval begins and ends halfway between two possible values The 1st interval begin about as much below the smallest value as the last interval ends above the largest. 5.class intervals classes boundariescutpoints. 5.The intervals are called class intervals and the boundaries are classes boundaries or cutpoints. (c0, c1), (c1, c2), …, (ck-1, ck): k class intervals. 6.class limits 6.The class limits are the smallest and largest possible observed values in a class. 7.class mark 7.The class mark ui is the midpoint of Class i.

 Candy bar weights   Visualization of the distribution: r= =6.2   k=7 classes of width 0.9   Relative frequency histogram (Density histogram)

Empirical Rule   If the histogram is bell-shaped,   ~68% of the data within the interval:  ~95%  ~99.7%   Relative Frequency Polygon   The polygon smoothes out the corresponding histogram somewhat.

  Class intervals of unequal lengths   Ex 1.3-4:   The modal class: the interval with the largest height.   The mode: the class mark of the modal class.   (1.5, 2.5) is the modal class and x=2 is the mode