Statistical Estimation Vasileios Hatzivassiloglou University of Texas at Dallas.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Point Estimation Notes of STAT 6205 by Dr. Fan.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Chapter 7. Statistical Estimation and Sampling Distributions
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
Outline input analysis input analyzer of ARENA parameter estimation
Chapter 5 Discrete Random Variables and Probability Distributions
SOLVED EXAMPLES.
Copyright © Cengage Learning. All rights reserved.
Chapter 4 Discrete Random Variables and Probability Distributions
Chapter 5 Basic Probability Distributions
Sampling Distributions
Maximum likelihood (ML)
SUMS OF RANDOM VARIABLES Changfei Chen. Sums of Random Variables Let be a sequence of random variables, and let be their sum:
A gentle introduction to Gaussian distribution. Review Random variable Coin flip experiment X = 0X = 1 X: Random variable.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Maximum likelihood (ML)
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Discrete Probability Distributions
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Discrete Random Variables Chapter 4.
Estimation Basic Concepts & Estimation of Proportions
Modeling and Simulation CS 313
Business and Finance College Principles of Statistics Eng. Heba Hamad 2008.
Random Sampling, Point Estimation and Maximum Likelihood.
1 Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND STATISTICS FOR SCIENTISTS AND ENGINEERS Systems.
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
P. STATISTICS LESSON 8.2 ( DAY 1 )
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 5 Discrete Random Variables.
2.1 Introduction In an experiment of chance, outcomes occur randomly. We often summarize the outcome from a random experiment by a simple number. Definition.
CpSc 881: Machine Learning Evaluating Hypotheses.
Consistency An estimator is a consistent estimator of θ, if , i.e., if
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
1 Standard error Estimated standard error,s,. 2 Example 1 While measuring the thermal conductivity of Armco iron, using a temperature of 100F and a power.
Expectation. Let X denote a discrete random variable with probability function p(x) (probability density function f(x) if X is continuous) then the expected.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
Confidence Interval & Unbiased Estimator Review and Foreword.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Week 41 How to find estimators? There are two main methods for finding estimators: 1) Method of moments. 2) The method of Maximum likelihood. Sometimes.
Maximum Likelihood Estimation
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
AP STATISTICS Section 7.1 Random Variables. Objective: To be able to recognize discrete and continuous random variables and calculate probabilities using.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Week 31 The Likelihood Function - Introduction Recall: a statistical model for some data is a set of distributions, one of which corresponds to the true.
Multiple Sequence Alignment Vasileios Hatzivassiloglou University of Texas at Dallas.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Probability Distribution. Probability Distributions: Overview To understand probability distributions, it is important to understand variables and random.
1 Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND STATISTICS FOR SCIENTISTS AND ENGINEERS Systems.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Conditional Expectation
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
Applied statistics Usman Roshan.
Probability Theory and Parameter Estimation I
Ch3: Model Building through Regression
Maximum Likelihood Estimation
Review of Probability and Estimators Arun Das, Jason Rebello
Chapter 5 Some Important Discrete Probability Distributions
Introduction to Probability and Statistics
The Binomial and Geometric Distributions
Econometric Models The most basic econometric model consists of a relationship between two variables which is disturbed by a random error. We need to use.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Presentation transcript:

Statistical Estimation Vasileios Hatzivassiloglou University of Texas at Dallas

2 Obama contract at intrade.com

3 Instance profiles Given k observations of maximum length n, construct a |Σ|×n matrix A (profile) where entry A ij is the estimated probability that the ith letter occurs in position j One way to estimate A ij is to count each letter occuring at this position (c ij ); then This is maximum likelihood estimation (MLE) Estimate becomes better as k increases

4 Example data 23 sample motif instances for the cyclic AMP receptor transcription factor (positions 3-9) TTGTGGC TTTTGAT AAGTGTC ATTTGCA CTGTGAG ATGCAAA GTGTTAA ATTTGAA TTGTGAT ATTTATT ACGTGAT ATGTGAG TTGTGAG CTGTAAC CTGTGAA TTGTGAC GCCTGAC TTGTGAT GTGTGAA CTGTGAC ATGAGAC TTGTGAG

5 Calculated profile A C G T

6 Probability of a motif Suppose that we consider M as a candidate motif consensus How do we find the best M given the observations in A? Assuming independence of positions,

7 Maximum likelihood estimation General method for estimating unknown parameters when we have –a sample of values that depend on these parameters –a formula specifying the probability of obtaining these values given the parameters

8 MLE example: three coins Suppose we have three coins with probability of heads ⅓, ½, and ⅔ One of them is used to generate a series of 20 tosses and we observe 11 heads θ = the heads probability of the coin used in the experiment Binomial distribution for the number of heads

9 Binomial distribution Count of one of two possible outcomes in a series of independent events The probabilities of the two outcomes are constant across events An example of iid events (independent, identically distributed)

10 Binomial probability mass If the probability of one outcome (let’s call it A) is p and there are n events –The probability of the other outcome is 1-p –The probability of obtaining a particular sequence of outcomes with m A’s is –There are sequences with the same number m of outcomes A Overall

11 MLE example: three coins Result: Choose θ = ½

12 MLE example: unknown coins θ can take any value between 0 and 1 m heads in n tosses Solve the differential equation

13 Solving the differential equation

14 MLE for binomial Of the three solutions, θ = 0 and θ = 1 result in P(X 1,X 2,...,X n | θ) = 0, i.e., local minima On the other hand, for 0 0, so θ = m/n must be a local maximum Therefore the MLE estimate is

15 Properties of estimators The estimation error for a given sample is where x is the unknown true value An estimator is a random variable –because it depends on the sample The mean square error represents the overall quality of the estimation across all samples

16 Expected values Recall that the expected value of a discrete random variable X is defined as The expected value of a dependent random variable f(X) is For continuous distributions, replace the sum with an integral

17 Bias in estimation An estimator is unbiased if MLE is not necessarily unbiased Example: standard deviation –Is the most commonly used measure of dispersion in a data set –For a random variable X, it is defined as

18 Estimators of standard deviation MLE estimator where “Almost unbiased” estimator ( is an unbiased estimator of σ 2 ) biased