July 3, 2015 1 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic.

July 3, 2015 1 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic

July 3, 2015 2 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden A statistic defines a partition of the sample space of (X 1, …, X n ) into classes satisfying T(x 1, …, x n ) = t for different values of t. If such a partition puts the sample x = (x 1, …, x n ) and y = (y 1, …, y n ) into the same class if and only if then T is minimal sufficient for 

July 3, 2015 3 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Example

July 3, 2015 4 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Rao-Blackwell theorem

July 3, 2015 5 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden The Exponential family of distributions A random variable X belongs to the (k-parameter) exponential family of probability distributions if the p.d.f. of X can be written What about  N( ,  2 ) ?  Po( ) ?  U(0,  ) ?

July 3, 2015 6 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden For a random sample x = (x 1, …, x n ) from a distribution belonging to the exponential family

July 3, 2015 7 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Exponential family written on the canonical form:

July 3, 2015 8 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Completeness Let x 1, …, x n be a random sample from a distribution with p.d.f. f (x;  )and T = T (x 1, …, x n ) a statistic Then T is complete for  if whenever h T (T ) is a function of T such that E[h T (T )] = 0 for all values of  then Pr(h T (T )  0) = 1 Important lemmas from this definition:  Lemma 2.6: If T is a complete sufficient statistic for  and h (T ) is a function of T such that E[h (T ) ] = , then h is unique (there is at most one such function)  Lemma 2.7: If there exists a Minimum Variance Unbiased Estimator (MVUE) for  and h (T ) is an unbiased estimator for , where T is a complete minimal sufficient statistic for , then h (T ) is MVUE  Lemma 2.8: If a sample is from a distribution belonging to the exponential family, then (  B 1 (x i ), …,  B k (x i ) ) is complete and minimal sufficient for  1, …,  k

July 3, 2015 9 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Maximum-Likelihood estimation Consider as usual a random sample x = x 1, …, x n from a distribution with p.d.f. f (x;  ) (and c.d.f. F(x;  ) ) The maximum likelihood point estimator of  is the value of  that maximizes L(  ; x ) or equivalently maximizes l(  ; x ) Useful notation: With a k-dimensional parameter:

July 3, 2015 10 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Complete sample case: If all sample values are explicitly known, then Censored data case: If some ( say n c ) of the sample values are censored, e.g. x i k 2, then where

July 3, 2015 11 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden When the sample comes from a continuous distribution the censored data case can be written In the case the distribution is discrete the use of F is also possible: If k 1 and k 2 are values that can be attained by the random variables then we may write where

July 3, 2015 13 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden

July 3, 2015 15 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Too complicated to find an analytical solutions. Solve by a numerical routine!

July 3, 2015 16 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Exponential family distributions: Use the canonical form (natural parameterization): Let Then the maximum likelihood estimators (MLEs) of  1, …,  k are found by solving the system of equations

July 3, 2015 19 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Computational aspects When the MLEs can be found by evaluating numerical routines for solving the generic equation g(  ) = 0 can be used. Newton-Raphson method Fisher’s method of scoring (makes use of the fact that under regularity conditions: ) This is the multidimensional analogue of Lemma 2.1 ( see page 17)

July 3, 2015 20 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden When the MLEs cannot be found the above way other numerical routines must be used: Simplex method EM-algorithm For description of the numerical routines see textbook. Maximum Likelihood estimation comes into natural use not for handling the standard case, i.e. a complete random sample from a distribution within the exponential family, but for finding estimators in more non- standard and complex situations.

July 3, 2015 23 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Properties of MLEs Invariance: Consistency: Under some weak regularity conditions all MLEs are consistent

July 3, 2015 24 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Efficiency: Under the usual regularity conditions: (Asymptotically efficient and normally distributed) Sufficiency:

July 3, 2015 28 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Invariance property 

July 3, 2015 31 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden i.e. the two MLEs are asymptotically uncorrelated (and by the normal distribution independent)

July 3, 2015 32 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Modifications and extensions Ancillarity and conditional sufficiency:

July 3, 2015 33 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Profile likelihood: This concept has its main use in cases where  1 contains the parameters of “interest” and  2 contains nuisance parameters. The same ML point estimator for  1 is obtained by maximizing the profile likelihood as by maximizing the full likelihood function

July 3, 2015 34 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Marginal and conditional likelihood: Again, these concepts have their main use in cases where  1 contains the parameters of “interest” and  2 contains nuisance parameters.

July 3, 2015 35 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Penalized likelihood: MLEs can be derived subjected to some criteria of smoothness. In particular this is applicable when the parameter is no longer a single value (one- or multidimensional), but a function such as an unknown density function or a regression curve. The penalized log-likelihood function is written

July 3, 2015 36 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Method of moments estimation (MM )

July 3, 2015 38 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden The method of moments point estimator of  = (  1, …,  k ) is obtained by solving for  1, …,  k the systems of equations

July 3, 2015 41 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Method of Least Squares (LS) First principles: Assume a sample x where the random variable X i can be written The least-squares estimator of  is the value of  that minimizes i.e.

July 3, 2015 42 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden A more general approach Assume the sample can be written (x, z ) where x i represents the random variable of interest (endogenous variable) and z i represent either an auxiliary random variable (exogenous) or a given constant for sample point i The least squares estimator of  is then

July 3, 2015 43 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Special cases The ordinary linear regression model: The heteroscedastic regression model:

July 3, 2015 44 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden The first-order auto-regressive model:

July 3, 2015 45 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden The conditional least-squares estimator of  (given  ) is

July 3, 2015 1 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic.

Similar presentations

Presentation on theme: "July 3, 2015 1 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

July 3, 2015 1 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic.

Similar presentations

Presentation on theme: "July 3, 2015 1 Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic."— Presentation transcript:

Similar presentations

About project

Feedback