Estimation of covariance matrix under informative sampling Julia Aru University of Tartu and Statistics Estonia Tartu, June 25-29, 2007.

Slides:



Advertisements
Similar presentations
Copula Representation of Joint Risk Driver Distribution
Advertisements

Agenda of Week VII Review of Week VI Multiple regression Canonical correlation.
Latent normal models for missing data Harvey Goldstein Centre for Multilevel Modelling University of Bristol.
The Normal Distribution
Copula Regression By Rahul A. Parsa Drake University &
Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Covariance and Correlation: Estimator/Sample Statistic: Population Parameter: Covariance and correlation measure linear association between two variables,
Master thesis presentation Joanna Gatz TU Delft 29 of July 2007 Properties and Applications of the T copula.
Probability theory 2010 Order statistics  Distribution of order variables (and extremes)  Joint distribution of order variables (and extremes)
1 Def: Let and be random variables of the discrete type with the joint p.m.f. on the space S. (1) is called the mean of (2) is called the variance of (3)
Different chi-squares Ulf H. Olsson Professor of Statistics.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
GRA 6020 Multivariate Statistics The regression model OLS Regression Ulf H. Olsson Professor of Statistics.
The General LISREL MODEL and Non-normality Ulf H. Olsson Professor of Statistics.
Multivariate Distance and Similarity Robert F. Murphy Cytometry Development Workshop 2000.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Effect of Violations of Normality Edgell and Noon, 1984 On the Correlation Coefficient t-Test.
Different chi-squares Ulf H. Olsson Professor of Statistics.
GRA 6020 Multivariate Statistics The regression model OLS Regression Ulf H. Olsson Professor of Statistics.
Mixed models Various types of models and their relation
Analysis of Simulation Input.. Simulation Machine n Simulation can be considered as an Engine with input and output as follows: Simulation Engine Input.
The General LISREL Model Ulf H. Olsson Professor of statistics.
Computer vision: models, learning and inference Chapter 3 Common probability distributions.
Correlations and Copulas Chapter 10 Risk Management and Financial Institutions 2e, Chapter 10, Copyright © John C. Hull
The General LISREL MODEL and Non-normality Ulf H. Olsson Professor of Statistics.
Statistics for the Social Sciences Psychology 340 Spring 2005 Course Review.
The Multivariate Normal Distribution, Part 2 BMTRY 726 1/14/2014.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Variance and covariance Sums of squares General linear models.
Computer vision: models, learning and inference Chapter 5 The Normal Distribution.
Latent Variable Models Christopher M. Bishop. 1. Density Modeling A standard approach: parametric models  a number of adaptive parameters  Gaussian.
The Multiple Correlation Coefficient. has (p +1)-variate Normal distribution with mean vector and Covariance matrix We are interested if the variable.
Chapter 14 Monte Carlo Simulation Introduction Find several parameters Parameter follow the specific probability distribution Generate parameter.
Advanced Higher Statistics Data Analysis and Modelling Hypothesis Testing Statistical Inference AH.
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
General ideas to communicate Dynamic model Noise Propagation of uncertainty Covariance matrices Correlations and dependencs.
The Multivariate Gaussian
–The shortest distance is the one that crosses at 90° the vector u Statistical Inference on correlation and regression.
Colorado Center for Astrodynamics Research The University of Colorado 1 STATISTICAL ORBIT DETERMINATION Probability and statistics review ASEN 5070 LECTURE.
University of Colorado Boulder ASEN 5070: Statistical Orbit Determination I Fall 2015 Professor Brandon A. Jones Lecture 14: Probability and Statistics.
Introduction to Machine Learning Multivariate Methods 姓名 : 李政軒.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Chapter 5 Joint Probability Distributions and Random Samples  Jointly Distributed Random Variables.2 - Expected Values, Covariance, and Correlation.3.
Probability and Statistics Dr. Saeid Moloudzadeh Uniform Random Variable/ Normal Random Variable 1 Contents Descriptive Statistics Axioms.
1 Sampling distributions The probability distribution of a statistic is called a sampling distribution. : the sampling distribution of the mean.
Université d’Ottawa / University of Ottawa 2003 Bio 8102A Applied Multivariate Biostatistics L4.1 Lecture 4: Multivariate distance measures l The concept.
Tests of hypothesis Contents: Tests of significance for small samples
Advanced Higher Statistics
Basic simulation methodology
CH 5: Multivariate Methods
Computer vision: models, learning and inference
Random Sampling Population Random sample: Statistics Point estimate
Chapter 9 Hypothesis Testing
Sampling Distribution
Sampling Distribution
Regression in the 21st Century
From Simulations to the Central Limit Theorem
CHAPTER 15 SUMMARY Chapter Specifics
Ch11 Curve Fitting II.
More Parameter Learning, Multinomial and Continuous Variables
The estimate of the proportion (“p-hat”) based on the sample can be a variety of values, and we don’t expect to get the same value every time, but the.
数据的矩阵描述.
Chapter 14 Monte Carlo Simulation
(Approximately) Bivariate Normal Data and Inference Based on Hotelling’s T2 WNBA Regular Season Home Point Spread and Over/Under Differentials
Multivariate Methods Berlin Chen
Correlation A measure of the strength of the linear association between two numerical variables.
The Multivariate Normal Distribution, Part I
Multivariate Methods Berlin Chen, 2005 References:
Statistical Inference for the Mean: t-test
Presentation transcript:

Estimation of covariance matrix under informative sampling Julia Aru University of Tartu and Statistics Estonia Tartu, June 25-29, 2007

Tartu, June 26-29, Outline Informative sampling Population and sample distribution Multivariate normal distribution and exponential inclusion probabilities Conclusions for normal case Simulation study

Tartu, June 26-29, Informative sampling Probability that an object belongs to the sample depends on the variable we are interested in For example: while studying income we see that people with higher income are not keen to respond Under informative sampling sample distribution of variable(s) of interest differs from that in population

Tartu, June 26-29, Population and sample distribution Vector of study variables Population distribution Sample distribution

Tartu, June 26-29, MVN case (1) Population distribution: multivariate normal with parameters µ and Σ: Inclusion probabilities: Matrix A is symmetrical and such that is positive-definite

Tartu, June 26-29, MVN case (2) Sample distribution is then again normal with parameters

Tartu, June 26-29, Conclusions for MVN case If variables are independent in the population (Σ is diagonal) then independence is preserved only in the case when matrix A is also diagonal Matrix A can be chosen to make variables independent in the sample or dependence structure to be very different from that in the population

Tartu, June 26-29, Simulation study (1) Population is bivariate standard normal with correlation coefficient r : Inclusion probabilities: Repetitions: 1000, population size: 10000, sample size: 1000

Tartu, June 26-29, Simulation study (2) rR

Tartu, June 26-29, Thank you!

Tartu, June 26-29, Exponential family (1) Population distribution belongs to expontial family With canonocal representation And inclusion probabilities have the form

Tartu, June 26-29, Exponential family (2) Then sample distribution belonds to the same family of distributions with canonical parameters