ALTERNATIVE SKEW-SYMMETRIC DISTRIBUTIONS Chris Jones THE OPEN UNIVERSITY, U.K.

Slides:



Advertisements
Similar presentations
Parametric Families of Distributions and Their Interaction with the Workshop Title Chris Jones The Open University, U.K.
Advertisements

NORMAL OR GAUSSIAN DISTRIBUTION Chapter 5. General Normal Distribution Two parameter distribution with a pdf given by:
Introduction to Non Parametric Statistics Kernel Density Estimation.
Modelling with parameter- mixture copulas October 2006 Xiangyuan Tommy Chen Econometrics & Business Statistics The University of Sydney
Pair-copula constructions of multiple dependence Workshop on ''Copulae: Theory and Practice'' Weierstrass Institute for Applied Analysis and.
Master thesis presentation Joanna Gatz TU Delft 29 of July 2007 Properties and Applications of the T copula.
Models for construction of multivariate dependence Workshop on Copulae and Multivariate Probability distributions in Finance – Theory, Applications,
Use of moment generating functions. Definition Let X denote a random variable with probability density function f(x) if continuous (probability mass function.
this photo View full size photo Visit the album this photo belongs to Check out the slide show Download photo Bookmark photo Publish photo Comment.
FAMILIES OF UNIMODAL DISTRIBUTIONS ON THE CIRCLE Chris Jones THE OPEN UNIVERSITY.
CHAPTER 16 MARKOV CHAIN MONTE CARLO
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
Pattern Recognition and Machine Learning
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 10 Statistical Modelling Martin Russell.
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
G. Cowan Lectures on Statistical Data Analysis Lecture 2 page 1 Statistical Data Analysis: Lecture 2 1Probability, Bayes’ theorem 2Random variables and.
Probability theory 2011 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different definitions.
Maximum likelihood (ML) and likelihood ratio (LR) test
Some standard univariate probability distributions
Linear and generalised linear models
July 3, Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic.
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Probability theory 2008 Outline of lecture 5 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different.
Statistical Theory; Why is the Gaussian Distribution so popular? Rob Nicholls MRC LMB Statistics Course 2014.
The Multivariate Normal Distribution, Part 1 BMTRY 726 1/10/2014.
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
CASA June 2006 BRATISLAVA Mária Bohdalová Faculty of Management, Comenius University Bratislava Oľga Nánásiová Faculty of Civil.
Random variables Petter Mostad Repetition Sample space, set theory, events, probability Conditional probability, Bayes theorem, independence,
Percentile Approximation Via Orthogonal Polynomials Hyung-Tae Ha Supervisor : Prof. Serge B. Provost.
Chapter 3 Basic Concepts in Statistics and Probability
Dept of Bioenvironmental Systems Engineering National Taiwan University Lab for Remote Sensing Hydrology and Spatial Modeling STATISTICS Random Variables.
ECE 8443 – Pattern Recognition LECTURE 03: GAUSSIAN CLASSIFIERS Objectives: Normal Distributions Whitening Transformations Linear Discriminants Resources.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
Functions of Random Variables. Methods for determining the distribution of functions of Random Variables 1.Distribution function method 2.Moment generating.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Elements of Financial Risk Management Second Edition © 2012 by Peter Christoffersen 1 Distributions and Copulas for Integrated Risk Management Elements.
1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to.
Quantification of the non- parametric continuous BBNs with expert judgment Iwona Jagielska Msc. Applied Mathematics.
Module 1: Statistical Issues in Micro simulation Paul Sousa.
The Normal Distribution Chapter 6. Outline 6-1Introduction 6-2Properties of a Normal Distribution 6-3The Standard Normal Distribution 6-4Applications.
Use of moment generating functions 1.Using the moment generating functions of X, Y, Z, …determine the moment generating function of W = h(X, Y, Z, …).
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Chapter 5.6 From DeGroot & Schervish. Uniform Distribution.
Modern Navigation Thomas Herring MW 11:00-12:30 Room
Problem: 1) Show that is a set of sufficient statistics 2) Being location and scale parameters, take as (improper) prior and show that inferences on ……
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 07: BAYESIAN ESTIMATION (Cont.) Objectives:
Geology 6600/7600 Signal Analysis 02 Sep 2015 © A.R. Lowry 2015 Last time: Signal Analysis is a set of tools used to extract information from sequences.
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Topic 5: Continuous Random Variables and Probability Distributions CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally from the class text,
STOCHASTIC HYDROLOGY Stochastic Simulation of Bivariate Distributions Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National.
Lecture 1: Basic Statistical Tools. A random variable (RV) = outcome (realization) not a set value, but rather drawn from some probability distribution.
STA347 - week 91 Random Vectors and Matrices A random vector is a vector whose elements are random variables. The collective behavior of a p x 1 random.
CHAPTER 5 CONTINUOUS PROBABILITY DISTRIBUTION Normal Distributions.
Lecture 8 Stephen G. Hall ARCH and GARCH. REFS A thorough introduction ‘ARCH Models’ Bollerslev T, Engle R F and Nelson D B Handbook of Econometrics vol.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Generating Random Variates
Advanced Higher Statistics
The Exponential and Gamma Distributions
Basic simulation methodology
STATISTICS Random Variables and Distribution Functions
Classical Continuous Probability Distributions
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
POPULATION (of “units”)
Regression in the 21st Century
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005
The Multivariate Normal Distribution, Part I
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
Professor Ke-sheng Cheng
Presentation transcript:

ALTERNATIVE SKEW-SYMMETRIC DISTRIBUTIONS Chris Jones THE OPEN UNIVERSITY, U.K.

For most of this talk, I am going to be discussing a variety of families of univariate continuous distributions (on the whole of R ) which are unimodal, and which allow variation in skewness and, perhaps, tailweight. Let g denote the density of a symmetric unimodal distribution on R ; this forms the starting point from which the various skew-symmetric distributions in this talk will be generated. For want of a better name, let us call these skew-symmetric distributions!

FAMILY 0 Azzalini-Type Skew Symmetric Define the density of X A to be w(x) + w(-x) = 1 (Wang, Boyer & Genton, 2004, Statist. Sinica) The most familiar special cases take w(x) = F( α x) to be the cdf of a (scaled) symmetric distribution (Azzalini, 1985, Scand. J. Statist.) where

FAMILY 1 Transformation of Random Variable FAMILY 0 Azzalini-Type Skew-Symmetric FAMILY 2 Transformation of Scale SUBFAMILY OF FAMILY 2 Two-Piece Scale FAMILY 3 Probability Integral Transformation of Random Variable on [0,1 ]

Structure of Remainder of Talk a brief look at each family of distributions in turn, and their main interconnections; some comparisons between them; open problems and challenges: brief thoughts about bi- and multi-variate extensions, including copulas.

FAMILY 1 Transformation of Random Variable Let W: R → R be an invertible increasing function. If Z ~ g, then define X R = W(Z). The density of the distribution of X R is where w = W'

A particular favourite of mine is a flexible and tractable two-parameter transformation that I call the sinh-arcsinh transformation: b=1 a>0 varying a=0 b>0 varying (Jones & Pewsey, 2009, Biometrika) Here, a controls skewness … … and b>0 controls tailweight

FAMILY 2 Transformation of Scale The density of the distribution of X S is just … which is a density if W(x) - W(-x) = x … which corresponds to w = W' satisfying w(x) + w(-x) = 1 (Jones, 2013, Statist. Sinica)

FAMILY 1 Transformation of Random Variable FAMILY 0 Azzalini-Type Skew-Symmetric and U|Z=z is a random sign with probability w(z) of being a plus X R = W(Z)e.g. X A = UZ FAMILY 2 Transformation of Scale X S = W(X A ) where Z ~ g

FAMILY 3 Probability Integral Transformation of Random Variable on (0,1) Let b be the density of a random variable U on (0,1). Then define X U = G -1 (U) where G'=g. The density of the distribution of X U is cf.

There are three strands of literature in this class: bespoke construction of b with desirable properties (Ferreira & Steel, 2006, J. Amer. Statist. Assoc.) choice of popular b: beta-G, Kumaraswamy-G etc (Eugene et al., 2002, Commun. Statist. Theor. Meth., Jones, 2004, Test) indirect choice of obscure b: b=B' and B is a function of G such that B is also a cdf e.g. B = G/{α+(1-α)G} (Marshall & Olkin, 1997, Biometrika) and

Comparisons I SkewSymmT of RVT of STwoPieceB(G) Unimodal?usuallyoften often When unimodal, with explicit mode?  Skewness ordering? seems well- behaved (van Zwet) (density asymmetry) (both)  Straightforward distribution function?  usually Tractable quantile function?  usually

Comparisons I SkewSymmT of RVT of STwoPieceB(G) Unimodal?usuallyoften often When unimodal, with explicit mode?  Skewness ordering? seems well- behaved (van Zwet) (density asymmetry) (both)  Straightforward distribution function?  usually Tractable quantile function?  usually

Comparisons I SkewSymmT of RVT of STwoPieceB(G) Unimodal?usuallyoften often When unimodal, with explicit mode?  Skewness ordering? seems well- behaved (van Zwet) (density asymmetry) (both)  Straightforward distribution function?  usually Tractable quantile function?  usually

Comparisons I SkewSymmT of RVT of STwoPieceB(G) Unimodal?usuallyoften often When unimodal, with explicit mode?  Skewness ordering? seems well- behaved (van Zwet) (density asymmetry) (both)  Straightforward distribution function?  usually Tractable quantile function?  usually

Comparisons I SkewSymmT of RVT of STwoPieceB(G) Unimodal?usuallyoften often When unimodal, with explicit mode?  Skewness ordering? seems well- behaved (van Zwet) (density asymmetry) (both)  Straightforward distribution function?  usually Tractable quantile function?  usually

Comparisons II SkewSymmT of RVT of STwoPieceB(G) Easy random variate generation? usually Easy ML estimation? (“problems” overblown?) Nice Fisher information matrix?  (singularity in one case) full FI (considerable parameter orthogonality) full FI “Physical” motivation? perhaps?  some- times Transferable to circle?  (non- unimodality)  (not by two scales) equivalent to T of RV?

Comparisons II SkewSymmT of RVT of STwoPieceB(G) Easy random variate generation? usually Easy ML estimation? (“problems” overblown?) Nice Fisher information matrix?  (singularity in one case) full FI (considerable parameter orthogonality) full FI “Physical” motivation? perhaps?  some- times Transferable to circle?  (non- unimodality)  (not by two scales) equivalent to T of RV?

Comparisons II SkewSymmT of RVT of STwoPieceB(G) Easy random variate generation? usually Easy ML estimation? (“problems” overblown?) Nice Fisher information matrix?  (singularity in one case) full FI (considerable parameter orthogonality) full FI “Physical” motivation? perhaps?  some- times Transferable to circle?  (non- unimodality)  (not by two scales) equivalent to T of RV?

Comparisons II SkewSymmT of RVT of STwoPieceB(G) Easy random variate generation? usually Easy ML estimation? (“problems” overblown?) Nice Fisher information matrix?  (singularity in one case) full FI (considerable parameter orthogonality) full FI “Physical” motivation? perhaps?  some- times Transferable to circle?  (non- unimodality)  (not by two scales) equivalent to T of RV?

Comparisons II SkewSymmT of RVT of STwoPieceB(G) Easy random variate generation? usually Easy ML estimation? (“problems” overblown?) Nice Fisher information matrix?  (singularity in one case) full FI (considerable parameter orthogonality) full FI “Physical” motivation? perhaps?  some- times Transferable to circle?  (non- unimodality)  (not by two scales) equivalent to T of RV?

Miscellaneous Plus Points T of RVT of SB(G) symmetric members can have kurtosis ordering of van Zwet … beautiful Khintchine theorem contains some known specific families … and, quantile- based kurtosis measures can be independent of skewness no change to entropy

OPEN problems and challenges: bi- and multi-variate extension I think it’s more a case of what copulas can do for multivariate extensions of these families rather than what they can do for copulas “natural” bi- and multi-variate extensions with these families as marginals are often constructed by applying the relevant marginal transformation to a copula (T of RV; often B(G)) T of S and a version of SkewSymm share the same copula Repeat: I think it’s more a case of what copulas can do for multivariate extensions of these families than what they can do for copulas

In the ISI News Jan/Feb 2012, they printed a lovely clear picture of the Programme Committee for the 2012 European Conference on Quality in Official Statistics … … on their way to lunch!

X R = W(Z) where Z ~ g 1-d: 2-d: Let Z 1, Z 2 ~ g 2 (z 1,z 2 ) [with marginals g] Then set X R,1 = W(Z 1 ), X R,2 = W(Z 2 ) to get a bivariate transformation of r.v. distribution [with marginals f R ] Transformation of Random Variable This is simply the copula associated with g 2 transformed to f R marginals

Azzalini-Type Skew Symmetric 1 1-d: ~ X A = Z|Y≤Z where Z ~ g and Y is independent of Z with density w'(y) 2-d: For example, let Z 1, Z 2, Y ~ w'(y) g 2 (z 1,z 2 ) Then set X A,1 = Z 1, X A,2 = Z 2 conditional on Y < a 1 z 1 +a 2 z 2 to get a bivariate skew symmetric distribution with density 2 w(a 1 z 1 +a 2 z 2 ) g 2 (z 1,z 2 ) However, unless w and g 2 are normal, this does not have marginals f A

Azzalini-Type Skew Symmetric 2 4 Now let Z 1, Z 2, Y 1, Y 2 ~ 4 w'(y 1 ) w'(y 2 ) g 2 (z 1,z 2 ) and restrict g 2 → g 2 to be `sign-symmetric’, that is, g 2 (x,y) = g 2 (-x,y) = g 2 (x,-y) = g 2 (-x,-y). Then set X A,1 = Z 1, X A,2 = Z 2 conditional on Y 1 < z 1 and Y 2 < z 2 to get a bivariate skew symmetric distribution with density 4 w(z 1 ) w(z 2 ) g 2 (z 1,z 2 ) (Sahu, Dey & Branco, 2003, Canad. J. Statist.) This does have marginals f A

1-d: 2-d: Transformation of Scale X S = W(X A ) where Z ~ f A Let X A,1, X A,2 ~ 4 w(x A,1 ) w(x A,2 ) g 2 (x A,1,x A,2 ) [with marginals f A ] Then set X S,1 = W(X A,1 ), X S,2 = W(X A,2 ) to get a bivariate transformation of scale distribution [with marginals f S ] This shares its copula with the second skew-symmetric construction

Probability Integral Transformation of Random Variable on (0,1) 1-d: X U = G -1 (U) where U ~ b on (0,1) 2-d: Where does b 2 come from? Sometimes there are reasonably “natural” constructs (e.g bivariate beta distributions) … Let U 1, U 2 ~ b 2 (z 1,z 2 ) [with marginals b] Then set X U,1 = G -1 (U 1 ), X U,2 = G -1 (Z 2 ) to get a bivariate version [with marginals f U ] … but often it comes down to choosing its copula