FAMILIES OF UNIMODAL DISTRIBUTIONS ON THE CIRCLE Chris Jones THE OPEN UNIVERSITY.

Slides:



Advertisements
Similar presentations
Parametric Families of Distributions and Their Interaction with the Workshop Title Chris Jones The Open University, U.K.
Advertisements

Chapter 3 Properties of Random Variables
NORMAL OR GAUSSIAN DISTRIBUTION Chapter 5. General Normal Distribution Two parameter distribution with a pdf given by:
One-way nonparametric ANOVA with trigonometric scores by Kravchuk, O.Y. School of Land and Food Sciences, University of Queensland.
Chapter 6 Sampling and Sampling Distributions
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Random Sampling and Data Description
Theoretical Probability Distributions We have talked about the idea of frequency distributions as a way to see what is happening with our data. We have.
Beginning the Visualization of Data
Prepared by Lloyd R. Jaisingh
ALTERNATIVE SKEW-SYMMETRIC DISTRIBUTIONS Chris Jones THE OPEN UNIVERSITY, U.K.
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
Statistics for the Social Sciences
Probability and Statistics Review
1 STATISTICAL INFERENCE PART I EXPONENTIAL FAMILY & POINT ESTIMATION.
Some standard univariate probability distributions
Statistical Theory; Why is the Gaussian Distribution so popular? Rob Nicholls MRC LMB Statistics Course 2014.
Random Variables and Probability Distributions
Continuous Probability Distributions A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
1 Normal Distributions Heibatollah Baghi, and Mastee Badii.
Describing Data: Numerical
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
The horseshoe estimator for sparse signals CARLOS M. CARVALHO NICHOLAS G. POLSON JAMES G. SCOTT Biometrika (2010) Presented by Eric Wang 10/14/2010.
Percentile Approximation Via Orthogonal Polynomials Hyung-Tae Ha Supervisor : Prof. Serge B. Provost.
(a.k.a: The statistical bare minimum I should take along from STAT 101)
Business Statistics: Communicating with Numbers
Traffic Modeling.
Review Which of these is a parameter? A.The average height of all people B.The time it takes rat #3 to learn the maze C.The number of subjects in your.
Continuous Distributions. The distributions that we have looked at so far have involved DISCRETE Data The distributions that we have looked at so far.
Measures of Dispersion & The Standard Normal Distribution 2/5/07.
Education 793 Class Notes Normal Distribution 24 September 2003.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
Chapter 2 Statistical Concepts Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright.
Section 1.3 Density Curves and Normal Distributions.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Chapter 6 Estimates and Sample Sizes 6-1 Estimating a Population Mean: Large Samples / σ Known 6-2 Estimating a Population Mean: Small Samples / σ Unknown.
NORMAL DISTRIBUTION AND ITS APPL ICATION. INTRODUCTION Statistically, a population is the set of all possible values of a variable. Random selection of.
1 Statistical Analysis – Descriptive Statistics Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
1 8. One Function of Two Random Variables Given two random variables X and Y and a function g(x,y), we form a new random variable Z as Given the joint.
Measures of variability: understanding the complexity of natural phenomena.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
Input Modeling for Simulation Chapter 3 “As far as the laws of mathematics refer to reality, they are not certain, and as far as they are certain, they.
Univariate Statistics PSYC*6060 Peter Hausdorf University of Guelph.
1 Chapter 2: The Normal Distribution 2.1Density Curves and the Normal Distributions 2.2Standard Normal Calculations.
From the population to the sample The sampling distribution FETP India.
Gil McVean, Department of Statistics Thursday February 12 th 2009 Monte Carlo simulation.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Selecting Input Probability Distributions. 2 Introduction Part of modeling—what input probability distributions to use as input to simulation for: –Interarrival.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Managerial Economics & Decision Sciences Department random variables  density functions  cumulative functions  business analytics II Developed for ©
Jacek Wallusch _________________________________ Statistics for International Business Lecture 8: Distributions and Densities.
Distributions.
The Exponential and Gamma Distributions
The Normal Distribution: Comparing Apples and Oranges
Univariate Descriptive Statistics
Essential Statistics (a.k.a: The statistical bare minimum I should take along from STAT 101)
Review of Probability Concepts
POPULATION (of “units”)
Regression in the 21st Century
8. One Function of Two Random Variables
Random Variables and Probability Distributions
Objective: To introduce the characteristics of normal distribution curve. Standard 5.10.
8. One Function of Two Random Variables
The Normal Distribution
Presentation transcript:

FAMILIES OF UNIMODAL DISTRIBUTIONS ON THE CIRCLE Chris Jones THE OPEN UNIVERSITY

FOR EMPIRICAL USE ONLY Structure of Talk 1)a quick look at three families of distributions on the real line R, and their interconnections; 2)extensions/adaptations of these to families of unimodal distributions on the circle C : a)somewhat unsuccessfully b)then successfully through direct and inverse Batschelet distributions c)then most successfully through our latest proposal … which Shogo will tell you about in Talk 2 [also Toshi in Talk 3?] Structure of Talks

To start with, then, I will concentrate on univariate continuous distributions on (the whole of) R a symmetric unimodal distribution on R with density g location and scale parameters which will be hidden one or more shape parameters, accounting for skewness and perhaps tail weight, on which I shall implicitly focus, via certain functions, w ≥ 0 and W, depending on them Here are some ingredients from which to cook them up: Part 1)

FAMILY 2 Transformation of Random Variable FAMILY 1 Azzalini-Type Skew-Symmetric FAMILY 3 Transformation of Scale SUBFAMILY OF FAMILY 3 Two-Piece Scale FAMILY 4 Probability Integral Transformation of Random Variable on [0,1 ]

FAMILY 1 Azzalini-Type Skew Symmetric Define the density of X A to be w(x) + w(-x) = 1 (Wang, Boyer & Genton, 2004, Statist. Sinica) The most familiar special cases take w(x) = F( ν x) to be the cdf of a (scaled) symmetric distribution (Azzalini, 1985, Scand. J. Statist., Azzalini with Capitanio, 2014, book) where

FAMILY 2 Transformation of Random Variable Let W: R → R be an invertible increasing function. If Z ~ g, define X R = W(Z). The density of the distribution of X R is, of course, where w = W' FOR EXAMPLE W(Z) = sinh( a + b sinh -1 Z ) (Jones & Pewsey, 2009, Biometrika)

FAMILY 3 Transformation of Scale The density of the distribution of X S is just … which is a density if W(x) - W(-x) = x … corresponding to w = W’ satisfying w(x) + w(-x) = 1 (Jones, 2014, Statist. Sinica) This works because X S = W(X A )

From a review and comparison of families on R in Jones, forthcoming,Internat. Statist. Rev.: x 0 =W(0)

So now let’s try to adapt these ideas to obtaining distributions on the circle C a symmetric unimodal distribution on C with density g location and concentration parameters which will often be hidden one or more shape parameters, accounting for skewness and perhaps “symmetric shape”, via certain specific functions, w and W, depending on them The ingredients are much the same as they were on R : Part 2)

ASIDE: if you like your “symmetric shape” incorporated into g, then you might use the specific symmetric family with densities g ψ (θ) ∝ { 1 + tanh(κψ) cos(θ-μ) } 1/ψ (Jones & Pewsey, 2005, J. Amer. Statist. Assoc.) EXAMPLES: Ψ = -1: wrapped Cauchy Ψ = 0: von Mises Ψ = 1: cardioid

The main example of skew-symmetric-type distributions on C in the literature takes w( θ ) = ½(1 + ν sin θ ), -1 ≤ ν ≤ 1: Part 2a) f A (θ) = (1 + ν sinθ) g(θ) This w is nonnegative and satisfies w(θ) + w(-θ) = 1 (Umbach & Jammalamadaka, 2009, Statist. Probab. Lett.; Abe & Pewsey, 2011, Statist. Pap.)

Unfortunately, these attractively simple skewed distributions are not always unimodal; And they can have problems introducing much in the way of skewness, plotted below as a function of ν and a parameter indexing a wide family of choices of g: Ψ, parameter indexing symmetric family

A nice example of transformation distributions on C uses a Möbius transformation M -1 (θ) = ν + 2 tan -1 [ ω tan(½(θ- ν)) ] f R (θ) = M′(θ) g(M(θ)) This has a number of nice properties, especially with regard to circular-circular regression, (Kato & Jones, 2010, J. Amer. Statist. Assoc.) What about transformation of random variables on C ? but f R isn’t always unimodal

That leaves “transformation of scale” … Part 2b) f S (θ) ∝ g(T(θ))... which is unimodal provided g is! (and its mode is at T -1 (0) ) A first skewing example is the “direct Batschelet distribution” essentially using the transformation B(θ) = θ - ν - ν cosθ, -1 ≤ ν ≤ 1. (Batschelet’s 1981 book; Abe, Pewsey & Shimizu, 2013, Ann. Inst. Statist. Math.)

B(θ) … ν: 0 …

Even better is the “inverse Batschelet distribution” which simply uses the inverse transformation B -1 (θ) where, as in the direct case, B(θ) = θ - ν - ν cosθ. (Jones & Pewsey, 2012, Biometrics)

Even better is the “inverse Batschelet distribution” which simply uses the inverse transformation B -1 (θ) where, as in the direct case, B(θ) = θ - ν - ν cosθ. (Jones & Pewsey, 2012, Biometrics) B(θ) … ν: 0 … B -1 (θ) … ν: 0 …

This is unimodal (if g is) with mode at B(θ) = - 2ν This has density f IB (θ) = g(B -1 (θ)) The equality arises because B′(θ) = 1 + ν sinθ equals 2w(θ), the w used in the skew- symmetric example described earlier; just as on R, if Θ ∼ f S, then Φ = B -1 ( Θ) ∼ f A.

κ=½κ=2 ν=½ ν=1

f IB is unimodal (if g is) – with mode explicitly at -2ν * includes g as special case has simple explicit density function – trivial normalising constant, independent of ν ** f IB (θ;-ν) = f IB (-θ;ν) with ν acting as a skewness parameter in a density asymmetry sense a very wide range of skewness and symmetric shape * a high degree of parameter orthogonality ** nice random variate generation * Some advantages of inverse Batschelet distributions * means not quite so nicely shared by direct Batschelet distributions ** means not (at all) shared by direct Batschelet distributions

no explicit distribution function no explicit characteristic function/trigonometric moments – method of (trig) moments not readily available ML estimation slowed up by inversion of B(θ) * Some disadvantages of inverse Batschelet distributions * means not shared by direct Batschelet distributions

Over to you, Shogo! Part 2c)

Comparisons: inverse Batschelet vs new model inverse Batschelet new model unimodal? with explicit mode? includes simple g as special case? (von Mises, WC, cardioid) (WC, cardioid) simple explicit density function? f(θ;-ν) = f(-θ;ν)? understandable skewness parameter? very wide range of skewness and kurtosis? high degree of parameter orthogonality?  nice random variate generation?

Comparisons continued inverse Batschelet new model explicit distribution function?  explicit characteristic function?  fully interpretable parameters? MoM estimation available?  ML estimation straightforward? closure under convolution?  FINAL SCORE: inverse Batschelet 10, new model 14