Measuring Inequality An examination of the purpose and techniques of inequality measurement.

Slides:



Advertisements
Similar presentations
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Advertisements

Describing Data: Measures of Dispersion
Measurement, Evaluation, Assessment and Statistics
Frequency Distributions Quantitative Methods in HPELS 440:210.
Learning Objectives In this chapter you will learn about measures of central tendency measures of central tendency levels of measurement levels of measurement.
Chapter 6 Economic Inequality.
1.2 Describing Distributions with Numbers. Center and spread are the most basic descriptions of what a data set “looks like.” They are intuitively meant.
QUANTITATIVE DATA ANALYSIS
Measures of spread, inequality, and dissimilarity Hist 5011.
Slides by JOHN LOUCKS St. Edward’s University.
Inequalities of Development Lorenz Curve and Gini Coefficient
Trieschmann, Hoyt & Sommer Risk Identification and Evaluation Chapter 2 ©2005, Thomson/South-Western.
Measures of Central Tendency
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Describing Data from One Variable
1 The student will learn about: §4.6 Applications to Economics. producers’ surplus, and consumers’ surplus, the Gini index.
May 06th, Chapter - 7 INFORMATION PRESENTATION 7.1 Statistical analysis 7.2 Presentation of data 7.3 Averages 7.4 Index numbers 7.5 Dispersion from.
Chapter 3 – Descriptive Statistics
Measures of Central Tendency or Measures of Location or Measures of Averages.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Why statisticians were created Measure of dispersion FETP India.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Chapter 8 Quantitative Data Analysis. Meaningful Information Quantitative Analysis Quantitative analysis Quantitative analysis is a scientific approach.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Describing distributions with numbers
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Descriptive Statistics: Presenting and Describing Data.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Sociology 5811: Lecture 3: Measures of Central Tendency and Dispersion Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Measures of variability: understanding the complexity of natural phenomena.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Measures of Variability: “The crowd was scattered all across the park, but a fairly large group was huddled together around the statue in the middle.”
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
1.  In the words of Bowley “Dispersion is the measure of the variation of the items” According to Conar “Dispersion is a measure of the extent to which.
Introduction to statistics I Sophia King Rm. P24 HWB
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Chapter 2 Review Using graphs/tables/diagrams to show variable relationships Understand cumulative frequency, percentile rank, and cross-tabulations Perform.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
1 Measuring Poverty: Inequality Measures Charting Inequality Share of Expenditure of Poor Dispersion Ratios Lorenz Curve Gini Coefficient Theil Index Comparisons.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Copyright © 2016 Brooks/Cole Cengage Learning Intro to Statistics Part II Descriptive Statistics Intro to Statistics Part II Descriptive Statistics Ernesto.
Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
One-Variable Statistics
Describing Distributions with Numbers
Data Mining: Concepts and Techniques
Topic 3: Measures of central tendency, dispersion and shape
Measures of Central Tendency
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
STATS DAY First a few review questions.
Measuring Income Inequality
An Introduction to Statistics
Numerical Descriptive Measures
Numerical Descriptive Measures
An examination of the purpose and techniques of inequality measurement
Day 52 – Box-and-Whisker.
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Presentation transcript:

Measuring Inequality An examination of the purpose and techniques of inequality measurement

in·equal·i·ty Function: noun 1 : the quality of being unequal or uneven: as a : lack of evenness b : social disparity c : disparity of distribution or opportunity d : the condition of being variable : changeableness 2 : an instance of being unequal What is inequality? From Merriam-Webster:

Our primary interest is in economic inequality. In this context, inequality measures the disparity between a percentage of population and the percentage of resources (such as income) received by that population. Inequality increases as the disparity increases.

If a single person holds all of a given resource, inequality is at a maximum. If all persons hold the same percentage of a resource, inequality is at a minimum. Inequality studies explore the levels of resource disparity and their practical and political implications.

Physical attributes – distribution of natural ability is not equal Physical attributes – distribution of natural ability is not equal Personal Preferences – Relative valuation of leisure and work effort differs Personal Preferences – Relative valuation of leisure and work effort differs Social Process – Pressure to work or not to work varies across particular fields or disciplines Social Process – Pressure to work or not to work varies across particular fields or disciplines Public Policy – tax, labor, education, and other policies affect the distribution of resources Public Policy – tax, labor, education, and other policies affect the distribution of resources Economic Inequalities can occur for several reasons:

Why measure Inequality? Measuring changes in inequality helps determine the effectiveness of policies aimed at affecting inequality and generates the data necessary to use inequality as an explanatory variable in policy analysis.

How do we measure Inequality? Before choosing an inequality measure, the researcher must ask two additional questions: Does the research question require the inequality metric to have particular properties (inflation resistance, comparability across groups, etc)? Does the research question require the inequality metric to have particular properties (inflation resistance, comparability across groups, etc)? What metric best leverages the available data? What metric best leverages the available data?

Choosing the best metric Range Range Range Ratio Range Ratio The McLoone Index The McLoone Index The Coefficient of Variation The Coefficient of Variation The Gini Coefficient The Gini Coefficient Theil’s T Statistic Theil’s T Statistic Some popular measures include:

Range The range is simply the difference between the highest and lowest observations. Number of employees Salary 2 $1,000, $200,000 $100,000 $45,000 $24,000 $60,000 In this example, the Range = $1,000,000-$24,000 = 976,000

Range Pros Easy to Understand Easy to Understand Easy to Compute Easy to ComputeCons Ignores all but two of the observations Ignores all but two of the observations Does not weight observations Does not weight observations Affected by inflation Affected by inflation Skewed by outliers Skewed by outliers The range is simply the difference between the highest and lowest observations.

Range Ratio The Range Ratio is computed by dividing a value at one predetermined percentile by the value at a lower predetermined percentile. 95 percentile Approx. equals 36 th person 5 percentile Approx. equals 2nd person In this example, the Range Ratio=200,000/24,000 =8.33 Note: Any two percentiles can be used in producing a Range Ratio. In some contexts, this 95/5 ratio is referred to as the Federal Range Ratio. Number of employees Salary 2 $1,000, $200,000 $100,000 $45,000 $24,000 $60,000

Range Ratio Pros Easy to understand Easy to understand Easy to calculate Easy to calculate Not skewed by severe outliers Not skewed by severe outliers Not affected by inflation Not affected by inflationCons Ignores all but two of the observations Ignores all but two of the observations Does not weight observations Does not weight observations The Range Ratio is computed by dividing a value at one predetermined percentile by the value at a lower predetermined percentile.

The McLoone Index The McLoone Index divides the summation of all observations below the median, by the median multiplied by the number of observations below median. Number of employees Salary 2 1,000, , , , , , Observations below median In this example, the summation of observations below the median = 603,000, and the median = 45,000 Thus, the McLoone Index = 603,000/(45,000(19)) =.7053

The McLoone Index Pros Easy to understand Easy to understand Conveys comprehensive information about the bottom half Conveys comprehensive information about the bottom halfCons Ignores values above the median Ignores values above the median Relevance depends on the meaning of the median value Relevance depends on the meaning of the median value The McLoone Index divides the summation of all observations below the median, by the median multiplied by the number of observations below median.

The Coefficient of Variation The Coefficient of Variation is a distribution’s standard deviation divided by its mean. Both distributions above have the same mean, 1, but the standard deviation is much smaller in the distribution on the left, resulting in a lower coefficient of variation.

The Coefficient of Variation Pros Fairly easy to understand Fairly easy to understand If data is weighted, it is immune to outliers If data is weighted, it is immune to outliers Incorporates all data Incorporates all data Not skewed by inflation Not skewed by inflationCons Requires comprehensive individual level data Requires comprehensive individual level data No standard for an acceptable level of inequality No standard for an acceptable level of inequality The Coefficient of Variation is a distribution’s standard deviation divided by its mean.

The Gini Coefficient The Gini Coefficient has an intuitive, but possibly unfamiliar construction. To understand the Gini Coefficient, one must first understand the Lorenz Curve, which orders all observations and then plots the cumulative percentage of the population against the cumulative percentage of the resource.

A – Equality Diagonal Population = Income A – Equality Diagonal Population = Income B – Lorenz Curve B – Lorenz Curve C – Difference Between Equality and Reality C – Difference Between Equality and Reality A B C Cumulative Population Cumulative Income The Gini Coefficient An equality diagonal represents perfect equality: at every point, cumulative population equals cumulative income. The Lorenz curve measures the actual distribution of income.

The Gini Coefficient Mathematically, the Gini Coefficient is equal to twice the area enclosed between the Lorenz curve and the equality diagonal. When there is perfect equality, the Lorenz curve is the equality diagonal, and the value of the Gini Coefficient is zero. When one member of the population holds all of the resource, the value of the Gini Coefficient is one.

The Gini Coefficient Pros Generally regarded as gold standard in economic work Generally regarded as gold standard in economic work Incorporates all data Incorporates all data Allows direct comparison between units with different size populations Allows direct comparison between units with different size populations Attractive intuitive interpretation Attractive intuitive interpretationCons Requires comprehensive individual level data Requires comprehensive individual level data Requires more sophisticated computations Requires more sophisticated computations Twice the area between the Lorenz curve and the equality diagonal.

Theil’s T Statistic Theil’s T Statistic lacks an intuitive picture and involves more than a simple difference or ratio. Nonetheless, it has several properties that make it a superior inequality measure. Theil’s T Statistic can incorporate group-level data and is particularly effective at parsing effects in hierarchical data sets.

Theil’s T Statistic Theil’s T Statistic generates an element, or a contribution, for each individual or group in the analysis which weights the data point’s size (in terms of population share) and weirdness (in terms of proportional distance from the mean). When individual data is available, each individual has an identical population share (1/N), so each individual’s Theil element is determined by his or her proportional distance from the mean.

Theil’s T Statistic Mathematically, with individual level data Theil’s T statistic of income inequality is given by: where n is the number of individuals in the population, y p is the income of the person indexed by p, and µ y is the population’s average income.

Theil’s T Statistic The formula on the previous slide emphasizes several points: The summation sign reinforces the idea that each person will contribute a Theil element. The summation sign reinforces the idea that each person will contribute a Theil element. y p /µ y is the proportion of the individual’s income to average income. y p /µ y is the proportion of the individual’s income to average income. The natural logarithm of y p /µ y determines whether the element will be positive (y p /µ y > 1); negative (y p /µ y 1); negative (y p /µ y < 1); or zero (y p /µ y = 0).

Theil’s T Statistic – Example 1 The following example assumes that exact salary information is known for each individual. Number of employees Exact Salary 2 $100, $80,000 $60,000 $20,000 $40,000 For this data, Theil’s T Statistic = Individuals in the top salary group contribute large positive elements. Individuals in the middle salary group contribute nothing to Theil’s T Statistic because their salaries are equal to the population average. Individuals in the bottom salary group contribute large negative elements.

Theil’s T Statistic Often, individual data is not available. Theil’s T Statistic has a flexible way to deal with such instances. If members of a population can be classified into mutually exclusive and completely exhaustive groups, then Theil’s T Statistic for the population (T ) is made up of two components, the between group component (T’g) and the within group component (T w g).

Theil’s T Statistic Algebraically, we have: T = T’ g + T w g T = T’ g + T w g When aggregated data is available instead of individual data, T’ g can be used as a lower bound for Theil’s T Statistic in the population.

Theil’s T Statistic The between group element of the Theil index has a familiar form: where i indexes the groups, p i is the population of group i, P is the total population, y i is the average income in group i, and µ is the average income across the entire population.

Theil’s T Statistic – Example 2 Now assume the more realistic scenario where a researcher has average salary information across groups. Number of employees in group Group Average Salary 2$95, $75,000 $60,000 $25,000 $45,000 For this data, T’ g = The top salary two salary groups contribute positive elements. The middle salary group contributes nothing to the between group Theil’s T Statistic because the group average salary is equal to the population average. The bottom two salary groups contribute negative elements.

Group analysis with Theil’s T Statistic: As Example 2 hints, Theil’s T Statistic is a powerful tool for analyzing inequality within and between various groupings, because: The between group elements capture each group’s contribution to overall inequality The between group elements capture each group’s contribution to overall inequality The sum of the between group elements is a reasonable lower bound for Theil’s T statistic in the population The sum of the between group elements is a reasonable lower bound for Theil’s T statistic in the population Sub-groups can be broken down within the context of larger groups Sub-groups can be broken down within the context of larger groups

Theil’s T Statistic Pros Can effectively use group data Can effectively use group data Allows the researcher to parse inequality into within group and between group components Allows the researcher to parse inequality into within group and between group componentsCons No intuitive motivating picture No intuitive motivating picture Cannot directly compare populations with different sizes or group structures Cannot directly compare populations with different sizes or group structures Comparatively mathematically complex Comparatively mathematically complex

Next Steps Those interested in a more rigorous examination of inequality metrics with several numerical examples should proceed to The Theoretical Basics of Popular Inequality Measures. Those interested in a more rigorous examination of inequality metrics with several numerical examples should proceed to The Theoretical Basics of Popular Inequality Measures. Otherwise, proceed to A Nearly Painless Guide to Computing Theil’s T Statistic which emphasizes constructing research questions and using a spreadsheet to conduct analysis. Otherwise, proceed to A Nearly Painless Guide to Computing Theil’s T Statistic which emphasizes constructing research questions and using a spreadsheet to conduct analysis.