Central Tendency and Variability

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Class Session #2 Numerically Summarizing Data
Introduction to Summary Statistics
Statistics for the Social Sciences
Descriptive (Univariate) Statistics Percentages (frequencies) Ratios and Rates Measures of Central Tendency Measures of Variability Descriptive statistics.
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
PSY 307 – Statistics for the Behavioral Sciences
Descriptive Statistics
Slides by JOHN LOUCKS St. Edward’s University.
Introduction to Educational Statistics
Data Transformation Data conversion Changing the original form of the data to a new format More appropriate data analysis New.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Data observation and Descriptive Statistics
Descriptive Statistics: Overview Measures of Center Mode Median Mean * Measures of Symmetry Skewness Measures of Spread Range Inter-quartile Range Variance.
Exploring Marketing Research William G. Zikmund
Measures of Central Tendency
Today: Central Tendency & Dispersion
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
Summarizing Scores With Measures of Central Tendency
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 3 – Descriptive Statistics
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistics 1 Measures of central tendency and measures of spread.
URBP 204A QUANTITATIVE METHODS I Statistical Analysis Lecture I Gregory Newmark San Jose State University (This lecture accords with Chapters 2 & 3 of.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Variability.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Determination of Sample Size: A Review of Statistical Theory
Basic Measurement and Statistics in Testing. Outline Central Tendency and Dispersion Standardized Scores Error and Standard Error of Measurement (Sm)
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Measures of Central Tendency: The Mean, Median, and Mode
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Central Tendency & Dispersion
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Central Tendency. Variables have distributions A variable is something that changes or has different values (e.g., anger). A distribution is a collection.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
LIS 570 Summarising and presenting data - Univariate analysis.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
MATH 1107 Elementary Statistics Lecture 3 Describing and Exploring Data – Central Tendency, Variation and Relative Standing.
CHAPTER 2: Basic Summary Statistics
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Univariate Statistics
Central Tendency and Variability
Descriptive Statistics
Characteristics of the Mean
Summary descriptive statistics: means and standard deviations:
Summary descriptive statistics: means and standard deviations:
Lecture 4 Psyc 300A.
Presentation transcript:

Central Tendency and Variability The two most essential features of a distribution

Questions Define Mean Median Mode What is the effect of distribution shape on measures of central tendency? When might we prefer one measure of central tendency to another?

Questions (2) Define Range Average Deviation Variance Standard Deviation When might we prefer one measure of variability to another? What is a z score? What is the point of Tchebycheff’s inequality?

Variables have distributions A variable is something that changes or has different values (e.g., anger). A distribution is a collection of measures, usually across people. Distributions of numbers can be summarized with numbers (called statistics or parameters).

Central Tendency refers to the Middle of the Distribution

Variability is about the Spread

1. Central Tendency: Mode, Median, & Mean The mode – the most frequently occurring score. Midpoint of most populous class interval. Can have bimodal and multimodal distributions.

Median Score that separates top 50% from bottom 50% Even number of scores, median is half way between two middle scores. 1 2 3 4 | 5 6 7 8 – Median is 4.5 Odd number of scores, median is the middle number 1 2 3 4 5 6 7 – Median is 4

Mean Sum of scores divided by the number of people. Population mean is (mu) and sample mean is (X-bar). We calculate the sample mean by: We calculate the population mean by:

Deviation from the mean x = X – . Deviations sum to zero. Deviation score – deviation from the mean Raw scores Deviation scores 9 8 10 7 11 -1 1 -2 2

Comparison of mean, median and mode Good for nominal variables Good if you need to know most frequent observation Quick and easy Median Good for “bad” distributions Good for distributions with arbitrary ceiling or floor

Comparison of mean, median & mode Used for inference as well as description; best estimator of the parameter Based on all data in the distribution Generally preferred except for “bad” distribution. Most commonly used statistic for central tendency.

Best Guess interpretations Mean – average of signed error will be zero. Mode – will be absolutely right with greatest frequency Median – smallest absolute error

Expectation Discrete and continuous variables Mean is expected value either way Discrete: Continuous: (The integral looks bad but just means take the average)

Influence of Distribution Shape

Review What is central tendency? Mode Median Mean

2. Variability aka Dispersion 4 Statistics: Range, Average Deviation, Variance, & Standard Deviation Range = high score minus low score. 12 14 14 16 16 18 20 – range=20-12=8 Average Deviation – mean of absolute deviations from the median: Note difference between this definition & undergrad text- deviation from Median vs. Mean

Variance Population Variance: Where means population variance, means population mean, and the other terms have their usual meaning. The variance is equal to the average squared deviation from the mean. To compute, take each score and subtract the mean. Square the result. Find the average over scores. Ta da! The variance.

Computing the Variance 5 15 -10 100 10 -5 25 20 Total: 75 250 Mean: Variance Is  50

Standard Deviation Variance is average squared deviation from the mean. To return to original, unsquared units, we just take the square root of the variance. This is the standard deviation. Population formula:

Standard Deviation Sometimes called the root-mean-square deviation from the mean. This name says how to compute it from the inside out. Find the deviation (difference between the score and the mean). Find the deviations squared. Find their mean. Take the square root.

Computing the Standard Deviation 5 15 -10 100 10 -5 25 20 Total: 75 250 Mean: Variance Is  50 Sqrt SD

Example: Age Distribution

Review Range Average deviation Variance Standard Deviation

Standard or z score A z score indicates distance from the mean in standard deviation units. Formula: Converting to standard or z scores does not change the shape of the distribution. Z-scores are not normalized.

Tchebycheff’s Inequality (1) General form Suppose we know mean height in inches is 66 and SD is 4 inches. We assume nothing about the shape of the distribution of height. What is the probability of finding people taller than 74 inches? (Note that b is a deviation from the mean; in this case 74-66=8.). Also 74 inches is 2 SDs above the mean; therefore, z = 2. [If we assume height is normally distributed, p is much smaller. But we will get to that later.]

Tchebycheff (2) Z-score form Probability of z score from any distribution being more than k SDs from mean is at most 1/k2. Z-scores from the worst distributions are rarely more than 5 or less than -5. For symmetric, unimodal distributions, |z| is rarely more than 3. For the problem in the previous slide:

Review Z-score in words Z-score in symbols Meaning of Tchebycheff’s theorem

Median House Price Data Find data Show Univariate Show plots