Summary descriptive statistics: means and standard deviations:

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Quantitative Methods in HPELS 440:210
Brought to you by Tutorial Support Services The Math Center.
Measures of Central Tendency. Central Tendency “Values that describe the middle, or central, characteristics of a set of data” Terms used to describe.
QUANTITATIVE DATA ANALYSIS
Calculating & Reporting Healthcare Statistics
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Descriptive Statistics
Biostatistics Unit 2 Descriptive Biostatistics 1.
Introduction to Educational Statistics
Central Tendency and Variability
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
The use of statistics in psychology. statistics Essential Occasionally misleading.
Measures of Central Tendency
Week 7: Means, SDs & z-scores problem sheet (answers)
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
Statistics 1 Measures of central tendency and measures of spread.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
Psychology’s Statistics. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Basic Measurement and Statistics in Testing. Outline Central Tendency and Dispersion Standardized Scores Error and Standard Error of Measurement (Sm)
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Agenda Descriptive Statistics Measures of Spread - Variability.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
The use of statistics in psychology. statistics Essential Occasionally misleading.
3 common measures of dispersion or variability Range Range Variance Variance Standard Deviation Standard Deviation.
Introduction to Statistics Santosh Kumar Director (iCISA)
Measures of Location INFERENTIAL STATISTICS & DESCRIPTIVE STATISTICS Statistics of location Statistics of dispersion Summarise a central pointSummarises.
Statistical Analysis Quantitative research is first and foremost a logical rather than a mathematical (i.e., statistical) operation Statistics represent.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Part II Sigma Freud and Descriptive Statistics Chapter 3 Vive La Différence: Understanding Variability.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Welcome to MM570 Applies Statistics for Psychology Unit 2 Seminar Dr. Bob Lockwood.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures.
CHAPTER 2: Basic Summary Statistics
1 Research Methods in Psychology AS Descriptive Statistics.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Descriptive Statistics(Summary and Variability measures)
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Measures of Central Tendency, Variance and Percentage.
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores.
An Introduction to Statistics
Descriptive Statistics
Measures of Central Tendency
How Psychologists Ask and Answer Questions Statistics Unit 2 – pg
26134 Business Statistics Week 3 Tutorial
Central Tendency and Variability
2-Way Tables, Statistics, & Probability
Description of Data (Summary and Variability measures)
Measures of Central Tendency
Theme 4 Describing Variables Numerically
Measures of Location Statistics of location Statistics of dispersion
Summary descriptive statistics: means and standard deviations:
Lesson 12: Presentation and Analysis of Data
CHAPTER 2: Basic Summary Statistics
Descriptive Statistics
Lecture 4 Psyc 300A.
Presentation transcript:

Summary descriptive statistics: means and standard deviations: Measures of central tendency ("averages") Measures of dispersion (spread of scores)

1. The Mode: The most frequent score in a set of scores. 6, 11, 22, 22, 96, 98. Mode = 22

Advantages of the mode: (i) Simple to calculate, easy to understand Advantages of the mode: (i) Simple to calculate, easy to understand. (ii) The only average which can be used with nominal data. Disadvantages of the mode: (i) May be unrepresentative and hence misleading. e.g.: 3, 4, 4, 5, 6, 7, 8, 8, 96, 96, 96. Mode is 96 - but most of the scores are low numbers. (ii) May be more than one mode in a set of scores. e.g.: 3, 3, 3, 4, 4, 4, 6, 6, 6 has three modes!

2. The Median: When scores are arranged in order of size, the median is either (a) the middle score (if there is an odd number of scores) 4, 5 ,6 ,7, 8, 8, 96. Median = 7. or (b) the average of the middle two scores (if there is an even number of scores). 4, 5, 6, 7, 8, 8, 96, 96. Median = (7+8)/2 = 7.5.

Advantages of the median: (i) Resistant to the distorting effects of extreme high or low scores. Disadvantages of the median: (i) Ignores scores' numerical values, which is wasteful if data are interval or ratio. (ii) More susceptible to sampling fluctuations than the mean. (iii) Less mathematically useful than the mean.

Add all the scores together and divide by the total number of scores.   3. The Mean: Add all the scores together and divide by the total number of scores. e.g. (3+4+4+5+6) / 5 = 22 / 5 = 4.4

Advantages of the mean: (i) Uses information from every single score. (ii) Resistant to sampling fluctuation - i.e., varies the least from sample to sample. (Important since we normally want to extrapolate from samples to populations). Disadvantages of the mean: (i) Susceptible to distortion from extreme scores. e.g.: 4, 5, 5, 6 : mean = 5. 4, 5, 5, 106: mean = 30. (ii) Can only be used with interval or ratio data, not with ordinal or nominal data.

1. The Range: The difference between the highest and lowest scores. (i.e. range = highest - lowest). Advantages: Quick and easy to calculate, easy to understand. Disadvantages: Unduly influenced by extreme scores. 3, 4, 4, 5, 100. Range = (100-3) = 97. 3, 4, 4, 5, 5. Range = (5-3) = 2. Conveys no information about the spread of scores between the highest and lowest scores. e.g. 2, 2, 2, 2, 2, 20 and 2, 20, 20, 20, 20, 20 have exactly the same range (18) but very different distributions.

2. The Standard Deviation (S.D.): The "average difference of scores from the mean". The bigger the s.d., the more scores differ from the mean and between themselves, and the less satisfactory the mean is as a summary of the data. Advantages: Like the mean, uses information from every score. Disadvantages: Not intuitively easy to understand! Can only be used with interval or ratio data.

X How to calculate the standard deviation: For the set of scores 5, 6, 7, 9, 11: X (a) Work out the mean: = 38 / 5 = 7.6

) ( - X X = s n å 2 (b) Subtract the mean from each score: 5 - 7.6 = -2.6 6 - 7.6 = -1.6 7 - 7.6 = -0.6 9 - 7.6 = 1.4 11- 7.6 = 3.4

( ) - X X = s n å 2 (c) Square the differences just obtained: -2.6 2 = 6.76 -1.6 2 = 2.56 -0.6 2 = 0.36 1.4 2 = 1.96 3.4 2 = 11.56

( ) - X X = s n å 2 (d) Add up the squared differences: 6.76 + 2.56 + 0.36 + 1.96 + 11.56 = 23.20

( ) 2 - å X X = s n (e) Divide this by the total number of scores, to get the variance: 23.20 / 5 = 4.64

( ) 2 - å X X = s n (f) Standard deviation is the square root of the variance (we do this to get back to the original units): 4.64 = 2.15. 2.15 is our sample standard deviation.

Complications in using the mean and s.d.: We usually obtain the mean and s.d. from a sample - very rarely from the parent population. Sometimes we are content to describe our sample per se, but usually we want to extrapolate to the population from our sample. A sample mean is a good estimate of the population mean. A sample s.d. tends to underestimate the population s.d. Hence, when using the sample s.d. as a description of the sample, divide by n. When using the sample s.d. as an estimate of the population s.d., divide by n-1 (to make the s.d. larger than it would otherwise have been).

sample s.d. as description of a sample (n ("sigma n") on calculators): sample s.d. as an estimate of the population s.d. (n-1 on calculators): population s.d. if you measure every member of the population (n on calculators): sample mean as description of a sample: sample mean as an estimate of population s.d.: population mean (“mu”):

The Standard Error of the Mean:   This is the standard deviation of a set of sample means.   Shows how much variation there is within a set of sample means, and hence how likely our particular sample mean is to be in error, as an estimate of the true population mean. means of different samples actual population mean

Formula for the standard error: We normally estimate this from our obtained data:

So - find the standard deviation; then divide this by the square root of the number of scores. If the S.E. is small, our obtained sample mean is more likely to be similar to the true population mean than if the S.E. is large.   Increasing n reduces the size of the S.E.. A sample mean based on 100 scores is probably closer to the population mean than a sample mean based on 10 scores.