PSY 307 – Statistics for the Behavioral Sciences

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Measures of Central Tendency& Variability.
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Statistics for the Social Sciences
Calculating & Reporting Healthcare Statistics
Scales of Measurement S1-1. Scales of Measurement: important for selecting stat's (later on) 1. Nominal Scale: number is really a name! 1 = male 2 = female.
Descriptive Statistics
Introduction to Educational Statistics
PSY 307 – Statistics for the Behavioral Sciences Chapter 8 – The Normal Curve, Sample vs Population, and Probability.
Data observation and Descriptive Statistics
Chapter 5: Variability and Standard (z) Scores How do we quantify the variability of the scores in a sample?
Central Tendency and Variability
Chapter 3: Central Tendency
2 Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo,
Measures of Central Tendency
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Week 7: Means, SDs & z-scores problem sheet (answers)
Today: Central Tendency & Dispersion
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Measures of Central Tendency
Statistics for Linguistics Students Michaelmas 2004 Week 1 Bettina Braun.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
What is statistics? STATISTICS BOOT CAMP Study of the collection, organization, analysis, and interpretation of data Help us see what the unaided eye misses.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
1.3 Psychology Statistics AP Psychology Mr. Loomis.
Methods for Describing Sets of Data
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Chapter 2 Characterizing Your Data Set Allan Edwards: “Before you analyze your data, graph your data.
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
STATISTICS. What is the difference between descriptive and inferential statistics? Descriptive Statistics: Describe data Help us organize bits of data.
Central Tendency & Dispersion
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Summary Statistics: Measures of Location and Dispersion.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Today: Standard Deviations & Z-Scores Any questions from last time?
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 2 The Mean, Variance, Standard.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Chapter 2 Describing and Presenting a Distribution of Scores.
Descriptive Statistics(Summary and Variability measures)
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Central Tendency and Variability
Central Tendency.
Dispersion How values arrange themselves around the mean
Summary (Week 1) Categorical vs. Quantitative Variables
Summary (Week 1) Categorical vs. Quantitative Variables
Presentation transcript:

PSY 307 – Statistics for the Behavioral Sciences Chapter 3-5 – Mean, Variance, Standard Deviation and Z-scores

Measures of Central Tendency (Representative Values) Quantitative data: Mode – the most frequently occurring observation Median – the middle value in the data Mean – average Qualitative data: Mode – can always be used Median – can sometimes be used Mean – can never be used

Mode The value of the most frequently occurring observation. In a frequency distribution, look for the highest frequency. In a graph, look for the peaks or highest bar in a histogram. Distributions with two peaks are bimodal (have two modes). Even if the peaks are not exactly the same height.

Median The middle value when observations are ordered from least to most, or vice versa. Half the numbers are higher and half are lower. When there is an even number of observations, the median is the average of the two middle values.

Mean The most commonly used and most useful average. Mean = sum of all observations number of all observations = X n Observations can be added in any order.

Notation Sample vs population Individual value = x (lower case) Population notation = Greek letters Individual value = x (lower case) Sample mean = x or M Population mean = m Summation sign = Sample size = n Population size = N

Mean as Balance Point The sum of the deviations from the mean always equals zero. The mean is the single point of equilibrium (balance) in a data set. The mean is affected by all values in the data set. If you change a single value, the mean changes. Demo

The Most Descriptive Average When a distribution is not skewed (lopsided), the mean, median & mode are similar. When a distribution is skewed, the mean is closer to the extreme values, mode is farthest. Report both the mean and median for a skewed distribution. The mean is the preferred average.

Ranked Data Mean and modal ranks are not informative. The mean always equals the median (middle) rank, so use the median. The mode occurs when there is a tie in the data, but doesn’t mean much. Find the median by finding the middle rank (or the average of the two middle ranks).

Fedex Cup Rankings for Golfers Player Rank Points Walker 1 1650 Spieth 2 1409 Holmes 3 1233 Reed 4 1126 Watson 5 1088 Johnson 6 1005 Hoffman 7 948 Median = 1126

Fedex Cup Rankings for Golfers Player Rank Points Walker 1 1650 Spieth 2 1409 Holmes 3 1233 Reed 4 1126 Watson 5 1088 Johnson 6 1005 Hoffman 7 948 Streb 8 903 Median = (1126 + 1088)/2 = 1107

Qualitative Data Averages The mode can always be used. The median can only be used when classes can be ordered. The median is the category that contains 50% in its cumulative frequency. Never report a median with unordered classes. Never report the mean.

Psychology Majors Year N Cumulative Freq. Freshmen 205 .30 or 30% Sophomore 198 .59 or 59% Junior 155 .82 or 82% Senior 123 1.00 or 100% Total 681 The median is the category that contains the middle observation. The middle is at 50%. The category containing that observation is Sophomore, so Sophomore is the median.

Measures of Variability Range – difference between highest and lowest value. Variance – the mean of the squared deviations (differences) from the mean. Standard Deviation – square root of the variance. The average amount that observations deviate from the mean.

Interquartile Range (IQR) The range for the middle 50% of observations. Distance between the 25th and 75th percentiles. Remove the highest and lowest 25% of scores then calculate the range for the remaining values. Used because it is insensitive to extreme observations.

Using IQR (from Holcomb) In Rio, what percentage had been injecting from 4.5 to 14 years? Median Year Injecting = 10 IQR is 4.5-14 (from text). 100% Median = 50% IQR 4.5 14 25%

More Notation Sample variance = S2 Population variance = s2 Sample standard deviation = S or SD Population standard deviation = s Interquartile range = IQR

What Does Variance Describe? Variance and standard deviation describe the amount that actual observations differ from the mean. How spread out are the scores? The range doesn’t tell us how scores are distributed between the high and low values. Because the mean is the balance point, the mean of the unsquared deviations is always zero.

An example using dogs. First calculate the height of the dogs. Mean =   600 + 470 + 170 + 430 + 300  =   1970 = 394 mm 5 5   Source of example using dogs: http://www.mathsisfun.com/standard-deviation.html

Next, compare their heights to the mean. The green line shows the mean. Subtract the mean from each dog’s height. Because some dogs are taller and others are shorter, some of the differences will be positive and some negative numbers. These differences will cancel each other out because the mean is the balance point in the distribution of dog heights.

Square the differences and take the mean. σ2 =   2062 + 762 + (-224)2 + 362 + (-94)2   =   108,520  = 21,704 5 5

Take the square root to return to the original units of measure. σ = √21,704 = 147 Which dogs are within one standard deviation of the mean? Rottweillers are unusally tall dogs. And Dachsunds are a bit short.

Standard Deviation The variance is expressed in squared units (e.g., squared lbs) which are hard to interpret. Taking the square root of the variance expresses the average deviation in the original units. The square root of the variance gives a slightly different result than taking the average of the absolute deviations.

Interpreting the SD For most distributions, the majority of observations fall within one standard deviation of the mean. A very small minority fall outside two standard deviations. This generalization is true no matter what the shape of the distribution. It works for skewed distributions.

A Measure of Distance The mean shows the position of the balance point within a distribution. The standard deviation is a unit of distance that is useful for comparing scores. Standard deviations cannot have a negative value. They can measure in both positive and negative directions from the mean.

Definition Formula Definition formula – easier to understand conceptually. The numerator is also called the Sum of the Squares (squared differences), abbreviated SS

Computation Formula Computation formula – easier to use, especially with large data sets. The computational and definition formulas produce the same result.

Population vs Sample The formulas are different depending on whether a sample or a population is being measured. Use n-1 in the denominator when using s or s2 to estimate s or s2 for a population. Using n-1 more accurately estimates the variability in a population.

Formulas Variance for sample: Variance for population:

Z-Score Indicates how many SDs an observation is above or below the mean of the normal distribution. Formula for converting any score to a z-score: Z = X – m s = mean s = std. deviation

Properties of z-Scores A z-score expresses a specific value in terms of the standard deviation of the distribution it is drawn from. The z-score no longer has units of measure (lbs, inches). Z-scores can be negative or positive, indicating whether the score is above or below the mean.

Standard Normal Curve By definition has a mean of 0 and an SD of 1. Standard normal table gives proportions for z-scores using the standard normal curve. Proportions on either side of the mean equal .50 (50%) and both sides add up to 1.00 (100%).

Other Distributions Any distribution can be converted to z-scores, giving it a mean of 0 and a standard deviation of 1. The distribution keeps its original shape, even though the scores are now z-scores. A skewed distribution stays skewed. The standard normal table cannot be used to find its proportions.

Transformed Standard Scores Z-scores are useful for converting between different types of standard scores: IQ test scores, T scores, GRE scores The z-scores are transformed into the standard scores corresponding to standard deviations (z). New score = mean + (z)(std dev)