Last class we discussed:

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Richard M. Jacobs, OSA, Ph.D.
A.k.a. “bell curve”.  If a characteristic is normally distributed in a population, the distribution of scores measuring that characteristic will form.
Measures of Dispersion
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Types of Measurement Continuous –Underlying continuum Index of amount Intermediate numbers make sense Discreet –Is or is not –Usually use only whole numbers.
QUANTITATIVE DATA ANALYSIS
Calculating & Reporting Healthcare Statistics
Why do we do statistics? To Make Inferences from a Small number of cases to a Large number of cases This means that we have to collect data.
Descriptive Statistics
Methods and Measurement in Psychology. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
Analysis of Research Data
Introduction to Educational Statistics
Data observation and Descriptive Statistics
Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories Ordinal measurement Involves sorting objects.
Today: Central Tendency & Dispersion
Inferential Statistics
+ Quantitative Analysis: Supporting Concepts EDTEC 690 – Methods of Inquiry Minjuan Wang (based on previous slides)
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
CHAPTER 2 Percentages, Graphs & Central Tendency.
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
With Statistics Workshop with Statistics Workshop FunFunFunFun.
Chapter 3 Statistical Concepts.
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
Psychometrics.
Statistics and Research methods Wiskunde voor HMI Betsy van Dijk.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
Chapter Eleven A Primer for Descriptive Statistics.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Warsaw Summer School 2014, OSU Study Abroad Program Variability Standardized Distribution.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Measures of Dispersion
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Central Tendency & Dispersion
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Data Analysis.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
LIS 570 Summarising and presenting data - Univariate analysis.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Descriptive Statistics(Summary and Variability measures)
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Welcome to… The Exciting World of Descriptive Statistics in Educational Assessment!
THE ROLE OF STATISTICS IN RESEARCH. Reading APPENDIX A: Statistics pp
STATS DAY First a few review questions. Which of the following correlation coefficients would a statistician know, at first glance, is a mistake? A. 0.0.
Measurements Statistics
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Science of Psychology AP Psychology
STATS DAY First a few review questions.
Descriptive Statistics
Basic Statistical Terms
Advanced Algebra Unit 1 Vocabulary
Presentation transcript:

Last class we discussed: Ways of knowing reality (science is chief means for psychology) Classification of Research Studies (1) Design (experiment / correlational / descriptive) (2) Setting (field / lab) (3) Data-collection method (self-report / observation)

By the end of today’s class, you will be able to: Define statistics and its two branches Understand the four scales of measurement Deal with measures of central tendancy Calculate standard deviation Discuss the properties of a distribution curve

Statistics is the branch of mathematics that deals with the collection, organization, and analysis of numerical data. Even if your data doesn’t begin as a set of numbers, you can quantify it (turn it into numbers) and use statistical analysis.

What are statistics? Numerical representations of large amounts of data Two Branches Descriptive – tell us about the numbers Inferential – tell us what the numbers mean The work of the statistician is no longer confined to gathering and tabulating data (descriptive statistics), but is chiefly a process of interpreting the information (inferential statistics).

How do we obtain statistics? Through Quantification (#) All behaviour needs to be quantified to be analysed statistically How do we quantify behaviour? Though scales of measurement

What are Scales of Measurement? Measurement is the assignment of numbers to objects or events in a systematic fashion. Four levels of measurement are commonly distinguished: Nominal Ordinal Interval Ratio “NOIR”

Nominal Based on Categories No quantitative information is conveyed and no ordering of the items is implied. Gender Religion Ethnicity

Nominal scale Classifies data according to a category only. Measurement: Frequency distributions. How often do certain items or responses occur?

Ordinal Ordered in the sense that higher numbers represent higher values. However, the intervals between the numbers are not necessarily equal. e.g., “Strongly Disagree, Disagree, Neither Agree nor Disagree, Agree, Strongly Agree”

Ordinal scale Classifies data according to rank. The difference between mild and average hotness may not represent the same difference as the difference between a rating of hot and very hot. There is no "true" zero point for ordinal scales since the zero point is chosen arbitrarily.

Interval One unit on the scale represents the same magnitude on the trait or characteristic being measured across the whole range of the scale. For example, if anxiety were measured on an interval scale, then a difference between a score of 10 and a score of 11 would represent the same difference in anxiety as would a difference between a score of 50 and a score of 51.

Interval scale Does not have true zero point (it is arbitrary). On the anxiety scale: score of 30 NOT twice as anxious as score of 15. We know they are MORE anxious, but that’s about where it ends. Similarly, a score of zero does not mean that they have zero anxiety. True interval measurement is rare to nonexistent in the behavioral sciences. Most measurement scales in psychology are ordinal.

Ratio All of the previous + absolute zero e.g. weight and height This scale has an absolute zero. Temperature of 300 Kelvin is twice as high as a temperature of 150 Kelvin. Time can have an absolute zero (no time).

Ratio scale Similar to interval scales, but has a true zero point. Racer C’s speed (below) was twice as fast as Racer A’s speed.

Levels of Measurement Mutually Exclusive Rank Order Equal Intervals Absolute Zero Nominal  Ordinal Interval Ratio Note. Mutually Exclusive: Every point on every measurement scale is separate from every other point. You can’t be red and blue at the same time.

Concept Review Which rating scale is based on categories? Nominal Which rating scale has an absolute zero? Ratio Most psychological scales, such as the CES-D, which measures depression, use which rating scale? Ordinal How do we interpret data? Inferential Statistics

2 main branches of these statistics: Inferential / Descriptive

Inferential Statistics Draw a sample from larger population The sample is smaller than the population and should be representative of the population. Use inferential statistics to draw inferences about the population based on the sample.

Statistics prescribes ways to take measurements of smaller groups of people and infer the findings true for all people matching the defined criteria, such as every Canadian adult between ages 25 and 34. The smaller group is our sample, and we infer from the sample to the population.

Descriptive Statistics Aims to describe sets of data values Ex: a group of individual test scores. Each data value is a single measurement of some attribute being observed. The term data set refers to all data values considered in a set of statistical calculations. Descriptive statistics summarize sets of information.

Descriptive Stats cont’d Examples Central Tendency (mean, median, mode) Variability Shape of Distributions Relations (Correlation) We’ll discuss each of these in turn

What are measures of central tendency? Each is a score that represents the typical performance of the sample Mode Mean Median

Mode Answers the question, "Which number or object appears the most often in our data set?“ Can have more than one mode. Example: Array of test scores 2,3,4,5,6,6,7,8,9,9  Mode is 6 & 9. We say it’s bimodal. The mean is the sum of the individual scores divided by the total number of scores.

Looking ahead: median and mean are a little more slippery (but not much more). It’s from the median and mean that we get the term “central tendency,” measures of the "center" (average) of the scores in a given data set.

Mean sum of the individual scores divided by the total number of scores. i.e. the average

Median …is the score that’s right in the middle of your data set Median …is the score that’s right in the middle of your data set. 1/2 of the data set’s values less than or equal to the median; 1/2 of the numbers in the data set will have values equal to or greater than the median. To find the median of a finite list of numbers: Arrange all observations from lowest value to highest value and pick the middle one. If there are an even number of observations, you can take the mean of the two middle values.

Central Tendency Example Hours watching The Bachelor per day for 10 people Subject Hours 1. 2 2. 3 3. 2 4. 7 5. 3 6. 5 7. 2 8. 2 9. 1 10. 3 Take a minute to calculate mean, median, and mode ?

Central Tendency Example Hours watching The Bachelor per day for 10 people: Subject Hours 1. 2 2. 3 3. 2 4. 7 5. 3 6. 5 7. 2 8. 2 9. 1 10. 3 Mean = 3 (add all scores and divide by 10) Median = 2.5 Mode = 2

Something important to keep in mind when writing papers: A conclusion is simply the place where you got tired of thinking.

Variability

Consider Two Streets Conservative Street 45, 46, 47, 44, 43 Mean = ? Wild Alley 25, 35, 45, 55, 65 Mean = ?

How are these streets different? Consider Two Streets Conservative Street 45, 46, 47, 44, 43 Mean = 45 Median = Wild Alley 25, 35, 45, 55, 65 Mean = 45 Median = How are these streets different?

Consider Two Streets Variability Conservative Street 45, 46, 47, 44, 43 Mean = 45 Median = 45 Wild Alley 25, 35, 45, 55, 65 Mean = 45 Median = 45 The means are the same. The medians are the same. How are these streets different? Variability

What is Variability? Extent of dispersion around the mean. Are the scores all pretty close together or more spread out?

Variance An index of variability (i.e. one way to express variability).

Variance Formula - 1 (Take the difference between each score and the mean, squaring each of these deviations, and then calculating the mean of the squared deviations. Finally, you divide by the number of scores minus one.)

The problem with variance Has an ATTITUDE It is an index in the wrong units! Everything was SQUARED to get the variance. We need something else…

H m m m (ideas?)

Standard Deviation Square root of the squared deviations about the mean. i.e. square root of variance (Wasn’t that disturbingly simple?)

Standard Deviation Formula Definitional Formula The standard deviation is equal to the square root of the sum of the squared deviations divided by the total number of scores - 1

Practice Run: Calculate SD and Variance Hours watching tv per day for 10 people Subject Hours (Hours – mean) squared 1. 2 2. 3 3. 2 4. 7 5. 3 6. 5 7. 2 8. 2 9. 1 10. 3 Calculate SD and variance for the data.

First step: Next: Then: raw score minus mean. then square the deviations. Sum these together. Next: Divide by n-1 to get variance. Then: Take the square root of this for SD.

Calculate SD and Variance Hours watching tv per day for 10 people Subject Hours (Hours – mean) squared 1. 2 2 – 3 1 2. 3 3 – 3 0 3. 2 2 – 3 1 4. 7 7 - 3 16 5. 3 3 - 3 0 6. 5 5 - 3 4 7. 2 2 - 3 1 8. 2 2 - 3 1 9. 1 1 - 3 4 10. 3 3 - 3 0

Sum = 28 N – 1 = 9 28/9 = 3.11 Square Root of 3.11 = 1.76

When you collect data… Assumption – normality. We assume that many things in life take the form of a “normal curve” when graphed, with most scores occurring close to the mean, and some scores more spread out. Assumption sometimes violated Example: Give a middle school-level math test to 6-year-olds and to 18-year-olds. Result will not be a normal curve; rather, you will see two clumps: high scores (18-yr-olds) and low scores (6-yr-olds).

Properties of a distribution curve Skewness Happens when the mean, median, and mode aren’t the same. symmetry of a curve determined by the tail in the curve

Positively skewed Tail on the right side of the distribution Most scores cluster on the left side of the distribution If this was a math test, most people had a low score and just a few performed very well.

Negatively skewed Tail on the left side of the distribution Most scores cluster on the right side of the distribution How did people perform on the math test this time?

Can’t use certain formula (t-tests, correlation) Why is Skew a Problem? Can’t use certain formula (t-tests, correlation) Assumption of normality is important Mathematically, we can manipulate the data to try to normalize distribution Or, use different formula

Kurtosis Peakedness of the curve compared to the normal curve. Are the scores grouped more toward the middle (the mean) or more toward the tails of the distribution

Leptokurtic Pointed shape Kurtosis >3 Most scores cluster at the middle of the distribution Simultaneously “peaked” centre and “fat” tails

Platykurtic Flat shape Kurtosis <3 Many scores spread toward the tails of the distribution Simultaneously less peaked and thinner tails

Mesokurtic Normal curve distribution Kurtosis = 0 (no skew)

Normal Distribution Bell shape Mean, median, and mode are equal Located at centre of distribution

Area of the Normal Curve 34% is one SD above or below the mean 68% within one SD above and below 47.5% of scores is between 0 and 2 SD 95% within two SD above and below