Chapter 11 Univariate Data Analysis; Descriptive Statistics These are summary measurements of a single variable. I.Averages or measures of central tendency.

Slides:



Advertisements
Similar presentations
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Advertisements

Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Measures of Central Tendency.  Parentheses  Exponents  Multiplication or division  Addition or subtraction  *remember that signs form the skeleton.
Calculating & Reporting Healthcare Statistics
Lecture 2 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Introduction to Educational Statistics
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter 7: Normal Curves & Probability
Data observation and Descriptive Statistics
Chapter 3: Central Tendency
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Summarizing Scores With Measures of Central Tendency
Describing distributions with numbers
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Part II Sigma Freud & Descriptive Statistics
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Chapter 5 The Normal Curve. In This Presentation  This presentation will introduce The Normal Curve Z scores The use of the Normal Curve table (Appendix.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Chapter 2 Describing Data.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
Part III Taking Chances for Fun and Profit
Part III Taking Chances for Fun and Profit Chapter 8 Are Your Curves Normal? Probability and Why it Counts.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 6 Probability Distributions Section 6.2 Probabilities for Bell-Shaped Distributions.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
INVESTIGATION 1.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Chapter 3 Variability I.Variability – how scores differ from one another. Which set of scores has greater variability? Set 1: 8,9,5,2,1,3,1,9 Set 2: 3,4,3,5,4,6,2,3.
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Descriptive Statistics Review – Chapter 14. Data  Data – collection of numerical information  Frequency distribution – set of data with frequencies.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 2 Describing and Presenting a Distribution of Scores.
Summation Notation, Percentiles and Measures of Central Tendency Overheads 3.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Normal Probability Distributions 5.
Descriptive Statistics(Summary and Variability measures)
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Psychology’s Statistics Appendix. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
STATS DAY First a few review questions. Which of the following correlation coefficients would a statistician know, at first glance, is a mistake? A. 0.0.
Chapter 2 The Normal Distributions. Section 2.1 Density curves and the normal distributions.
Chapter 5 Normal Probability Distributions.
Chapter 7: Hypotheses Testing
Chapter 12 Statistics 2012 Pearson Education, Inc.
Summarizing Scores With Measures of Central Tendency
Description of Data (Summary and Variability measures)
STATS DAY First a few review questions.
Chapter 4 Graphing I. Why? Describes data visually, more clearly.
Introduction to Course, Book, and SPSS
Introduction to Course, Book, and SPSS
Statistics: The Interpretation of Data
Ms. Saint-Paul A.P. Psychology
Chapter 7 (Probability)
Chapter Nine: Using Statistics to Answer Questions
Chapter 5 Normal Probability Distributions.
Chapter 5 Normal Probability Distributions.
Chapter 12 Statistics.
Presentation transcript:

Chapter 11 Univariate Data Analysis; Descriptive Statistics These are summary measurements of a single variable. I.Averages or measures of central tendency – describes a dataset. A.Three kinds: mean, median, mode. 1.Mean: most common. Sum all the values in a group, divide by the total number of values in that group (Hint: start listing them in columns/headings).

Weighted Mean: Multiply each value by its frequency. Sum. Divide by total frequency. 2.Median: the mean is very sensitive to outlier scores that skew the distribution; median is not. It is the midpoint value. Instructions: order all values. Find the middle-most score. That’s the median (if even number of cases, find middle-most two values; add them, divide by two). Percentiles: 50 th percentile is the median. 75 th percentile means score is at or above 75% of the other scores. 3.Mode: most frequent value. B.When to use what. 1.Three kinds of data a.Nominal – categorical data (race, region). b.Ordinal – values are ranked, but not necessarily equal in distance (7 values indicating GOP support). c.Interval – values are equal in distance (income). 2.Use mean for interval (and sometimes ordinal). Use mode for nominal (and sometimes ordinal). Use median for interval if you think there are outliers.

II. Variability – how much scores differ from one another. Which set of scores has greater variability? Set 1: 8,9,5,2,1,3,1,9 Set 2: 3,4,3,5,4,6,2,3 Means are Set 1: 4.75 and Set 2: Tells us nothing of variability. Variability is more precisely how different/far scores are from the mean. III. Computing the Range Subtract the lowest score from the highest (r=h-l) What is the range of these scores? 98,86,77,56,48 Answer: 50 (98-48=50) IV. Computing the Standard Deviation The standard deviation (s) is the average amount of variability in a set of scores (average distance from mean).

A.Formula: Compute s for the following: 5,8,5,4,6,7,8,8,3,6 So, an s of 1.76 tells us that each score differs from the mean by an average of 1.76 points. B.Purpose: to compare scores between different distributions, even when the means and standard deviations are different (e.g., men and women). Larger the s the greater the variability.

V. Graphing and Tables. Why? Describes data visually, more clearly. Frequency Distribution (Table 11-4) A.Class Interval Column – divides the scores up into categories (0-4, 5-9, etc.). Usually range of 2,5,10, or 25 data points. Main thing: be consistent! B.Frequency Column – number of scores within that range or category. VI. Graphs A.Histogram – shows the distribution of scores by class interval. Can compare different distributions on the same histogram. Shows: 1.Variability 2.Skewness - If the mean is greater than the median, positive skewness. If median is greater than mean, negative skewness.

Central Tendency and Variability Centre

Central Tendency and Variability Spread

Skewness If the data set is symmetric, the mean equals the median. MeanMedian

Skewness If the data set is skewed to the right, the mean is greater than the median. Mean Median

Skewness If the data set is skewed to the left, the mean is less than the median. Mean Median

B. Column Charts – simply tells the quantity of a category according to some scale. SCALE IS IMPORTANT (CSPAN- drug use story). C.Bar Charts – same as Column chart, but reverse the axes. D.Line Chart – Used to show trends (e.g. rise and fall in presidential popularity – line on page 317). E.Pie Charts – Great for proportions (percent of MS budget going to each budget category).

Line Graph

VII. The Normal Curve and Probability Theory A. Tells us likelihood of an outcome B.Tells us degree of confidence in a finding or outcome (i.e., how sure are we that the observed outcome is due to X versus random chance? AND how likely is it that our research hypothesis is true?). VIII. Normal Curve or Bell-Shaped Curve Properties (Fig. 11-6) A. Mean, median and mode are same NOT Skewed

B. Perfectly symmetrical about the mean (i.e., two halves fit perfectly together). C. Tails of the normal curve are asymptotic. Curves come close, but never touch the horizontal axis. Are curves usually normal? Yes, especially with large sets of data (more than 30). Most scores are concentrated in the center and few are concentrated at the ends (height, intelligence, coin flipping).

IX. Divisions of the Normal Curve (Fig. 11-9) A.Mean is at the center B. Scores along x-axis correspond to standard deviations. C. Sections within the bell curve represent % of cases expected to fall therein. Geometrically true (these are percentages of entire normal distribution). D. For normal distributions (most data sets), practically all scores fall in between +3 and -3 sd’s (99.74%). Look at the probabilities of falling in between % x 2 = 68.26% cases fall within 1 to -1 sd’s from mean.

X. Z-scores (standard scores; i.e. the # of standard deviations from the mean) A. Allow us to compare distributions with one another because they are scores that are standardized in units of standard deviations (can’t compare scores if they are measured differently; nonsensical). Different variables or groups will have different means and cannot be compared. But z-scores between groups of data can be compared because they are equivalent (e.g., one unit above or below the mean, respectively).

B. Formula and interpretation VII.Comparing z-scores from different distributions (p. 158 example). -The raw scores of 12.8 and 64.8 in our data are equal distances from their respective means (z=.4 for both) VIII.What z-scores represent A. Z-scores correspond to sections under the curve (percentages under the curve).

B. These percentages can be seen as probabilities of a certain score occurring given in Appendix D. Example of what we are saying: “In a distribution with a mean of 100 and standard deviation of 10, what is the probability that any score will be 110 or above?” The answer = _________. C. What about a z-score of 1.38? What are the chances that a score will fall within the mean and a z-score of 1.38? _______ What about above a z-score of 1.38?____ What about at or below 1.38?______

What about between a z-score of 1 and 2.5? Answer:______ (look at picture 11-9) Again, we are asking, what is the probability that a score will fall in between 1 and 2.5 standard deviations (z’s) of the mean? -1 and 2.5?