3-1 Stats Unit 3 Summary Statistics (Descriptive Statistics) FPP Chapter 4 For one variable - - Center of distribution "central value", "typical value"

Slides:



Advertisements
Similar presentations
Exam One Review Quiz Psy302 Quantitative Methods.
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Measures of Dispersion or Measures of Variability
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
PSY 307 – Statistics for the Behavioral Sciences
Slides by JOHN LOUCKS St. Edward’s University.
Learning Objectives for Section 11.3 Measures of Dispersion
Edpsy 511 Homework 1: Due 2/6.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Central Tendency and Variability
Measures of Variability: Range, Variance, and Standard Deviation
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
Quiz 2 Measures of central tendency Measures of variability.
Math 116 Chapter 12.
Chapter 2 Describing Data with Numerical Measurements
Department of Quantitative Methods & Information Systems
Describing distributions with numbers
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Data Analysis and Statistics. When you have to interpret information, follow these steps: Understand the title of the graph Read the labels Analyze pictures.
Chapter 3 – Descriptive Statistics
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
1.3 Psychology Statistics AP Psychology Mr. Loomis.
Methods for Describing Sets of Data
Statistics 1 Measures of central tendency and measures of spread.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Descriptive Statistics: Numerical Methods
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Chapter 8 Quantitative Data Analysis. Meaningful Information Quantitative Analysis Quantitative analysis Quantitative analysis is a scientific approach.
Descriptive Statistics
KNR 445 Statistics t-tests Slide 1 Variability Measures of dispersion or spread 1.
Describing distributions with numbers
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Skewness & Kurtosis: Reference
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Agenda Descriptive Statistics Measures of Spread - Variability.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Summary Statistics: Measures of Location and Dispersion.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
LIS 570 Summarising and presenting data - Univariate analysis.
Chapter 11 Data Descriptions and Probability Distributions Section 3 Measures of Dispersion.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
The following data represent marks of three students’ groups and each group with different teacher, find the mean of each group: A: 59, 61, 62, 58, 60.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
Chapter 3.3 – 3.4 Applications of the Standard Deviation and Measures of Relative Standing.
Chapter 4 Measures of Spread. RMS RMS size of a list: (S) (S) square values in list (M) (M) sum squared values and divide by total # of values in list.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
2.5: Numerical Measures of Variability (Spread)
Characteristics of the Mean
Descriptive Statistics
Descriptive Statistics: Numerical Methods
Chapter 3.
Numerical Descriptive Measures
Statistics: The Interpretation of Data
Numerical Descriptive Measures
The Mean Variance Standard Deviation and Z-Scores
Presentation transcript:

3-1 Stats Unit 3 Summary Statistics (Descriptive Statistics) FPP Chapter 4 For one variable - - Center of distribution "central value", "typical value" - Spread of distribution How variable are the values in a set of data? - Measure how many / what proportion of observations are above / below a given value. W.01

3-2 Stats Summary Statistics Purposes: compact reporting easy comparison Important considerations: interpretable stable We will discuss: how the statistics are defined when each is (in)appropriate how to interpret them how to compute them "guesstimation" techniques

3-3 Stats Example: Hospital Charges Total charge (in dollars) of the hospital stay for 29 normal deliveries of babies Charges 1,905 2,324 2,048 2,888 2,907 2,840 2,607 2,823 2,310 2,953 2,138 3,418 4,903 3,729 3,709 5,063 3,932 3,392 3,287 3,819 4,248 2,640 2,921 2,785 2,804 2,955 2,219 2,184 2,681 14,898

3-4 Stats Definitions Hospital Charges (in Dollars) freq. 12 mode = most frequently occurring value = _______________ median = "middle value" = __________________ = mean = sum / # measurements in the data set = = __________/___________ = _________ = another way to compute the mean:

3-5 Stats Locating These Summary Statistics on a Histogram mode: median: mean: comparing mean & median: For skewed histograms, the mean could be deceiving Hospital Charges (in Dollars) freq. 12

3-6 Stats

3-7 Stats Event Day Abnormal Returns (ref. "Marketing Science", Fall 1987, vol 6, no 4, pages , "Does It Pay to Change Your Company's Name?")

3-8 Stats mode = most frequently occurring value =______ median = "middle value" = __________ mean = "average" = (sum of values in list)/(# values in list) = _____ / _____ = _____ p th percentile = the value with p percent of the list less than (or equal to it) and 100-p percent greater than it 10 th percentile = _____ 25 th percentile = _____ 80 th percentile = _____

3-9 Stats Histogram for Abnormal Returns RETURNS

3-10 Stats Does This Statistic Make Sense? Some summary statistics make sense only for certain types of data. mean: median: mode:

3-11 Stats Water Watch

3-12 Stats Aug 1-22 the average consumption was million gallons per day. Aug 1-25 the average consumption was million gallons per day. Q1: Was the average consumption higher Aug 1-22 or Aug 23-25? Q2: What was the total amount of water consumed Aug 23-25? Q3: What was the average daily consumption Aug 23-25?

3-13 Stats Baseball Batting Averages Suppose batting average = (# hits / # at bats) x 1000 Before the game starts, a player has batting average = first at bat, strikes out - new batting average = 200 Q1: How many times has this batter been up? Another player starts the game with batting average 500. After his first at bat, his new batting average is 524. Q2: Did he get a hit? Q3: How many times has this batter been up?

3-14 Stats

3-15 Stats Measures of Location & Spread of a Data Set LOCATION mean median mode SPREAD standard deviation (SD) range variance

3-16 Stats Range RANGE: (largest measurement) - (smallest measurement) example:

3-17 Stats Deviation from Average definition: deviation from average = data value - average note: A deviation can be zero data value

3-18 Stats Standard Deviation of a list of numbers definition: standard deviation = SD = rms size of the deviations from average =

3-19 Stats rms (root mean square) size of a list of numbers root-mean-square (rms) operation data value deviation

3-20 Stats Standard Deviation Try another list of numbers. Find the standard deviation (rms size of the deviations from average) for this list of numbers. 2, - 6, 12, 4, 6 I.Find the average of this list of numbers. II.Find the deviation of each value from this average. III.Find the rms size of the list of deviations data

3-21 Stats Standard Deviation The STANDARD DEVIATION (SD) OF A DATA SET measures how far away numbers are from their average. Most entries on the list will be somewhere around one SD away from the average. Very few will be more than two or three SDs away.

3-22 Stats Interpreting the Standard Deviation * Roughly 68% of the entries on a list (roughly 2/3 of the entries) are within one SD of the average. * The other 32% (approximately 1/3) are further away. ** Roughly 95% (19 out of 20) are within two SDs of the average. ** The other 5% are further away. The 2/3 rule is true for most data sets. The 95% rule is true for many data sets, but not all.

3-23 Stats Delivery Times Example Class Limits Tallies Frequency 25-34| ||| |||| | |||| ||| |||| |||| |||| |||| ||| |||| |||| ||2 TIME IN DAYS

3-24 Stats Delivery Times Continued days Elapsed Time to Delivery rel. freq. Days Elapsed Between Order Date and Delivery Date for 50 Orders average (mean) = median = SD =

3-25 Stats Delivery Times - 3 “The 2/3 Rule” says that Roughly 2/3 or 68% of the entries on a list are within one SD of the average days Actually, 49 out of 50 deliveries took between 35.1 and days. 49/50 = 0.98 = 98% Actually, in this data set, 34 out of 50 deliveries took between 59.4 and days. 34/50 = 0.68 = 68% “The 95% Rule” says that Roughly 95% of the entries on a list are within two SD’s of the average days

3-26 Stats

3-27 Stats Guesstimating the SD Middle 2/3 Rule 1. Locate the middle 2/3 of the data. 2. The range of the middle 2/3 of the data is approximately 2 SD's. So, 1/2 of this range is approximately 1 SD.

3-28 Stats Variance The variance of a list of numbers is the SD squared. That is, the SD is the square root of the variance.

3-29 Stats z-score The z-score says how many SD's above (+) or below (-) the average a value is. The sample z-score for a measurement is z = The population z-score for a measurement is z = example:

3-30 Stats Interpreting z-scores Interpretation of z-Scores for "Mound-Shaped" Distributions of Data 1. Approximately 68% of the measurements will have a z-score between -1 and Approximately 95% of the measurements will have a z-score between -2 and All or almost all of the measurements will have a z-score between -3 and +3.

3-31 Stats Wonderlic Scores

3-32 Stats USC had average team score What is their z- score? Is this value extreme among NCAA Division I teams? How about Michigan State whose average team score is 16.6? Find their z-score and interpret it. How about Stanford whose average team score is 28.2? Find their z-score and interpret it..