Chapter 4 Displaying and Summarizing Quantitative Data Math2200.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Chapter 4 Displaying and Summarizing Quantitative Data.
Displaying and Summarizing Quantitative Data Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Copyright © 2010 Pearson Education, Inc. Chapter 4 Displaying and Summarizing Quantitative Data.
Copyright © 2009 Pearson Education, Inc. Chapter 4 Displaying and Summarizing Quantitative Data.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 4 Displaying and Summarizing Quantitative Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 4- 1.
Chapter 4: Displaying Quantitative Data
Displaying & Summarizing Quantitative Data
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Displaying and Summarizing Quantitative Data.
It’s an outliar!.  Similar to a bar graph but uses data that is measured.
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 4: Displaying & Summarizing Quantitative Data
Objectives 1.2 Describing distributions with numbers
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Have out your calculator and your notes! The four C’s: Clear, Concise, Complete, Context.
Objectives Vocabulary
What is Statistics? Statistics is the science of collecting, analyzing, and drawing conclusions from data –Descriptive Statistics Organizing and summarizing.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 4 Describing Numerical Data.
Copyright © 2010 Pearson Education, Inc. Chapter 4 Displaying and Summarizing Quantitative Data.
Categorical vs. Quantitative…
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Displaying Quantitative Data Graphically and Describing It Numerically AP Statistics Chapters 4 & 5.
Slide 4-1 Copyright © 2004 Pearson Education, Inc. Dealing With a Lot of Numbers… Summarizing the data will help us when we look at large sets of quantitative.
Chapter 4 Displaying Quantitative Data *histograms *stem-and-leaf plots *dotplot *shape, center, spread.
1 Chapter 4 Displaying and Summarizing Quantitative Data.
Unit 4 Statistical Analysis Data Representations.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Copyright © 2011 Pearson Education, Inc. Describing Numerical Data Chapter 4.
Displaying Quantitative Data AP STATS NHS Mr. Unruh.
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 4 Displaying and Summarizing Quantitative Data.
1 Never let time idle away aimlessly.. 2 Chapters 1, 2: Turning Data into Information Types of data Displaying distributions Describing distributions.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Chapter 3: Displaying and Summarizing Quantitative Data Part 1 Pg
Displaying and Summarizing Quantitative Data 90 min.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Analyzing Data Week 1. Types of Graphs Histogram Must be Quantitative Data (measurements) Make “bins”, no overlaps, no gaps. Sort data into the bins.
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
AP Statistics. Chapter 1 Think – Where are you going, and why? Show – Calculate and display. Tell – What have you learned? Without this step, you’re never.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 3, Slide 1 Chapter 3 Displaying and Summarizing Quantitative Data.
Section 2.1 Visualizing Distributions: Shape, Center, and Spread.
UNIT ONE REVIEW Exploring Data.
Displaying and Summarizing Quantitative Data
Displaying and Summarizing Quantitative Data
Objective: Given a data set, compute measures of center and spread.
Displaying and Summarizing Quantitative Data
Bell Ringer Create a stem-and-leaf display using the Super Bowl data from yesterday’s example
Midrange (rarely used)
Displaying and Summarizing Quantitative Data
Histograms: Earthquake Magnitudes
Give 2 examples of this type of variable.
Displaying and Summarizing Quantitative Data
Displaying Quantitative Data
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
Displaying and Summarizing Quantitative Data
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Displaying and Summarizing Quantitative Data
Displaying and Summarizing Quantitative Data
Means & Medians.
Honors Statistics Review Chapters 4 - 5
Displaying and Summarizing Quantitative Data
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Chapter 4 Displaying and Summarizing Quantitative Data Math2200

Example: Tsunamis and Earthquake - The most disastrous tsunami - Dec 26, 2004, in Sumatra - Earthquake: magnitude Killed 225,000 people

Question Was the earthquake that caused it truly unusually big? US National Geophysical Data Center Data on the magnitude of underlying earthquakes for 1240 historical tsunamis How do we learn the data YearCountryMagnitude -1300GREECE6 -479GREECE GREECE GREECE GREECE7 -227GREECE ALBANIA CYPRUS ISLAND NEW ZEALAND JAPAN JAPAN USA INDONESIA JAPAN JAPAN INDONESIA9

Histogram Display a quantitative variable by discretizing it into equal-width bins Counts for the bins give the distribution of the quantitative variable Make a bar chart based on these counts and align the bar according to the bin values, we get a histogram –Do not leave gaps between bars

With different number of bins

Summarize the histogram Magnitudes are typically around 7 Most are between 5.5 and 8.5 Minimum is around 3 Maximum is around 9 Why there is a sharp peak in the middle?

How to make a histogram using TI-83? Input the data –Suppose our data is 1.34, 2.21, 3.64, 6.2 –2nd + ( , 2.21, 3.64, nd + ) + STO + 2 nd + 1 –That gives L1 = {1.34, 2.21, 3.64, 6.2} Make the histogram –Press 2 nd +Y= (STAT PLOT) –You will see STAT PLOTS 1: Plot 1 … Off –Move the cursor to highlight 1:, then press ENTER –Select the following: ON Type: histogram (3 rd one) Xlist: L1 –Press ZOOM –Press 9 To adjust the graph window, press WINDOW and change the options

Stem-and-Leaf Plot John W. Tukey Useful for small data sets Similar to histogram, but the bars give numerical values more than counts 5 | 6 6 | | | | | | 8 Pulse-rates of 24 woman (8|8 means 88 beats/min)

Handwriting Handwriting may not give the same space for different digits. That violates the area principle When you make a stem-and-leaf plot, be sure to give each digit the same width.

Dotplot Replace digits in stem-and-leaf plot by dots

How to summarize the distribution of a quantitative variable? shape mode, symmetry, outlier center mean, median Spread sd, IQR

Shape Peak / Mode –Is there a peak? If so, how many peaks? –For quantitative variables, the mode is where the peak is at. –No peak: uniform –One peak: unimodal –Two peaks: bimodal –More than two peaks: multimodal

Shape Symmetry –Tail: thinner ends of a distribution –Skewed: If one tail stretches out farther than the other, we say the histogram is skewed to the side of the longer tail

Shape Outliers –Those that stand away from the body of the distribution –The judgment is vague sometimes

Center When a histogram is symmetric and unimodal, the center is obvious –The corresponding numerical value can be taken as the sample average, or say the sample mean –The sample mean is actually where the histogram balances

Center For skewed distribution –The sample mean is dragged to the side of the longer tail –Usually, much more than 50% values will be less or larger than the sample mean –Median is more appropriate Median is the value that splits the data in half

Finding the median Suppose that we have n numbers Order them first –If n is odd, the median is middle value. That is, the value in the (n+1)/2 position –If n is even, we take median as the average of the values in positionsn/2 and n/2+1

Mean versus median Extreme values / outliers: –Median only considers the order of the values, so it is resistant to extreme values –Mean is very sensitive Skewed distribution –Median is preferred than mean Unimodal and symmetric distribution –Mean is preferred because it uses more information from the data

Spread To quantify the variation Range Interquartile range (IQR) Standard deviation

Range Range = max – min Very sensitive to extreme values

Interquartile Range Quartiles –Q1 (lower quartile or the 25 th percentile): one quarter of the data lies below Q1 –Q2 (median or the 50 th percentile) –Q3 (upper quartile or the 75 th percentile): one quarter of the data lies above Q3 IQR = Q3-Q1 –Not sensitive to extreme values How to find Q1 and Q3? –Split the order values into two halves using the median –Q1 is the median of the first half –Q3 is the median of the second half

Standard deviation Sample variance = average of squared deviations Standard deviation (sd) –Sensitive to extreme values

σ X in TI-83

How to obtain these numbers using TI-83? Press STAT Move the cursor to CALC Press 1 The screen shows 1-Var Stats Put the list you want the statistics for. For example, L1. Press ENTER, then you will see –Sample mean, sample sum, sample sum squares, sample standard deviation (S x ),σ x (the same as except divided by n instead of n-1), sample size n, minimum, Q1, median, Q3, maximum

Summary Make a picture –Histogram, stem-and-leaf plot, dot plot Shape –How many modes? –Symmetric? –Outliers? If there are outliers, summarize once with the outliers and another time without the outliers Center and spread –Skewed distribution: median and IQR –Symmetric and unimodal distribution: mean and sd

What can go wrong? Do not use what we learned in chapter 4 for a categorical variable –Do not make histogram of a categorical variable –Do not look for shape and center and spread of a bar chart –Do not use mean, sd, IQR, etc. for a categorical variable Graph with bars are not always histograms or bar charts Choose a bin width appropriate to the data Check the summary numbers. Do they make sense? Do not worry about small differences when using different methods –No need to use too many digits for the summary numbers –Using one or two more digits than data is enough Do not round in the middle of a calculation Multiple modes, outliers (make a picture)