Displaying and Summarizing Quantitative Data

Slides:



Advertisements
Similar presentations
So What Do We Know? Variables can be classified as qualitative/categorical or quantitative. The context of the data we work with is very important. Always.
Advertisements

Describing Quantitative Variables
DESCRIBING DISTRIBUTION NUMERICALLY
CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Displaying and Summarizing Quantitative Data Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Copyright © 2009 Pearson Education, Inc. Chapter 4 Displaying and Summarizing Quantitative Data.
Displaying & Summarizing Quantitative Data
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 1 – Exploring Data YMS Displaying Distributions with Graphs xii-7.
Chapter 4 Displaying and Summarizing Quantitative Data Math2200.
What is Statistics? Statistics is the science of collecting, analyzing, and drawing conclusions from data –Descriptive Statistics Organizing and summarizing.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 4 Describing Numerical Data.
Copyright © 2010 Pearson Education, Inc. Chapter 4 Displaying and Summarizing Quantitative Data.
Categorical vs. Quantitative…
Copyright © 2011 Pearson Education, Inc. Describing Numerical Data Chapter 4.
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Statistics and Data Analysis
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Section 2.1 Visualizing Distributions: Shape, Center, and Spread.
Displaying and Summarizing Quantitative Data
Methods for Describing Sets of Data
Chapter 1.1 Displaying Distributions with graphs.
Introduction To compare data sets, use the same types of statistics that you use to represent or describe data sets. These statistics include measures.
ISE 261 PROBABILISTIC SYSTEMS
Chapter 5 : Describing Distributions Numerically I
Chapter 1: Exploring Data
Warm Up.
Objective: Given a data set, compute measures of center and spread.
Displaying and Summarizing Quantitative Data
1st Semester Final Review Day 1: Exploratory Data Analysis
Laugh, and the world laughs with you. Weep and you weep alone
Jeopardy Final Jeopardy Chapter 1 Chapter 2 Chapter 3 Chapter 4
Displaying and Summarizing Quantitative Data
Chapter 5: Describing Distributions Numerically
Topic 5: Exploring Quantitative data
Histograms: Earthquake Magnitudes
Displaying and Summarizing Quantitative Data
Displaying Quantitative Data
Warm-up 8/25/14 Compare Data A to Data B using the five number summary, measure of center and measure of spread. A) 18, 33, 18, 87, 12, 23, 93, 34, 71,
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
Lesson 1: Summarizing and Interpreting Data
Displaying Distributions with Graphs
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Displaying and Summarizing Quantitative Data
Displaying and Summarizing Quantitative Data
Displaying and Summarizing Quantitative Data
Basic Practice of Statistics - 3rd Edition
Exploratory Data Analysis
Chapter 1: Exploring Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Describing Distributions Numerically
Honors Statistics Review Chapters 4 - 5
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Displaying and Summarizing Quantitative Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Math 341 January 24, 2007.
Chapter 1: Exploring Data
MATH 2311 Section 1.5.
Chapter 1: Exploring Data
Presentation transcript:

Displaying and Summarizing Quantitative Data Chapter 4 Displaying and Summarizing Quantitative Data

A bar chart or pie chart is often used to display categorical data. These types of displays, however, are not appropriate for quantitative data. Quantitative data is often displayed using either a histogram, dot plot, or a stem-and-leaf plot.

In a histogram, the interval corresponding to the width of each bar is called a bin. A histogram displays the bin counts as the height of the bars (like a bar chart). Unlike a bar chart, however, the bars in a histogram touch one another. An empty space between bars represents a gap in data values. If a value falls on the border between two consecutive bars, it is placed in the bin on the right.

A relative frequency histogram displays the proportion of cases in each bin instead of the count. Histograms are useful when working with large sets of data, and they can easily be constructed using a graphing calculator. A disadvantage of histograms is that they do not show individual values. Be sure to choose an appropriate bin width when constructing a histogram. As a general rule of thumb, your histogram should contain about 10 bars. A stem-and-leaf plot is similar to a histogram, but it shows individual values rather than bars. It may be necessary to split stems if the range of data values is small.

Number of Pairs of Shoes Owned

The stems of the stem-and-leaf plot correspond to the bins of a histogram. You may only use one digit for the leaves. Round or truncate your values if necessary. Stem-and-leaf plots are useful when working with sets of data that are small to moderate in size, and when you want to display individual values.

How would you setup the following stem-and-leaf plots? quiz scores (out of 100) states visited student weights SAT scores weights of cattle (1000-2000 pounds)

Dot plots may also be used to display quantitative variables. Dot plots are useful when working with small sets of data.

When describing a distribution, you should tell about three things: shape, center, and spread. You should also mention any unusual features, like outliers or gaps.

Identify the shapes of the following distributions:

When describing distributions, we need to discuss shape, center, and spread. How we measure the center and spread of a distribution depends on its shape. The center of a distribution is a “typical” value. If the shape is unimodal and symmetric, a “typical” value is in the middle. If the shape is skewed, however, a “typical” value is not necessarily in the middle.

For skewed distributions, use the median to determine the center of the distribution and the interquartile range to describe the spread of the distribution.   The median: is the middle data value (when the data have been ordered) that divides the histogram into two equal areas has the same units as the data is resistant to outliers (extreme data values)

The range: is the difference between the maximum value and the minimum value is a number, NOT an interval is sensitive to outliers

The interquartile range (IQR): contains the middle 50% of the data is the difference between the lower (Q1) and upper (Q3) quartiles is a number, NOT an interval is resistant to outliers The Five-Number Summary gives: minimum, lower quartile, median, upper quartile, maximum.

For symmetric distributions, use the mean to determine the center of the distribution and the standard deviation to describe the spread of the distribution.

The mean: is the arithmetic average of the data values is the balancing point of a histogram has the same units as the data is sensitive to outliers is given by the formula

The standard deviation: measures the “typical” distance each data value is from the mean Because some values are above the mean and some are below the mean, finding the sum is not useful (positives cancel out negatives); therefore we first square the deviations, then calculate an adjusted average. This is called the variance. This statistics does not have the same units as the data, since we squared the deviations. Therefore, the final step is to take the square root of the variance, which gives us the standard deviation. is given by the formula is sensitive to outliers, since its calculation involves the mean