Organizing and Summarizing Data

Slides:



Advertisements
Similar presentations
Introduction to the Practice of Statistics
Advertisements

Chapter Two Organizing and Summarizing Data
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Chapter Organizing and Summarizing Data © 2010 Pearson Prentice Hall. All rights reserved 3 2.
Section 2.2 Frequency Distributions and Their Graphs
© 2010 Pearson Prentice Hall. All rights reserved Organizing and Summarizing Data Graphically.
Organizing Information Pictorially Using Charts and Graphs
Statistics-MAT 150 Chapter 2 Descriptive Statistics
Histogram A frequency plot that shows the number of times a response or range of responses occurred in a data set.
Describing Data with Tables and Graphs.  A frequency distribution is a collection of observations produced by sorting observations into classes and showing.
Chapter Organizing and Summarizing Data © 2010 Pearson Prentice Hall. All rights reserved 3 2.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
2.1 Organizing Qualitative Data
Chapter Organizing and Summarizing Data © 2010 Pearson Prentice Hall. All rights reserved 3 2.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
Slide 2-2 Copyright © 2012, 2008, 2005 Pearson Education, Inc. Chapter 2 Organizing Data.
Chapter Two Organizing and Summarizing Data 2.2 Organizing Quantitative Data I.
Chapter 2: Organizing Data Section 2: Frequency Distribution and Histograms.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 2 Section 1 – Slide 1 of 27 Chapter 2 Section 1 Organizing Qualitative Data.
ORGANIZING QUALITATIVE DATA 2.1. FREQUENCY DISTRIBUTION Qualitative data values can be organized by a frequency distribution A frequency distribution.
1 MATB344 Applied Statistics Chapter 1 Describing Data with Graphs.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Bias in Sampling 1.5.
2.2 Organizing Quantitative Data. Data O Consider the following data O We would like to compute the frequencies and the relative frequencies.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
Areej Jouhar & Hafsa El-Zain Biostatistics BIOS 101 Foundation year.
When data is collected from a survey or designed experiment, they must be organized into a manageable form. Data that is not organized is referred to as.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
Statistics Visual Representation of Data Part 1 Tables.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 2 Section 2 – Slide 1 of 37 Chapter 2 Section 2 Organizing Quantitative Data.
Objectives Organize discrete data in tables Construct histograms of discrete data Organize continuous data in tables Construct histograms of continuous.
Descriptive Statistics
Graphing options for Quantitative Data
Descriptive Statistics: Tabular and Graphical Methods
Organizing Quantitative Data: The Popular Displays
Organizing Qualitative Data
Chapter 2 Descriptive Statistics.
Chapter 2 Descriptive Statistics.
Organizing and Summarizing Data
STATISTICS INFORMED DECISIONS USING DATA
Organizing and Summarizing Data
Organizing Qualitative Data
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
ISE 261 PROBABILISTIC SYSTEMS
MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Chapter 2: Methods for Describing Data Sets
3 2 Chapter Organizing and Summarizing Data
Frequency Distributions and Their Graphs
CONSTRUCTION OF A FREQUENCY DISTRIBUTION
STATISTICS INFORMED DECISIONS USING DATA
Chapter Two Organizing and Summarizing Data
Chapter 1 Data Analysis Section 1.2
3 2 Chapter Organizing and Summarizing Data
Chapter 1: Describing Data with Graphs
Organizing and Summarizing Data
Analyzing One-Variable Data
Frequency Distributions and Their Graphs
Sexual Activity and the Lifespan of Male Fruitflies
Frequency Distributions and Histograms
Introduction to Probability and Statistics Thirteenth Edition
Definitions Covered Census vs. Sample
Frequency Distributions, Histograms, and Related Topics
Descriptive Statistics
Homework Check.
Organizing Qualitative Data
Descriptive Statistics
Organizing, Displaying and Interpreting Data
Descriptive Statistics
Displaying the Order in a Group of Numbers Using Tables and Graphs
Presentation transcript:

Organizing and Summarizing Data Chapter 2 Organizing and Summarizing Data

Organizing Qualitative Data Section Organizing Qualitative Data 2.1

When data is collected from a survey or designed experiment, they must be organized into a manageable form. Data that is not organized is referred to as raw data. Ways to Organize Data Tables Graphs Numerical Summaries (Chapter 3) A frequency distribution lists each category of data and the number of occurrences for each category of data. 3

EXAMPLE Organizing Qualitative Data into a Frequency Distribution A physical therapist wants to determine types of rehabilitation required by her patients. To do so, she obtains a simple random sample of 20 of her patients and records the body part requiring rehabilitation. Construct a frequency distribution of location of injury. Back, Wrist, Elbow, Back, Hip, Neck, Shoulder, Back, Knee , Hand, Back, Back, Back, Shoulder, Knee, Knee, Shoulder, Back, Knee, Back 4

Location Tally Frequency Back IIIII III 8 Wrist I 1 Elbow Hip Neck Shoulder III 3 Knee IIII 4 Hand

The relative frequency is the proportion (or percent) of observations within a category and is found using the formula: A relative frequency distribution lists each category of data with the relative frequency. 6

Use the frequency distribution obtained to construct a relative frequency distribution of the color of plain M&Ms. Color Tally Frequency Brown ||||| ||||| || 12 Yellow ||||| ||||| 10 Red ||||| |||| 9 Orange ||||| | 6 Blue ||| 3 Green ||||| 5 7

Color Tally Frequency Relative Frequency Brown ||||| ||||| || 12 ||||| ||||| || 12 12/45 ≈ 0.2667 Yellow ||||| ||||| 10 0.2222 Red ||||| |||| 9 0.2 Orange ||||| | 6 0.1333 Blue ||| 3 0.0667 Green ||||| 5 0.1111 8

EXAMPLE. Organizing Qualitative Data into a Relative EXAMPLE Organizing Qualitative Data into a Relative Frequency Distribution Use the frequency distribution obtained in the prior example to construct a relative frequency distribution of the location of injury. 9

Location Tally Frequency Relative Frequency Back IIIII III 8 8/20 = 0.4 Wrist I 1 0.05 Elbow Hip Neck Shoulder III 3 0.15 Knee IIII 4 0.2 Hand

Bar Graphs A bar graph is constructed by labeling each category of data on either the horizontal or vertical axis and the frequency or relative frequency of the category on the other axis. Rectangles of equal width are drawn for each category. The height of each rectangle represents the category’s frequency or relative frequency.

Construct a bar graph Frequency table

2.1 Organizing Qualitative Data 2.1.2 Construct Bar Graphs (4 of 13)

Organizing Quantitative Data: The Popular Displays Section Organizing Quantitative Data: The Popular Displays 2.2

EXAMPLE. Constructing Frequency and Relative EXAMPLE Constructing Frequency and Relative Frequency Distribution from Discrete Data The following data represent the number of available cars in a household based on a random sample of 50 households. Construct a frequency and relative frequency distribution. 3 0 1 2 1 1 1 2 0 2 4 2 2 2 1 2 2 0 2 4 1 1 3 2 4 1 2 1 2 2 3 3 2 1 2 2 0 3 2 2 2 3 2 1 2 2 1 1 3 5 Data based on results reported by the United States Bureau of the Census. 15

||||| ||||| ||||| ||||| || 22 0.44 # of Cars Tally Frequency Relative Frequency |||| 4 4/50 = 0.08 1 ||||| ||||| ||| 13 13/50 = 0.26 2 ||||| ||||| ||||| ||||| || 22 0.44 3 ||||| || 7 0.14 ||| 0.06 5 | 0.02 16

A histogram is constructed by drawing rectangles for each class of data. The height of each rectangle is the frequency or relative frequency of the class. The width of each rectangle is the same and the rectangles touch each other. (Note: can also be thought of as bar graph) 17

EXAMPLE Drawing a Histogram for Discrete Data Draw a frequency and relative frequency histogram for the “number of cars per household” data. # of Cars Frequency Relative Frequency 4 4/50 = 0.08 1 13 13/50 = 0.26 2 22 0.44 3 7 0.14 0.06 5 0.02 18

19

20

Classes are categories into which data are grouped Classes are categories into which data are grouped. When a data set consists of a large number of different discrete data values or when a data set consists of continuous data, we must create classes by using intervals of numbers. 21

The following data represents the number of persons aged 25 - 64 who are currently work-disabled. The lower class limit of a class is the smallest value within the class while the upper class limit of a class is the largest value within the class. The lower class limit of first class is 25. The lower class limit of the second class is 35. The upper class limit of the first class is 34. The class width is the difference between consecutive lower class limits. The class width of the data given above is 35 – 25 = 10. 22

EXAMPLE. Organizing Continuous Data into a EXAMPLE Organizing Continuous Data into a Frequency and Relative Frequency Distribution The following data represent the time between eruptions (in seconds) for a random sample of 45 eruptions at the Old Faithful Geyser in Wyoming. Construct a frequency and relative frequency distribution of the data. Source: Ladonna Hansen, Park Curator 23

The smallest data value is 672 and the largest data value is 738 The smallest data value is 672 and the largest data value is 738. We will create the classes so that the lower class limit of the first class is 670 and the class width is 10 and obtain the following classes: 670 - 679 680 - 689 690 - 699 700 - 709 710 - 719 720 - 729 730 - 739 24

Time between Eruptions (seconds) Tally Frequency Relative Frequency 670 – 679 || 2 2/45 = 0.044 680 - 689 690 - 699 ||||| || 7 0.1556 700 - 709 ||||| |||| 9 0.2 710 - 719 720 - 729 ||||| ||||| | 11 0.2444 730 - 739 25

The choices of the lower class limit of the first class and the class width were rather arbitrary. There is not one correct frequency distribution for a particular set of data. However, some frequency distributions can better illustrate patterns within the data than others. So constructing frequency distributions is somewhat of an art form. Use the distribution that seems to provide the best overall summary of the data. 26

Time between Eruptions (seconds) Tally Frequency Relative Frequency 670 – 674 | 1 1/45 = 0.0222 675 - 679 0.0222 680 - 684 685 - 689 690 - 694 695 - 699 ||||| || 7 0.1556 700 - 704 705 - 709 || 2 0.0444 710 - 714 ||||| 5 0.1111 715 - 719 |||| 4 0.0889 720 - 724 ||||| | 6 0.1333 725 - 729 0.1114 730 - 734 ||| 3 0.0667 735 - 739 27

Choosing the Lower Class Limit of the First Class Guidelines for Determining the Lower Class Limit of the First Class and Class Width Choosing the Lower Class Limit of the First Class Choose the smallest observation in the data set or a convenient number slightly lower than the smallest observation in the data set. 28

Determining the Class Width Decide on the number of classes. Generally, there should be between 5 and 20 classes. The smaller the data set, the fewer classes you should have. Determine the class width by computing Round this value up to a convenient number. 29

EXAMPLE. Constructing a Frequency and Relative EXAMPLE Constructing a Frequency and Relative Frequency Histogram for Continuous Data Using class width of 10: 30

Relative Frequency 31

Using class width of 5: 32

Uniform distribution the frequency of each value of the variable is evenly spread out across the values of the variable Bell-shaped distribution the highest frequency occurs in the middle and frequencies tail off to the left and right of the middle Skewed right the tail to the right of the peak is longer than the tail to the left of the peak Skewed left the tail to the left of the peak is longer than the tail to the right of the peak. 33

34

EXAMPLE Identifying the Shape of the Distribution Identify the shape of the following histograms which represents the time between eruptions at Old Faithful. 35

Skewed Right Skewed Left Skewed Left (slightly) Bell Shaped