Chapter 3 Displaying and Describing Categorical Data Math2200.

Slides:



Advertisements
Similar presentations
Displaying and Describing Categorical Data 60 min.
Advertisements

Introduction to Stats Honors Analysis. Data Analysis Individuals: Objects described by a set of data. (Ex: People, animals, things) Variable: Any characteristic.
Displaying & Describing Categorical Data Chapter 3.
Area Principle  The area occupied by a part of the graph should correspond to the magnitude of the value it represents.
Exploring Two Categorical Variables: Contingency Tables
You have 15 min. to: Meet me at the table in the middle for HW Questions Should have paper labeled #1-10 with each question answered.
Chapter 3 Graphical and Numerical Summaries of Categorical Data UNIT OBJECTIVES At the conclusion of this unit you should be able to: n 1)Construct graphs.
In 2007, deaths of a large number of pet dogs and cats were ultimately traced to contamination of some brands of pet food. The manufacturer NOW claims.
Slide Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
The Three Rules of Data Analysis
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
Chapter 3 Graphical and Numerical Summaries of Qualitative Data UNIT OBJECTIVES At the conclusion of this unit you should be able to: n 1)Construct graphs.
WARM UP JAKE IS A CAR BUFF WHO WANTS TO FIND OUT MORE ABOUT THE VEHICLES THAT STUDENTS AT HIS SCHOOL DRIVE. HE GETS PERMISSION TO GO TO THE STUDENT PARKING.
. Chapter 3 Displaying and Describing Categorical Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
  The three rules of data analysis won’t be difficult to remember: 1. Make a picture—things may be revealed that are not obvious in the raw data. These.
Copyright © 2012 Pearson Education. Chapter 4 Displaying and Describing Categorical Data.
Copyright © 2010 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Do Now Have you: Read Harry Potter and the Deathly Hallows Seen Harry Potter and the Deathly Hallows (part 2)
Displaying & Describing Categorical Data Chapter 3.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Chapter 3 Displaying and Describing Categorical Data
Chapters 1 and 2 Week 1, Monday. Chapter 1: Stats Starts Here What is Statistics? “Statistics is a way of reasoning, along with a collection of tools.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Chapter 2 DISPLAYING AND DESCRIBING CATEGORICAL DATA.
Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.
Displaying Categorical Data THINK SHOW TELL What is categorical data? Bar, Segmented Bar, and Pie Charts Frequency vs. Relative Frequency Tables/Charts.
Chapter 3: Displaying and Describing Categorical Data *Data Analysis *Frequency Tables, Bar Charts, Pie Charts Contingency Tables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. - use pie charts, bar graphs, and tables to display data Chapter 3: Displaying and Describing Categorical.
Chapter 2 Displaying and Describing Categorical Data UNIT OBJECTIVES At the conclusion of this unit you should be able to: n 1)Construct graphs that appropriately.
1 Chapter 3 Displaying and Describing Categorical Data.
Chapter 11 Chi-Square Procedures 11.2 Contingency Tables; Association.
Slide 3-1 Copyright © 2004 Pearson Education, Inc.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Unit 2 Descriptive Statistics Objective: To correctly identify and display sets of data.
Lesson 2 9/4/12.
Chapter 3 Displaying and Describing Categorical Data.
Displaying & Describing Categorical Data Chapter 3.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Objectives Given a contingency table of counts, construct a marginal distribution. Given a contingency table of counts, create a conditional distribution.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Unit 6, Module 15 – Two Way Tables (Part I) Categorical Data Comparing 2.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 2 Displaying and Describing Categorical Data.
Copyright © 2009 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data Chapter 3.
August 25,  Passengers on the Titanic by class of ticket. ClassCount 1 st nd rd th 885.
CATEGORICAL DATA CHAPTER 3 GET A CALCULATOR!. Slide 3- 2 THE THREE RULES OF DATA ANALYSIS won’t be difficult to remember: 1. Make a picture — things may.
Sections TAKE OUT YOUR NOTES, Book & Do Page 8 #7-8
Smart Start In June 2003, Consumer Reports published an article on some sport-utility vehicles they had tested recently. They had reported some basic.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
AP Statistics Chapter 3 Part 3
Displaying and Describing Categorical Data
Chapter 3: Displaying and Describing Categorical Data
CATEGORICAL DATA CHAPTER 3
Displaying and Describing
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
AP Statistics Chapter 3 Part 2
Displaying and Describing Categorical Data
Stats Starts Here Copyright © 2009 Pearson Education, Inc.
Displaying and Describing Categorical Data
Displaying and Describing Categorical data
Displaying and Describing Categorical Data
Grab a post it note and place it in the correct bin for where you went to middle school
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Presentation transcript:

Chapter 3 Displaying and Describing Categorical Data Math2200

Categorical variable A categorical variable has only finite number of possible values Gender Car size Course grade

Titanic WHO People on the Titanic WHAT Survival status, age, sex, ticket class WHY Historical interest WHEN April 14,1912 WHERE North Atlantic HOW A variety of sources and Internet sites SurvivedAgeSexClass DeadAdultMaleThird DeadAdultMaleCrew DeadAdultMaleThird DeadAdultMaleCrew DeadAdultMaleCrew DeadAdultMaleCrew AliveAdultFemaleFirst DeadAdultMaleThird DeadAdultMaleCrew

Three rules of data analysis 1. Make a picture A picture can reveal the pattern and relationship hidden in your data 2. Make a picture A picture can show extraordinary data values or unexpected patterns 3. Make a picture Easy to understand

Florence Nightingale Founder of modern nursing First female member of British Statistical Society Used a picture to argue forcefully for better hospital conditions for soldiers

Frequency tables Making piles: count the number of cases corresponding to each category and pile them up People on Titanic: by ticket class ClassCount First325 Second285 Third706 Crew885

Relatively frequency table Proportion: divide counts by the total number of cases Percentage: multiply by 100 The frequency table or relative frequency table describe the distribution of a categorical variable ClassPercentage First14.77% Second12.95% Third32.08% Crew40.21%

What is your feeling about the proportion of crew members on board?

Why is the picture misleading? The length of each ship corresponds to the number of people in each category Our eyes tend to be more impressed by the area than by other aspects of the image. Even though the length of the ship is about 3 times, but the area is about 9 times. And that is misleading.

The area principle the area occupied by a part of the graph should correspond to the magnitude of the value it represents.

Bar chart Display of counts of a categorical variable with bars

Pie Charts

When you make a bar chart or pie chart, pay attention to the following Make sure the variable is indeed categorical Your data are counts or percentages of cases in categories Make sure that the categories do not overlap

Was there a relationship between the kind of ticket a passenger held and the passenger’s chances of making it into the lifeboat? What table should we make to answer this question?

Contingency table A two-way table The table shows how the subjects are distributed along each variable, contingent on the value of the other variable FirstSecondThirdCrewtotal Alive Dead Total

Add relative frequencies FirstSecondThirdCrewtotal Alive Counts % of Row28.55%16.60%25.04%29.82% % % of Column62.46%41.40%25.21%23.95%32.30% % of Table9.22%5.36%8.09%9.63%32.30% Dead Counts % of Row8.19%11.21%35.44%45.17% % % of Column37.54%58.60%74.79%76.05%67.70% % of Table5.54%7.59%23.99%30.58%67.70% Total Counts % of Row14.77%12.95%32.08%40.21% % % of Column100.00% % of Table14.77%12.95%32.08%40.21% %

Percent of what? What percent of the survivors were in second class? 118/711 = 16.60% What percent were second-class passengers who survived? The Who is everyone on board, i.e., 2201 is the denominator 118/2201 What percent of the second-class passengers survived? 118/285

A simplified table FirstSecondThirdCrewtotal Alive9.22%5.36%8.09%9.63%32.30% Dead5.54%7.59%23.99%30.58%67.70% Total14.76%12.95%32.08%40.21%100.00%

Marginal distribution In the margins of a contingency table, the frequency distribution of one of the variables is called its marginal distribution

Conditional distribution 1 FirstSecondThirdCrewtotal Alive %16.60%25.04%29.82%100.00% Dead %11.21%35.44%45.17%100.00%

Pie chart for conditional distributions of ticket Class for survivors and non-survivors

Conditional distribution 2 FirstSecondThirdCrewtotal Alive Counts % of Column62.46%41.40%25.21%23.95%32.30% Dead Counts % of Column37.54%58.60%74.79%76.05%67.70% Total Counts % of Column100.00%

Bar chart for conditional distributions of Ticket Class

Segmented Bar Chart

What can go wrong? Do not violate the area principle Incorrect correct

What can go wrong? Keep it honest Pay attention to labels Whether all percentages add up to 1? Do not confuse similar-sounding percentages The percentage of passengers who were both in first class and survived The percentage of the first class passengers who survived The percentage of the survivors who were in first class

What can go wrong? Do not forget to look at the variables separately, too. Look at both conditional and marginal distributions Be sure to use enough individuals Do not overstate your case

What can go wrong? Be careful with averages of proportion across several different groups Simpson’s Paradox ( Calculation in last column makes no sense) On-time record for two pilots DayNightOverall Moe90/100=90%10/20=50%100/120=83% Jill19/20=95%75/100=75%94/120=78%

Summary Chapter 3 Bar charts and pie charts are displays for categorical variables. A contingency table shows how cases are distributed along each variable conditioned on the other variable. Row/ column sums of table percentage of each cell in a contingency table give the marginal distributions. Row/column percentage in a contingency table show the conditional distributions. Contingency tables help to show the relationship of two categorical variables.