Displaying and Describing Categorical Data 60 min.

Slides:



Advertisements
Similar presentations
So What Do We Know? Variables can be classified as qualitative/categorical or quantitative. The context of the data we work with is very important. Always.
Advertisements

Displaying & Describing Categorical Data Chapter 3.
Displaying and Describing
Area Principle  The area occupied by a part of the graph should correspond to the magnitude of the value it represents.
Exploring Two Categorical Variables: Contingency Tables
In 2007, deaths of a large number of pet dogs and cats were ultimately traced to contamination of some brands of pet food. The manufacturer NOW claims.
Slide Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
The Three Rules of Data Analysis
CHAPTER 1 STATISTICS Statistics is a way of reasoning, along with a collection of tools and methods, designed to help us understand the world.
. Chapter 3 Displaying and Describing Categorical Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 Displaying and Describing Categorical Data.
  The three rules of data analysis won’t be difficult to remember: 1. Make a picture—things may be revealed that are not obvious in the raw data. These.
Copyright © 2012 Pearson Education. Chapter 4 Displaying and Describing Categorical Data.
Copyright © 2010 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Do Now Have you: Read Harry Potter and the Deathly Hallows Seen Harry Potter and the Deathly Hallows (part 2)
Displaying & Describing Categorical Data Chapter 3.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
Chapter 3 Displaying and Describing Categorical Data
Chapters 1 and 2 Week 1, Monday. Chapter 1: Stats Starts Here What is Statistics? “Statistics is a way of reasoning, along with a collection of tools.
Chapter 3 Addie Molique, Ash Nair Displaying and Describing Categorical Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Chapter 2 DISPLAYING AND DESCRIBING CATEGORICAL DATA.
Unit 3 Relations in Categorical Data. Looking at Categorical Data Grouping values of quantitative data into specific classes We use counts or percents.
Displaying Categorical Data THINK SHOW TELL What is categorical data? Bar, Segmented Bar, and Pie Charts Frequency vs. Relative Frequency Tables/Charts.
Chapter 3: Displaying and Describing Categorical Data *Data Analysis *Frequency Tables, Bar Charts, Pie Charts Contingency Tables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. - use pie charts, bar graphs, and tables to display data Chapter 3: Displaying and Describing Categorical.
1 Chapter 3 Displaying and Describing Categorical Data.
Slide 3-1 Copyright © 2004 Pearson Education, Inc.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Unit 2 Descriptive Statistics Objective: To correctly identify and display sets of data.
Categorical Data! Frequency Table –Records the totals (counts or percentage of observations) for each category. If percentages are shown, it is a relative.
Chapter 3 Displaying and Describing Categorical Data.
Displaying & Describing Categorical Data Chapter 3.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 3- 1.
Objectives Given a contingency table of counts, construct a marginal distribution. Given a contingency table of counts, create a conditional distribution.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Unit 6, Module 15 – Two Way Tables (Part I) Categorical Data Comparing 2.
Chapter 3 Displaying and Describing Categorical Data Math2200.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 2 Displaying and Describing Categorical Data.
Copyright © 2009 Pearson Education, Inc. Chapter 3 Displaying and Describing Categorical Data.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data Chapter 3.
August 25,  Passengers on the Titanic by class of ticket. ClassCount 1 st nd rd th 885.
CATEGORICAL DATA CHAPTER 3 GET A CALCULATOR!. Slide 3- 2 THE THREE RULES OF DATA ANALYSIS won’t be difficult to remember: 1. Make a picture — things may.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Honors Statistics Chapter 3 Part 1
Displaying and Describing Categorical Data
Chapter 3: Displaying and Describing Categorical Data
Bell Ringer The State Education Department requires local school districts to keep these records on all students: age, race or ethnicity, days absent,
CATEGORICAL DATA CHAPTER 3
Displaying and Describing
Displaying and Describing Categorical Data
Math 153 Stats Starts Here.
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Relations in Categorical Data
Quick review of last time~
Announcements 100 Years: Let's celebrate! The National Park Service turns 100 on August 25, 2016, and everyone can take part in the celebration! To honor.
Displaying and Describing Categorical Data
Chapter 1 Stats Starts Here Copyright © 2009 Pearson Education, Inc.
Math 153 Stats Starts Here.
Stats Starts Here Copyright © 2009 Pearson Education, Inc.
Displaying and Describing Categorical Data
Displaying and Describing Categorical data
Displaying and Describing Categorical Data
Grab a post it note and place it in the correct bin for where you went to middle school
Displaying and Describing Categorical Data
Displaying and Describing Categorical Data
Presentation transcript:

Displaying and Describing Categorical Data 60 min

1. Make a picturethings may be revealed that are not obvious in the raw data. These will be things to think about. 2. Make a pictureimportant features of and patterns in the data will show up. You may also see things that you did not expect. 3. Make a picturethe best way to tell others about your data is with a well-chosen picture. Area principle: The area occupied by a part of the graph should correspond to the magnitude of the value it represents.

We can pile the data by counting the number of data values in each category of interest. We can organize these counts into a frequency table, which records the totals and the category names.

A relative frequency table is similar, but gives the percentages (instead of counts) for each category.

A bar chart displays the distribution of a categorical variable, showing the counts for each category next to each other for easy comparison. A bar chart stays true to the area principle. Thus, a better display for the ship data is:

A relative frequency bar chart displays the relative proportion of counts for each category. A relative frequency bar chart also stays true to the area principle. Replacing counts with percentages in the ship data:

When you are interested in parts of the whole, a pie chart might be your display of choice. Pie charts show the whole group of cases as a circle. They slice the circle into pieces whose size is proportional to the fraction of the whole in each category.

A contingency table allows us to look at 2 categorical variables together. It shows how individuals are distributed along each variable, contingent on the value of the other variable. Example: we can examine the class of ticket and whether a person survived the Titanic:

The margins of the table, both on the right and on the bottom, give totals and the frequency distributions for each of the variables. Each frequency distribution is called a marginal distribution of its respective variable. The marginal distribution of Survival is:

Each cell of the table gives the count for a combination of values of the two values. For example, the second cell in the crew column tells us that 673 crew members died when the Titanic sunk.

A conditional distribution shows the distribution of one variable for just the individuals who satisfy some condition on another variable. The following is the conditional distribution of ticket Class, conditional on having survived:

The following is the conditional distribution of ticket Class, conditional on having perished:

The conditional distributions tell us that there is a difference in class for those who survived and those who perished. This is better shown with pie charts of the two distributions:

We see that the distribution of Class for the survivors is different from that of the nonsurvivors. This leads us to believe that Class and Survival are associated, that they are not independent. The variables would be considered independent when the distribution of one variable in a contingency table is the same for all categories of the other variable.

A segmented bar chart displays the same information as a pie chart, but in the form of bars instead of circles. Here is the segmented bar chart for ticket Class by Survival status:

Example Professor Weiss asked his introductory statistics students to state their political party affiliations as Democratic (D), Republican (R), or Other (O). The responses are given in the table. Determine the frequency and relative-frequency distributions for these data.

Solution Display the relative-frequency distribution of these qualitative data with a a. pie chart. b. bar graph.

Solution

Keep it honestmake sure your display shows what it says it shows. This plot of the percentage of high-school students who engage in specified dangerous behaviors has a problem. Can you see it?

Dont overstate your casedont claim something you cant. Dont use unfair or silly averagesthis could lead to Simpsons Paradox, so be careful when you average one variable across different levels of a second variable. PilotDayNightOverall Moe90/100 (90%)10/20 (50%)100/120 (83%) Jill19/20 (95%)75/100 (75%)94/12 0 (78%) The table shows the number of flights each pilot land on time during daytime, nighttime and overall. Who is the better pilot?

Page 40 – 45: Problem #5, 7, 11, 13, 15, 19, 23, 25, 27, 35, 41, 45, 47.