X y Exploratory data analysis Cross tabulations and scatter diagrams.

Slides:



Advertisements
Similar presentations
Microsoft Office XP Microsoft Excel
Advertisements

1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Excel Charts – Basic Skills Creating Charts in Excel.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 5 Analyzing.
Microsoft Excel 2010 Chapter 7
1 1 Slide © University of Minnesota-Duluth, Summer-2009 Econ-2030(Dr. Tadesse) Chapter-2: Descriptive Statistics: Tabular and Graphical Presentations Part.
1/54 Statistics Descriptive Statistics— Tables and Graphics.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Descriptive Statistics Summarizing qualitative data Summarizing quantitative data.
XP New Perspectives on Microsoft Excel 2002 Tutorial 1 1 Microsoft Excel.
Chapter 12a Simple Linear Regression
First-Year Engineering Program 1 Autumn 2009 Graphing with Microsoft Excel Lecture 11 Engineering H191 Engineering Fundamentals and Laboratory.
1 1 Slide © 2006 Thomson/South-Western Chapter 2 Descriptive Statistics: Tabular and Graphical Presentations Part A n Summarizing Qualitative Data n Summarizing.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
X y Exploratory data analysis Cross tabulations and scatter diagrams.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Summarizing Quantitative Data Frequency Distribution Relative Frequency and Percent Frequency Distributions Histogram Cumulative Distributions Ogive.
© 2006 by Thomson Learning, a division of Thomson Asia Pte Ltd.. 1 Slide Slide Slides Prepared by Juei-Chao Chen Fu Jen Catholic University Slides Prepared.
1 1 Slide 統計學 Fall 2003 授課教師:統計系余清祥 日期: 2003 年 9 月 23 日 第二週:敘述性統計量.
Econ 3790: Business and Economics Statistics
Scatter Diagrams Summer Cadre What is a Scatter Diagram? A Scatter Diagram is used to confirm a relationship between two variables. A Scatter Diagram.
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
Instructor: Professor Cora Martinez, PhD Department of Civil and Environmental Engineering Florida International University.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
© 2016 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 1 Slide © 2006 Thomson/South-Western Chapter 2 Descriptive Statistics: Tabular and Graphical Presentations Part B n Exploratory Data Analysis n Crosstabulations.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
STATISTICS Microsoft Excel “Frequency Distribution”
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
This is the first sheet of a spreadsheet workbook. The workbook begins initially with 3 work sheets. A spreadsheet is made of columns and rows. The intersection.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Demographic Profiles of Agency Clients - Part 2 Next, we will create a table and a column chart for the conservator field in my database. Because we are.
Microsoft ® Office Excel 2007 Working with Charts.
1 Excel Lesson 8 Working with Charts Microsoft Office 2010 Introductory Pasewark & Pasewark.
Honors Physics Lab Journals Graphing. Lab Journal Entries  Number pages in consecutive order  Date all entries  Title all activities and investigations.
BIA 2610 – Statistical Methods Chapter 2 – Descriptive Statistics: Tabular and Graphical Displays.
1 1 Slide © 2005 Thomson/South-Western Introduction to Statistics Chapter 2 Descriptive Statistics.
1. Chapter 15 Creating Charts 3 Charting Data in Word A chart or graph presents data visually. A chart depicts numeric data in a graphical format. If.
Chapter 2 – Descriptive Statistics
Chapter 2, Part A Descriptive Statistics: Tabular and Graphical Presentations n Summarizing Categorical Data n Summarizing Quantitative Data Categorical.
XP. Objectives Sort data and filter data Summarize an Excel table Insert subtotals into a range of data Outline buttons to show or hide details Create.
1 1 Slide © 2005 Thomson/South-Western OPIM 303-Lecture #1 Jose M. Cruz Assistant Professor.
EXCEL CHAPTER 6. OBJECTIVES Create a PivotTable Change the values field Modify and Format PivotTable Create a PivotChart 2.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
LSP 120: Quantitative Reasoning and Technological Literacy Topic 1: Introduction to Quantitative Reasoning and Linear Models Lecture Notes 1.3 Prepared.
LOGO Chapter 4 Charts 1. LOGO What is a Chart?  Charts are visual representations of worksheet data. Charts often makes it easier to understand the data.
1 1 Slide Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Copyright © 2005 by Nelson, a division of Thomson Canada Limited 14-0 EXCEL CHAPTER 14 PHILIP BEDIENT.
Excel Part 4 Working with Charts and Graphics. XP Objectives Create an embedded chart Work with chart titles and legends Create and format a pie chart.
Excel Part 4 Working with Charts and Graphics. XP Objectives Create an embedded chart Work with chart titles and legends Create and format a pie chart.
Fundamentals of Business Statistics chapter2 descriptive statistics: tabular and graphical presentations.
Chapter 8 Working with Trendlines, PivotTable Reports, PivotChart Reports, and Slicers Microsoft Excel 2013.
Excel Lesson 8 Working with Charts
Summarizing Categorical Data
Chapter 2 Descriptive Statistics
Chapter 2 Descriptive Statistics: Tabular and Graphical Methods
Analyzing Bivariate Data
Essentials of Statistics for Business and Economics (8e)
Excel Lesson 8 Working with Charts
Microsoft Excel 2007 – Level 2
Fu Jen Catholic University
Presentation transcript:

x y Exploratory data analysis Cross tabulations and scatter diagrams

Exploratory data analysis consists of simple arithmetic and easy-to-draw graphs that can be used to summarize data quickly

The Stem and Leaf Display A stem-and-leaf display shows both the rank order A stem-and-leaf display shows both the rank order and shape of the distribution of the data. and shape of the distribution of the data. It is similar to a histogram on its side, but it has the It is similar to a histogram on its side, but it has the advantage of showing the actual data values. advantage of showing the actual data values. The first digits of each data item are arranged to theThe first digits of each data item are arranged to the left of a vertical line. left of a vertical line. To the right of the vertical line we record the lastTo the right of the vertical line we record the last digit for each item in rank order. digit for each item in rank order.

Example: Hudson Auto Repair The manager of Hudson Auto would like to have a better understanding of the cost of parts used in the engine tune-ups performed in the shop. She examines 50 customer invoices for tune-ups. The costs of parts, rounded to the nearest dollar, are listed on the next slide.

Stretched Stem and Leaf If we believe the original stem-and-leaf display has condensed the data too much, we can stretch the display by using two stems for each leading digit(s).If we believe the original stem-and-leaf display has condensed the data too much, we can stretch the display by using two stems for each leading digit(s). Whenever a stem value is stated twice, the first value corresponds to leaf values of 0 - 4, and the second value corresponds to leaf values of Whenever a stem value is stated twice, the first value corresponds to leaf values of 0 - 4, and the second value corresponds to leaf values of

Sample parts cost for 50 tune-ups

A Stem and Leaf Display for the Auto Parts Cost data Stem Leaf

Stretched Stem and Leaf for Hudson Auto parts data

Leaf Units A single digit is used to define each leaf. In the preceding example, the leaf unit was 1. But it does not have to be 1. The leaf unit can be 0.1, 10, or 100.

Example: Leaf unit =.1 Suppose we have the following data: The leaf unit is.1. Thus:

Example: Leaf Unit = 10 If we have data with values such as Leaf Unit = a stem-and-leaf display of these data will be The 82 in 1682 is rounded down to 80 and is represented as an 8.

Crosstabulations and Scatter Diagrams So far we have considered only ONE variable (parts cost, audit time). But often we are interested in tabular and graphical data that uncover the relationship between TWO variables.

Crosstabulations A tabular method for summarizing the data for two variables simultaneously Crosstabulations can be used when one variable is qualitative and the other is quantitative, one variable is qualitative and the other is quantitative, both variables are qualitative, or both variables are qualitative, or both variables are quantitative. both variables are quantitative.

Price Range Colonial Log Split Split A-Frame Total < $99,000 > $99, Total Home Style Home Style Example: Finger Lakes Homes n Crosstabulation The number of Finger Lakes homes sold for each style and price for the past two years is shown below. quantitative variable variable qualitative

Crosstabulation: Row or Column Percentages Converting the entries in the table into row percentages or column percentages can provide additional insight about the relationship between the two variables.

Price Range Colonial Log Split A-Frame Total < $99,000 > $99, Note: row totals are actually due to rounding Home Style Home Style (Colonial and > $99K)/(All >$99K) x 100 = (12/45) x 100 Crosstabulation: Row Percentages

Price Range Colonial Log Split A-Frame < $99,000 > $99, Home Style Home Style Total (Colonial and > $99K)/(All Colonial) x 100 = (12/30) x 100 Crosstabulation: Column Percentages

Using Excel’s PivotTable Report to Construct a Crosstabulation Step 3 When the Create Pivot Table dialog box appears: Choose Select a table or range Enter A1:C301 in the Table/Range box Select New Worksheet Click OK Step 2: In the Tables group, click the icon above PivotTable Step 1: Click on the Insert tab on the ribbon Chapter 2 file Restaurant.xlsx

Using the Pivot table Field List Step 1: In the PivotTable Field List, go to Choose Fields to add to report: –Drag the Quality Rating Field to the Row Labels area. –Drag the ($)Meal Price field to the Column Labels area. –Drag the Restaurant field to the Values area. Step 2: Click Sum of Restaurant in the Values area –Select Value Field Settings. Step 3: When the Value Field Settings dialog box appears: –Under Summarize value field by, choose Count –Click OK

Finalizing the PivtotTable Report Step 1: Right-click in cell B4 (or any other cell containing meal prices) –Select Group Step 2: When the Grouping dialog box appears: –Enter 10 in the Starting at box –Enter 49 in the Ending at box –Enter 10 in the By box Step 3: Right-click on Excellent in cell A5 –Choose Move –Select Move “Excellent” to End Step 4: Close the PivotTable Field List box

Crosstabulation for the LA Restaurant Example Meal Price ($) Quality Rating Grand Total Good Very Good Excellent Grand Total Chapter 2 file Restaurant.xlsx

Crosstabulation: Simpson’s Paradox Simpson’ Paradox: In some cases the conclusions Simpson’ Paradox: In some cases the conclusions based upon an aggregated crosstabulation can be based upon an aggregated crosstabulation can be completely reversed if we look at the unaggregated completely reversed if we look at the unaggregated data. data. We must be careful in drawing conclusions about the We must be careful in drawing conclusions about the relationship between the two variables in the relationship between the two variables in the aggregated crosstabulation. aggregated crosstabulation. Data in two or more crosstabulations are often Data in two or more crosstabulations are often aggregated to produce a summary crosstabulation. aggregated to produce a summary crosstabulation.

Judge Verdict KendallLuckettTotal Upheld Reversed 129 (86%) 21 (14%) 110 (88%) 15 (12%) Total (%) 150 (100%)125 (100%)275 You might think Luckett is the better Judge. However, a larger share of Kendall’s cases were in municipal court—where the likelihood of being overturned on appeal is higher.

The general pattern of the plotted points suggests the The general pattern of the plotted points suggests the overall relationship between the variables. overall relationship between the variables. One variable is shown on the horizontal axis and the One variable is shown on the horizontal axis and the other variable is shown on the vertical axis. other variable is shown on the vertical axis. A scatter diagram is a graphical presentation of the A scatter diagram is a graphical presentation of the relationship between two quantitative variables. relationship between two quantitative variables. Scatter Diagram and Trendline A trendline is an approximation of the relationship. A trendline is an approximation of the relationship.

A Positive Relationship Y X 0

A Negative Relationship Y X 0

No Apparent Relationship Y X 0

Example: Panthers Football Team Scatter Diagram The Panthers football team is interested in investigating the relationship, if any, between interceptions made and points scored x = Number of Interceptions y = Number of Points Scored Points Scored

Scatter Diagram y x Number of Interceptions Number of Points Scored

n Insights Gained from the Preceding Scatter Diagram The relationship is not perfect; all plotted points in The relationship is not perfect; all plotted points in the scatter diagram are not on a straight line. the scatter diagram are not on a straight line. Higher points scored are associated with a higher Higher points scored are associated with a higher number of interceptions. number of interceptions. The scatter diagram indicates a positive relationship The scatter diagram indicates a positive relationship between the number of interceptions and the between the number of interceptions and the number of points scored. number of points scored. Example: Panthers Football Team

Using Excel’s Chart Wizard to Construct a Scatter Diagram and Trendline n Formula Worksheet (showing data entered)

Step 1 Select cells A1:B6 Step 2 Click the Chart Wizard button on standard toolbar Step 3 When the Chart Wizard - Step 1 of 4 - Chart Type dialog box appears: dialog box appears: Choose XY (Scatter) in the Chart Type list Choose XY (Scatter) in the Chart Type list Choose Scatter from the Chart subtype display Choose Scatter from the Chart subtype display Click Next > Click Next > Using Excel’s Chart Wizard to Construct a Scatter Diagram... continue

Using Excel’s Chart Wizard to Construct a Scatter Diagram Step 4 When the Chart Wizard - Step 2 of 4 - Chart Source Data dialog box appears: Source Data dialog box appears: Click Next > Click Next >... continue

Using Excel’s Chart Wizard to Construct a Scatter Diagram Step 5 When the Chart Wizard - Step 3 of 4 – Chart Options dialog box appears: Options dialog box appears: Select the Titles tab and then Select the Titles tab and then Type Scatter Diagram for the Panthers Type Scatter Diagram for the Panthers in the Chart title: box in the Chart title: box Type Number of Interceptions in the Type Number of Interceptions in the Value (X) axis: box Value (X) axis: box Type Number of Points Scored in the Type Number of Points Scored in the Value (Y) axis: box Value (Y) axis: box... continue

Step 5 (continued) Select the Legend tab and then Select the Legend tab and then Remove the check in the Show Legend box Click Next > Click Next > Using Excel’s Chart Wizard to Construct a Scatter Diagram Step 6 When the Chart Wizard – Step 4 of 4 - Chart Location dialog box appears: Location dialog box appears: Specify a location for the new chart Specify a location for the new chart Click Finish Click Finish

Using Excel’s Chart Wizard to Construct a Scatter Diagram

n Adding a Trendline Using Excel’s Chart Wizard to Construct a Scatter Diagram and Trendline Step 3 When the Add Trendline dialog box appears: Select the Type tab and then Select the Type tab and then Choose Linear from the Trend/ Choose Linear from the Trend/ Regression type display Regression type display Click OK Click OK Step 2 Choose the Add Trendline option Step 1 Position the mouse pointer over any data point in the scatter diagram and right click point in the scatter diagram and right click

Using Excel’s Chart Wizard to Construct a Scatter Diagram and Trendline

Scatter Diagram for the Stereo and Sound Equipment Store Example

Scatter Diagram for the Stereo and Sound Equipment Store Example—with a Trendline