Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.

Slides:



Advertisements
Similar presentations
Chapter 3 Graphic Methods for Describing Data. 2 Basic Terms  A frequency distribution for categorical data is a table that displays the possible categories.
Advertisements

2- 1 Chapter Two McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 2 Describing Data: Graphical
Chapter 2 Presenting Data in Tables and Charts
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Ch. 2: The Art of Presenting Data Data in raw form are usually not easy to use for decision making. Some type of organization is needed Table and Graph.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Chapter 2 Describing Data Sets
Business 90: Business Statistics
PPA 415 – Research Methods in Public Administration Lecture 2 - Counting and Charting Responses.
Descriptive statistics (Part I)
Ka-fu Wong © 2003 Chap 2-1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Basic Descriptive Statistics Healey, Chapter 2
Frequency Distributions and Graphs
Descriptive Statistics
2.1: Frequency Distributions and Their Graphs. Is a table that shows classes or intervals of data entries with a count of the number of entries in each.
Frequency Distributions and Their Graphs
CHAPTER 2 Frequency Distributions and Graphs. 2-1Introduction 2-2Organizing Data 2-3Histograms, Frequency Polygons, and Ogives 2-4Other Types of Graphs.
STATISTICAL GRAPHS.
Frequency Distributions and Graphs
Frequency Distributions and Graphs
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 2-1 What is a Frequency Distribution? A frequency distribution is a list or a.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
2- 1 Chapter Two McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Chapter 2 Summarizing and Graphing Data Sections 2.1 – 2.4.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 3 Organizing and Displaying Data.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 3.
Data Presentation.
Copyright © 2015 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 C H A P T E R T W O Frequency Distributions and Graphs.
Basic Descriptive Statistics Percentages and Proportions Ratios and Rates Frequency Distributions: An Introduction Frequency Distributions for Variables.
Unit 2 Sections 2.1.
Lecture 2 Graphs, Charts, and Tables Describing Your Data
Organizing Quantitative Data: The Popular Displays
Frequency Distribution
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Statistics Unit 2: Organizing Data Ms. Hernandez St. Pius X High School
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 3 Graphical Methods for Describing Data.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 2-1 Chapter 2 Presenting Data in Tables and Charts Statistics For Managers 4 th.
McGraw-Hill/ Irwin © The McGraw-Hill Companies, Inc., 2003 All Rights Reserved. 2-1 Chapter Two Describing Data: Frequency Distributions and Graphic Presentation.
Chap 2-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course in Business Statistics 4 th Edition Chapter 2 Graphs, Charts, and Tables.
Chapter 11 Data Descriptions and Probability Distributions Section 1 Graphing Data.
DESCRIPTIVE STATISTICS
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Chapter 2 Frequency Distributions and Graphs 1 Copyright © 2012 The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Chapter 2 Summarizing and Graphing Data  Frequency Distributions  Histograms  Statistical Graphics such as stemplots, dotplots, boxplots, etc.  Boxplots.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Relative Cumulative Frequency Graphs
Honors Statistics Chapter 4 Part 1
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Descriptive Statistics
Frequency Distributions and Graphs
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Sexual Activity and the Lifespan of Male Fruitflies
Frequency Distributions and Graphs
Descriptive Statistics
Descriptive Statistics
Organizing, Displaying and Interpreting Data
Descriptive Statistics
Frequency Distributions
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Presentation transcript:

Chapter 3: Organizing Data

Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques include: –T–Tables such as frequency distributions –G–Graphs such as histograms, bar graphs, line graphs, pie charts, stem-and-leaf plots, and scatterplots

Frequency Distribution A frequency distribution is a table that lists all the categories or values of a variable as well as the corresponding number of occurrences or responses for each category or value of the variable (its frequency, or how often the category occurs). Frequency distributions can be used for both categorical variables (nominal or ordinal) and numerical variables (interval or ratio).

To create a frequency distribution for categorical data: First create a list of all the categories or values of the variable and then count the number of times each different category or value occurred in the data. Then find the percentage of respondents for each category. Set up your basic table to have 3 columns: (1) the list of categories or the values of the variable of interest, (2) the frequency count, and (3) the percentage. When dealing with ordinal variables, make sure the categories are ranked (lowest to highest or highest to lowest).

Examples of frequency distributions for nominal variables:

Examples of frequency distributions for ordinal variables:

Statistical Software There are many statistical software packages that exist. Most of the output looks the same, so reading the output will be similar amongst programs. The textbook shows output from SPSS (Statistical Package for the Social Sciences). Some other useful programs are StatCrunch, SAS (Statistical Analytics System), and even Excel, amongst many others. Let’s look at SPSS output for a frequency distribution.

Notice this is similar to our frequency distributions, but has a few extra columns. Percent is calculated the same way it was calculated prior. Valid percent accounts for the missing data. It divides the frequency by the total minus the missing data (1485 for this example). Cumulative percent is a running total (based on valid %).

It is important to be able to utilize the frequency distribution to interpret the data and answer questions. Utilize the valid percent column when answer percent questions.

The two common ways to represent frequency distributions of categorical data are bar graphs and pie charts. For a bar graph, place the categories on the horizontal axis and either the frequency or the percent on the vertical axis.

For a pie chart, make sure each sector is labeled, appropriately sized, and contains the percent.

Simple Frequency Distribution for Numerical Data Frequency distributions for numerical data are either simple (the individual values are displayed with their frequencies) or grouped (list grouped frequencies, or equal sized classes). Grouped frequency distributions are used when we have a large number of observations.

Examples:

To construct a simple frequency distribution: 1)Find the lowest and highest numbers. 2)In column form, write in ascending order all the consecutive numbers from the lowest to highest. 3)Count the frequency.

Example: Construct a simple frequency distribution

In a grouped frequency, the numbers are usually grouped into equal-sized ranges called class intervals. Each class interval contains a lower class limit and an upper class limit. The class width is how wide each interval is, and should usually be equal amongst intervals. If the data contains an extremely small or large value, it might not be possible to have intervals of equal width. In this case, use an open-end class interval.

Example: Identify the class width for the following class intervals.

The class mid-point is the average of the lower and upper class limits. The mid-point of a class interval would be: What is the class mid-point for a class interval of 20-40?

Steps to Construct a Grouped Frequency Table 1)Find the highest and lowest values in the dataset and subtract to find the range. 2)Decide on the desired number of classes (it should be between 5 and 20) and then compute the class width by dividing the range by the desired number of classes. Note: There is no clear right or wrong answer for the number of classes. 3)Select a starting point (lower class limit). Use either the lowest number or a convenient number slightly lower than the lowest score. 4)Add the class width to the starting point to get the second lower limit. List the lower limits in a vertical column and enter the corresponding upper limits. Then fill in the values.

Example: The daily high temperature in degrees F for the month of July in Carucciville was as follows: Construct a grouped frequency distribution for the data.

Note: There are 8 classes instead of the desired 7 due to rounding.

We can also find the relative frequencies for each class.

Sometimes, if a variable is continuous, the values recorded in the study may be rounded off. In these instances, we want to know the real limits or class boundaries. The real limits of a continuous variable are usually the values that are above or below the recorded value by one-half of the place value to which the numbers were rounded. Example: Say we are examining height and rounding to the nearest centimeter. If 130 cm is recorded, the real limits are cm, because anything in those limits would result in us recording 130 cm.

Example: Find the real class limits or the class boundaries for the following class intervals:

A histogram is a graph in which the areas in the form of vertical bars represent the frequency of occurrence in a distribution of scores.

A relative frequency histogram has the same shape as a histogram, but the vertical scale is marked with relative frequencies instead of actual frequencies.