COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.

Slides:



Advertisements
Similar presentations
Frequency Distributions Quantitative Methods in HPELS 440:210.
Advertisements

Chapter 2: Frequency Distributions
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 1-3 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Histograms, Frequency Polygons and Ogives
Chapter 1 Displaying the Order in a Group of Numbers
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
PRED 354 TEACH. PROBILITY & STATIS. FOR PRIMARY MATH Lesson 2 Variables, Scales of Measurement, Frequency Distribution and its Shape, Percentiles.
Lecture 2 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Frequency Distributions
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Frequency Distribution Ibrahim Altubasi, PT, PhD The University of Jordan.
Chapter 2 CREATING AND USING FREQUENCY DISTRIBUTIONS.
Chapter 3: Central Tendency
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data with Tables and Graphs.  A frequency distribution is a collection of observations produced by sorting observations into classes and showing.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Frequency Distributions and Percentiles
Frequency Distributions and Graphs
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Chapter 3: Central Tendency Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor.
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Chapter 1: Introduction to Statistics
Sample Distributions. Review Parameter vs. Statistic Parameter vs. Statistic Population and Sample Population and Sample Construct Construct Variables.
COURSE: JUST 3900 TIPS FOR APLIA Developed By: Ethan Cooper (Lead Tutor) John Lohman Michael Mattocks Aubrey Urwick Chapter 2: Frequency Distributions.
Graphs of Frequency Distribution Introduction to Statistics Chapter 2 Jan 21, 2010 Class #2.
Descriptive Statistics: Tabular and Graphical Methods
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 3 Organizing and Displaying Data.
Data Presentation.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Chapter 2: Frequency Distributions Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Chapter 2 Frequency Distributions
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Chapter 2 Describing Data.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Chapter 2 Frequency Distributions. Learning Outcomes Understand how frequency distributions are used 1 Organize data into a frequency distribution table…
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
Dr. Serhat Eren Other Uses for Bar Charts Bar charts are used to display data for different categories where the data are some kind of quantitative.
FREQUENCY DISTRIBUTIONS FREQUENCY DISTRIBUTIONS. FREQUENCY DISTRIBUTIONS   Overview.   Frequency Distribution Tables.   Frequency Distribution Graphs.
Chapter 2 EDRS 5305 Fall Descriptive Statistics  Organize data into some comprehensible form so that any pattern in the data can be easily seen.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Chapter 2Prepared by Samantha Gaies, M.A.1 Example: Based on a stress questionnaire given to N =151 students (from Aron, Paris, & Aron, 1995). Sample question:
Lecture 2.  A descriptive technique  An organized tabulation showing exactly how many individuals are located in each category on the scale of measurement.
Chapter 2: Frequency Distributions. 2 Control GroupExperimental Group Control Group Exam Score Experimental.
Frequency Distributions
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
1 Frequency Distributions. 2 After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get.
Introduction to statistics I Sophia King Rm. P24 HWB
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 4-6 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Chapter 2 Describing and Presenting a Distribution of Scores.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 2 Section 2 – Slide 1 of 37 Chapter 2 Section 2 Organizing Quantitative Data.
Chapter 2 Frequency Distributions PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Frequency Distributions
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Chapter 2 Descriptive Statistics.
Chapter 2 Descriptive Statistics.
Frequency Distributions
Frequency Distributions
Chapter 2 Descriptive Statistics
An Introduction to Statistics
Experimental Design Experiments Observational Studies
Organizing, Displaying and Interpreting Data
Displaying the Order in a Group of Numbers Using Tables and Graphs
Presentation transcript:

COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology Chapter 2: Frequency Distributions © DO NOT CITE, QUOTE, REPRODUCE, OR DISSEMINATE WITHOUT WRITTEN PERMISSION FROM THE AUTHOR: Dr. John J. Kerbs can be ed for permission at

After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get a general overview of the results. After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get a general overview of the results. This is the goal of descriptive statistical techniques. This is the goal of descriptive statistical techniques. One method for simplifying and organizing data is to construct a frequency distribution. One method for simplifying and organizing data is to construct a frequency distribution. Frequency Distributions

A frequency distribution is an organized tabulation showing exactly how many individuals are located in each category on the scale of measurement. A frequency distribution presents an organized picture of the entire set of scores, and it shows where each individual is located relative to others in the distribution. A frequency distribution is an organized tabulation showing exactly how many individuals are located in each category on the scale of measurement. A frequency distribution presents an organized picture of the entire set of scores, and it shows where each individual is located relative to others in the distribution. Frequency Distributions (continued)

Frequency Distribution Tables A frequency distribution table consists of at least two columns - one listing categories on the scale of measurement (X) and another for frequency (f). A frequency distribution table consists of at least two columns - one listing categories on the scale of measurement (X) and another for frequency (f). In the X column, values are listed from the highest to lowest, without skipping any. In the X column, values are listed from the highest to lowest, without skipping any. For the frequency column, tallies are determined for each value (how often each X value occurs in the data set). These tallies are the frequencies for each X value. For the frequency column, tallies are determined for each value (how often each X value occurs in the data set). These tallies are the frequencies for each X value. The sum of the frequencies should equal N. The sum of the frequencies should equal N.

Frequency Distribution Tables (continued.) A third column can be used for the proportion (p) for each category: p = f/N. The sum of the p column should equal A third column can be used for the proportion (p) for each category: p = f/N. The sum of the p column should equal A fourth column can display the percentage of the distribution corresponding to each X value. The percentage is found by multiplying p by 100. The sum of the percentage column is 100%. A fourth column can display the percentage of the distribution corresponding to each X value. The percentage is found by multiplying p by 100. The sum of the percentage column is 100%.

Frequency Distribution Table: Example Xfp = f/N% = p(100) 511/10 = % 422/10 = % 333/10 = % 233/10 = % 111/10 = % NOTE: Σf = N = = 10

Regular Frequency Distribution When a frequency distribution table lists all of the individual categories (X values) it is called a regular frequency distribution. When a frequency distribution table lists all of the individual categories (X values) it is called a regular frequency distribution.

Grouped Frequency Distribution Sometimes, however, a set of scores covers a wide range of values. In these situations, a list of all the X values would be quite long - too long to be a “simple” presentation of the data. Sometimes, however, a set of scores covers a wide range of values. In these situations, a list of all the X values would be quite long - too long to be a “simple” presentation of the data. To remedy this situation, a grouped frequency distribution table is used. To remedy this situation, a grouped frequency distribution table is used.

Grouped Frequency Distribution (continued.) In a grouped table, the X column lists groups of scores, called class intervals, rather than individual values. In a grouped table, the X column lists groups of scores, called class intervals, rather than individual values. These intervals all have the same width, usually a simple number such as 2, 5, 10, and so on. These intervals all have the same width, usually a simple number such as 2, 5, 10, and so on. Each interval begins with a value that is a multiple of the interval width. The interval width is selected so that the table will have approximately ten intervals. Each interval begins with a value that is a multiple of the interval width. The interval width is selected so that the table will have approximately ten intervals.

Frequency Distribution Graphs In a frequency distribution graph, the score categories (X values) are listed on the X axis and the frequencies are listed on the Y axis. In a frequency distribution graph, the score categories (X values) are listed on the X axis and the frequencies are listed on the Y axis. When the score categories consist of numerical scores from an interval or ratio scale, the graph should be either a histogram or a polygon. When the score categories consist of numerical scores from an interval or ratio scale, the graph should be either a histogram or a polygon.

4 Guidelines for Grouped Frequency Distribution Tables 1. Limit to around 10 class intervals 1. Limit to around 10 class intervals 2. Interval width should be a simple number (e.g., 2, 5, 10, 20, etc) 2. Interval width should be a simple number (e.g., 2, 5, 10, 20, etc) 3. The bottom interval should be a multiple of the width 3. The bottom interval should be a multiple of the width For example, if you use a width of 5 points, the intervals should start with 5, 10, 15, 20, and so on For example, if you use a width of 5 points, the intervals should start with 5, 10, 15, 20, and so on 4. All intervals should be the same width 4. All intervals should be the same width Intervals should cover the range of scores without gaps and overlap. Intervals should cover the range of scores without gaps and overlap.

Histograms In a histogram, a bar is centered above each score (or class interval) so that the height of the bar corresponds to the frequency and the width extends to the real limits, so that adjacent bars touch. In a histogram, a bar is centered above each score (or class interval) so that the height of the bar corresponds to the frequency and the width extends to the real limits, so that adjacent bars touch. Apparent Limits versus Real Limits: Apparent Limits versus Real Limits: Apparent limits for class intervals (e.g., 50-59) are represented by the lowest score (50) and the highest score (59) in any given interval (the upper and lower boundaries for the class interval) Apparent limits for class intervals (e.g., 50-59) are represented by the lowest score (50) and the highest score (59) in any given interval (the upper and lower boundaries for the class interval) Real Limits are defined by the boundaries of the lowest and highest scores Real Limits are defined by the boundaries of the lowest and highest scores Example: Class Interval of for continuous variable Example: Class Interval of for continuous variable Lowest Score = 49.5 to 50.5 Lowest Score = 49.5 to 50.5 Highest Score = 58.5 to 59.5 Highest Score = 58.5 to 59.5 Thus, real limits for interval are 49.5 and 59.5 Thus, real limits for interval are 49.5 and 59.5

Histogram Examples For discrete variables For grouped data Indicates omitted scores

Polygons In a polygon, a dot is centered above each score so that the height of the dot corresponds to the frequency. The dots are then connected by straight lines. An additional line is drawn at each end to bring the graph back to a zero frequency. In a polygon, a dot is centered above each score so that the height of the dot corresponds to the frequency. The dots are then connected by straight lines. An additional line is drawn at each end to bring the graph back to a zero frequency.

Polygon Examples Mid-point for group with grouped data

Bar Graphs When the score categories (X values) are measurements from a nominal or an ordinal scale, the graph should be a bar graph. When the score categories (X values) are measurements from a nominal or an ordinal scale, the graph should be a bar graph. A bar graph is just like a histogram except that gaps or spaces are left between adjacent bars. A bar graph is just like a histogram except that gaps or spaces are left between adjacent bars.

Bar Graph Example

Relative Frequency Many populations are so large that it is impossible to know the exact number of individuals (frequency) for any specific category. Many populations are so large that it is impossible to know the exact number of individuals (frequency) for any specific category. In these situations, population distributions can be shown using relative frequency instead of the absolute number of individuals for each category. In these situations, population distributions can be shown using relative frequency instead of the absolute number of individuals for each category.

Relative Frequency Example

Smooth Curve If the scores in the population are measured on an interval or ratio scale, it is customary to present the distribution as a smooth curve rather than a jagged histogram or polygon. If the scores in the population are measured on an interval or ratio scale, it is customary to present the distribution as a smooth curve rather than a jagged histogram or polygon. The smooth curve emphasizes the fact that the distribution is not showing the exact frequency for each category. The smooth curve emphasizes the fact that the distribution is not showing the exact frequency for each category.

Smooth Curve Example

Frequency Distribution Graphs Frequency distribution graphs are useful because they show the entire set of scores. Frequency distribution graphs are useful because they show the entire set of scores. At a glance, you can determine the highest score, the lowest score, and where the scores are centered. At a glance, you can determine the highest score, the lowest score, and where the scores are centered. The graph also shows whether the scores are clustered together or scattered over a wide range. The graph also shows whether the scores are clustered together or scattered over a wide range.

Three Characteristics of Distribution Graphs 1. Shape - - i.e., form of distribution 1. Shape - - i.e., form of distribution 2. Central Tendency - - i.e., where the center of the distribution is located 2. Central Tendency - - i.e., where the center of the distribution is located 3. Variability - - i.e., the spread of scores, either over a wide range or clustered together in a small range 3. Variability - - i.e., the spread of scores, either over a wide range or clustered together in a small range

Shape A graph shows the shape of the distribution. A graph shows the shape of the distribution. A distribution is symmetrical if the left side of the graph is (roughly) a mirror image of the right side. A distribution is symmetrical if the left side of the graph is (roughly) a mirror image of the right side. One example of a symmetrical distribution is the bell-shaped normal distribution. One example of a symmetrical distribution is the bell-shaped normal distribution. On the other hand, distributions are skewed when scores pile up on one side of the distribution, leaving a "tail" of a few extreme values on the other side. On the other hand, distributions are skewed when scores pile up on one side of the distribution, leaving a "tail" of a few extreme values on the other side.

Positively and Negatively Skewed Distributions In a positively skewed distribution, the scores tend to pile up on the left side of the distribution with the tail tapering off to the right. In a positively skewed distribution, the scores tend to pile up on the left side of the distribution with the tail tapering off to the right. In a negatively skewed distribution, the scores tend to pile up on the right side and the tail points to the left. In a negatively skewed distribution, the scores tend to pile up on the right side and the tail points to the left.

Symmetrical and Skewed Distributions: Examples Tail of Distribution

Percentiles, Percentile Ranks, and Interpolation The relative location of individual scores within a distribution can be described by percentiles and percentile ranks. The relative location of individual scores within a distribution can be described by percentiles and percentile ranks. The percentile rank for a particular X value is the percentage of individuals with scores equal to or less than that X value. The percentile rank for a particular X value is the percentage of individuals with scores equal to or less than that X value. When an X value is described by its rank, it is called a percentile. When an X value is described by its rank, it is called a percentile.

Percentiles, Percentile Ranks, and Interpolation (continued.) To find percentiles and percentile ranks, two new columns are placed in the frequency distribution table: One is for cumulative frequency (cf) and the other is for cumulative percentage (c%). To find percentiles and percentile ranks, two new columns are placed in the frequency distribution table: One is for cumulative frequency (cf) and the other is for cumulative percentage (c%). Each cumulative percentage identifies the percentile rank for the upper real limit of the corresponding score or class interval. Each cumulative percentage identifies the percentile rank for the upper real limit of the corresponding score or class interval.

Percentiles versus Percentile Ranks Xfcfc% % % % 24630% 12210% = 14(14/20)*100 = 70%

Interpolation When scores or percentages do not correspond to upper real limits or cumulative percentages, you must use interpolation to determine the corresponding ranks and percentiles. When scores or percentages do not correspond to upper real limits or cumulative percentages, you must use interpolation to determine the corresponding ranks and percentiles. Interpolation is a mathematical process based on the assumption that the scores and the percentages change in a regular, linear fashion as you move through an interval from one end to the other. Interpolation is a mathematical process based on the assumption that the scores and the percentages change in a regular, linear fashion as you move through an interval from one end to the other.

Interpolation Example

4 Steps for Interpolation Step 1: Find the width of the interval on both scales Step 1: Find the width of the interval on both scales Step 2: Locate position of intermediate value in the interval based upon the respective fraction of the whole interval Step 2: Locate position of intermediate value in the interval based upon the respective fraction of the whole interval Fraction = Distance from top of interval / Width Fraction = Distance from top of interval / Width Step 3: Use same fraction to determine corresponding position on other scale. First, determine the distance from the top of the interval Step 3: Use same fraction to determine corresponding position on other scale. First, determine the distance from the top of the interval Distance = Fraction x Width Distance = Fraction x Width Step 4: Use the distance from the top to determine the position on the other scale Step 4: Use the distance from the top to determine the position on the other scale

Interpolation Example Notice that the 50 th percentile is between 10% and 60%, with associated upper real limits of 4.5 and 9.5, respectively. Notice that the 50 th percentile is between 10% and 60%, with associated upper real limits of 4.5 and 9.5, respectively. Xfcfc% % % % % % Using the distribution of scores below, please locate the 50 th percentile.

Interpolation Example (continued) Scores (X)Percentages Top9.560% ???50% Intermediate Value Bottom4.510% Step 1: Find the width of the interval on both scales Step 1: Find the width of the interval on both scales 5 and 50 points, respectively 5 and 50 points, respectively Step 2: Locate position of intermediate value Step 2: Locate position of intermediate value 50% is located 10 points from top (10/50 = 1/5 of interval) 50% is located 10 points from top (10/50 = 1/5 of interval) Step 3: Use same fraction to determine corresponding position on other scale. First, determine the distance from the top of the interval Step 3: Use same fraction to determine corresponding position on other scale. First, determine the distance from the top of the interval Distance = Fraction x Width = (1/5) * (5 points) = 1 Point Distance = Fraction x Width = (1/5) * (5 points) = 1 Point Step 4: Use distance from top to determine the position on the other scale Step 4: Use distance from top to determine the position on the other scale 9.5 – 1 = – 1 = 8.5 Thus, 50 th percentile for X is 8.5 Thus, 50 th percentile for X is 8.5

Stem-and-Leaf Displays A stem-and-leaf display provides an efficient method for obtaining and displaying a frequency distribution. A stem-and-leaf display provides an efficient method for obtaining and displaying a frequency distribution. Each score is divided into a stem consisting of the first digit or digits, and a leaf consisting of the final digit. Each score is divided into a stem consisting of the first digit or digits, and a leaf consisting of the final digit. Then, go through the list of scores, one at a time, and write the leaf for each score beside its stem. Then, go through the list of scores, one at a time, and write the leaf for each score beside its stem.

Stem-and-Leaf Displays (continued.) The resulting display provides an organized picture of the entire distribution. The number of leaves beside each stem corresponds to the frequency, and the individual leaves identify the individual scores. The resulting display provides an organized picture of the entire distribution. The number of leaves beside each stem corresponds to the frequency, and the individual leaves identify the individual scores.

Stem-and-Leave Displays: Example