STA 291 Spring 2008 Lecture 3 Dustin Lueker.

Slides:



Advertisements
Similar presentations
Displaying Data Objectives: Students should know the typical graphical displays for the different types of variables. Students should understand how frequency.
Advertisements

Chapter 2: Frequency Distributions
Chapter Two Organizing and Summarizing Data
Copyright © 2014 Pearson Education, Inc. All rights reserved Chapter 2 Picturing Variation with Graphs.
LECTURE 5 5 FEBRUARY 2009 STA291 Fall Itinerary 2.3 Graphical Techniques for Interval Data (mostly review) 2.4 Describing the Relationship Between.
QMS 6351 Statistics and Research Methods Chapter 2 Descriptive Statistics: Tabular and Graphical Methods Prof. Vera Adamchik.
QM Spring 2002 Statistics for Decision Making Descriptive Statistics.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Welcome to Data Analysis and Interpretation
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
Chapter 2 Presenting Data in Tables and Charts. 2.1 Tables and Charts for Categorical Data Mutual Funds –Variables? Measurement scales? Four Techniques.
Descriptive Statistics: Tabular and Graphical Methods
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
LECTURE 5 10 SEPTEMBER 2009 STA291 Fall Itinerary Graphical Techniques for Interval Data (mostly review) Describing the Relationship Between Two.
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
LECTURE 4 3 FEBRUARY 2008 STA 291 Fall Administrative (Review) 5.3 Sampling Plans 2.2 Graphical and Tabular Techniques for Nominal Data 2.3 Graphical.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Lecture 3 Dustin Lueker. 2  Suppose the population can be divided into separate, non-overlapping groups (“strata”) according to some criterion ◦ Select.
Lecture 3 Dustin Lueker.  Simple Random Sampling (SRS) ◦ Each possible sample has the same probability of being selected  Stratified Random Sampling.
LECTURE 6 15 SEPTEMBER 2009 STA291 Fall Review: Graphical/Tabular Descriptive Statistics Summarize data Condense the information from the dataset.
Chapter 2 Describing Data.
Statistics Unit 2: Organizing Data Ms. Hernandez St. Pius X High School
STA Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
BUSINESS STATISTICS I Descriptive Statistics & Data Collection.
Essential Statistics Chapter 11 Picturing Distributions with Graphs.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Lecture 2.  A descriptive technique  An organized tabulation showing exactly how many individuals are located in each category on the scale of measurement.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Sampling ‘Scientific sampling’ is random sampling Simple random samples Systematic random samples Stratified random samples Random cluster samples What?
1 Frequency Distributions. 2 After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get.
Copyright 2011 by W. H. Freeman and Company. All rights reserved.1 Introductory Statistics: A Problem-Solving Approach by Stephen Kokoska Chapter 2 Tables.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Descriptive Statistics – Graphic Guidelines Pie charts – qualitative variables, nominal data, eg. ‘religion’ Bar charts – qualitative or quantitative variables,
Objectives Organize discrete data in tables Construct histograms of discrete data Organize continuous data in tables Construct histograms of continuous.
Descriptive Statistics: Tabular and Graphical Methods
Descriptive Statistics: Tabular and Graphical Methods
Advanced Quantitative Techniques
Chapter 12 Statistics 2012 Pearson Education, Inc.
Chapter 12 Statistics.
Chapter(2) Frequency Distributions and Graphs
MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Frequency Distributions and Graphs
Chapter 2 Frequency Distribution and Graph
3.2 Picturing Distributions of Data
Chapter 2 Descriptive Statistics
Unit 4 Statistical Analysis Data Representations
Basic Statistics.
CHAPTER 1: Picturing Distributions with Graphs
Frequency Distributions and Graphs
THE STAGES FOR STATISTICAL THINKING ARE:
Chapter 2 Organizing data
STA 291 Summer 2008 Lecture 4 Dustin Lueker.
Descriptive Statistics
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Descriptive Statistics
Descriptive Statistics
Constructing and Interpreting Visual Displays of Data
Experimental Design Experiments Observational Studies
DESIGN OF EXPERIMENT (DOE)
STA 291 Spring 2008 Lecture 4 Dustin Lueker.
Organizing, Displaying and Interpreting Data
Descriptive Statistics
Displaying Distributions with Graphs
Frequency Distribution and Graphs
Presentation transcript:

STA 291 Spring 2008 Lecture 3 Dustin Lueker

Stratified Sampling Suppose the population can be divided into separate, non-overlapping groups (“strata”) according to some criterion Select a simple random sample independently from each group Usefulness We may want to draw inference about population parameters for each subgroup Sometimes, (“proportional stratified sample”) estimators from stratified random samples are more precise than those from simple random samples STA 291 Spring 2008 Lecture 3 2

Proportional Stratification The proportions of the different strata are the same in the sample as in the population Mathematically Population size N Subpopulation Ni Sample size n Subpopulation ni STA 291 Spring 2008 Lecture 3 3

Descriptive Statistics Summarize data Condense the information from the dataset Graphs Table Numbers Interval data Histogram Nominal/Ordinal data Bar chart Pie chart STA 291 Spring 2008 Lecture 3

Data Table: Murder Rates Alabama 11.6 Alaska 9.0 Arizona 8.6 Arkansas 10.2 California 13.1 Colorado 5.8 Connecticut 6.3 Delaware 5.0 D C 78.5 Florida 8.9 Georgia 11.4 Hawaii 3.8 … Difficult to see the “big picture” from these numbers We want to try to condense the data STA 291 Spring 2008 Lecture 3

Frequency Distribution A listing of intervals of possible values for a variable Together with a tabulation of the number of observations in each interval. STA 291 Spring 2008 Lecture 3

Frequency Distribution Murder Rate Frequency 0-2.9 5 3-5.9 16 6-8.9 12 9-11.9 12-14.9 4 15-17.9 18-20.9 1 >21 Total 51 STA 291 Spring 2008 Lecture 3

Frequency Distribution Conditions for intervals Equal length Mutually exclusive Any observation can only fall into one interval Collectively exhaustive All observations fall into an interval Rule of thumb: If you have n observations then the number of intervals should approximately STA 291 Spring 2008 Lecture 3

Relative Frequencies Relative frequency for an interval Proportion of sample observations that fall in that interval Sometimes percentages are preferred to relative frequencies STA 291 Spring 2008 Lecture 3

Frequency, Relative Frequency, and Percentage Distribution Murder Rate Frequency Relative Frequency Percentage 0-2.9 5 .10 10 3-5.9 16 .31 31 6-8.9 12 .24 24 9-11.9 12-14.9 4 .08 8 15-17.9 18-20.9 1 .02 2 >21 Total 51 100 STA 291 Spring 2008 Lecture 3

Frequency Distributions Notice that we had to group the observations into intervals because the variable is measured on a continuous scale For discrete data, grouping may not be necessary Except when there are many categories Intervals are sometimes called classes Class Cumulative Frequency Number of observations that fall in the class and in smaller classes Class Relative Cumulative Frequency Proportion of observations that fall in the class and in smaller classes STA 291 Spring 2008 Lecture 3

Frequency and Cumulative Frequency Murder Rate Frequency Relative Frequency Cumulative Relative Cumulative Frequency 0-2.9 5 .10 3-5.9 16 .31 21 .41 6-8.9 12 .24 33 .65 9-11.9 45 .89 12-14.9 4 .08 49 .97 15-17.9 18-20.9 1 .02 50 .99 >21 51 Total STA 291 Spring 2008 Lecture 3

Histogram (Interval Data) Use the numbers from the frequency distribution to create a graph Draw a bar over each interval, the height of the bar represents the relative frequency for that interval Bars should be touching Equally extend the width of the bar at the upper and lower limits so that the bars are touching. STA 291 Spring 2008 Lecture 3

Histogram STA 291 Spring 2008 Lecture 3

Histogram w/o DC STA 291 Spring 2008 Lecture 3

Bar Graph (Nominal/Ordinal Data) Histogram: for interval (quantitative) data Bar graph is almost the same, but for qualitative data Difference: The bars are usually separated to emphasize that the variable is categorical rather than quantitative For nominal variables (no natural ordering), order the bars by frequency, except possibly for a category “other” that is always last STA 291 Spring 2008 Lecture 3

Pie Chart (Nominal/Ordinal Data) First Step Create a frequency distribution Highest Degree Obtained Frequency (Number of Employees) Grade School 15 High School 200 Bachelor’s 185 Master’s 55 Doctorate 70 Other 25 Total 550 STA 291 Spring 2008 Lecture 3

We could display this data in a bar chart… Bar graph If the data is ordinal, classes are presented in the natural ordering STA 291 Spring 2008 Lecture 3

Pie Chart Pie is divided into slices Area of each slice is proportional to the frequency of each class STA 291 Spring 2008 Lecture 3

Pie Chart for Highest Degree Achieved STA 291 Spring 2008 Lecture 3

Sample/Population Distribution Frequency distributions and histograms exist for the population as well as for the sample Population distribution vs. sample distribution As the sample size increases, the sample distribution looks more and more like the population distribution STA 291 Spring 2008 Lecture 3 21

Population Distribution The population distribution for a continuous variable is usually represented by a smooth curve Like a histogram that gets finer and finer Similar to the idea of using smaller and smaller rectangles to calculate the area under a curve when learning how to integrate Symmetric distributions Bell-shaped U-shaped Uniform Not symmetric distributions: Left-skewed Right-skewed Skewed STA 291 Spring 2008 Lecture 3 22

Skewness Symmetric Right-skewed Left-skewed STA 291 Spring 2008 Lecture 3

Summary of Graphical and Tabular Techniques Discrete data Frequency distribution Continuous data Grouped frequency distribution Small data sets Stem and leaf plot Interval data Histogram Categorical data Bar chart Pie chart Grouping intervals should be of same length, but may be dictated more by subject-matter considerations STA 291 Spring 2008 Lecture 3 24

Good Graphics Present large data sets concisely and coherently Can replace a thousand words and still be clearly understood and comprehended Encourage the viewer to compare two or more variables Do not replace substance by form Do not distort what the data reveal STA 291 Spring 2008 Lecture 3 25

Bad Graphics Don’t have a scale on the axis Have a misleading caption Distort by using absolute values where relative/proportional values are more appropriate Distort by stretching/shrinking the vertical or horizontal axis Use bar charts with bars of unequal width STA 291 Spring 2008 Lecture 3 26