LECTURE 4 3 FEBRUARY 2008 STA 291 Fall 2008 1. Administrative (Review) 5.3 Sampling Plans 2.2 Graphical and Tabular Techniques for Nominal Data 2.3 Graphical.

Slides:



Advertisements
Similar presentations
Chapter 3 Graphic Methods for Describing Data. 2 Basic Terms  A frequency distribution for categorical data is a table that displays the possible categories.
Advertisements

Chapter Two Organizing and Summarizing Data
LECTURE 5 5 FEBRUARY 2009 STA291 Fall Itinerary 2.3 Graphical Techniques for Interval Data (mostly review) 2.4 Describing the Relationship Between.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Ka-fu Wong © 2003 Chap 2-1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Statistics - Descriptive statistics 2013/09/23. Data and statistics Statistics is the art of collecting, analyzing, presenting, and interpreting data.
2.1 Summarizing Qualitative Data  A graphic display can reveal at a glance the main characteristics of a data set.  Three types of graphs used to display.
Chapter 2 Frequency Distributions and Graphs 1 © McGraw-Hill, Bluman, 5 th ed, Chapter 2.
SECTION 12-1 Visual Displays of Data Slide
Frequency Distributions and Graphs
Welcome to Data Analysis and Interpretation
© 2016 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Frequency Distributions and Graphs
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-1 Visual Displays of Data.
Frequency Distributions and Graphs
Thinking Mathematically
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Chapter 2 Presenting Data in Tables and Charts. 2.1 Tables and Charts for Categorical Data Mutual Funds –Variables? Measurement scales? Four Techniques.
Descriptive Statistics: Tabular and Graphical Methods
Data Presentation.
DATA FROM A SAMPLE OF 25 STUDENTS ABBAB0 00BABB BB0A0 A000AB ABA0BA.
Basic Descriptive Statistics Percentages and Proportions Ratios and Rates Frequency Distributions: An Introduction Frequency Distributions for Variables.
Slide 2-2 Copyright © 2008 Pearson Education, Inc. Chapter 2 Organizing Data.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
LECTURE 5 10 SEPTEMBER 2009 STA291 Fall Itinerary Graphical Techniques for Interval Data (mostly review) Describing the Relationship Between Two.
Graphical summaries of data
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
Chapter Two Organizing and Summarizing Data 2.2 Organizing Quantitative Data I.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Lecture 3 Dustin Lueker. 2  Suppose the population can be divided into separate, non-overlapping groups (“strata”) according to some criterion ◦ Select.
Lecture 3 Dustin Lueker.  Simple Random Sampling (SRS) ◦ Each possible sample has the same probability of being selected  Stratified Random Sampling.
LECTURE 6 15 SEPTEMBER 2009 STA291 Fall Review: Graphical/Tabular Descriptive Statistics Summarize data Condense the information from the dataset.
Chapter 2 Describing Data.
LECTURE 3 THURSDAY, 22 JANUARY STA 291 Fall
Statistics Unit 2: Organizing Data Ms. Hernandez St. Pius X High School
STA Lecture 51 STA 291 Lecture 5 Chap 4 Graphical and Tabular Techniques for categorical data Graphical Techniques for numerical data.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Probability & Statistics
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 3 Graphical Methods for Describing Data.
When data is collected from a survey or designed experiment, they must be organized into a manageable form. Data that is not organized is referred to as.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
McGraw-Hill/ Irwin © The McGraw-Hill Companies, Inc., 2003 All Rights Reserved. 2-1 Chapter Two Describing Data: Frequency Distributions and Graphic Presentation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Chapter Descriptive Statistics 1 of © 2012 Pearson Education, Inc. All rights reserved.
1 Frequency Distributions. 2 After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Copyright 2011 by W. H. Freeman and Company. All rights reserved.1 Introductory Statistics: A Problem-Solving Approach by Stephen Kokoska Chapter 2 Tables.
CHAPTER 2 CHAPTER 2 FREQUENCY DISTRIBUTION AND GRAPH.
Copyright © 2009 Pearson Education, Inc. 3.2 Picturing Distributions of Data LEARNING GOAL Be able to create and interpret basic bar graphs, dotplots,
Frequency Distributions and Graphs. Organizing Data 1st: Data has to be collected in some form of study. When the data is collected in its’ original form.
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
Chapter(2) Frequency Distributions and Graphs
Chapter 2: Methods for Describing Data Sets
Frequency Distributions and Graphs
Chapter 2 Frequency Distribution and Graph
Chapter 2 Descriptive Statistics
Frequency Distributions and Graphs
STA 291 Spring 2008 Lecture 3 Dustin Lueker.
THE STAGES FOR STATISTICAL THINKING ARE:
THE STAGES FOR STATISTICAL THINKING ARE:
Experimental Design Experiments Observational Studies
Organizing, Displaying and Interpreting Data
Presentation transcript:

LECTURE 4 3 FEBRUARY 2008 STA 291 Fall

Administrative (Review) 5.3 Sampling Plans 2.2 Graphical and Tabular Techniques for Nominal Data 2.3 Graphical Techniques for Interval Data Suggested problems from the textbook (not graded, but good as exam preparation): 2.12, 2.16, 2.20, 2.36 (a,c), 2.40 (a,c) 2

Where we left off: Types of Bias Selection Bias – Selection of the sample systematically excludes some part of the population of interest Measurement/Response Bias – Method of observation tends to produce values that systematically differ from the true value Nonresponse Bias – Occurs when responses are not actually obtained from all individuals selected for inclusion in the sample 3

Where we left off: Sampling Error Assume you take a random sample of 100 UK students and ask them about their political affiliation (Democrat, Republican, Independent) Now take another random sample of 100 UK students Will you get the same percentages? 4

Sampling Error (cont’d) No, because of sampling variability. Also, the result will not be exactly the same as the population percentage, unless you 1. take a “sample” consisting of the whole population of 30,000 students (this would be called a ? ) - or - 2. you get very lucky 5

Sampling Error Sampling Error is the error that occurs when a statistic based on a sample estimates or predicts the value of a population parameter. In random samples, the sampling error can usually be quantified. In nonrandom samples, there is also sampling variability, but its extent is not predictable. 6

Nonsampling Error Any error that could also happen in a census, that is, when you ask the whole population Examples: Bias due to question wording, question order, nonresponse (people refuse to answer), wrong answers (especially to delicate questions) 7

Chapter 2: Descriptive Statistics Summarize data Use graphs, tables (and numbers, see Chapter 4) Condense the information from the dataset Interval data: Histogram Nominal/Ordinal data: Bar chart, Pie chart 8

Data Table: Murder Rates Difficult to see the “big picture” from these numbers Try to condense the data… Alabama11.6Alaska9 Arizona8.6Arkansas10.2 California13.1Colorado5.8 Connecticut6.3Delaware5 D.C.78.5Florida8.9 Georgia11.4Hawaii3.8 …… 9

Frequency Distribution Murder RateFrequency 0 – – – – – – – > 211 Total51 A listing of intervals of possible values for a variable And a tabulation of the number of observations in each interval. 10

Frequency Distribution Use intervals of same length (wherever possible) Intervals must be mutually exclusive: Any observation must fall into one and only one interval Rule of thumb: If you have n observations, the number of intervals should be about 11

Frequency, Relative Frequency, and Percentage Distribution Murder RateFrequencyRelative Frequency Percentage 0 – ( = 5 / 51 )10 ( =.10 * 100% ) 3 – ( = 16 / 51 )31 ( =.31 * 100% ) 6 – – – – – > Total

Frequency Distributions Notice that we had to group the observations into intervals because the variable is measured on a continuous scale For discrete data, grouping may not be necessary (except when there are many categories) 13

Frequency and Cumulative Frequency Class Cumulative Frequency: Number of observations that fall in the class and in smaller classes Class Relative Cumulative Frequency: Proportion of observations that fall in the class and in smaller classes 14

Cumulative Frequencies & Relative Frequencies Murder RateFrequencyRelative Frequency Cumulative Frequency Cumulative Relative Frequency 0 – – ( = ).41 (= ) 6 – (= ).65(= ) 9 – – – – > Total511 15

Histogram (Interval Data) Use the numbers from the frequency distribution to create a graph Draw a bar over each interval, the height of the bar represents the relative frequency for that interval Bars should be touching; i.e., equally extend the width of the bar at the upper and lower limits so that the bars are touching. 16

Histogram (version I) 17

Histogram (version II) 18

Bar Graph (Nominal/Ordinal Data) Histogram: for interval (quantitative) data Bar graph is almost the same, but for qualitative data Difference: – The bars are usually separated to emphasize that the variable is categorical rather than quantitative – For nominal variables (no natural ordering), order the bars by frequency, except possibly for a category “other” that is always last 19

Pie Chart (Nominal/Ordinal Data) First Step: Create a Frequency Distribution Highest DegreeFrequency (Number of Responses) Relative Frequency Grade School15 High School200 Bachelor’s185 Master’s55 Doctorate70 Other25 Total550 20

We could display this data in a bar chart… 21

Pie Chart Highest DegreeRelative FrequencyAngle ( = Rel. Freq. * 360 ◦ ) Grade School.027 ( = 15/550 )9.72 ( =.027 * 360 ◦ ) High School Bachelor’s Master’s Doctorate Other Pie Chart: Pie is divided into slices; The area of each slice is proportional to the frequency of each class. 22

Pie Chart 23

Stem and Leaf Plot Write the observations ordered from smallest to largest Each observation is represented by a stem (leading digit(s)) and a leaf (final digit) Looks like a histogram sideways Contains more information than a histogram, because every single measurement can be recovered 24

Stem and Leaf Plot Useful for small data sets (<100 observations) – Example of an EDA Practical problem: – What if the variable is measured on a continuous scale, with measurements like , , , , , , , etc. – Use common sense when choosing “stem” and “leaf” 25

Stem-and-Leaf Example: Age at Death for Presidents 26

Attendance Survey Question #4 On an index card – Please write down your name and section number – Today’s Question: