Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 Overview and Descriptive Statistics.

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 Overview and Descriptive Statistics.
Advertisements

Describing Quantitative Variables
DESCRIBING DISTRIBUTION NUMERICALLY
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Measures of Dispersion
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
1 Chapter 1: Sampling and Descriptive Statistics.
IB Math Studies – Topic 6 Statistics.
Displaying & Summarizing Quantitative Data
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Copyright © Cengage Learning. All rights reserved. 1 Overview and Descriptive Statistics /are-your-conversion-test-results-accurate-enough.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Statistics: Use Graphs to Show Data Box Plots.
Copyright © Cengage Learning. All rights reserved. 1 Overview and Descriptive Statistics.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements
Business Statistics for Managerial Decision Making
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Programming in R Describing Univariate and Multivariate data.
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 1 Exploring Data
REPRESENTATION OF DATA.
Objectives 1.2 Describing distributions with numbers
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
NA387(3) Lecture 2: Populations, Samples, and Frequency Analysis (Devore, Ch )
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Chapter 2 Describing Data.
6-1 Numerical Summaries Definition: Sample Mean.
Describing distributions with numbers
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 3 Graphical Methods for Describing Data.
Bellwork 1. If a distribution is skewed to the right, which of the following is true? a) the mean must be less than the.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Engineering Statistics KANCHALA SUDTACHAT. Statistics  Deals with  Collection  Presentation  Analysis and use of data to make decision  Solve problems.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
1 Chapter 4 Numerical Methods for Describing Data.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
LIS 570 Summarising and presenting data - Univariate analysis.
1 Never let time idle away aimlessly.. 2 Chapters 1, 2: Turning Data into Information Types of data Displaying distributions Describing distributions.
Copyright 2011 by W. H. Freeman and Company. All rights reserved.1 Introductory Statistics: A Problem-Solving Approach by Stephen Kokoska Chapter 2 Tables.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.
Descriptive Statistics(Summary and Variability measures)
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Descriptive Statistics
ISE 261 PROBABILISTIC SYSTEMS
Lesson 8 Introduction to Statistics
4. Interpreting sets of data
Description of Data (Summary and Variability measures)
Laugh, and the world laughs with you. Weep and you weep alone
Descriptive Statistics
DAY 3 Sections 1.2 and 1.3.
An Introduction to Statistics
Chapter 5: Describing Distributions Numerically
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Displaying and Summarizing Quantitative Data
Basic Practice of Statistics - 3rd Edition
Honors Statistics Review Chapters 4 - 5
Basic Practice of Statistics - 3rd Edition
Presentation transcript:

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 Overview and Descriptive Statistics

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Populations, Samples, and Processes

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Populations and Samples A population is a well-defined collection of objects. When information is available for the entire population we have a census. A subset of the population is a sample.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Data and Observations Univariate data consists of observations on a single variable (multivariate – more than two variables).

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Branches of Statistics Descriptive Statistics – summary and description of collected data. Inferential Statistics – generalizing from a sample to a population.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Relationship Between Probability and Inferential Statistics Population Sample Probability Inferential Statistics

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. 1.2 Pictorial and Tabular Methods in Descriptive Statistics

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Stem-and- Leaf Displays 1. Select one or more leading digits for the stem values. The trailing digits become the leaves. 2. List stem values in a vertical column. 3. Record the leaf for every observation. 4. Indicate the units for the stem and leaf on the display.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Stem-and-Leaf Example 9, 10, 15, 22, 9, 15, 16, 24,11 Observed values: Stem: tens digitLeaf: units digit

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Stem-and- Leaf Displays Identify typical value Extent of spread about a value Presence of gaps Extent of symmetry Number and location of peaks Presence of outlying values

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Another stem-and-leaf example The decimal point is 1 digit to the right of the | 2 | 55 3 | | | | | | | 088

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Dotplots 9, 10, 15, 22, 9, 15, 16, 24,11 Observed values: Represent data with dots.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Types of Variables A variable is discrete if its set of possible values constitutes a finite set or an infinite sequence. A variable is continuous if its set of possible values consists of an entire interval on a number line.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Histograms Discrete Data Determine the frequency and relative frequency for each value of x. Then mark possible x values on a horizontal scale. Above each value, draw a rectangle whose height is the relative frequency of that value.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Ex. Students from a small college were asked how many charge cards they carry. x is the variable representing the number of cards and the results are below. x#people Rel. Freq Frequency Distribution

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Histograms xRel. Freq Credit card results:

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Histograms Continuous Data: Equal Class Widths Determine the frequency and relative frequency for each class. Then mark the class boundaries on a horizontal measurement axis. Above each class interval, draw a rectangle whose height is the relative frequency.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Histogram example

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Histograms Continuous Data: Unequal Widths After determining frequencies and relative frequencies, calculate the height of each rectangle using: The resulting heights are called densities and the vertical scale is the density scale.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Histogram Shapes symmetric unimodalbimodal positively skewednegatively skewed

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Histogram example: symmetric, slightly bimodal

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. 1.3 Measures of Location

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. The Mean The average (mean) of the n numbers Population mean:

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. The sample median, is the middle value in a set of data that is arranged in ascending order. For an even number of data points the median is the average of the middle two. Median Population median:

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Median example In a class of 85 exam scores, the median, is the 43 rd number if the scores are listed in ascending order. (Note: In this case there are 42 above the median and 42 below the median.)

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Three Different Shapes for a Population Distribution symmetric positive skew negative skew

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Slight positive skew

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. 1.4 Measures of Variability

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Sample Variance Variance is a measure of the spread of the data. The sample variance of the sample x 1, x 2, …x n of n values of X is given by We refer to s 2 as being based on n – 1 degrees of freedom.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Sample variance example First, find sample mean: Next, add up squared deviations from mean: Divide by n-1, where n is the number of observations (in this case, 85):

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. The sample standard deviation is the square root of the sample variance: Standard Deviation Standard deviation is a measure of the spread of the data using the same units as the data.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Standard deviation example

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Formula for s 2 An alternative expression for the numerator of s 2 is

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Formula for s 2 : Shortcut example First, sum the scores: Next, sum the squares: Numerator of variance equals

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Properties of s 2 Let x 1, x 2,…,x n be any sample and c be any nonzero constant. where is the sample variance of the x’s and is the sample variance of the y’s.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Upper and Lower Fourths After the n observations in a data set are ordered from smallest to largest, the lower (upper) fourth is the median of the smallest (largest) half of the data, where the median is included in both halves if n is odd. A measure of the spread that is resistant to outliers is the fourth spread f s = upper fourth – lower fourth.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Third and first quartiles After the n observations in a data set are ordered from smallest to largest, the first (third) quartile is the median of the smallest (largest) half of the data, where the median is included in both halves if n is odd. A measure of the spread that is resistant to outliers is the interquartile range or IQR f s = 3 rd quartile – 1 st quartile.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Outliers Any observation farther than 1.5f s from the closest fourth is an outlier. An outlier is extreme if it is more than 3f s from the nearest fourth, and it is mild otherwise.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Boxplots median extreme outliers upper fourth lower fourth mild outliers

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Boxplot example

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Another boxplot example