1 Descriptive Statistics Descriptive Statistics Ernesto Diaz Faculty – Mathematics Redwood High School.

Slides:



Advertisements
Similar presentations
Calculating & Reporting Healthcare Statistics
Advertisements

B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Slides by JOHN LOUCKS St. Edward’s University.
12.3 – Measures of Dispersion
12.2 – Measures of Central Tendency
Measures of Central Tendency Section 2.3 Statistics Mrs. Spitz Fall 2008.
SECTION 12-1 Visual Displays of Data Slide
Section 12-2 Measures of Central Tendency.
Measures of Central Tendency
Describing Data: Numerical
Chapter 3 Descriptive Measures
Chapter 13 Section 5 - Slide 1 Copyright © 2009 Pearson Education, Inc. AND.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
12.3 – Measures of Dispersion Dispersion is another analytical method to study data. Two of the most common measures of dispersion are the range and the.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
SECTION 12-3 Measures of Dispersion Slide
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-3 Measures of Dispersion.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
9.3 – Measures of Dispersion
Summary Statistics: Measures of Location and Dispersion.
Can't Type? press F11 Can’t Hear? Check: Speakers, Volume or Re-Enter Seminar Put ? in front of Questions so it is easier to see them. 1 Graded Projects.
Symbol Description It would be a good idea now to start looking at the symbols which will be part of your study of statistics.  The uppercase Greek letter.
9.1 – Visual Displays of Data Objective – TSW construct visual displays of data. Chapter 1.
Statistics Unit 9 only requires us to do Sections 1 & 2. * If we have time, there are some topics in Sections 3 & 4, that I will also cover. They tie in.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Lesson 25 Finding measures of central tendency and dispersion.
Adapted from Pearson Education, Inc. Copyright © 2009 Pearson Education, Inc. Welcome to MM150! Kirsten Meymaris Thursday, Mar. 31st Plan for the hour.
4.1 Measures of Center We are learning to…analyze how adding another piece of data can affect the measures of center and spread.
Descriptive Statistics(Summary and Variability measures)
Copyright © 2016 Brooks/Cole Cengage Learning Intro to Statistics Part II Descriptive Statistics Intro to Statistics Part II Descriptive Statistics Ernesto.
Slide Copyright © 2009 Pearson Education, Inc. Unit 9 Seminar Agenda Final Project and Due Dates Measures of Central Tendency Measures of Dispersion.
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statics – Part II Chapter 9. Mean The mean, is the sum of the data divided by the number of pieces of data. The formula for calculating the mean is where.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
Copyright © 2009 Pearson Education, Inc. Chapter 13 Section 5 - Slide 1 Section 5 Measures of Central Tendency.
AND.
Descriptive Statistics Ernesto Diaz Faculty – Mathematics
Chapter 12 Statistics 2012 Pearson Education, Inc.
Chapter 12 Statistics.
Intro to Statistics Part II Descriptive Statistics
Chapter 12 Statistics 2012 Pearson Education, Inc.
Intro to Statistics Part II Descriptive Statistics
Chapter 12 Statistics 2012 Pearson Education, Inc.
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Description of Data (Summary and Variability measures)
Numerical Descriptive Measures
9.2 - Measures of Central Tendency
12.2 – Measures of Central Tendency
Numerical Descriptive Measures
10.2 Statistics Part 1.
Measures of Dispersion
Chapter 12 Statistics.
Chapter 12 Statistics.
Numerical Descriptive Measures
Presentation transcript:

1 Descriptive Statistics Descriptive Statistics Ernesto Diaz Faculty – Mathematics Redwood High School

2 Basic Concepts In statistics a population, includes all of the items of interest, and a sample, includes some of the items in the population. The study of statistics can be divided into two main areas. Descriptive statistics, has to do with collecting, organizing, summarizing, and presenting data (information). Inferential statistics, has to do with drawing inferences or conclusions about populations based on information from samples.

3 Basic Concepts Information that has been collected but not yet organized or processed is called raw data. It is often quantitative (or numerical ), but can also be qualitative (or nonnumerical ).

4 Basic Concepts Quantitative data : The number of siblings in ten different families: 3, 1, 2, 1, 5, 4, 3, 3, 8, 2 Qualitative data: The makes of five different automobiles: Toyota, Ford, Nissan, Chevrolet, Honda Quantitative data can be sorted in mathematical order. The number siblings can appear as 1, 1, 2, 2, 3, 3, 3, 4, 5, 8

5 © 2008 Pearson Addison-Wesley. All rights reserved Measures of Central Tendency Mean Median Mode Symmetry in Data Sets

6 © 2008 Pearson Addison-Wesley. All rights reserved Mean The mean (more properly called the arithmetic mean ) of a set of data items is found by adding up all the items and then dividing the sum by the number of items. (The mean is what most people associate with the word “average.”) The mean of a sample is denoted (read “ x bar”), while the mean of a complete population is denoted (the lower case Greek letter mu ).

7 © 2008 Pearson Addison-Wesley. All rights reserved Mean The mean of n data items x 1, x 2,…, x n, is given by the formula We use the symbol for “summation,” (the Greek letter sigma ).

8 Example-find the mean Find the mean amount of money parents spent on new school supplies and clothes if 5 parents randomly surveyed replied as follows: $327 $465 $672 $150 $230

9 © 2008 Pearson Addison-Wesley. All rights reserved Weighted Mean The weighted mean of n numbers x 1, x 2,…, x n, that are weighted by the respective factors f 1, f 2,…, f n is given by the formula

10 © 2008 Pearson Addison-Wesley. All rights reserved Median Another measure of central tendency, which is not so sensitive to extreme values, is the median. This measure divides a group of numbers into two parts, with half the numbers below the median and half above it.

11 © 2008 Pearson Addison-Wesley. All rights reserved Median To find the median of a group of items: Step 1 Rank the items. Step2 If the number of items is odd, the median is the middle item in the list. Step 3 If the number of items is even, the median is the mean of the two middle numbers.

12 © 2008 Pearson Addison-Wesley. All rights reserved Example: Median Solution Ten students in a math class were polled as to the number of siblings in their individual families and the results were: 3, 2, 2, 1, 1, 6, 3, 3, 4, 2. Find the median number of siblings for the ten students. In order: 1, 1, 2, 2, 2, 3, 3, 3, 4, 6 Median = (2+3)/2 = 2.5

13 © 2008 Pearson Addison-Wesley. All rights reserved Mode The mode of a data set is the value that occurs the most often. Sometimes, a distribution is bimodal (literally, “two modes”). In a large distribution, this term is commonly applied even when the two modes do not have exactly the same frequency

14 © 2008 Pearson Addison-Wesley. All rights reserved Example: Mode for a Set Solution Ten students in a math class were polled as to the number of siblings in their individual families and the results were: 3, 2, 2, 1, 3, 6, 3, 3, 4, 2. Find the mode for the number of siblings. 3, 2, 2, 1, 3, 6, 3, 3, 4, 2 The mode for the number of siblings is 3.

15 © 2008 Pearson Addison-Wesley. All rights reserved Example: Mode for Distribution Solution The mode is 5 since it has the highest frequency (8). Find the median for the distribution. Value12345 Frequency43268

16 Measures of Position Measures of position are often used to make comparisons. Two measures of position are percentiles and quartiles.

17 To Find the Quartiles of a Set of Data Order the data from smallest to largest. Find the median, or 2 nd quartile, of the set of data. If there are an odd number of pieces of data, the median is the middle value. If there are an even number of pieces of data, the median will be halfway between the two middle pieces of data.

18 To Find the Quartiles of a Set of Data continued The first quartile, Q 1, is the median of the lower half of the data; that is, Q 1, is the median of the data less than Q 2. The third quartile, Q 3, is the median of the upper half of the data; that is, Q 3 is the median of the data greater than Q 2.

19 Example: Quartiles The weekly grocery bills for 23 families are as follows. Determine Q 1, Q 2, and Q

20 Example: Quartiles continued Order the data: Q 2 is the median of the entire data set which is 190. Q 1 is the median of the numbers from 50 to 172 which is 95. Q 3 is the median of the numbers from 210 to 330 which is 270.

21 Measures of Dispersion

22 Measures of Dispersion Sometimes we want to look at a measure of dispersion, or spread, of data. Two of the most common measures of dispersion are the range and the standard deviation.

23 Measures of Dispersion Measures of dispersion are used to indicate the spread of the data. The range is the difference between the highest and lowest values; it indicates the total spread of the data.

24 Example: Range Nine different employees were selected and the amount of their salary was recorded. Find the range of the salaries. $24,000 $32,000 $26,500 $56,000 $48,000 $27,000 $28,500 $34,500 $56,750 Range = $56,750  $24,000 = $32,750

© 2008 Pearson Addison-Wesley. All rights reserved Standard Deviation One of the most useful measures of dispersion, the standard deviation, is based on deviations from the mean of the data.

© 2008 Pearson Addison-Wesley. All rights reserved Example: Deviations from the Mean Solution Data Value Deviation–6–5146 Find the deviations from the mean for all data values of the sample 1, 2, 8, 11, 13. The mean is 7. Subtract to find deviation. The sum of the deviations for a set is always – 7 = 6

© 2008 Pearson Addison-Wesley. All rights reserved Standard Deviation The variance is found by summing the squares of the deviations and dividing that sum by n – 1 (since it is a sample instead of a population). The square root of the variance gives a kind of average of the deviations from the mean, which is called a sample standard deviation. It is denoted by the letter s. (The standard deviation of a population is denoted the lowercase Greek letter sigma.)

28 Standard Deviation The standard deviation measures how much the data differ from the mean.

© 2008 Pearson Addison-Wesley. All rights reserved Calculation of Standard Deviation The individual steps involved in this calculation are as follows Step 1 Calculate the mean of the numbers. Step 2 Find the deviations from the mean. Step 3 Square each deviation. Step 4 Sum the squared deviations. Step 5 Divide the sum in Step 4 by n – 1. Step 6 Take the square root of the quotient in Step 5.

© 2008 Pearson Addison-Wesley. All rights reserved Example Find the standard deviation of the sample 1, 2, 8, 11, 13. Data Value Deviation–6–5146 ( Deviation ) The mean is 7. Sum = = 114 Solution

© 2008 Pearson Addison-Wesley. All rights reserved Example Solution (continued) Divide by n – 1 with n = 6: Take the square root:

32 Example Find the standard deviation of the following prices of selected washing machines: $280, $217, $665, $684, $939, $299 Find the mean.

33 Example continued, mean = , , , , , , (-297) 2 = 88, (Data - Mean) 2 Data - Mean Data

34 Example continued, mean = 514 The standard deviation is $

35 Interpreting Measures of Dispersion A main use of dispersion is to compare the amounts of spread in two (or more) data sets. A common technique in inferential statistics is to draw comparisons between populations by analyzing samples that come from those populations.

36 Example: Interpreting Measures Two companies, A and B, sell small packs of sugar for coffee. The mean and standard deviation for samples from each company are given below. Which company consistently provides more sugar in their packs? Which company fills its packs more consistently? Company A Company B

37 Example: Interpreting Measures Solution We infer that Company A most likely provides more sugar than Company B (greater mean). We also infer that Company B is more consistent than Company A (smaller standard deviation).

38 © 2008 Pearson Addison-Wesley. All rights reserved Symmetry in Data Sets The most useful way to analyze a data set often depends on whether the distribution is symmetric or non- symmetric. In a “symmetric” distribution, as we move out from a central point, the pattern of frequencies is the same (or nearly so) to the left and right. In a “non- symmetric” distribution, the patterns to the left and right are different.

39 © 2008 Pearson Addison-Wesley. All rights reserved Some Symmetric Distributions

40 © 2008 Pearson Addison-Wesley. All rights reserved Non-symmetric Distributions A non-symmetric distribution with a tail extending out to the left, shaped like a J, is called skewed to the left. If the tail extends out to the right, the distribution is skewed to the right.

41 © 2008 Pearson Addison-Wesley. All rights reserved Some Non-symmetric Distributions

© 2008 Pearson Addison-Wesley. All rights reserved Chebyshev’s Theorem For any set of numbers, regardless of how they are distributed, the fraction of them that lie within k standard deviations of their mean (where k > 1) is at least

© 2008 Pearson Addison-Wesley. All rights reserved Example: Chebyshev’s Theorem What is the minimum percentage of the items in a data set which lie within 3 standard deviations of the mean? Solution With k = 3, we calculate