LECTURE 8 Thursday, 19 February STA291 Fall 2008.

Slides:



Advertisements
Similar presentations
CHAPTER 1 Exploring Data
Advertisements

Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Measures of Dispersion
Numerically Summarizing Data
LECTURE 7 THURSDAY, 11 FEBRUARY STA291 Fall 2008.
Descriptive Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Basic Business Statistics 10th Edition
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
STA Lecture 111 STA 291 Lecture 11 Describing Quantitative Data – Measures of Central Location Examples of mean and median –Review of Chapter 5.
Numerical Descriptive Techniques
Exploration of Mean & Median Go to the website of “Introduction to the Practice of Statistics”website Click on the link to “Statistical Applets” Select.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
1.3: Describing Quantitative Data with Numbers
STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Lecture 3 Describing Data Using Numerical Measures.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Lecture 2 Dustin Lueker.  Center of the data ◦ Mean ◦ Median ◦ Mode  Dispersion of the data  Sometimes referred to as spread ◦ Variance, Standard deviation.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Basic Business Statistics Chapter 3: Numerical Descriptive Measures Assoc. Prof. Dr. Mustafa Yüzükırmızı.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures (Summary Measures) Basic Business Statistics.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Basic Business Statistics 11 th Edition.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 6: Interpreting the Measures of Variability.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Descriptive Statistics ( )
NUMERICAL DESCRIPTIVE MEASURES
CHAPTER 1 Exploring Data
Numerical Descriptive Measures
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Please take out Sec HW It is worth 20 points (2 pts
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
CHAPTER 1 Exploring Data
Describing Quantitative Data with Numbers
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
The Five-Number Summary
CHAPTER 1 Exploring Data
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

LECTURE 8 Thursday, 19 February STA291 Fall 2008

Chap. 4: Numerical Descriptive Techniques 4.3 Measures of Relative Standing and Box Plots 4.2 Measures of Variability 4.4 Measures of Linear Relationship (next time) 2

Homework and Suggested Study Material 3 [10 points] Due Saturday 11pm Assignment on CengageNOW. Use the Study Tools at Thomson Now, click on our Courseware Book, and work through “Chapter 4 – Numerical Descriptive Techniques”. (Pre-test, study plan, and post-test) Suggested problems from the textbook: 4.42, 4.44, 4.46

Five-Number Summary (Review) 4 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software SAS output (Murder Rate Data) Note the distance from the median to the maximum compared to the median to the minimum.

Interquartile Range 5 The Interquartile Range (IQR) is the difference between upper and lower quartile IQR = Q 3 – Q 1 IQR= Range of values that contains the middle 50% of the data IQR increases as variability increases

Box Plot (AKA Box-and-Whiskers Plot) A box plot is basically a graphical version of the five- number summary (unless there are outliers) It consists of a box that contains the central 50%of the distribution (from lower quartile to upper quartile), A line within the box that marks the median, And whiskers that extend to the maximum and minimum values, unless there are outliers 6

Outliers An observation is an outlier if it falls – more than 1.5 IQR above the upper quartile or – more than 1.5 IQR below the lower quartile Example: Murder Rate Data w/o DC – upper quartile Q3 = 10.3 – IQR = 6.4 – Q IQR = _______ – Any outliers? 7

Illustrating Boxplot with Murder Rate Data 8 (w/o DC—key: 20|3 = 20.3)

Measures of Variation 9 Mean and Median only describe a typical value, but not the spread of the data Two distributions may have the same mean, but different variability Statistics that describe variability are called measures of variation (or dispersion)

Sample Measures of Variation 10 Sample Range: Difference between maximum and minimum sample value Sample Variance: Sample Standard Deviation: Sample Interquartile Range: Difference between upper and lower quartile of the sample

Population Measures of Variation 11 Population Range: Difference between maximum and minimum population values Population Variance: Population Standard Deviation: Population Interquartile Range: Difference between upper and lower quartile of the population

Range 12 Range: Difference between the largest and smallest observation Very much affected by outliers (one misreported observation may lead to an outlier, and affect the range) The range does not always reveal different variation about the mean

Deviations 13 The deviation of the i th observation, x i, from the sample mean,, is, the difference between them The sum of all deviations is zero because the sample mean is the center of gravity of the data (remember the balance beam?) Therefore, people use either the sum of the absolute deviations or the sum of the squared deviations as a measure of variation

Sample Variance 14 The variance of n observations is the sum of the squared deviations, divided by n – 1.

Variance: Interpretation 15 The variance is about the average of the squared deviations “average squared distance from the mean” Unit: square of the unit for the original data Difficult to interpret Solution: Take the square root of the variance, and the unit is the same as for the original data

Sample standard deviation 16 The standard deviation s is the positive square root of the variance

Standard Deviation: Properties 17 s ≥ 0 always s = 0 only when all observations are the same If data is collected for the whole population instead of a sample, then n-1 is replaced by n s is sensitive to outliers

Standard Deviation Interpretation: Empirical Rule 18 If the histogram of the data is approximately symmetric and bell-shaped, then – About 68% of the data are within one standard deviation from the mean – About 95% of the data are within two standard deviations from the mean – About 99.7% of the data are within three standard deviations from the mean

Standard Deviation Interpretation: Empirical Rule 19

Sample Statistics, Population Parameters 20 Population mean and population standard deviation are denoted by the Greek letters μ (mu) and  (sigma) They are unknown constants that we would like to estimate Sample mean and sample standard deviation are denoted by and s They are random variables, because their values vary according to the random sample that has been selected

Attendance Survey Question 8 21 On a your index card: – Please write down your name and section number – Today’s Question: