Analyze Data USE MEAN & MEDIAN TO COMPARE THE CENTER OF DATA SETS. IDENTIFY OUTLIERS AND THEIR EFFECT ON DATA SETS.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Descriptive Measures MARE 250 Dr. Jason Turner.
Dot Plots & Box Plots Analyze Data.
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Understanding and Comparing Distributions 30 min.
HOW TALL ARE APRENDE 8 TH GRADERS? Algebra Statistics Project By, My Favorite Math Teacher In my project I took the heights of 25 Aprende 8 th graders.
Histograms & Comparing Graphs
IB Math Studies – Topic 6 Statistics.
Unit 4: Describing Data.
Two-Way Frequency Tables
Homework Questions. Quiz! Shhh…. Once you are finished you can work on the warm- up (grab a handout)!
Describing Distributions Numerically
Standard Deviation In addition to level 3.0 and above and beyond what was taught in class, the student may: · Make connection with other concepts.
Statistics: Use Graphs to Show Data Box Plots.
Quartiles & Extremes (displayed in a Box-and-Whisker Plot) Lower Extreme Lower Quartile Median Upper Quartile Upper Extreme Back.
Programming in R Describing Univariate and Multivariate data.
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
WHAT IS AN INTEGER? Integers can be thought of as discrete, equally spaced points on an infinitely long number line. (Nonnegative integers (purple) and.
Table of Contents 1. Standard Deviation
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Statistics: Mean of Absolute Deviation
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
Calculate Standard Deviation ANALYZE THE SPREAD OF DATA.
CCGPS Advanced Algebra UNIT QUESTION: How do we use data to draw conclusions about populations? Standard: MCC9-12.S.ID.1-3, 5-9, SP.5 Today’s Question:
LIS 570 Summarising and presenting data - Univariate analysis.
Vocabulary to know: *statistics *data *outlier *mean *median *mode * range.
Summary Statistics, Center, Spread, Range, Mean, and Median Ms. Daniels Integrated Math 1.
2-6 Box-and-Whisker Plots Indicator  D1 Read, create, and interpret box-and whisker plots Page
Introductory Statistics Lesson 2.5 A Objective: SSBAT find the first, second and third quartiles of a data set. SSBAT find the interquartile range of a.
Statistics and Data Analysis
Unit 4 Describing Data Standards: S.ID.1 Represent data on the real number line (dot plots, histograms, and box plots) S.ID.2 Use statistics appropriate.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Unit 3 Guided Notes. Box and Whiskers 5 Number Summary Provides a numerical Summary of a set of data The first quartile (Q 1 ) is the median of the data.
Statistics Review  Mode: the number that occurs most frequently in the data set (could have more than 1)  Median : the value when the data set is listed.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Interpreting Categorical and Quantitative Data. Center, Shape, Spread, and unusual occurrences When describing graphs of data, we use central tendencies.
Bell Ringer What does the word “average” mean in math?
Bellwork 1. Order the test scores from least to greatest: 89, 93, 79, 87, 91, 88, Find the median of the test scores. 79, 87, 88, 89, 91, 92, 93.
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
Introduction To compare data sets, use the same types of statistics that you use to represent or describe data sets. These statistics include measures.
Chapter 5 : Describing Distributions Numerically I
How do I graphically represent data?
Statistics Unit Test Review
Unit 2 Section 2.5.
Averages and Variation
6th Grade Math Lab MS Jorgensen 1A, 3A, 3B.
Description of Data (Summary and Variability measures)
Analyze Data: IQR and Outliers
Unit 4 Statistics Review
Box and Whisker Plots Algebra 2.
Dot Plots & Box Plots Analyze Data.
Warm-up 8/25/14 Compare Data A to Data B using the five number summary, measure of center and measure of spread. A) 18, 33, 18, 87, 12, 23, 93, 34, 71,
Quartile Measures DCOVA
The absolute value of each deviation.
11.2 box and whisker plots.
How to create a Box and Whisker Plot
Organizing, Summarizing, &Describing Data UNIT SELF-TEST QUESTIONS
Warm Up # 3: Answer each question to the best of your knowledge.
Mean As A Balancing Point
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Lesson – Teacher Notes Standard:
. . Box and Whisker Measures of Variation Measures of Variation 8 12
Describing Data Coordinate Algebra.
Statistics Vocab Notes
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Analyze Data: IQR and Outliers
Presentation transcript:

Analyze Data USE MEAN & MEDIAN TO COMPARE THE CENTER OF DATA SETS. IDENTIFY OUTLIERS AND THEIR EFFECT ON DATA SETS.

43210 In addition to level 3.0 and above and beyond what was taught in class, the student may: · Make connection with other concepts in math · Make connection with other content areas. The student will summarize, represent, and interpret data on a single count or measurement variable. - Comparing data includes analyzing center of data (mean/median), interquartile range, shape distribution of a graph, standard deviation and the effect of outliers on the data set. - Read, interpret and write summaries of two-way frequency tables which includes calculating joint, marginal and relative frequencies. The student will be able to: - Make dot plots, histograms, box plots and two-way frequency tables. - Calculate standard deviation. - Identify normal distribution of data (bell curve) and convey what it means. With help from the teacher, the student has partial success with summarizing and interpreting data displayed in a dot plot, histogram, box plot or frequency table. Even with help, the student has no success understandin g statistical data. Focus 6 Learning Goal – (HS.S-ID.A.1, HS.S-ID.A.2, HS.S-ID.A.3, HS.S-ID.B.5) = Students will summarize, represent and interpret data on a single count or measurement variable.

Reminder:  To find the median, you put all the numbers in order from least to greatest. The middle number is the median.  To find the mean, you add up all of the numbers then divide by how many numbers are in the data set.

Measure of Central Tendency  Mean & median are both measures of central tendency. This means they identify the “middle” of the data.  This measure attempts to describe the whole set of data with a single value that represents the middle or center of its distribution.  Median:  Advantage: Is less affected by outliers and skewed data. It is the preferred measure of center when the distribution is not symmetrical.  Mean:  Advantage: Can be used for both continuous and discrete numeric data.  Limitations: Is influenced by outliers and skewed distribution.

The shape of data distributions.  Normal distribution is mound shaped, symmetric.  If the mean and median are equal, then the data is symmetric.  If the mean is greater than the median, the data is skewed right.  If the mean is less than the median, the data is skewed left.

Test your memory…  The mean of a data set is 12 and the median is 12. What are the possible shapes for this data set?  A. Mound B. Symmetric C. Skewed Right D. Skewed Left E. Both A & B  The mean of a data set is 12 and the median is 10. What is the data shape?  A. Octagonal B. Symmetric C. Skewed Right D. Skewed Left

Outliers  The shape of the data helps us find and identify outliers.  An outlier is something that sticks out from the rest of the data.  It is a data point that has an “extreme value” when compared with the rest of the data set.  Mathematically speaking, an outlier is defined as any point that falls 1.5 times the IQR below the lower quartile or 1.5 times the IQR above the upper quartile.

Data: 37, 37, 38, 38, 40, 40, 42, 42, 42, 62  The median is:  Q1:  Q3:  IQR = Q3 – Q1=  The box plot looks like this: – 38 = 4  The lower limit on outliers is Q1 – (1.5)(IQR).  38 – (1.5)(4) = 32  This means an outlier would be any number less than 32.  The upper limit on outliers is Q3 + (1.5)(IQR).  42 + (1.5)(4) = 48  This means an outlier would be any number greater than 48.

Data: 37, 37, 38, 38, 40, 40, 42, 42, 42, 62  The outlier for this data set is 62.  It surpasses the cut off of 48.  When there is an outlier on one side of the data set, we can chop off the “whisker” at the limit and then record the outlier as data points.  The final box plot would look like this.  Calculate the mean of the data set.  Calculate the mean of the data set without the outlier.  Removing the outlier changes the mean significantly.  Removing the outlier does not change the median significantly

Going Fishing  A fisherman records the length, in centimeters of 10 bass caught in a stream:  He wants to know the average length of a fish he can catch.  Determine the mean and median of the data.  Mean: 248 ÷ 10 = 24.8 cm  Median:  18.5 cm

Going Fishing  Are there any outliers?  Divide the data into quarters to find the IQR.  Q3 Q1  IQR = 27 – 18 = 9  The lower limit on outliers is Q1 – (1.5)(IQR).  18 – (1.5)(9) = 4.5  The upper limit on outliers is Q3 + (1.5)(IQR).  27 + (1.5)(9) = 40.5 Any number less than 4.5 or greater than 40.5 are outliers. 45 and 51 are outliers.

Going Fishing  Remove the outliers and recalculate the mean and median.   Mean: 152 ÷ 8 = 19 cm  Median: 18 cm  With the outliers removed, the mean is now closer to the center of the data.  The average length of a fish caught in this stream is ________.