Unit 4: Describing Data.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Dot Plots & Box Plots Analyze Data.
Measures of Central Tendency and Variation 11-5
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Statistics Unit 6.
Warm-Up 4/15/2017 In a golf tournament, the top 6 men’s and women’s scores are given. Calculate the mean, median, mode, range, and IQR for each data.
Unit 4 – Probability and Statistics
Statistics: Use Graphs to Show Data Box Plots.
CONFIDENTIAL 1 Grade 8 Algebra1 Data Distributions.
7.7 Statistics & Statistical Graphs p.445. What are measures of central tendency? How do you tell measures of central tendency apart? What is standard.
Objectives Describe the central tendency of a data set.
Analyze Data USE MEAN & MEDIAN TO COMPARE THE CENTER OF DATA SETS. IDENTIFY OUTLIERS AND THEIR EFFECT ON DATA SETS.
Statistics: Mean of Absolute Deviation
Warm Up Find the mean, median, mode, range, and outliers of the following data. 11, 7, 2, 7, 6, 12, 9, 10, 8, 6, 4, 8, 8, 7, 4, 7, 8, 8, 6, 5, 9 How does.
7.7 Statistics and Statistical Graphs. Learning Targets  Students should be able to… Use measures of central tendency and measures of dispersion to describe.
Unit 4 Day 1 Vocabulary. Mean The average value of a data set, found by summing all values and dividing by the number of data points Example:
7.7 Statistics & Statistical Graphs p.445. An intro to Statistics Statistics – numerical values used to summarize & compare sets of data (such as ERA.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Chapter 12 Objectives: SWBAT make and interpret frequency tables and histograms SWBAT find mean, median, mode, and range SWBAT make and interpret box-and-
What are the effects of outliers on statistical data?
Warm Up Simplify each expression
Summary Statistics, Center, Spread, Range, Mean, and Median Ms. Daniels Integrated Math 1.
Introductory Statistics Lesson 2.5 A Objective: SSBAT find the first, second and third quartiles of a data set. SSBAT find the interquartile range of a.
CCGPS Advanced Algebra Day 1 UNIT QUESTION: How do we use data to draw conclusions about populations? Standard: MCC9-12.S.ID.1-3, 5-9, SP.5 Today’s Question:
Holt McDougal Algebra Data Distributions Warm Up Identify the least and greatest value in each set Use the data below to make a stem-and-
Unit 4 Describing Data Standards: S.ID.1 Represent data on the real number line (dot plots, histograms, and box plots) S.ID.2 Use statistics appropriate.
Chapter 4 Measures of Central Tendency Measures of Variation Measures of Position Dot Plots Stem-and-Leaf Histograms.
Holt McDougal Algebra Measures of Central Tendency and Variation Recall that the mean, median, and mode are measures of central tendency—values.
StatisticsStatistics Unit 5. Example 2 We reviewed the three Measures of Central Tendency: Mean, Median, and Mode. We also looked at one Measure of Dispersion.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
Holt McDougal Algebra 1 Data Distributions Holt Algebra 1 Warm Up Warm Up Lesson Presentation Lesson Presentation Lesson Quiz Lesson Quiz Holt McDougal.
Graphically Representing Data. Objectives: To represent and interpret data displayed on dot plots To represent and interpret data displayed on histograms.
Statistics Vocab Notes Unit 4. Mean The average value of a data set, found by adding all values and dividing by the number of data points Example: 5 +
Statistics Unit 6.
Measures of Central Tendency and Variation
6.1 - Measures of Central Tendency and Variation
10-3 Data Distributions Warm Up Lesson Presentation Lesson Quiz
Please copy your homework into your assignment book
Chapter 5 : Describing Distributions Numerically I
How do I graphically represent data?
Statistics Unit Test Review
Unit 2 Section 2.5.
10-3 Data Distributions Warm Up Lesson Presentation Lesson Quiz
U4D3 Warmup: Find the mean (rounded to the nearest tenth) and median for the following data: 73, 50, 72, 70, 70, 84, 85, 89, 89, 70, 73, 70, 72, 74 Mean:
3-3: Measures of Position
Analyze Data: IQR and Outliers
Unit 4 Statistics Review
Percentiles and Box-and- Whisker Plots
Statistics Unit 6.
Dot Plots & Box Plots Analyze Data.
Dot Plots & Box Plots Analyze Data.
Vocabulary box-and-whisker plot lower quartile upper quartile
The absolute value of each deviation.
Tuesday, February 18th What is the range of the upper 75%?
Advanced Placement Statistics Ch 1.2: Describing Distributions
Measures of Central Tendency
Unit 4 Day 1 Vocabulary.
Unit 4 Day 1 Vocabulary.
Warm Up # 3: Answer each question to the best of your knowledge.
Day 52 – Box-and-Whisker.
10-3 Data Distributions Warm Up Lesson Presentation Lesson Quiz
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Please copy your homework into your assignment book
14.2 Measures of Central Tendency
Describing Data Coordinate Algebra.
Statistics Vocab Notes
Analyze Data: IQR and Outliers
Presentation transcript:

Unit 4: Describing Data

Central Tendency S.ID.2: Use statistics appropriate to the shape of the data distribution to compare center and spread of two or more different data sets.

Essential Question: How do I find the mean, median, mode, and range in a numerical set of data?

Measure of Dispersion: Describes the dispersion, or spread, of data. Vocabulary: Mean: the average value from a numerical set of data. The symbol is x, which is read as “x-bar”. Median: the middle number from a numerical set of data when written in order. If the data set has an even number of values, the median is the mean of the two middle values. Also known as the “quartile”. Mode: The mode is the value that occurs most frequently in a data set. There can be one mode, or many. There can also be no mode in a data set. Range: In a numerical set of data, the range is the difference between the highest value and the lowest value. Measure of Dispersion: Describes the dispersion, or spread, of data. Deviation from the Mean: The difference of a data value and the mean of the data set.

Example 1: TEST SCORES: The test scores received by students on a history exam are listed below. Find the mean, median, mode, and range: 65, 68, 71, 77, 81, 82, 86, 88, 93, 93, 95, and 97 MEAN = MEDIAN = MODE = RANGE =

Which measure(s) best represent the data? 65, 68, 71, 77, 81, 82, 86, 88, 93, 93, 95, and 97 MEAN = 83 MEDIAN = 84 MODE = 93 RANGE = 32 Mean and Median BEST represent the data given.

Class work: Red Math 1 books that are under your desk. Pg. 365, #’s 1-4 (find the mean, median, mode, and range) Lastly, do # 15 a, b, and c

1. Mean = 5 Median = 5 Modes = 1, 5 Range = 10 Answers: 1. Mean = 5 Median = 5 Modes = 1, 5 Range = 10 2. Mean = 19, Med = 22, Mode = 25, Range = 18 3. Mean = 16, Med = 13.5, Modes = 8, 28 Range = 22 4. Mean = 2.8, Med = 2.8, Mode = none Range=3.1 15a. 24 15b. Mean = 41.25, Med = 41.5, Mode = none 15c. Both mean and median represent the data well – they are both close to all data points.

Homework: Worksheet

HW Answers: 1. Mean: 58 2. Mean: 56 Median: 59 Median: 55 Mode: none Mode: none Range: 85 Range: 78 3. Mean: 45 4. Mean: 56 Median: 36 Median: 50 Range: 85 Range: 75 5. Mean: 61 6. Mean: 66 Median: 65 Median: 79.5 Mode: none Mode: 80, 81 Range: 86 Range: 82 7. Mean: 45 8. Mean: 75 Median: 41 Median: 80 Range: 87 Range: 81 9. Mean: 48 10. Mean: 55 Median: 43.5 Median: 53 Mode: none Mode: 96 Range: 85 Range: 83

Essential Question: How do I find the mean absolute deviation in a numerical set of data? Standard: S.ID.2

Finding the mean absolute deviation: Find the mean of the data set Subtract the mean from each individual piece of data. Take the absolute value of your answer (MAKE IT POSITIVE!). Add all of those numbers together and divide by the total number of data entries. Round to the nearest hundredth. This value is the mean absolute deviation.

Example 2: Find the mean absolute deviation of the data set: 67, 69, 69, 71, 74, 76 MEAN =

Class work: Textbook pg. 365, #’s 9-14 (just find the mean absolute deviation)

9. 1.68 10. 2.88 11. 4.67 12. 2.25 13. 2.44 14. 0.6

HW: Worksheet

Quartiles Essential Questions: How do I compare summary statistics? How do I find the lower (Q1) and upper quartile (Q3) and the interquartile range? How do I compare summary statistics?

Vocabulary: Quartile: the median of an ordered data set (Q2) Upper quartile: the median of the upper half of an ordered data set (Q3) Lower quartile: the median of the lower half of an ordered data set (Q1) Interquartile Range: the difference of the upper quartile and lower quartile of an ordered data set

Example 1: The data sets below give the number of home runs by each player on the Bears and the Wildcats during a season of the Oakmont Baseball League. Compare the data using the mean, median, range and interquartile range. Bears: 28, 25, 21,19, 18, 14, 10, 8, 7 , 5, 3, 2 Wildcats: 20, 19, 18, 16, 15, 15, 12, 11, 9, 8, 6, 5, 4

Solution: Bears: Wildcats: Mean: 13.33 Mean: 12.15 Median: 12 Median: 12 Range: 26 Range: 16 Lower Quartile: 6 Lower Quartile: 7 Upper Quartile: 20 Upper Quartile: 17 Interquartile Range: 14 Interquartile Range: 10

The Bears’ mean is greater than the Wildcats’ mean so they averaged more homeruns per player than the Wildcats. The Wildcats’ range is less than the Bears’, so their data is less spread out than the Bears’ data. The WC interquartile range is less than the Bears’ interquartile range, so the WC middle 50% of the data showed less variation than the middle 50% of the Bears’ data.

Solution: Average Mean = 10.5 + 11.6 + 13.4 = 3 11.83 Average median = 12.17 Average Range: 25.33 Average Interquartile Range: 15.33

What does the data mean??

Class work: Red Mathematics 1 Textbook: pg. 371, #’s 1-4, 6, 8-9

Answers: 1. Mean= 8 Med = 8 R = 9 LQ = 5.5 UQ = 11 IQR= 5.5 2. Mean= 60.3 Med = 61 R = 55 LQ = 49 UQ = 71 IQR= 22 3. Mean= 4.7 Med = 4.9 R = 5.2 LQ = 3.1 UQ = 6.3 IQR= 3.2 4. Mean= 115 Med = 110 R = 119 LQ = 77 UQ = 148.5 IQR= 71.5 6. Answers may vary 8. Avg. Mean = 86.6 Avg. Median = 87.4 Avg. Range = 34.6 Avg. LQ = 80.2 Avg. UQ = 92.4 Avg. IQR = 12.2 Avg. Mean is greater than pop. The avg. median, range, and IRQ are less. 9. Avg. Mean = increases to 87.3 Avg. Median = increases to 88.75 and is now greater than the population median. Avg. Range = decreases Avg. IQR = decreases

Central Tendency Lab Instructions: Form groups containing 5-6 people per group (move desks accordingly). Measure the height of each group member in inches with a tape measure or ruler (I will provide those). Record those heights (including yours) in the table provided on problem # 3. Complete the worksheet front and back. Each student will be turning in a completed worksheet.

Lab sheet answers:

Essential Questions: How do I create a dot plot from a numerical data set? How do I create a box and whisker plot from a numerical data set?

Discrete data - data that has only a finite number of values. Dot Plot - a graph that shows how discrete data are distributed using a number line. Data distribution - the way in which the data is spread out or clustered together. symmetric - the left and right halves of the graph are nearly mirror images of each other. skewed right - the peak of the data is to the left side of the graph. skewed left - the peak of the data is to the right side of the graph.

Twenty high school students were randomly selected from a very large high school. They were asked to keep a record for a week of the number of hours they slept each night. These seven values were averaged to obtain an average night of sleep for each. The results are as follows: 9, 8, 8, 7.5, 6, 6, 4, 5.5, 7, 8, 5, 7.5, 6.5, 10, 8.5, 6.5, 5, 5.5, 7, and 7.5 hours. Create a dotplot of these data and discuss what the display implies about the data.

The ages of the Oscar winners for best actor and best actress (at the time they won the award) from 1996 to 2004 are as follows: 45, 39, 59, 33, 45, 25, 42, 24, 35, 32, 46, 32, 28, 34, 42, 27, 36, and 30. Create a doplot of these data and discuss what the display implies about the data.

box-and-whiskers plot - displays the data distribution on a five number summary. The five number summary consists of: minimum value Q1 (the first quartile) median Q3 (the third quartile) maximum value

Construct a box and whiskers plot for the data set: {5, 2, 16, 9, 13, 7, 10}

Construct a box and whiskers plot for the data set: 5 9 13 9 9 14 7 3 6 8 8 6 9 8 5

Assignment: Pg. 453-458, #'s: ALL OMIT #'S 7 AND 8 ON PG. 456!!!!!

Answers:

Find the mean, median, mode, range, IQR, make a dot plot and a box and whisker plot for the following data set: 105, 101, 102, 99, 110, 90, 100, 100, 108, 120, 32

Mean: 97 Median: 101 Mode: 100 Range: 88 IQR: 9 32, 90, 99, 100, 100, 101, 102, 105, 108, 110, 120 Five # Summary: Minimum: 32 Q1: 99 Q3: 108 Maximum: 120

Outliers EQ: How do I find outliers and what effects do they have on the context of a data set? Standard: S.ID.3 - Interpret differences in shape, center, and spread in the context of the data sets, accounting for possible effects of extreme data points (outliers).

Outlier: a data value that is significantly greater or lesser than other data values in a data set. You are determining a lower and upper limit for the data. Any value outside of these limits is an outlier. The lower limit is called the "lower fence". The upper limit is called the "upper fence".

Calculating the Fences: lower fence: Q1 - (IQR x 1.5) upper fence: Q3 + (IQR x 1.5)

Example 1: Make a box and whisker plot (find the five number summary) then calculate the upper and lower fences to determine if there are any outliers in the data set. 2, 5, 6, 6, 7, 9, 10, 11, 12, 12, 14, 28, 30 Five # Summary Minimum Value: Q1: Median (Q2): Q3: Maximum Value: Box and Whisker Plot: Lower Fence: Q1 - (IQR x 1.5) Upper Fence: Q3 + (IQR x 1.5) Are there any outliers?

The idea is if any outliers are discovered, that you would remove them and recalculate the values of central tendency to decrease the spread of your data.

Example 2: Same directions as example 1. 10, 13, 17, 20, 22, 24, 24, 27, 28, 29, 35 Five # Summary Minimum Value: Q1: Median (Q2): Q3: Maximum Value: Box and Whisker Plot: Lower Fence: Q1 - (IQR x 1.5) Upper Fence: Q3 + (IQR x 1.5) Are there any outliers?

Example 3: Same directions as example 1. 0, 7, 17, 17, 18, 24, 24, 24, 25, 27, 45 Five # Summary Minimum Value: Q1: Median (Q2): Q3: Maximum Value: Box and Whisker Plot: Lower Fence: Q1 - (IQR x 1.5) Upper Fence: Q3 + (IQR x 1.5) Are there any outliers?

Assignment: Pg. 467, #'s 1-4 AND pg. 468-473, #'s 1-12

HW answers: 1. b 2. d 3. a 4. c 2. IQR = 10 LF: 1 UF: 41 0 is an outlier 3. IQR = 12 LF: 10 UF: 58 9 and 59 are outliers 4. IQR = 11 LF: 15.5 UF: 59.5 no outliers 5. IQR = 6.5 LF: 8.75 UF: 34.75 8 is an outlier 6. IQR = 22.5 LF: 21.25 UF: 111.25 15, 20, and 115 are outliers 8. IQR = 4 LF: 10 UF: 26 9. IQR = 15 LF: 22.5 UF: 82.5 At least 1 outlier on the lower side and at least 1 outlier on the upper side. 10. IQR = 5 LF: -.5 UF: 19.5 At least 1 outlier greater than the upper fence. 11. IQR = 200 LF: 50 UF: 850 At least 1 outlier less than the lower fence. 12. IQR = 6 LF: -5 UF: 19 No outliers.

Class work assignment: Pg. 103-105 ALL (thin books) **problem 1 d. wants you to reconstruct the box and whisker plot by removing the outliers you found and then recalculating your five number summary to create a new box and whisker plot.

Answers:

EQ: How do I use and analyze histograms?

Histogram: a graphical way to display quantitative data using vertical bars. Bin: intervals of data along the horizontal axis of a histogram Frequency: the height of each bar (vertical axis) which is the number of data values included in each bin

Bin Intervals: bin intervals can be written as compound inequalities. For example: 0, 5, 10, 15, 20, 25 across x-axis Data points: 2, 4, 5, 6, 15, 23, 10, 11, 14, 12, 24

Get a Student Text Volume 2 book... Turn to page 463 and look at problem #3. We are going to walk through some examples together. Please draw the histogram at the bottom of pg. 463 in your notes.

Histogram Problem A number of students were asked to calculate their average commute time one-way to school. Their averages (in minutes) are listed in the table below. 2.3 1.8 10.2 7.0 29.4 12.4 15.3 21.1 5.2 12.1 41.9 22.1 9.3 15.2 18.4 21.6 12.0 10.8 31.5 15.9 6.2 30.0 23.6 17.6 12.3 19.4 8.1 32.5 29.1 21.4 16.3 22.7 Create a histogram with bin intervals of 5 minutes. How many modes are their? List all modal groups. Describe the shape of the histogram. Make some conclusions about driving times based on your graph.

Class work / Homework: pg. 97-100 (thin book), #'s 1-2 ALL

Answers: