Presentation on theme: "Statistics 4. Cumulative Frequency and Box Plots"— Presentation transcript:
1 Statistics 4. Cumulative Frequency and Box Plots Mr F’s Maths NotesStatistics4. Cumulative Frequency and Box Plots
2 4. Cumulative Frequency and Box Plots Why do we bother with Statistical Diagrams?The answer to this question is similar to the one for: “why do we bother working out averages and measures of spreads?”.We live in a world jam-packed full of statistics, and if we were forced to look at all the facts and figures in their raw, untreated form, not only would we probably not be able to make any sense out of them, but there is also a very good chance our heads would explode.Statistical Diagrams – if they are done properly - present those figures in a clear, concise, visually pleasing way, allowing us to make some sense out of the figures, summarise them, and compare them to other sets of data.1. What is Cumulative Frequency?Cumulative is just a posh way of saying “add up as you go along”Frequency is just a posh word for “total”So… if you put them together, you get a very posh way of saying “add the totals up as you go along”
3 2. Adding a Cumulative Frequency Column Big ExampleTo the right is a table showing the length of time a group of 40 Year 10 students spent playing on the Nintendo Wii on a gloomy week in January. Draw a Cumulative Frequency Curve, use it to estimate the Median and Inter-Quartile Range, and construct a Box PlotHours spent playingFrequency0 < h ≤ 121 < h ≤ 252 < h ≤ 3103 < h ≤ 4154 < h ≤ 66 < h ≤ 1032. Adding a Cumulative Frequency ColumnBefore you can even start thinking about drawing a Cumulative Frequency Curve, you need to be able to add a Cumulative Frequency column to your Frequency table.Remember, Cumulative Frequency just means that you add up the frequencies as you go along, so that is exactly what you do!This is the number of people who play for 1 hour or lessHours spent playingFrequencyCumulative Freq0 < h ≤ 121 < h ≤ 2572 < h ≤ 310173 < h ≤ 415324 < h ≤ 6376 < h ≤ 10340This is the number of people who play for 2 hours or less (5 + 2)This is the number of people who play for 3 hours or less ( )Check: This final entry should always equal the total frequency!
4 3. Drawing the Cumulative Frequency Curve Remember: we plot Cumulative Frequency (y axis) against the upper boundary of each group (x axis)So… for group one it’s 1 on the x axis and 2 on the yand for group two, it’s 2 on the x axis and 7 on the y…Hours spent playingFrequencyCumulative Freq0 < h ≤ 121 < h ≤ 2572 < h ≤ 310173 < h ≤ 415324 < h ≤ 6376 < h ≤ 10340Things to notice about the Cumulative Frequency Curve:1. When you have finished plotting the points, join them up with a smooth curve.2. Native the curve starts at (0, 0). This is because there is nobody playing less than 0 hours a week!3. You must label your axis correctly, or you lose very easy marks!
6 4. Estimating the Median and Inter-Quartile Range We have spent a while drawing our cumulative frequency curve, so we may as well use it. Very quickly we can come up with estimates for the Median and the Inter-Quartile RangeMedianAs you hopefully remember, the Median is the MIDDLE value.To find it we:1. Work out what is 50% of our total frequency (half way up the y axis)2. Draw a horizontal line across until it hits our curve3. When it hits the curve, draw a vertical line down to the x axis4. The value on the x axis is our Median(b) Inter-Quartile RangeFor this we need to work out the upper quartile (UQ) and the lower quartile (LQ), and then calculate: UQ - LQTo find the Upper Quartile:1. Work out what is 75% of our total frequency (three-quarters of the way up the y axis)2. Draw a horizontal line across until it hits our curve3. When it hits the curve, draw a vertical line down to the x axis4. The value on the x axis is our Upper QuartileThe Lower Quartile is the same, but 25% (one-quarter) of the way up!
7 Median:50% of 40 = 20Median = 3.2 hoursUpper Quartile75% of 40 = 30UQ = 3.8 hoursLower Quartile25% of 40 = 10LQ = 2.4 hoursInter-Quartile Range= UQ – LQ= 3.8 – 2.4= 1.4 hoursRemember: The Median is a form of average, and just like the Range, The Inter-Quartile Range is a measure of consistency
8 5. Drawing Box Plots Lowest value Median Highest value Lower Quartile Box Plots are another way of representing all the same information that can be found on a Cumulative Frequency graph.Top Tip: if you have the chance, draw your box plot directly below your cumulative frequency graph, using the same scale on the x axis, and you can just extend the vertical lines downwards and save yourself a lot of time!Lowest valueMedianHighest valueLower QuartileUpper QuartileInter-Quartile RangeRangeNote: The minimum value is the lowest possible value of your first group, and the maximum value is the highest possible value of your last group
9 Min Value = 0LQ = 2.6Median = 3.2UQ = 3.8Max Value = 10