Determining the Best Measure of Center

Slides:



Advertisements
Similar presentations
CHAPTER 1 Exploring Data
Advertisements

Analyzing Data Sets For One Variable
Describing distributions with numbers
Objective To understand measures of central tendency and use them to analyze data.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Graphically Representing Data. Objectives: To represent and interpret data displayed on dot plots To represent and interpret data displayed on histograms.
Describing Distributions of Quantitative Data
Chapter 1: Exploring Data
Bellwork 1. Order the test scores from least to greatest: 89, 93, 79, 87, 91, 88, Find the median of the test scores. 79, 87, 88, 89, 91, 92, 93.
Chapter 1: Exploring Data
Chapter 3 Describing Data Using Numerical Measures
CHAPTER 2: Describing Distributions with Numbers
Statistics Unit Test Review
CHAPTER 2: Describing Distributions with Numbers
Chapter 3 Describing Data Using Numerical Measures
CHAPTER 1 Exploring Data
Means & Medians Chapter 4.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Means & Medians Chapter 4.
Homework: Frequency & Histogram worksheet
Please take out Sec HW It is worth 20 points (2 pts
Means & Medians Chapter 5.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Calculating IQR and Identifying Outliers
Good research questions
CHAPTER 1 Exploring Data
Means & Medians Chapter 4.
Describing Quantitative Data with Numbers
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 2: Describing Distributions with Numbers
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Warm Up # 3: Answer each question to the best of your knowledge.
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Myers Chapter 1 (F): Statistics in Psychological Research: Measures of Central Tendency A.P. Psychology.
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Means & Medians.
Advanced Algebra Unit 1 Vocabulary
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Compare and contrast histograms to bar graphs
Chapter 1: Exploring Data
Describing Data Coordinate Algebra.
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Determining the Best Measure of Center

Objectives: To calculate and interpret the mean and median of a data set To estimate the mean and median of a data set from its data distribution To determine which measure of central tendency (mean or median) is best to use for a data set

the sum of the data values the number of data values Previously you analyzed a data set by creating a graphical representation of the data. However, you can also analyze a data set by describing numerical characteristics, or STATISTIC, of the data. A statistic that describes the β€œcenter” of a data set is called a measure of central tendency. A MEASURE OF CENTRAL TENDENCY is the numerical values used to describe the overall clustering of data in a set. Two measures of central tendency that are typically used to describe a set of data are the mean and the median. The arithmetic mean, or mean, represents the sum of the data values divided by the number of values. A common notation for the mean is π‘₯ , which is read β€œπ‘₯ bar.” The formula shown represents the mean of a data set: π‘₯ = π‘₯ 𝑛 the sum of the data values mean the number of data values

Use the table on page 471 to complete Question 1: 1. Represent the sugar amount in different cereals using the formula for the mean. Then determine the mean of the data set.

π‘₯ Ξ£π‘₯ 𝑛 How would we use DESMOS to help? 2. For each given symbol, state what it represents and its calculated value. π‘₯ Ξ£π‘₯ 𝑛

Now answer question 4 on page 472: 4. Determine the median sugar amount in grams in one serving of cereal. Interpret the meaning in terms of this problem situation. How would we use DESMOS to help? Now look at question 5 on page 473. Locate and label the mean and median values. Now answer question 6. Which measure best represents the data set?

to enter the data and obtain min, max, median, Q1, and Q3 Many graphing calculators will create a box-and-whisker plot for you. Unfortunately DESMOS is not a great resource for constructing a box-and-whisker plot. I recommend using the following websites in conjunction with each other: http://www.shodor.org/interactivate/activities/BoxPlot/ to enter the data and obtain min, max, median, Q1, and Q3 followed by http://www.imathas.com/stattools/boxplot.html to create a clean graph Use these websites to create the box-and whisker plot for the sugar in breakfast cereal data from the table on page 471. How does it compare to the graph given in question 5?

Let’s consider the data set without the value of 22. Now use these websites to complete Question 7 on Pages 473 & 474: Let’s consider the data set without the value of 22. Remove the value of 22 from the data set. Use your graphing calculator to create a box-and-whisker plot for the new data set. Plot above the given box-and-whisker plot your new box-and-whisker plot on the same graph in Question 5. How does the removal of the value 22 affect the distribution of the data set? Did the mean and median change with the removal of the value 22? Does your choice for the best measure of center from Question 6 still hold true?

Let’s review the homework from Pages 474 – 476, PROBLEM 2: When the distribution of data is approximately symmetric, the mean is generally the more appropriate measure of center to use. When the distribution of data is skewed left or skewed right, the median is the more appropriate measure of center to use. The reason why the mean is more appropriate in a symmetric data distribution is due to the fact that most data points are close to the mean. There are not many if any data values that are much greater or lesser than the mean. In a skewed left or right distribution, most data values are closer to the median with few data points being much greater or lesser than the median. Therefore, the median is not affected by these values.

To create a histogram, I recommend the following website: http://www.shodor.org/interactivate/activities/Histogram/ Use this website to create a histogram for the sugar amounts in one serving of cereal, from the table on page 471. Use an interval size of 4 and minimums of zero for the x- and y- axes.