Descriptive Statistics

Slides:



Advertisements
Similar presentations
Descriptive Statistics
Advertisements

Measures of Central Tendency and Dispersion
Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Descriptive Statistics
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
Descriptive Statistics
QBM117 Business Statistics
Central Tendency and Variability
Starter 1.Find the median of Find the median of Calculate the range of Calculate the mode.
Today: Central Tendency & Dispersion
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
SECTION 1-7: ANALYZING AND DISPLAYING DATA Goal: Use statistical measures and data displays to represent data.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data I can: calculate and interpret measures of spread MSIP/Home Learning: p. 168 #2b, 3b, 4, 6, 7,
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Warm up The following graphs show foot sizes of gongshowhockey.com users. What shape are the distributions? Calculate the mean, median and mode for one.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
WARM UP Find the mean, median, mode, and range 1. 5, 10, 19, 34, 16, , 22, 304, 425, 219, 304, 22, 975 When you are done the warm up put the calculator.
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
INVESTIGATION 1.
Statistics and parameters. To find out about a population we take a sample.
Do Now Find the mean, median, mode, and range of each data set and then state which measure of central tendency best represents the data. 1)2, 3, 3, 3,
CHAPTER 3  Descriptive Statistics Measures of Central Tendency 1.
Central Tendency & Dispersion
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Statistics and Data Analysis
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Descriptive Statistics(Summary and Variability measures)
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.
Measures of Dispersion Advanced Higher Geography Statistics.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
Descriptive Statistics
Exploratory Data Analysis
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Measures of Central Tendency
PROGRAMME 27 STATISTICS.
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
DS5 CEC Interpreting Sets of Data
Description of Data (Summary and Variability measures)
Summary descriptive statistics: means and standard deviations:
Chapter 3 Describing Data Using Numerical Measures
Type of Data Qualitative data Quantitative data
Measures of Central Tendency
Unit 4 Statistics Review
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Summary descriptive statistics: means and standard deviations:
Chapter 3: Central Tendency
Numerical Descriptive Measures
Warm Up # 3: Answer each question to the best of your knowledge.
Ms. Saint-Paul A.P. Psychology
Statistics 2 Lesson 2.7 Standard Deviation 2.
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Chapter 3: Central Tendency
Basic Biostatistics Measures of central tendency and dispersion
USING STATISTICS TO DESCRIBE GEOGRAPHICAL DATA
Presentation transcript:

Descriptive Statistics Advanced Higher Geography Descriptive Statistics

Descriptive statistics include: Types of data Measures of central tendency Measures of dispersion

Types of data (1) Nominal data: data that has names. eg: rock types (sedimentary, igneous or metamorphic). Ordinal data: data that can be placed in ascending or descending order. eg: settlement type (city, town, village & hamlet).

Types of data (2) Interval data: data with no true zero. Very uncommon so don’t worry about it. Ratio data: most numerical data.

Central Tendency When you calculate the central tendency of a data set you calculate its average. The measurements used for calculating central tendency include the mean, the mode and the median.

The Mean Calculating the mean is one of the commonly used statistics in geography. It is found by totalling the values for all observations (∑x) and dividing by the total number of observations (n). The formula for finding the mean is: Mean = ∑x n

The Median The median is the middle value when all of the data is placed in ascending / descending order. Where there are two middle values we take the average of these.

The Mode The mode is the number that occurs the most often. Sometimes there are two (or more) modes. Where there are two modes the data is said to be bi-modal.

Task Answers Find the mean, median and mode of the following data. 5 mins ©Microsoft Word clipart Find the mean, median and mode of the following data. The weekly pocket money for 9 first year pupils was found to be: 3 – 12 – 4 – 6 – 1 – 4 – 2 – 5 – 8 Answers Mean 5 Median 4 Mode 4

Midpoint x frequency (fx) Groups of data Sometimes the data we collect are in group form. Slope Angle (°) Midpoint (x) Frequency (f) Midpoint x frequency (fx) 0-4 2 6 12 5-9 7 84 10-14 15-19 17 5 85 20-24 22 Total n = 30 ∑(fx) = 265 Finding the mean is slightly more difficult. We use the midpoint of the group and multiply this by the frequency.

Midpoint x frequency (fx) Calculating the mean Slope Angle (°) Midpoint (x) Frequency (f) Midpoint x frequency (fx) 0-4 2 6 12 5-9 7 84 10-14 15-19 17 5 85 20-24 22 Total n = 30 ∑(fx) = 265 The mean is: ∑(fx)/n = 265 / 30 = 8.8 Which is in the 5 – 9 group

Midpoint x frequency (fx) Calculating the mode Slope Angle (°) Midpoint (x) Frequency (f) Midpoint x frequency (fx) 0-4 2 6 12 5-9 7 84 10-14 15-19 17 5 85 20-24 22 Total n = 30 ∑(fx) = 265 We cannot find the mode for grouped data but we can find the modal group. The modal group. The modal group is the group that occurs most frequently (ie: 5-9 group).

Your turn Read page 25 – 29 of ‘Geographical Measurements and Techniques: Statistical Awareness, LT Scotland, June 2000. Answer questions 1 & 2 from Task 4 in this book. 10 mins

The Interquartile Range The interquartile range consists of the middle 50% of the values in a distribution; 25% each side of the median (middle value). This calculation is useful because it shows how closely the values are grouped around the median.

The benefits It is easy to calculate It is unaffected by extreme values It is a useful way of comparing sets of similar data.

Interquartile range Question We know that the median divides the data into two halves. We also know that for a set of n ordered numbers the median is the (n + 1) ÷ 2 th value. Similarly, the lower quartile divides the bottom half of the data into two halves, and the upper quartile also divides the upper half of the data into two halves. Lower quartile is the (n + 1) ÷ 4 Upper quartile is the 3 (n + 1) ÷ 4 Question

Box and whisker diagrams A box and whisker plot is used to display information about the range, the median and the quartiles. It is usually drawn alongside a number line, as shown:Box and whisker

Drawbacks It can be a laborious process to calculate the location of the quartiles, especially when there is a large number of data within the set. It does not give any indication of how the entire data set is distributed, just the limits of the middle 50% of the data Not all values are considered and hence a false impression may be given of the data set being analysed,

Standard Deviation You could have 2 sets of data that produce the same mean, but the data may have a very different range of values within them. Standard Deviation is a tool that produces a figure indicating the extent to which the data is clustered around the mean.

The Normal distribution curve

The normal curve assumes Data in your sample follows the simple distribution around the mean. The standard deviation gives important information as it indicates the shape of the normal curve. If the SD is large then it suggests a wide spread of data around the mean and a flatter, wider normal distribution curve. If the SD is small, it suggests a steep and narrow normal distribution curve and a narrow spread around the mean.

A smaller SD suggests a more reliable mean A smaller SD suggests a more reliable mean. There is likely to be few extreme values. It is also useful for comparing two sets of data that may have similar means but quite different ranges of data within each set.