Quantitative Analysis (Statistics Week 8)

Slides:



Advertisements
Similar presentations
1 A B C
Advertisements

Measures of Location and Dispersion
Simplifications of Context-Free Grammars
Statistics Part II Math 416. Game Plan Creating Quintile Creating Quintile Decipher Quintile Decipher Quintile Per Centile Creation Per Centile Creation.
AP STUDY SESSION 2.
STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
David Burdett May 11, 2004 Package Binding for WS CDL.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 5- 1.
Describing Data: Measures of Dispersion
4-4 Variability Objective: Learn to find measures of variability.
CALENDAR.
Lecture Slides Elementary Statistics Tenth Edition
Mean, Median, Mode & Range
Copyright © 2010 Pearson Education, Inc. Slide The number of sweatshirts a vendor sells daily has the following probability distribution. Num of.
Lecture 7 THE NORMAL AND STANDARD NORMAL DISTRIBUTIONS
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
CS1512 Foundations of Computing Science 2 Lecture 20 Probability and statistics (2) © J R W Hunter,
Multiple-choice example
Chapter 7 Sampling and Sampling Distributions
Box and Whiskers with Outliers. Outlier…… An extremely high or an extremely low value in the data set when compared with the rest of the values. The IQR.
Biostatistics Unit 5 Samples Needs to be completed. 12/24/13.
Break Time Remaining 10:00.
Factoring Quadratics — ax² + bx + c Topic
Elementary Statistics
PP Test Review Sections 6-1 to 6-6
The Frequency Table or Frequency Distribution Table
Frequency Tables and Stem-and-Leaf Plots 1-3
Chapter 6 Normal Distributions Understandable Statistics Ninth Edition
Data Analysis 53 Given the two histograms below, which of the following statements are true?
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
Introduction Our daily lives often involve a great deal of data, or numbers in context. It is important to understand how data is found, what it means,
Chapter 1: Expressions, Equations, & Inequalities
Adding Up In Chunks.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
When you see… Find the zeros You think….
Slide R - 1 Copyright © 2009 Pearson Education, Inc. Publishing as Pearson Prentice Hall Active Learning Lecture Slides For use with Classroom Response.
AQA - Business Statistics , Quantitative Analysis Peter Matthews
Subtraction: Adding UP
Copyright © 2011 Pearson Education, Inc. Putting Statistics to Work.
Take out the homework from last night then do, Warm up #1
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Statistical Inferences Based on Two Samples
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
A First Course Fifth Edition
Measures of Dispersion. Here are two sets to look at A = {1,2,3,4,5,6,7} B = {8,9,10,11,12,13,14} Do you expect the sets to have the same means? Median?
Basic Statistics Measures of Central Tendency.
Tutorial: Understanding the normal curve. Gauss Next mouse click.
Clock will move after 1 minute
PSSA Preparation.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Chapter 14 Nonparametric Statistics
Select a time to count down from the clock above
Copyright Tim Morris/St Stephen's School
9. Two Functions of Two Random Variables
1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.
Commonly Used Distributions
Your next mouse click will display a new screen.
Statistics for the Social Sciences
Calculating & Reporting Healthcare Statistics
Normal Distribution. Objectives The student will be able to:  identify properties of normal distribution  apply mean, standard deviation, and z -scores.
Statistics Recording the results from our studies.
Math II UNIT QUESTION: Can real world data be modeled by algebraic functions? Standard: MM2D1, D2 Today’s Question: How is a normal distribution used to.
C Del Siegle This tutorial is intended to assist students in understanding the normal curve. Adapted from work created by Del Siegle University of.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Different Types of Data
Numerical Descriptive Measures
Presentation transcript:

Quantitative Analysis (Statistics Week 8) FDA B&M Level C Peter Matthews matthewsp@bpc.ac.uk

Calculating Standard Deviation 1. Find the mean average of the set 2. For each number, subtract the average from it. 3. Square each of the differences. 4. Add up all the results from Step 3. 5. Divide the sum of squares (Step 4) by the count of numbers in the data set , minus one (n–1). 6. Take the Square Root now gives standard deviation.

What is the standard deviation (in English) By far the most commonly used measure of variability is the standard deviation. The standard deviation of a data set, denoted by s, represents the typical distance from any point in the data set to the centre It’s roughly the average distance from the centre, and in this case, the centre is the average. Most often, you don’t hear a standard deviation given just by itself; if it’s reported (and it’s not reported nearly enough) it’s usually in the fine print, in parentheses, like “(s = 2.68).”

Standard Deviation symbol σ (Population) S (sample) Stdev (if in doubt)

Here are some properties that can help you when interpreting a standard deviation: ✓ The standard deviation can never be a negative number. ✓ The smallest possible value for the standard deviation is 0 (when every number in the data set is exactly the same). ✓ Standard deviation is affected by outliers, as it’s based on distance from the mean, which is affected by outliers. ✓ The standard deviation has the same units as the original data, while variance is in square units.

As we continued to plot foot lengths, a pattern would begin to emerge. Suppose we measured the right foot length of 30 Students and graphed the results. Assume the first person had a Size 10 foot. We could create a bar graph and plot that person on the graph. If our second subject had a size 9 foot, we would add them to the graph. As we continued to plot foot lengths, a pattern would begin to emerge. 8 7 6 5 4 3 2 1 Number of People with that Shoe Size 4 5 6 7 8 9 10 11 12 13 14 Size of Right Foot

Notice how there are more people (n=6) with a size 10 right foot than any other length. Notice also how as the length becomes larger or smaller, there are fewer and fewer people with that measurement. This is a characteristics of many variables that we measure. There is a tendency to have most measurements in the middle, and fewer as we approach the high and low extremes. If we were to connect the top of each bar, we would create a frequency polygon. 8 7 6 5 4 3 2 1 Number of People with that Shoe Size 4 5 6 7 8 9 10 11 12 13 14 Size of Right Foot

You will notice that if we smooth the lines, our data almost creates a bell shaped curve. 8 7 6 5 4 3 2 1 Number of People with that Shoe Size 4 5 6 7 8 9 10 11 12 13 14 Size of Right Foot

You will notice that if we smooth the lines, our data almost creates a bell shaped curve. This bell shaped curve is known as the “Bell Curve” or the “Normal Distribution Curve.” 8 7 6 5 4 3 2 1 Number of People with that Shoe Size 4 5 6 7 8 9 10 11 12 13 14 Length of Right Foot

Tip of the day : Whenever you see a normal curve, you should imagine the bar graph within it. 12 13 14 15 16 17 18 19 20 21 22 Points on a Quiz Number of Students 9 8 7 6 5 4 3 2 1

Now lets look at quiz scores for 51 students. 12 13 14 15 16 17 18 19 20 21 22 Points on a Quiz Number of Students 9 8 7 6 5 4 3 2 1

Mean=17, 12+13+13+14+14+14+14+15+15+15+15+15+15+16+16+16+16+ 16+16+16+16+17+17+17+17+17+17+17+17+17+18+18+18+18+ 18+18+18+18+19+19+19+19+ 19+ 19+20+20+20+20+ 21+21+22 = 867 867 / 51 = 17 12 13 14 15 16 17 18 19 20 21 22 Points on a Quiz Number of Students 9 8 7 6 5 4 3 2 1

Mean=17, Mode=17, 12 13 13 14 14 14 14 15 15 15 15 15 15 16 16 16 16 16 16 16 16 17 17 17 17 17 17 17 17 17 18 18 18 18 18 18 18 18 19 19 19 19 19 19 20 20 20 20 21 21 22 12 13 14 15 16 17 18 19 20 21 22 Points on a Quiz Number of Students 9 8 7 6 5 4 3 2 1

Median=17, Mean=17, Mode=17, 12, 13, 13, 14, 14, 14, 14, 15, 15, 15, 15, 15, 15, 16, 16, 16, 16, 16, 16, 16, 16, 17, 17, 17, 17, 17, 17, 17, 17, 17, 18, 18, 18, 18, 18, 18, 18, 18, 19, 19, 19, 19, 19, 19, 20, 20, 20, 20, 21, 21, 22 12 13 14 15 16 17 18 19 20 21 22 Points on a Quiz Number of Students 9 8 7 6 5 4 3 2 1

Will all fall on the same spot for normal distribution Median=17, Mean=17, Mode=17, Will all fall on the same spot for normal distribution 12 13 14 15 16 17 18 19 20 21 22 Points on a Quiz Number of Students 9 8 7 6 5 4 3 2 1

Normal distributions (bell shaped) are a family of distributions that have the same general shape. They are symmetric (the left side is an exact mirror of the right side) with scores more concentrated in the middle than in the tails. Examples of normal distributions are shown below. Notice that they differ in how spread out they are. The area under each curve is the same.

The Normal Distribution f(X) Changing σ/s (stdev) increases or decreases the spread. σ Mean Median Mode X

Lower Stdev (narrower) Higher Stdev (wider)

+/- 1 standard deviation Empirical Rule For data having a bell-shaped distribution: of the values of a normal random variable are within of its mean. 68.26% +/- 1 standard deviation of the values of a normal random variable are within of its mean. 95.44% +/- 2 standard deviations of the values of a normal random variable are within of its mean. 99.72% +/- 3 standard deviations

If your data fits a normal distribution, approximately 68% of your subjects will fall within one standard deviation of the mean. Approximately 95% of your subjects will fall within two standard deviations of the mean. Over 99% of your subjects will fall within three standard deviations of the mean.

Points on a Different Test The number of points that one standard deviations equals varies from distribution to distribution. On one maths test, a standard deviation = 7 points. The mean = 45, then we would know that 68% of the students scored from 38 to 52. 31 38 45 52 59 Points on Math Test 30 35 40 45 50 55 60 Points on a Different Test On another test, Standard deviation = 5 points. The mean is still 45, then 68% of the students would score from 40 to 50 points.

The mean and standard deviation are useful ways to describe a set of scores. If the scores are grouped closely together, they will have a smaller standard deviation than if they are spread farther apart. Small Standard Deviation Large Standard Deviation Comparing Data Different Means Different Standard Deviations Same Standard Deviations Same Means

Skew refers to the tail of the distribution. Data do not always form a normal distribution. When most of the scores are high, the distributions is not normal, but negatively (left) skewed. Skew refers to the tail of the distribution. Because the tail is on the negative (left) side of the graph, the distribution has a negative (left) skew. 8 7 6 5 4 3 2 1 Number of People with that Shoe Size 4 5 6 7 8 9 10 11 12 13 14 Size of Right Foot

When most of the scores are low, the distributions is not normal, but positively (right) skewed. Because the tail is on the positive (right) side of the graph, the distribution has a positive (right) skew. 8 7 6 5 4 3 2 1 Number of People with that Shoe Size 4 5 6 7 8 9 10 11 12 13 14 Size of Right Foot

When data are skewed, they do not possess the characteristics of the normal curve (distribution). For example, 68% of the subjects do not fall within one standard deviation above or below the mean. The mean, mode, and median do not fall on the same score. The mode will still be represented by the highest point of the distribution, but the mean will be toward the side with the tail and the median will fall between the mode and mean.

Class test 33,17,19,25,28,56,12,45,95,52,65,55,62,42,53,61,19,18,25 Calculate Mean, Median, Mode are these numbers a : Or Something different

Questions ? matthewsp@bpc.ac.uk Next week – scattergraphs, trends, correleation in others words Question 2