IB Math Studies – Topic 6 Statistics.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Material Taken From: Mathematics for the international student Mathematical Studies SL Mal Coad, Glen Whiffen, John Owen, Robert Haese, Sandra Haese and.
Statistics Unit 6.
IB Math Studies – Topic 6 Statistics.
Statistical Tests Karen H. Hagglund, M.S.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
1 Economics 240A Power One. 2 Outline w Course Organization w Course Overview w Resources for Studying.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Describing Distributions Numerically
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Statistics: Use Graphs to Show Data Box Plots.
IB Math Studies – Topic 6 Daniela and Megan. IB Course Guide Description.
Describing Data: Numerical
Describing distributions with numbers
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Objectives 1.2 Describing distributions with numbers
Material Taken From: Mathematics for the international student Mathematical Studies SL Mal Coad, Glen Whiffen, John Owen, Robert Haese, Sandra Haese and.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
Do Now. Chapter 5 Section E and F Vocabulary Relative frequency- frequency expressed as a fraction of the total frequency Cumulative frequency- sum of.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 2 Describing Data.
By: Amani Albraikan 1. 2  Synonym for variability  Often called “spread” or “scatter”  Indicator of consistency among a data set  Indicates how close.
Describing distributions with numbers
Lecture 3 Describing Data Using Numerical Measures.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
Numerical Measures of Variability
Material Taken From: Mathematics for the international student Mathematical Studies SL Mal Coad, Glen Whiffen, John Owen, Robert Haese, Sandra Haese and.
1 Chapter 4 Numerical Methods for Describing Data.
ContentFurther guidance  Hypothesis testing involves making a conjecture (assumption) about some facet of our world, collecting data from a sample,
Box Plots & Cumulative Frequency Graphs
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Measures of Center & Spread. Measures of Center.
CHAPTER 1 Basic Statistics Statistics in Engineering
Material Taken From: Mathematics for the international student Mathematical Studies SL Mal Coad, Glen Whiffen, John Owen, Robert Haese, Sandra Haese and.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
StatisticsStatistics Did you hear about the statistician who put her head in the oven and her feet in the refrigerator? She said, "On average, I feel just.
Measures of the Central of a Distribution: M ean –i–i s what we know as the average –s–s ymbol:, x bar –i–i s generally a good representative of the data.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Unit 3 Guided Notes. Box and Whiskers 5 Number Summary Provides a numerical Summary of a set of data The first quartile (Q 1 ) is the median of the data.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Statistics Vocabulary. 1. STATISTICS Definition The study of collecting, organizing, and interpreting data Example Statistics are used to determine car.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Chapter 2 Describing Data: Numerical
Business and Economics 6th Edition
Statistics 1: Statistical Measures
Material Taken From: Mathematics for the international student Mathematical Studies SL Mal Coad, Glen Whiffen, John Owen, Robert Haese, Sandra Haese.
Unit 2 Section 2.5.
Description of Data (Summary and Variability measures)
Ch. 18- Descriptive Statistics.
Correlation At a tournament, athletes throw a discus. The age and distance thrown are recorded for each athlete: Do you think the distance an athlete.
Chapter 3 Describing Data Using Numerical Measures
Unit 7: Statistics Key Terms
Topic 5: Exploring Quantitative data
Consider this table: The Χ2 Test of Independence
6F Measuring the Spread of Data, 6G Box and Whisker Plots
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Descriptive Statistics
Basic Biostatistics Measures of central tendency and dispersion
Presentation transcript:

IB Math Studies – Topic 6 Statistics

IB Course Guide Description

IB Course Guide Description

IB Course Guide Description

Describing Data Types of Data Categorical Data – Describes a particular quality or characteristic. It can be divided into categories. i.e. color of eyes or types of ice cream Quantitative Data – Contains a numerical value. The information collected is termed numerical data. Discrete – Takes exact number values and is often the result of counting. i.e. number of TVs or number of houses on a street Continuous – Takes numerical values within a certain range and is often a result of measuring. i.e. the height of seniors or the weight of freshman

Types of Distribution Symmetric Distribution Positively Skewed Distribution Negatively Skewed Distribution

Example 1: Describing Data 24 families were surveyed to find the number of people in the family. The results are: 5, 9, 4, 4, 4, 5, 3, 4, 6, 8, 8, 5, 7, 6, 6, 8, 6, 9, 10, 7, 3, 5, 6, 6 Is this data discrete or continuous? Construct a frequency table for the data. Display the data using a column graph. Describe the shape of the distribution. Are there any outliers? What percentage of families have 5 or fewer people in them?

Standard Deviation Formula x is any score is the mean n is the number of scores

Calculate the standard deviation Values 2 4 5 6 7 35 Calculate the mean Subtract the mean from each value Square these Add them Divide by n Take the square root

Standard Deviation on the GDC Type data in List 1 1-Var Stats L1 On paper you’ll see ‘s’ being used to standard for standard deviation. But you should use the σ measurement from the calculator.

Measuring the Spread of Dara The median is the second quartile, Q2 or 50th percentile The lower quartile, Q1, is the median of the lower half of the data or 25th percentile The upper quartile, Q3, is the median of the upper half of the data or 75th percentile The inter-quartile range is the difference in the upper quartile and the lower quartile. IQR = Q3 – Q1

Box Plots The inter-quartile range is the width of the box. The maximum length of each whisker is 1.5 times the inter-quartile range. Any data value that is larger than (or smaller than) 1.5 × IQR is marked as an outlier.

To Create a Box-and-Whisker Plot: Make a number line. Create the box between Q1 and Q3. Draw in Q2. Determine any outliers: Upper boundary = Q3 + 1.5(IQR) Lower boundary = Q1 – 1.5(IQR) Plot any outliers. Extend the whiskers to the maximum & minimum (provided they’re not outliers).

Example :Box and Whisker Plots A hospital is trialing a new anesthetic drug and has collected data on how long the new and old drugs take before the patient becomes unconscious. They wish to know which drug acts faster and which is more reliable. Old drug times: 8, 12, 9, 8, 16, 10, 14, 7, 5, 21, 13, 10, 8, 10 11, 8, 11, 9, 11, 14 New drug times: 8, 12, 7, 8, 12, 11, 9, 8, 10, 8, 10, 9, 12, 8, 8, 7, 10, 7, 9, 9 Prepare a parallel box plot for the data sets and use it to compare the two drugs for speed and reliability.

FORMULA Pearson’s Correlation Coefficient: r

Correlation Coefficient on the GDC Turn on your Diagnostics Enter the data in L1 and L2 LinReg L1, L2

Example 1: Correlation Coefficient In an experiment a vertical spring was fixed at its upper end. It was stretched by hanging different weights on its lower end. The length of the spring was then measured. The following readings were obtained. Load (kg) x 1 2 3 4 5 6 7 8 Length (cm) y 23.5 25 26.5 27 28.5 31.5 34.5 36 37.5 (i) Write down the mean value of the load, (ii) Write down the standard deviation of the load. (iii) Write down the mean value of the length, (iv) Write down the standard deviation of the length. Nov 2006, paper 2, question 3 It is given that the covariance Sxy is 12.17. (d) (i) Write down the correlation coefficient, r, for these readings. (ii) Comment on this result.

Average speed in the metropolitan area and age of drivers Example 2: Correlation Coefficient Average speed in the metropolitan area and age of drivers The r-value for this association is 0.027. Describe the association.

Drawing the Line of Best Fit Calculate mean of x values , and mean of y values Mark the mean point on the scatter plot Draw a line through the mean point that is through the middle of the data equal number of points above and below line

Least Squares Regression Line Consider the set of points below. Square the distances and find their sum. we want that sum to be small. The regression line is used for prediction purposes. The regression line is less reliable when extended far beyond the region of the data.

Line of Regression using GDC LinReg(ax +b) Test, L1, L2 where L1 contains your independent data. and L2 contains your dependent data

Example 3: Line of Regression The table shows the annual income and average weekly grocery bill for a selection of families Construct a scatter plot to illustrate the data. Use technology to find the line of best fit. Estimate the weekly grocery bill for a family with an annual income of £95000. Comment on whether this estimate is likely to be reliable. Example 7 on page 610 of second edition

X2 Test of Independence The variables may be dependent: Females may be more likely to exercise regularly than males. The variables may be independent: Gender has no effect on whether they exercise regularly. A chi-squared test is used to determine whether two variables from the same sample are independent.

How to do it: Write the null hypothesis (H0) and the alternate hypothesis (H1). Create contingency tables for observed and expected values. Calculate the chi-square statistic and degrees of freedom. Find the chi-squared critical value (booklet). Depends on the level of significance (p) and the degrees of freedom (v). Determine whether or not to accept the null hypothesis.

Contingency Tables Observed Frequencies Expected Frequencies Column1 Totals Row1 a b sum row1 Row2 c d sum row2 Sum column1 Sum column2 total Column1 Column2 Totals Row1 sum row1 Row2 sum row2 sum column 1 sum column 2 total

Χ2 Statistic on the GDC On the calculator: Put your contingency table in matrix A STAT TESTS C: χ2 Test Observed: [A] Expected: [B] (this is where you want to go) Calculate Output: Χ2  Χ2 calculated value df  degrees of freedom in Matrix B  expected values

Find the Critical Value Get this from the formula booklet. Significance level (p) is always given in the problem. A 5% significance level = 95% confidence level Degrees of freedom: v = (c - 1)(r – 1) where c = number of columns in table and r = number of rows in table

Accepting the Null Hypothesis If X2calc < Critical Value ACCEPT the null hypothesis If X2calc > Critical Value REJECT the null hypothesis Hoffman’s train of thought: Rejection is bad Want calc value to be smaller than booklet value (the calculator is physically smaller than the booklet)

Important IB Notes: In examinations: the value of sxy will be given if required. sx represents the standard deviation of the variable X; sxy represents the covariance of the variables X and Y. A GDC can be used to calculate r when raw data is given. For the EXAM students do NOT need to know how to find the covariance. But, for their project if they’re doing regression, then they DO need to do covariance by hand so they can do the r by hand so they can get points for using a sophisticated math process.