Statistics and Data Analysis

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Dot Plots & Box Plots Analyze Data.
Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Measures of Central Tendency and Variation 11-5
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Introduction to Summary Statistics
IB Math Studies – Topic 6 Statistics.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Measures of Dispersion
Topics: Descriptive Statistics A road map Examining data through frequency distributions Measures of central tendency Measures of variability The normal.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Programming in R Describing Univariate and Multivariate data.
Working with one variable data. Spread Joaquin’s Tests Taran’s Tests: 76, 45, 83, 68, 64 67, 70, 70, 62, 62 What can you infer, justify and conclude about.
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
REPRESENTATION OF DATA.
Methods for Describing Sets of Data
7.7 Statistics & Statistical Graphs p.445. What are measures of central tendency? How do you tell measures of central tendency apart? What is standard.
Objectives Vocabulary
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Objectives Describe the central tendency of a data set.
10/17/2015Mrs. McConaughy1 Exploring Data: Statistics & Statistical Graphs During this lesson, you will organize data by using tables and graphs.
Table of Contents 1. Standard Deviation
Chapter 2 Describing Data.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
WARM UP Find the mean, median, mode, and range 1. 5, 10, 19, 34, 16, , 22, 304, 425, 219, 304, 22, 975 When you are done the warm up put the calculator.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
7.7 Statistics and Statistical Graphs. Learning Targets  Students should be able to… Use measures of central tendency and measures of dispersion to describe.
Measures of Center vs Measures of Spread
7.7 Statistics & Statistical Graphs p.445. An intro to Statistics Statistics – numerical values used to summarize & compare sets of data (such as ERA.
© 2010 Pearson Education, Inc. All rights reserved Data Analysis/Statistics: An Introduction Chapter 10.
What are the effects of outliers on statistical data?
Copyright © 2011 Pearson Education, Inc. Describing Numerical Data Chapter 4.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Warm Up Simplify each expression
Descriptive Statistics Tabular and Graphical Displays –Frequency Distribution - List of intervals of values for a variable, and the number of occurrences.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
Vocabulary to know: *statistics *data *outlier *mean *median *mode * range.
Cumulative frequency Cumulative frequency graph
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Unit 4: Probability Day 4: Measures of Central Tendency and Box and Whisker Plots.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
1. What is the first step in finding the median in a set of data? Write the numbers in order from least to greatest. 2.Find the mean using this data: 0,
Unit 3 Guided Notes. Box and Whiskers 5 Number Summary Provides a numerical Summary of a set of data The first quartile (Q 1 ) is the median of the data.
Statistics Review  Mode: the number that occurs most frequently in the data set (could have more than 1)  Median : the value when the data set is listed.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Exploratory Data Analysis
Measures of Central Tendency and Variation
Methods for Describing Sets of Data
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Measures of Central Tendency
Statistics Unit Test Review
Description of Data (Summary and Variability measures)
Descriptive Statistics
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
The absolute value of each deviation.
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
Measures of Central Tendency and Variation 8-1
Warm Up # 3: Answer each question to the best of your knowledge.
Please copy your homework into your assignment book
11.1 Find Measures of Central Tendency & Dispersion
Ticket in the Door GA Milestone Practice Test
Find the Mean of the following numbers.
Ch. 12 Vocabulary 9.) measure of central tendency 10.) outlier
Review of 6th grade material to help with new Statistics unit
Presentation transcript:

Statistics and Data Analysis Chapter 14 Statistics and Data Analysis

Data Analysis Chart Types Line Plot Uses a symbol to show frequency

Data Analysis Chart Types Bar Graph Uses bars to indicate frequency

Data Analysis Chart Types Back-to-Back bar graph A special bar graph that shows the comparisons of two sets of related data

Data Analysis Chart Types Three Dimensional Bar Graph Used when showing three aspects of a set of data at the same time

Data Analysis Chart Types Stem and Leaf Plot Used to organize a large number of data Stem Column on the left usually digits in the greatest common place value of data Leaf Column on the right one digit numbers, which are in the next greatest place value after the stem

Data Analysis Chart Types Create a stem and leaf plot for the data below. The following are the grades scored on a quiz with 50 possible points 42, 49, 36, 32, 10,19,38,40,41, 50,40,49,30,20,48,47,40,41,32, 37,25,41,43,37,39 What is the first thing you need to do? Write in numerical order

Data Analysis Chart Types Histogram Most common way of displaying frequency distributions Type of bar graph in which the width of each bar represents a class interval and the height of the bar represents the frequency in that interval.

Data Analysis Chart Types

Data Analysis Chart Activity Get in groups of 3 or 4 You will be making a data analysis chart to display and explain to the class You can look at things like: Brothers and Sisters How many days you workout, go to the beach, read a book, play a sport, etc each week States visited Be creative!

Measures of Central Tendency Measures of averages Mean Median Mode Arithmetic Mean X, adding the values of the set of data and dividing by the number of values of the data

Measures of Central Tendency General Formula Find the mean of (36.8, 29.5, 29.1, 33.3, 30.0, 20.7, 39.5) About 31.3

Measures of Central Tendency Median The middle value If there are two middle values, then it is the mean of the two middle values What is the median of (5,6,8,11,14)? 8 What is the median of (3,4,6,7,8,10)? (6+7)/2=6.5 Doesn’t have to be part of original data set

Measures of Central Tendency Mode Most frequent value Some sets may have multiple modes and others can have none Data with two modes are called “bimodal” Mode, unlike mean and median, has to be part of the data set

Year # of HR 1918 11 1919 29 1920 54 1921 59 1922 41 1923 46 1924 47 1925 60 1926 1927 1928 49 1929 Example What is the mean, median and mode of the data? Mean 45.2 Median46.5 Mode46

Measures of Central Tendency Recall this example from Lesson 1: The following are the grades scored on a quiz with 50 possible points 42, 49, 36, 32, 10,19,38,40,41,50,40,49, 30,20,48,47,40,41,32, 37,25,41,43,37,39 Now, use your steam and leaf plot to help find the mean, median, and mode for the data Mean 37.04 Median 40 Mode40,41

Frequency Distribution Activity Get into partners and complete the following with your specific data set: Find the mean, median, and Mode

Box and Whisker Plot Measures of Variability QuartilesQ1, Q2 , Q3 Range of a data set QuartilesQ1, Q2 , Q3 Which Quartile is the median of the data? Q2 Interquartile Range (Q3-Q1) Semi-Interquartile Range (Q3-Q1)/2

Box and Whisker Plot Find the interquartile range of the following test scores 82, 78, 94, 68, 74, 88, 64, 42, 72, 82, 79, 99 Write in order first. What is the mean, median, and mode?

Box and Whisker Plot 82, 78, 94, 68, 74, 88, 64, 42, 72, 82, 79, 99 Mean 77 Median 78 Mode 82 What are Q1, Q2 , Q3? Q1=69 Q2=78 Q3=86 Interquartile Range 17 Semi-interquartile Range 8.5

Box and Whisker Plot Box-and-whisker plots Used to summarize data and illustrates the variability of the data Displays median, quartiles, interquartile range, and extreme values Box consists of Quartiles 1 and 3 Whiskers stop at the extreme values of the set Outliers Values that are more than 1.5 times the interquartile range beyond the upper or lower quartiles

Box and Whisker Plot Draw a box-and-whisker plot for the test scores in first example. 82, 78, 94, 68, 74, 88, 64, 42, 72, 82, 79, 99

Measures of Variability in Data Set Mean Deviation The average absolute value distance each piece of data is from the mean Formula MD= What is the mean deviation of our example?

Mean Deviation Example Recall previous box and whisker example: 82, 78, 94, 68, 74, 88, 64, 42, 72, 82, 79, 99 Find Mean Deviation

Frequency Distribution Activity Get into partners and complete the following with your specific data set: Make a Box and Whisker Plot with all necessary information for your specific data set. Find the mean deviation for your data set.

Measures of Variability in Data Set Standard Deviation Measures of the average amount each piece of data deviates from the mean Formula

Measures of Variability in Data Set Variance Describes the spread of the data Mean of the squares of the deviations from the average =δ2 Therefore standard deviation is the positive square root of the variance

Measures of Variability in Data Set What is the variance and standard deviation for our test score example? Variance Standard Deviation

Frequency Distribution Activity Get into partners and complete the following with your specific data set: Variance and Standard deviation for your data set. Reflect on what these measures tell you about the data.

Measures of Variability in Frequency Distribution Standard Deviation of the Data in a Frequency Distribution

Measures of Variability in Frequency Distribution Variance of the Data in a Frequency Distribution =δ2

Measures of Variability in Frequency Distribution Make a frequency distribution for the test score example from the box and whisker plot lesson below. 82, 78, 94, 68, 74, 88, 64, 42, 72, 82, 79, 99 What is the variance, standard deviation, and mean deviation from this frequency distribution?

The Normal Distribution A frequency distribution that occurs when there is a large number of values in a set of data Looks like a symmetric bell-shaped curve called a normal curve Shape of the curve comes from a large number of frequencies falling in the middle of the distribution; small percent fall at the extreme values

The Normal Distribution About 95.2% of the data are within 2 standard deviations from the mean About 68% of the data are within 1 standard deviation from the mean. About 99.6% of the data are within 3 standard deviations from the mean

The Normal Distribution Represents those values that fall between two and three standard deviations below the mean Represents those values that fall between one and two standard deviations above the mean Mean Value

The Normal Distribution The average healing time of a certain type of incision is 240 hours with a standard deviation of 20 hours. What does the normal curve look like? First put in the mean; Then figure out each interval How many patients healed in the 220-260 hour interval if there were a total of 2000 patients? 68.3%*(2000)=1366 How many patients healed in the 180-300 hour interval if there were a total of 2000 patients? 1994

Review 14.3 Find the variance and standard deviation for the data set below: 12, 22, 25, 27, 15, 18 Put the following data into a frequency distribution and then find the variance and standard deviation: 11, 16, 18, 25, 29, 22, 24, 5, 9, 2