What is Statistics? Statistics is the science of collecting, analyzing, and drawing conclusions from data –Descriptive Statistics Organizing and summarizing.

Slides:



Advertisements
Similar presentations
Click the mouse button or press the Space Bar to display the answers.
Advertisements

Analyzing Data (C2-5 BVD) C2-4: Categorical and Quantitative Data.
Histograms & Stemplots for Quantitative Data. Describing Data using Summary Features of Quantitative Variables Center — Location in middle of all data.
Quantitative Data Continued Histograms. Used with numerical data Bars touch on histograms Two types – Discrete Bars are centered over discrete values.
Displaying & Summarizing Quantitative Data
Chapter 1 & 3.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
AP Statistics Overview
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
AP STATISTICS Section 1.1: Displaying Distributions.
Chapter 4: Displaying & Summarizing Quantitative Data
Chapter 1 Exploring Data
Chapter 4 Displaying Quantitative Data. Graphs for Quantitative Data.
1.1 Displaying Distributions with Graphs
Have out your calculator and your notes! The four C’s: Clear, Concise, Complete, Context.
Chapter 4 Displaying and Summarizing Quantitative Data Math2200.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Warm Up – Find the mean, median & mode of each set. Data Set I Data Set II.
Lesson Describing Distributions with Numbers adapted from Mr. Molesky’s Statmonkey website.
Visual Displays for Quantitative Data
Categorical vs. Quantitative…
Displaying Quantitative Data Graphically and Describing It Numerically AP Statistics Chapters 4 & 5.
Bellwork 1. If a distribution is skewed to the right, which of the following is true? a) the mean must be less than the.
Unit 4 Statistical Analysis Data Representations.
MMSI – SATURDAY SESSION with Mr. Flynn. Describing patterns and departures from patterns (20%–30% of exam) Exploratory analysis of data makes use of graphical.
Section 1-1 Day One Types of Data Bar Graphs, Pie Charts Dots Plots, Stem and leaf plots, Histograms.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
Histograms & Stemplots for Quantitative Data Describing Data using Summary Features of Quantitative Variables Center — Location in middle of all data.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Class Two Before Class Two Chapter 8: 34, 36, 38, 44, 46 Chapter 9: 28, 48 Chapter 10: 32, 36 Read Chapters 1 & 2 For Class Three: Chapter 1: 24, 30, 32,
What is Statistics?. Statistics 4 Working with data 4 Collecting, analyzing, drawing conclusions.
Unit 1 - Graphs and Distributions. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
Analyzing Data Week 1. Types of Graphs Histogram Must be Quantitative Data (measurements) Make “bins”, no overlaps, no gaps. Sort data into the bins.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
UNIT ONE REVIEW Exploring Data.
Quantitative Data Continued
CHAPTER 1 Exploring Data
Unit 4 Statistical Analysis Data Representations
Choosing the “Best Average”
Chapter 1 & 3.
Objective: Given a data set, compute measures of center and spread.
1st Semester Final Review Day 1: Exploratory Data Analysis
MATH 2311 Section 1.5.
Bell Ringer Create a stem-and-leaf display using the Super Bowl data from yesterday’s example
Laugh, and the world laughs with you. Weep and you weep alone
Distributions and Graphical Representations
Unit 1 - Graphs and Distributions
Basics of Statistics.
Frequency Distributions
Measure of Center And Boxplot’s.
Measure of Center And Boxplot’s.
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Organizing Data AP Stats Chapter 1.
Means & Medians.
What is Statistics? Day 2..
Describing Data (Univariate Data) Ch. 1
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Honors Statistics Review Chapters 4 - 5
8/21/2017 Homework: pg. 46 #3-6 3.) A. Stemplot. The dots are too spread out to identify the shape of the distribution. B. Cumulative counts of observations.
Probability and Statistics
Section 1.1: Displaying Distributions
Displaying and Summarizing Quantitative Data
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
Describing Data (Univariate Data) Ch. 1
MATH 2311 Section 1.5.
Presentation transcript:

What is Statistics? Statistics is the science of collecting, analyzing, and drawing conclusions from data –Descriptive Statistics Organizing and summarizing –Inferential Statistics Generalizing from a sample to the population from which it was selected

What kind of data is there? How can it be graphed for visual comparison? How can it be described verbally? How can it be analyzed numerically? Describing Data

Data--Types of Variables CategoricalGroup or category names w/no order Eye Color (brown, blue, green) QuantitativeNumerical values (in order, can be averaged, etc.) Weight (117 lbs, 170 lbs, 253 lbs)

Types of Quantitative (Numerical) Data Discrete Takes on only certain values Example: Number of siblings, number of pockets in a pair of jeans, number of free throws made in a season,… Continuous Takes on any of an infinite number of values Example: Time, Weight, Height, …because of our limitations of measurement accuracy we often round to the nearest second, ounce, inch,…

Describing Univariate Data The distribution of a variable tells us what values the variable takes, how often it takes those values, and shows the pattern of variation Categorical –Bar graph –Segmented Bar Graph –Pie chart Quantitative –Dotplot –Stemplot (Stem & leaf) –Histogram (Frequency distribution) –Ogive: Cumulative relative frequency plot –Boxplot

Bar, Segmented Bar, & Pie Charts

What’s misleading about this graph?

Source: Marist Institute for Public Opinion How is this graph misleading?

Describing Data using Summary Features of Quantitative Variables Center — Location in middle of all data Unusual features - Outliers, gaps, clusters Spread—Measure of variability, range Shape—Distribution pattern: symmetric, skewed, uniform, bimodal, etc. Always CUSS in context!

Dotplot for Univariate Quantitative Data

1.11 Stemplot Answer (c) The distribution is skewed to the right. The spread is approximately 90 (3 to 93). The center of the distribution is at approximately $28. There are several moderate outliers visible in the split- stem plot; specifically, the five amounts of $70 or more. While most shoppers spent small to moderate amounts of money around $30, a “cluster” of shoppers spent larger amounts ranging from $70 to $93. a) 1| 9 represents $19 spent at store b) Stemplots: Stems & Leaves in order Leave stem blank if no leaf Split stems if too few stems

Back-to-back Stemplot Babe Ruth Roger Maris | 0 | 8 | 1 | 3, 4, 6 5, 2 | 2 | 3, 6, 8 5, 4 | 3 | 3, 9 9, 7, 6, 6, 6, 1, 1 | 4 9, 4, 4 | 5 | 0 | 6 | 1 Number of home runs in a season When comparing data, use comparative language! (higher, more than, etc.)

Histogram of Discrete Data: Rolling a fair six-sided die 300 times

1.14 Answer Histogram of Continuous Data The center is located at 350 ($350,000). There appears to be one outlier of $1,103,000. The distribution is skewed to the right with a peak in the $200,000s. The spread is approximately $1,082,000 ($21,000 to $1,103,000) Which bars did the $200,000 and $300,000 salaries go? Border values always go in the bar on the right! (First bar is salaries of at least 0 to less than $100,000.)

Histograms on the calculator Enter data into List1 by going to Stat, 1:Edit Turn StatPlot on and choose histogram option. Set Xlist to the list you used to enter in the data. Choose 1 for Freq or a 2nd list if data is stored in two lists (values in one, frequency in another) Press Zoom 9:Statplot to set window to the data initially Check the window and set reasonable, pretty values of min & max for both x (values) and y (frequency count). The Xscl will set the width of the bins – make this is a “pretty” number also! Then press graph to see the adjusted graph Press trace to see details of the graph

Histogram of People’s Weights

Data from Histogram Weight Class IntervalFrequency Relative Frequency Cumulative Relative Frequency 100 to < to < to < to < to < to < to < 2404 Total

Ogive: Cumulative Relative Frequency Graph Weight (in pounds) Cumulative Relative Frequency

5 Number Summary Minimum Q1 (lower quartile) is the 25 th percentile of ordered data or median of lower half of ordered data Median (Q2) is 50 th percentile, or middle number of ordered data (average the two middle numbers if there is an even number of #s) Q3 (upper quartile) is the 75 th percentile of ordered data or median of upper half of ordered data Maximum Range = Maximum – minimum IQR(Interquartile Range) = Q3 – Q1 Outlier Formula: Any point that falls below Q1- 1.5(IQR) or above Q (IQR) is considered an outlier.

Boxplot – using the 5 # summary Salaries from 1.14 – Enter in calc and press stat, calc, 1-var stats Min 21 Q1 250 Median 350 Q3 543 Max 1103 Check for outliers: IQR = Q3 – Q1 = =293 Low boundary: Q (IQR) = 250 – 1.5(293) = no outliers on low end since no salaries are less than this High boundary: Q (IQR) = (293) = one outlier on high end (1103) since it is higher than Max value that’s not an outlier

Scatterplot—Bivariate quantitative data