Introduction to Summary Statistics. Statistics The collection, evaluation, and interpretation of data Statistical analysis of measurements can help verify.

Slides:



Advertisements
Similar presentations
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Advertisements

Introduction to Summary Statistics
Forging new generations of engineers. Introductionto Basic Statistics.
Introduction to Summary Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Unit 4 – Probability and Statistics
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Programming in R Describing Univariate and Multivariate data.
Mean, Median, and Mode An Introduction to Data Management: Measures of Central Tendencies.
Summarizing Scores With Measures of Central Tendency
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Chapter 3 Statistics for Describing, Exploring, and Comparing Data
Methods for Describing Sets of Data
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
Introduction to Summary Statistics
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Mr. Wax Meridian Joint School District No 2 1/11/2013
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 2 Describing Data.
1.1 EXPLORING STATISTICAL QUESTIONS Unit 1 Data Displays and Number Systems.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
7.7 Statistics and Statistical Graphs. Learning Targets  Students should be able to… Use measures of central tendency and measures of dispersion to describe.
7.7 Statistics & Statistical Graphs p.445. An intro to Statistics Statistics – numerical values used to summarize & compare sets of data (such as ERA.
Mean = The sum of the data divided by the number of items in the data set. Median = The middle number in a set of data when the data are arranged in numerical.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
Statistics 1: Introduction to Probability and Statistics Section 3-2.
Cumulative frequency Cumulative frequency graph
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Unit 3 Guided Notes. Box and Whiskers 5 Number Summary Provides a numerical Summary of a set of data The first quartile (Q 1 ) is the median of the data.
Statistics Review  Mode: the number that occurs most frequently in the data set (could have more than 1)  Median : the value when the data set is listed.
Central Tendency  Key Learnings: Statistics is a branch of mathematics that involves collecting, organizing, interpreting, and making predictions from.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
INTRODUCTION TO STATISTICS
Statistical Methods Michael J. Watts
Statistical Methods Michael J. Watts
Chapter 3 Describing Data Using Numerical Measures
Statistics Unit Test Review
Introduction to Summary Statistics
Measures of Central Tendency
Summarizing Scores With Measures of Central Tendency
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Midrange (rarely used)
Description of Data (Summary and Variability measures)
Introduction to Summary Statistics
Chapter 3 Describing Data Using Numerical Measures
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Inferential Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Statistics: The Interpretation of Data
Statistics 1: Introduction to Probability and Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Basic Statistics
Introduction to Summary Statistics
Mean.
Presentation transcript:

Introduction to Summary Statistics

Statistics The collection, evaluation, and interpretation of data Statistical analysis of measurements can help verify the quality of a design or process

Summary Statistics Central Tendency “Center” of a distribution –Mean, median, mode Variation Spread of values around the center –Range, standard deviation, interquartile range Distribution Summary of the frequency of values –Frequency tables, histograms, normal distribution

meanThe mean (average) is the sum of the values of a set of data divided by the number of values in that data set. Mean Central Tendency

Data Set Sum of the values = 243 Number of values = 11 Mean = = = Mean Central Tendency

General Rule: Don’t round until the final answer –If you are writing intermediate results you may round values, but keep unrounded number in memory Mean – round to one more decimal place than the original data A Note about Rounding in Statistics

Data Set Sum of the values = 243 Number of values = 11 Reported: Mean = Mean = = = Mean – Rounding 22.1

Mode Central Tendency Measure of central tendency The most frequently occurring value in a set of data is the mode Symbol is M Data Set:

The most frequently occurring value in a set of data is the mode Data Set: Mode = M = 21 Mode Central Tendency

The most frequently occurring value in a set of data is the mode Bimodal Data Set: Two numbers of equal frequency stand out Multimodal Data Set: More than two numbers of equal frequency stand out Mode Central Tendency

Determine the mode of 48, 63, 62, 49, 58, 2, 63, 5, 60, 59, 55 Mode = 63 Determine the mode of 48, 63, 62, 59, 58, 2, 63, 5, 60, 59, 55 Mode = 63 & 59 Bimodal Determine the mode of 48, 63, 62, 59, 48, 2, 63, 5, 60, 59, 55 Mode = 63, 59, & 48 Multimodal Mode Central Tendency

Measure of central tendency The median is the value that occurs in the middle of a set of data that has been arranged in numerical order Symbol is x ~ Median Central Tendency

The median is the value that occurs in the middle of a set of data that has been arranged in numerical order Data Set: Median Central Tendency

A data set that contains an odd number of values always has a Median Data Set: Median Central Tendency

For a data set that contains an even number of values, the two middle values are averaged with the result being the Median Data Set: Median Central Tendency Middle of data set

Measure of data variation The range is the difference between the largest and smallest values that occur in a set of data Symbol is R Range = R = maximum value – minimum value Data Set: Range Variation R = 44 – 3 = 41

A histogram is a common data distribution chart that is used to show the frequency with which specific values, or values within ranges, occur in a set of data. An engineer might use a histogram to show the variation of a dimension that exists among a group of parts that are intended to be identical. Histogram Distribution

Large sets of data are often divided into a limited number of groups. These groups are called class intervals. -5 to 5 Class Intervals 6 to to -6 Histogram Distribution

The number of data elements in each class interval is shown by the frequency, which is indicated along the Y-axis of the graph. Frequency to 5 6 to to -6 Histogram Distribution

3 Example Frequency to to 151 to 5 1, 7, 15, 4, 8, 8, 5, 12, 10 12,15 1, 4, 5, 7, 8, 8, 10, Histogram Distribution < x ≤ < x ≤ < x ≤ 15.5

The height of each bar in the chart indicates the number of data elements, or frequency of occurrence, within each range. Histogram Distribution 3 Frequency to to 151 to 5 12,15 1, 4, 5, 7, 8, 8, 10,

MINIMUM = in. MAXIMUM = in. Histogram Distribution < x ≤ Size constraint as a compound inequality

Dot Plot Distribution

Frequency Dot Plot Distribution

Frequency Data Elements Bell shaped curve Normal Distribution Distribution