Statistics It is the science of planning studies and experiments, obtaining sample data, and then organizing, summarizing, analyzing, interpreting data,

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Created by Tom Wegleitner, Centreville, Virginia
STATISTICS ELEMENTARY MARIO F. TRIOLA EIGHTH EDITION.
Population Population
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
EDU 660 Methods of Educational Research Descriptive Statistics John Wilson Ph.D.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Section 1-3 Types of Data.
Measures of Relative Standing and Boxplots
Introduction to Statistics
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2004 Pearson Education, Inc.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Lecture Slides Elementary Statistics Twelfth Edition
Slide Slide 1 Baby Leo’s 4-month “Healthy Baby” check-up reported the following: 1)He is in the 90 th percentile for weight 2)He is in the 95 th percentile.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Created by Tom Wegleitner, Centreville, Virginia Section 3-1 Review and.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Slide 1 Statistics Workshop Tutorial 6 Measures of Relative Standing Exploratory Data Analysis.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Elementary Statistics Eleventh Edition Chapter 3.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
1 Chapter 1. Section 1-1 and 1-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Chapter 2 Describing Data.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Essentials of Statistics Third Edition by Mario.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 1-3 Types of Data.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
Honors Statistics Chapter 3 Measures of Variation.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Biostatistics Introduction Article for Review.
Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of Experiments.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Measures of Center.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Lecture Slides Elementary Statistics Twelfth Edition
STATISTICS ELEMENTARY MARIO F. TRIOLA EIGHTH EDITION.
Midrange (rarely used)
statistics Specific number
Lecture Slides Essentials of Statistics 5th Edition
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 3 Statistics for Describing, Exploring, and Comparing Data
Lecture Slides Elementary Statistics Twelfth Edition
Numerical Descriptive Measures
Chapter 3 Statistics for Describing, Exploring, and Comparing Data
statistics Specific number
Quartile Measures DCOVA
Statistics Workshop Tutorial 1
Population Population
Population Population
Chapter 1 Introduction to Statistics
Lecture Slides Elementary Statistics Eleventh Edition
Lecture Slides Elementary Statistics Eleventh Edition
Lecture Slides Essentials of Statistics 5th Edition
Presentation transcript:

Statistics It is the science of planning studies and experiments, obtaining sample data, and then organizing, summarizing, analyzing, interpreting data, and drawing conclusions about a population based on the data.

Sample Data Sample data are the collection of observations (such as measurements, genders, survey responses---data values)

Population It is the complete collection of all individuals (scores, people, measurements, and so on) to be studied; the collection is complete in the sense that it includes all of the individuals to be studied.

Statistical Values The subject of statistics is largely about using sample data to make inferences (or generalizations) about an entire population. It is essential to know and understand the definitions and the distinction between “parameter” and “statistic.”

Parameter population parameter It is a numerical measurement describing some characteristic of a population. population parameter

Statistic sample statistic It is a numerical measurement describing some characteristic of sample data. sample statistic

Categorical Data Categorical (or qualitative or attribute) data consists of names or labels (representing categories) Example: The genders (male/female) of professional athletes Example: Shirt numbers on professional athletes uniforms - substitutes for names.

Nominal Level Nominal level of measurement characterized by data that consist of names, labels, or categories only, and the data cannot be arranged in an ordering scheme (such as low to high) Example: Survey responses yes, no, undecided

Ordinal Level Ordinal level of measurement involves data that can be arranged in some order, but differences between data values either cannot be determined or are meaningless Example: Course grades A, B, C, D, or F

Quantitative Data Quantitative (or numerical) data consists of numbers representing counts or measurements. Example: The weights of supermodels Example: The ages of respondents

Discrete Data Discrete data result when the number of possible values is either a finite number or a ‘countable’ number (i.e. the number of possible values is 0, 1, 2, 3, . . .) Example: The number of eggs that a hen lays

Continuous Data Continuous (numerical) data result from infinitely many possible values that correspond to some continuous scale that covers a range of values without gaps, interruptions, or jumps. Example: The amount of milk that a cow produces; e.g. 2.343115 gallons per day

Frequency Distribution It shows how a data set is partitioned among all of several categories (or classes) by listing all of the categories along with the number of data values in each of the categories.

Histogram A graph consisting of bars of equal width drawn adjacent to each other (without gaps). The horizontal scale represents the classes of quantitative data values and the vertical scale represents the frequencies. The heights of the bars correspond to the frequency values.

Arithmetic Mean It is the measure of center obtained by adding the values and dividing the total by the number of values. What most people call an average. The symbol for mean is pronounced ‘x-bar’ and denotes the mean of a set of sample values 27.5 = 24 + 31 + 28 + 22 + 34 + 29 + 27 + 25 8

Mean Advantages It is relatively reliable, means of samples drawn from the same population don’t vary as much as other measures of center. Takes every data value into account. Disadvantage It is sensitive to every data value, one extreme value can affect it dramatically; it is not a resistant measure of center.

(in order - even number of values – no exact middle Median It is the middle value when the original data values are arranged in order of increasing (or decreasing) magnitude. It is not affected by an extreme value - is a resistant measure of the center. 5.40 1.10 0.42 0.73 0.48 1.10 0.42 0.48 0.73 1.10 1.10 5.40 (in order - even number of values – no exact middle shared by two numbers) 0.73 + 1.10 MEDIAN is 0.915 2

Measures of Center

Range The range of a set of data values is the difference between the maximum data value and the minimum data value. Range = (maximum value) – (minimum value) It is very sensitive to extreme values; therefore not as useful as other measures of variation.

Standard Deviation The standard deviation of a set of sample values, denoted by s, is a measure of variation of values about the mean.

Properties of the Standard Deviation (part 1) Measures the variation among data values Values close together have a small standard deviation, but values with much more variation have a larger standard deviation Has the same units of measurement as the original data

Empirical (or 68-95-99.7) Rule For data sets having a distribution that is approximately bell shaped, the following properties apply: About 68% of all values fall within 1 standard deviation of the mean. About 95% of all values fall within 2 standard deviations of the mean. About 99.7% of all values fall within 3 standard deviations of the mean.

Histogram with Bell Shape

The Empirical Rule

Properties of the Standard Deviation (part 2) For many data sets, a value is unusual if it differs from the mean by more than two standard deviations Compare standard deviations of two different data sets only if the they use the same scale and units, and they have means that are approximately the same

Coefficient of Variation The coefficient of variation (or CV) for a set of nonnegative sample or population data, expressed as a percent, describes the standard deviation relative to the mean.

Quartiles Are measures of location, denoted Q1, Q2, and Q3, which divide a set of data into four groups with about 25% of the values in each group. Q1 (First Quartile) separates the bottom 25% of sorted values. Q2 (Second Quartile) same as the median; separates the bottom 50% of sorted values. Q3 (Third Quartile) separates the bottom 75% of sorted values.

5-Number Summary For a set of data, the 5-number summary consists of the minimum value Q0; the first quartile Q1; the median (or second quartile Q2); the third quartile, Q3; and the maximum value Q4. 25% Q3 Q2 Q1 (minimum) (maximum) (median)

Interquartile Range It is used for the measure of variation, and defined as the difference between Q1 and Q3: IQR = Q3 - Q1

Boxplot A boxplot (or box-and-whisker-diagram) is a graph of a data set that consists of a line extending from the minimum value to the maximum value, and a box with lines drawn at the first quartile, Q1; the median; and the third quartile, Q3.

Outliers An outlier is a value that lies very far away from the vast majority of the other values in a data set.

Outliers for Modified Boxplots For purposes of constructing modified boxplots, we can consider outliers to be data values meeting specific criteria. In modified boxplots, a data value is an outlier if it is . . . above Q3 by an amount greater than 1.5  IQR or below Q1 by an amount greater than 1.5  IQR

Modified Boxplot Construction A modified boxplot is constructed with these specifications: A special symbol (such as an asterisk) is used to identify outliers. The solid horizontal line extends only as far as the minimum data value that is not an outlier and the maximum data value that is not an outlier.