Topic 1: Descriptive Statistics CEE 11 Spring 2001 Dr. Amelia Regan These notes draw liberally from the class text, Probability and Statistics for Engineering.

Slides:



Advertisements
Similar presentations
Agricultural and Biological Statistics
Advertisements

Introduction to Summary Statistics
Topic 6: Introduction to Hypothesis Testing
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Intro to Descriptive Statistics
Biostatistics Unit 2 Descriptive Biostatistics 1.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
CHAPTER 3: Statistical Description of Data
MEASURES of CENTRAL TENDENCY.
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
MGQ 201 WEEK 4 VICTORIA LOJACONO. Help Me Solve This Tool.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
SECTION 3.2 MEASURES OF SPREAD Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Chapter 3 – Descriptive Statistics
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 Overview and Descriptive Statistics.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Worked examples and exercises are in the text STROUD (Prog. 28 in 7 th Ed) PROGRAMME 27 STATISTICS.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part A n Measures of.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
Descriptive Statistics: Numerical Methods
Statistics Measures Chapter 15 Sections
Chapter 2 Describing Data.
STATISTICS. Statistics * Statistics is the area of science that deals with collection, organization, analysis, and interpretation of data. * A collection.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Subbulakshmi Murugappan H/P:
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Chapter 9 Statistics.
FARAH ADIBAH ADNAN ENGINEERING MATHEMATICS INSTITUTE (IMK) C HAPTER 1 B ASIC S TATISTICS.
Topic 5: Continuous Random Variables and Probability Distributions CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally from the class text,
Measures of Variation Range Standard Deviation Variance.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Topic 6: The distribution of the sample mean and linear combinations of random variables CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally.
Quick review of some key ideas CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally from the class text, Probability and Statistics for Engineering.
CHAPTER 2: Basic Summary Statistics
Chapter ( 2 ) Strategies for understanding the meanings of Data : Learning outcomes Understand how data can be appropriately organized and displayed Understand.
Measures of Variation. Range, Variance, & Standard Deviation.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Data Descriptions.
STATISTICS AND PROBABILITY IN CIVIL ENGINEERING
INTRODUCTION TO STATISTICS
Chapter 3 Describing Data Using Numerical Measures
Introduction to Summary Statistics
St. Edward’s University
Central Tendency and Variability
Introduction to Summary Statistics
Descriptive Statistics: Presenting and Describing Data
Descriptive Statistics
Characteristics of the Mean
Introduction to Summary Statistics
Introduction to Summary Statistics
Describing Data: Numerical Measures
Introduction to Summary Statistics
Numerical Descriptive Measures
Introduction to Summary Statistics
Descriptive Statistics
Statistics 1: Introduction to Probability and Statistics
Numerical Descriptive Measures
Introduction to Summary Statistics
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Introduction to Summary Statistics
CHAPTER 2: Basic Summary Statistics
Introduction to Summary Statistics
Descriptive statistics for groups:
Presentation transcript:

Topic 1: Descriptive Statistics CEE 11 Spring 2001 Dr. Amelia Regan These notes draw liberally from the class text, Probability and Statistics for Engineering and the Sciences by Jay L. Devore, Duxbury 1995 (4th edition)

definitions n A population consists of all objects of a certain type that are relevant to a particular study or analysis. u all students at UCI represent a population n A sample is a subset or portion of the population u students in this class represent a sample of the population of students at UCI

frequency distributions and histograms n A frequency is a count, the number of occurrences in the sample of a particular value which are within a particular class. n Classes must be mutually exclusive (no overlap allowed) and collectively exhaustive (the full range of the data must be covered). n A histogram is a bar chart of the frequency distribution.

guidelines for forming class intervals n Use intervals of equal length with midpoints at convenient round numbers. n For large data sets use more intervals n For small data sets use a small number of intervals

Example n 30 students are asked to submit their weights with these results Men (18 in sample) Women (12 in sample)

Example n We might break the sample into classes and construct the following frequency table class 100-< < < < <200 frequency rel freq Class mid. pt

From the table we can easily construct a histogram for the sample

mean n The mean of a sample or data set is simply the arithmetic average of the values in the set, obtained by summing the values and dividing by the number of values. The mean of the sample of weights is pounds

mean of a frequency distribution n When we summarize a data set in a frequency distribution, we are approximating the data set by "rounding" each value in a given class to the class mark. The mean of the weight data obtained in this way is

median n The median is the value that is roughly in the middle of the data set. If n is odd, the median is the single value in the middle, namely the value with rank (n + 1)/2. n If n is even, there is not a single value in the middle, so the median is defined to be the average of the two middle values, namely the values with ranks n/2 and n/ The median for our example is ( )/2 = lbs.

mode n The mode of a data set is the value that appears most often. The modal values for our sample are 130 and the mode need not be a single value n If data are broken into classes, the modal class is the class with the most members. The modal class for our sample is 140-<160

range n The range or spread of of a data set is the difference between its largest and smallest values The range for the weight data is 102 to 190 or 88 lbs

variance n The variance of a population is the average of the squared deviations from the mean n The variance of a sample is approximately the average of the squared deviations from the mean (note that we divide the sum of the squared deviations by n-1 rather than n)

standard deviation n The standard deviation is the square root of the variance n The standard deviation is useful because it is in the same units as the mean (and the original data) therefore it provides better insight into the relative variability a sample. n The variance and standard deviation of the weight data are lbs 2 and lbs

coefficient of variation n The coefficient of variation is the standard deviation divided by the mean n The coefficient of variation is used to examine the relative variability of more than one data set for the weight data the coefficient of variation is 0.163

shortcut formula for the variance n Its sometimes more convenient to use the following formula for the variance

Class exercise n The national weather service maintains and publishes historical weather data for 100 US cities. The average annual rain fall in inches for the cities in the data base beginning with A are listed below. n Calculate the mean, median, range, variance and standard deviation for the following data

properties of S 2 n Let x 1, x 2, x,...,x n be a sample and c be any nonzero constant. n If y 1 = x 1 + c, y 2 = x 2 + c,...,y n = x n + c, then S 2 y = S 2 x n If y 1 = cx 1, y 2 = cx 2,...,y n = cx n, then S 2 y = c 2 S 2 x, S y = |c|S 2 x In other words -- if we add a constant to a sample we do not increase the variance -- if we multiply by a constant we increase the variance by the square of the constant

related properties of the sample mean n Let x 1, x 2, x,...,x n be a sample and c be any nonzero constant. n If y 1 = x 1 + c, y 2 = x 2 + c,...,y n = x n + c then n If y 1 = cx 1, y 2 = cx 2,...,y n = cx n, then In other words if we add or multiply the sample by a constant we add or multiply the mean by the same constant

Class exercise n Without using your calculators, calculate the mean and variance of the following data Xi | fi | n Hint, shift the observations “to the left” by subtracting a constant and then divide by another constant