Engineering 1811.01 College of Engineering Engineering Education Innovation Center Analyzing Measurement Data Rev: 20130604, MCAnalyzing Data1.

Slides:



Advertisements
Similar presentations
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Advertisements

Introduction to Summary Statistics
Introduction to Summary Statistics
Analyzing Measurement Data ENGR 1181 Class 8. Analyzing Measurement Data in the Real World As previously mentioned, data is collected all of the time,
Measures of Central Tendency. Central Tendency “Values that describe the middle, or central, characteristics of a set of data” Terms used to describe.
Calculating & Reporting Healthcare Statistics
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Edpsy 511 Homework 1: Due 2/6.
Today: Central Tendency & Dispersion
Programming in R Describing Univariate and Multivariate data.
Objective To understand measures of central tendency and use them to analyze data.
What is statistics? STATISTICS BOOT CAMP Study of the collection, organization, analysis, and interpretation of data Help us see what the unaided eye misses.
Psy302 Quantitative Methods
Numerical Descriptive Techniques
1.3 Psychology Statistics AP Psychology Mr. Loomis.
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
Statistics Recording the results from our studies.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
Data Collection and Analysis ENGR 1181 Class 7. Data Collection in the Real World Data is collected all of the time, just think about it. When you are.
Introduction to Summary Statistics. Statistics The collection, evaluation, and interpretation of data Statistical analysis of measurements can help verify.
And the Rule THE NORMAL DISTRIBUTION. SKEWED DISTRIBUTIONS & OUTLIERS.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
MATH IN THE FORM OF STATISTICS IS VERY COMMON IN AP BIOLOGY YOU WILL NEED TO BE ABLE TO CALCULATE USING THE FORMULA OR INTERPRET THE MEANING OF THE RESULTS.
Skewness & Kurtosis: Reference
A tour of fundamental statistics introducing Basic Statistics.
Measures of Central Tendency: The Mean, Median, and Mode
Statistics for Psychology!
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
Psychology and Statistics Interpreting Data (Ch. 1 Myers and Ch. 2 Barron’s)
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
RESEARCH & DATA ANALYSIS
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Experimental Methods: Statistics & Correlation
LIS 570 Summarising and presenting data - Univariate analysis.
DO NOW1/29/14 Use the gas price data to answer the following questions. {$4.79, $4.60, $4.75, $4.66, $4.60} 1.Find the mean of the data (hint: average)
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
Data Analysis. Statistics - a powerful tool for analyzing data 1. Descriptive Statistics - provide an overview of the attributes of a data set. These.
THE ROLE OF STATISTICS IN RESEARCH. Reading APPENDIX A: Statistics pp
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
STATS DAY First a few review questions. Which of the following correlation coefficients would a statistician know, at first glance, is a mistake? A. 0.0.
Chapter 11 Summarizing & Reporting Descriptive Data.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 10 Descriptive Statistics Numbers –One tool for collecting data about communication.
Introductory Psychology: Statistical Analysis
Statistical Methods Michael J. Watts
Statistical Methods Michael J. Watts
Statistics.
Introduction to Summary Statistics
Statistical Reasoning in Everyday Life
Statistics in AP Psychology
Description of Data (Summary and Variability measures)
Univariate Descriptive Statistics
Univariate Descriptive Statistics
STATS DAY First a few review questions.
Measures of Central Tendency
MEASURES OF CENTRAL TENDENCY
Central Tendency Central Tendency – measures of location for a distribution Mode – the commonly occurring number in a data set Median – the middle score.
Module 8 Statistical Reasoning in Everyday Life
Descriptive Statistics
Psychology Statistics
Making Sense of Measures of Center Investigation 2
Good morning! Please get out your homework for a check.
Describing Data Coordinate Algebra.
Presentation transcript:

Engineering College of Engineering Engineering Education Innovation Center Analyzing Measurement Data Rev: , MCAnalyzing Data1

Engineering Example Rev: , AMAnalyzing Data2

Engineering Example Rev: , AMAnalyzing Data 3 Most values fall between 14 and 20 m. This data contains an outlier of 45.2 m.

Engineering Represent the Data with a Histogram First, determine an appropriate bin size. The bin size [k] can be assigned directly or can be calculated from a suggested number of bins [h]: Let’s try the most commonly used formula first: Rev: , AMAnalyzing Data4 If you have this many data points [n] Use this number of bins [h] Less than 505 to 7 50 to 996 to to 2507 to 12 More than to 20

Engineering Histogram - Example Rev: , AMAnalyzing Data5 Is this the best way to represent this data? By changing our bin size, [k], we can improve the representation. Bin SizeFrequency

Engineering Histogram - Example Rev: , AMAnalyzing Data6 All 3 histograms represent the exact same data set, but the bin width and number of bins for the two shown above were selected manually. Which one is most descriptive?

Engineering Dealing with outliers Engineers must carefully consider any outliers when analyzing data. It is up to the engineer to determine whether the outlier is a valid data point or if it is invalid and should be discarded. Invalid data points can result from measurement errors or recording the data incorrectly. Rev: , AMAnalyzing Data7

Engineering Characterizing the data Statistics allows us to characterize the data numerically as well as graphically. We characterize data in two ways: –Central Tendency –Variation Rev: , AMAnalyzing Data8

Engineering Central Tendency (Expected Value) Central tendency is a single value that best represents the data. But which number do we choose? Mean Median Mode –Note: For most engineering applications, mean and median are most relevant. Rev: , AMAnalyzing Data9

Engineering Central Tendency - Mean Rev: , AMAnalyzing Data10 Is the mean value a good depiction of the data? How does the outlier affect the mean?

Engineering Central Tendency - Mean Problem: Outliers may decrease the usefulness of the mean as a central value. Observe how outliers can affect the mean for this simple data set: Rev: , AMAnalyzing Data Without outliers Changing 3 to -112 Outlier: -112 Changing 44 to 212 Outlier: 212 Solution: Look at the median.

Engineering Central Tendency - Median Rev: , AMAnalyzing Data12 n = 20  even number of data points. Must take the average of the 2 middle values Which value looks like a better representation of the data? Mean (18.47) or median (17.4)? Why?

Engineering Central Tendency Median Rev: , AMAnalyzing Data13 Using the simple data set, observe how the median reduces the impact of outliers on the central tendency. Median = 21

Engineering Central Tendency – Mean and Median Which value, the mean (18.47 m) or the median (17.4) is a better representation of the data? Rev: , AMAnalyzing Data14

Engineering Characterizing the data We can select a value of central tendency to represent the data, but is one number enough? It is also important to know how much variation there is in the data set. Variation refers to how the data is distributed around the central tendency value. Rev: , AMAnalyzing Data15

Engineering Variation As with central tendency, there are multiple ways to represent the variation of a set of data. ± (“Plus, Minus”) gives the range of the values. Standard Deviation provides a more sophisticated look at how the data is distributed around the central value. Rev: , AMAnalyzing Data16

Engineering Variation - Standard Deviation Definition: how closely the values cluster around the mean; how much variation there is in the data Equation: Rev: , AMAnalyzing Data17

Engineering Standard Deviation Example Rev: , MCAnalyzing Data18 mean = ∑ =

Engineering Standard Deviation: Interpretation Rev: , AMAnalyzing Data19 These curves describe the distribution of students’ exam grades. The average value is an 83%. Which class would you rather be in? Curve B Curve A AA BB

Engineering Data that is normally distributed occurs with greatest frequency around the mean. Normal distributions are also frequently referred to as Gaussian distributions or bell curves Normal Distribution Rev: , AMAnalyzing Data20 Frequency Bins mean

Engineering Normal Distribution Rev: , AMAnalyzing Data21 Mean = Median = Mode -68% of values fall within 1 SD -95% of values fall within 2 SDs

Engineering Other Distributions Rev: , AMAnalyzing Data22 Skewed distributions: Multimodal distribution: Uniform distribution:

Engineering What we’ve learned This lecture has introduced some basic statistical tools that engineers use to analyze data. Histograms are used to represent data graphically. Engineers use both central tendency and variation to numerically describe data. Rev: , AMAnalyzing Data23