Statistics Dealing With Uncertainty. Objectives Describe the difference between a sample and a population Learn to use descriptive statistics (data sorting,

Slides:



Advertisements
Similar presentations
Measurement, Evaluation, Assessment and Statistics
Advertisements

Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Introduction to Summary Statistics
Unit 32 STATISTICS.
Frequency Distribution and Variation Prepared by E.G. Gascon.
QUANTITATIVE DATA ANALYSIS
Calculating & Reporting Healthcare Statistics
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
Data Analysis Statistics. OVERVIEW Getting Ready for Data Collection Getting Ready for Data Collection The Data Collection Process The Data Collection.
Descriptive Statistics
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Introduction to Educational Statistics
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Probability and Statistics in Engineering Philip Bedient, Ph.D.
Central Tendency and Variability
2011 Pearson Prentice Hall, Salkind. Chapter 7 Data Collection and Descriptive Statistics.
Chapter 3: Central Tendency
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Basic Definitions  Statistics Collect Organize Analyze Summarize Interpret  Information - Data Draw conclusions.
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
Chap 11 Engineering Statistics PREP004 – Introduction to Applied Engineering College of Engineering - University of Hail Fall 2009.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
POPULATION DYNAMICS Required background knowledge:
CHAPTER 1 Basic Statistics Statistics in Engineering
Statistics Chapter 9. Statistics Statistics, the collection, tabulation, analysis, interpretation, and presentation of numerical data, provide a viable.
Statistics: Dealing With Uncertainty ACADs (08-006) Covered Keywords Sample, normal distribution, central tendency, histogram, probability, sample, population,
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Basic Statistics for Engineers. Collection, presentation, interpretation and decision making. Prof. Dudley S. Finch.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Basic Statistics  Statistics in Engineering  Collecting Engineering Data  Data Summary and Presentation  Probability Distributions - Discrete Probability.
Chapter 21 Basic Statistics.
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
Skewness & Kurtosis: Reference
Chapter 6: Random Errors in Chemical Analysis CHE 321: Quantitative Chemical Analysis Dr. Jerome Williams, Ph.D. Saint Leo University.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Two Main Uses of Statistics: 1)Descriptive : To describe or summarize a collection of data points The data set in hand = the population of interest 2)Inferential.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Statistics 1: Introduction to Probability and Statistics Section 3-2.
FARAH ADIBAH ADNAN ENGINEERING MATHEMATICS INSTITUTE (IMK) C HAPTER 1 B ASIC S TATISTICS.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
The field of statistics deals with the collection,
Lean Six Sigma: Process Improvement Tools and Techniques Donna C. Summers © 2011 Pearson Higher Education, Upper Saddle River, NJ All Rights Reserved.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Descriptive Statistics Research Writing Aiden Yeh, PhD.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Chapter15 Basic Data Analysis: Descriptive Statistics.
Descriptive Statistics(Summary and Variability measures)
Descriptive Statistics Dr.Ladish Krishnan Sr.Lecturer of Community Medicine AIMST.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Statistical Methods Michael J. Watts
MATH-138 Elementary Statistics
Statistical Methods Michael J. Watts
Measures of Central Tendency
Chapter 5 STATISTICS (PART 1).
Central Tendency and Variability
Description of Data (Summary and Variability measures)
Chapter 12 Using Descriptive Analysis, Performing
Statistical Evaluation
Statistics 1: Introduction to Probability and Statistics
Descriptive Statistics
Presentation transcript:

Statistics Dealing With Uncertainty

Objectives Describe the difference between a sample and a population Learn to use descriptive statistics (data sorting, central tendency, etc.) Learn how to prepare and interpret histograms State what is meant by normal distribution and standard normal distribution. Use Z-tables to compute probability.

Statistics “There are lies, d#$& lies, and then there’s statistics.” Mark Twain

Statistics is... a standard method for... - collecting, organizing, summarizing, presenting, and analyzing data - drawing conclusions - making decisions based upon the analyses of these data. used extensively by engineers (e.g., quality control)

Populations and Samples Population - complete set of all of the possible instances of a particular object e.g., the entire class Sample - subset of the population e.g., a team We use samples to draw conclusions about the parent population.

Why use samples? The population may be large all people on earth, all stars in the sky. The population may be dangerous to observe automobile wrecks, explosions, etc. The population may be difficult to measure subatomic particles. Measurement may destroy sample bolt strength

Team Exercise: Sample Bias To three significant figures, estimate the average age of the class based upon your team. When would a team not be a representative sample of the class?

Measures of Central Tendency If you wish to describe a population (or a sample) with a single number, what do you use? Mean - the arithmetic average Mode - most likely (most common) value. Median - “middle” of the data set.

What is the Mean? The mean is the sum of all data values divided by the number of values.

Sample Mean Where: is the sample mean x i are the data points n is the sample size

Population Mean Where: μ is the population mean x i are the data points N is the total number of observations in the population

What is the Mode? mode - the value that occurs the most often in discrete data (or data that have been grouped into discrete intervals) Example, students in this class are most likely to get a grade of B.

Mode continued Example of a grade distribution with mean C, mode B

What is the Median? Median - for sorted data, the median is the middle value (for an odd number of points) or the average of the two middle values (for an even number of points). useful to characterize data sets with a few extreme values that would distort the mean (e.g., house price,family incomes).

What Is the Range? Range - the difference between the lowest and highest values in the set. Example, driving time to Houston is 2 hours +/- 15 minutes. Therefore... Minimum = 105 min Maximum = 135 minutes Range = 30 minutes

Standard Deviation Gives a unique and unbiased estimate of the scatter in the data.

Standard Deviation Population Sample Deviation Variance =  2 Variance = s 2

The Subtle Difference Between  and σ N versus n-1 n-1 is needed to get a better estimate of the population  from the sample s. Note: for large n, the difference is trivial.

A Valuable Tool Gauss invented standard deviation circa 1700 to explain the error observed in measured star positions. Today it is used in everything from quality control to measuring financial risk.

Team Exercise In your team’s bag of M&M candies, count the number of candies for each color the total number of candies in the bag When you are done counting, have a representative from your team enter your data on the board Using Excel, enter the data gathered by the entire class More

Team Exercise (con’t) For each color, and the total number of candies, determine the following: maximummode minimummedian rangestandard deviation meanvariance

Individual Exercise: Histograms Flip a coin EXACTLY ten times. Count the number of heads YOU get. Report your result to the instructor who will post all the results on the board Open Excel Using the data from the entire class, create bar graphs showing the number of classmates who get one head, two heads, three heads, etc.

Data Distributions The “shape” of the data is described by its frequency histogram. Data that behaves “normally” exhibit a “bell-shaped” curve, or the “normal” distribution. Gauss found that star position errors tended to follow a “normal” distribution.

The Normal Distribution The normal distribution is sometimes called the “Gauss” curve. mean x RF Relative Frequency

Standard Normal Distribution Define: Then Area = 1.00 z

Some handy things to know. 50% of the area lies on each side of the mid-point for any normal curve. A standard normal distribution (SND) has a total area of “z-Tables” show the area under the standard normal distribution, and can be used to find the area between any two points on the z-axis.

Using Z Tables (Appendix C, p. 624) Question: Find the area between z= -1.0 and z= 2.0 From table, for z = 1.0, area = By symmetry, for z = -1.0, area = From table, for z= 2.0, area = Total area = = “Tails” area = =

“Quick and Dirty” Estimates of  and    (lowest + 4*mode + highest)/6 For a standard normal curve, 99.7% of the area is contained within ± 3  from the mean. Define “highest” =  Define “lowest” =  Therefore,   (highest - lowest)/6

Example: Drive time to Houston Lowest = 1 h Most likely = 2 h Highest = 4 h (including a flat tire, etc.)  = (1+4*2+4)/6 = 2.16 (2 h 12 min)  = (4 - 1)/6= 0.5 h This technique (Delphi) was used to plan the moon flights.

Team Exercise You want to put a scale on your rubber-band car to relate a given scale setting and an expected distance traveled. Design an experiment to establish a scale for your car. More

Team Exercise continued. Some Issues to consider: Sample size Range of distances Desired accuracy

Review Central tendency mean mode median Scatter range variance standard deviation Normal Distribution