Ex St 801 Statistical Methods Introduction. Basic Definitions STATISTICS : Area of science concerned with extraction of information from numerical data.

Slides:



Advertisements
Similar presentations
Brought to you by Tutorial Support Services The Math Center.
Advertisements

Describing Quantitative Variables
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
Class Session #2 Numerically Summarizing Data
1 Chapter 1: Sampling and Descriptive Statistics.
Descriptive Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
1 Economics 240A Power One. 2 Outline w Course Organization w Course Overview w Resources for Studying.
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Introduction to Educational Statistics
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Edpsy 511 Homework 1: Due 2/6.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Descriptive Statistics
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Sta220 - Statistics Mr. Smith Room 310 Class #3. Section
Methods for Describing Sets of Data
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter 2: Methods for Describing Sets of Data
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Table of Contents 1. Standard Deviation
Chapter 2 Describing Data.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
7.7 Statistics and Statistical Graphs. Learning Targets  Students should be able to… Use measures of central tendency and measures of dispersion to describe.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
Summary Statistics: Measures of Location and Dispersion.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Descriptive Statistics Tabular and Graphical Displays –Frequency Distribution - List of intervals of values for a variable, and the number of occurrences.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
INEN 270 ENGINEERING STATISTICS Fall 2011 Introduction.
Basic Statistics  Statistics in Engineering (collect, organize, analyze, interpret)  Collecting Engineering Data  Data Presentation and Summary  Types.
2-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
An Introduction to Statistics
COMPLETE BUSINESS STATISTICS
Exploratory Data Analysis
Methods for Describing Sets of Data
a graphical presentation of the five-number summary of data
ISE 261 PROBABILISTIC SYSTEMS
Chapter 6 ENGR 201: Statistics for Engineers
NUMERICAL DESCRIPTIVE MEASURES
Description of Data (Summary and Variability measures)
Descriptive Statistics
An Introduction to Statistics
Numerical Measures: Skewness and Location
Statistics: The Interpretation of Data
Day 52 – Box-and-Whisker.
NUMERICAL DESCRIPTIVE MEASURES
Presentation transcript:

Ex St 801 Statistical Methods Introduction

Basic Definitions STATISTICS : Area of science concerned with extraction of information from numerical data and its use in making inference about a population from data that are obtained from a sample.

Basic Definitions (cont.) POPULATION : set representing all measurements of interest to the investigator. PARAMETER : an unknown population characteristic of interest to the investigator.

Basic Definitions (cont.) SAMPLE : subset of measurements selected from the population of interest. STATISTIC: a sample characteristic of interest to the investigator.

Some Frequently Used Statistics and Parameters 

Basic Definitions (cont.) STATISTICAL INFERENCE : making an "INFORMED GUESS" about a parameter based on a statistic. ( This is the main objective of statistics. )

STATISTICAL INFERENCE GATHER DATA MAKE INFERENCES POPULATION SAMPLE PARAMETERS SAMPLE STATISTICS

More Basic Definitions A VARIABLE is a characteristic of an individual or object that may vary for different observations. A QUANTITATIVE VARIABLE measures a variable scale. A QUALITATIVE VARIABLE categorizes the values of the variable.

RAISIN BRAN EXAMPLE A cereal company claims that the average amount of raisins in its boxes of raisin bran is two scoops. A random sample of five boxes was taken off the production line, and an analysis revealed an average of 1.9 scoops per box.

Components of the Problem Identify the population Identify the sample Identify the symbol for the parameter Identify the symbol for the statistic Is the variable quantitative or qualitative?

ASPIRIN AND HEART ATTACKS 1 Twenty thousand doctors participated in a study to determine if taking an aspirin every other day would result in a reduction of heart attacks.

ASPIRIN AND HEART ATTACKS 2 The physicians were randomly divided into two groups. The first group (called the treatment group) received an aspirin every other day, while the other group (called the control group) received a placebo.

ASPIRIN AND HEART ATTACKS 3 At the end of the study, there had been 104 heart attacks in the treatment group and 189 heart attacks in the control group.

Identifying Components of the Problem Identify the population Identify the sample Identify the symbol for the parameter Identify the symbol for the statistic Is the variable quantitative or qualitative?

Five Steps in a Statistical Study : 1. Stating the problem 2. Gathering the data 3. Summarizing the data 4. Analyzing the data 5. Reporting the results

Stating the Problem Specifically identifying the population to be sampled Identifying the parameter(s) being studied

Stating the Problem Example A researcher wanted to determine if a vitamin supplement would reduce the rate of certain cancers. A large study was conducted in China and the results indicated that people who had the vitamin supplement had a significantly lower cancer rate. Do the results of this study apply to Americans? Why or why not?

Gathering the Data SURVEYS –Random Sampling –Stratified Sampling –Cluster Sampling –Systematic sampling

Gathering the Data EXPERIMENTS –Completely Randomized Design –Randomized Block Design –Factorial Design

More Definitions DESCRIPTIVE STATISTICS: Organizing and describing sample information. (Descriptive Statistics describe how things are.)

Graphical Displays for Qualitative Data PIE CHART BAR CHART

Major Volcanoes in the World

Graphical Displays for Quantitative Data HISTOGRAM STEM AND LEAF DISPLAY

Histogram of Major Volcanoes in the World

Life Expectancies in 33 Developed Nations

Histogram of Life Expectancies in 33 Developed Nations

Stem-Leaf Display for Elevation KEY: UNIT = | 2 REPRESENTS STEM LEAF

Construction of a Stem-Leaf Display List the stem values, in order, in a vertical column Draw a vertical line to the right of the stem values For each observation, record the leaf portion of the observation in the row corresponding to the appropriate stem Reorder the leaves from the lowest to highest within each stem row

Construction of a Stem-Leaf Display (cont.) If the number of leaves appearing in each stem is too large, divide the stems into two groups, the first corresponding to leaves 0 through 4, and the second corresponding to leaves 5 through 9. (This subdivision can be increased to five groups if necessary). Provide a key to your stem and leaf coding, so the reader can reconstruct the actual measurements.

Numerical Measures for Summarizing Data TYPES: 1. Measures of CENTRAL TENDENCY 2. Measures of VARIABILITY 3. Measures OF RELATIVE LOCATION

The Arithmetic Mean The ARITHMETIC MEAN of a set of n measurements (y 1, y 2,..., y n ) is equal to the sum of the measurements divided by n.

The mathematical notation for the ARITHMETIC MEAN is:

The Median The MEDIAN of a set of n measurements (y 1, y 2,..., y n ) is the value that falls in the middle position when the measurements are ordered from the smallest to the largest.

RULE FOR CALCULATING THE MEDIAN 1 Order the measurements from the smallest to the largest. 2 A)If the sample size is odd, the median is the middle measurement. B)If the sample size is even, the median is the average of the two middle measurements.

Example A random sample of six values were taken from a population. These values were: y 1 =7, y 2 =1, y 3 =10, y 4 =8, y 5 =4, and y 6 =12. What are the sample mean and sample median for these data?

Sample Mean

CALCULATIONS FOR THE SAMPLE MEDIAN ( Ordered Sample) MEDIAN = ( ) / 2 = 7.5 y 2 =1, y 5 =5, y 1 =7, y 4 =8, y 3 =10, y 6 =12

Consider the following sample: Which measure of central tendency best describes the central location of the data: THE SAMPLE MEAN OR SAMPLE MEDIAN?

STEM LEAF

MEASUREMENTS OF VARIABILITY RANGE VARIANCE STANDARD DEVIATION

Deviation The DEVIATION of an observation y i from the sample mean is equal to: Deviations to the left of the sample mean are negative and deviations to the right of the sample mean are positive. Also, notice that the larger the squared deviation, the further away the observation is from the mean.

Formula for the Sample Variance

Obs Y (Y-Y) (Y-Y) 2 Obs. Y Y 2

Calculation of Sample Variance

THE EMPIRICAL RULE Given a large set of measurements possessing a mound-shaped histogram, then the interval contains approximately 68% of the measurements. the interval contains approximately 95% of the measurements. the interval contains approximately 99.7% of the measurements.

Major Volcanoes in the World

TWO MEASURES OF RELATIVE STANDING Percentile Quartile

The P th Percentile is the value Xp such that p% of the measurements will fall below that value and (100-p)% of the measurements will fall above that value. p% (100-p)% XpXp

Quartiles divide the measurements into four parts such that 25% of the measurements are contained in each part. The first quartile (Lower Quartile) is denoted by Q 1, the second by Q 2, and the third (Upper Quartile) by Q 3. Q1Q2Q3 25%

Box and Whisker Plot Life Expectancies in 33 Developed Nations

Calculating Fence Values Lower Inner Fence: Q (IQR) Upper Inner Fence: Q (IQR) Lower Outer Fence: Q1 - 3 (IQR) Upper Outer Fence: Q3 + 3 (IQR)

EXAMPLE: Construct a Box-and-Whisker Plot for the elevations of volcanoes in Africa 1,650 5,981 7,745 9,281 10,023 11,400 12,198 13,451 19,340 Median = Q 1 = Q 2 = IQR = Lower Inner Fence = Upper Inner Fence = Lower Outer Fence = Upper Outer Fence =

BOX AND WHISKER PLOT MAJOR VOLCANOES IN AFRICA

Ex St 801 Statistical Methods The End