CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)

Slides:



Advertisements
Similar presentations
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Advertisements

Psy302 Quantitative Methods
MEASURES OF DISPERSION
Calculating & Reporting Healthcare Statistics
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
Descriptive Statistics Chapter 3 Numerical Scales Nominal scale-Uses numbers for identification (student ID numbers) Ordinal scale- Uses numbers for.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Biostatistics Unit 2 Descriptive Biostatistics 1.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Measures of Dispersion
Describing Data: Numerical Measures
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements
Department of Quantitative Methods & Information Systems
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.
Statistics Workshop Tutorial 3
CHAPTER 1 Basic Statistics Statistics in Engineering
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistics Recording the results from our studies.
CHAPTER 1 Basic Statistics Statistics in Engineering
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Descriptive Statistics: Numerical Methods
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
STATISTICS. Statistics * Statistics is the area of science that deals with collection, organization, analysis, and interpretation of data. * A collection.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Business Statistics (BQT 173) ІМ ќ INSTITUT MATEMATIK K E J U R U T E R A A N U N I M A P Descriptive Statistics: Numerical Measures (Statistic)
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Measures of Central Tendency. These measures indicate a value, which all the observations tend to have, or a value where all the observations can be assumed.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
Chapter 9 Statistics.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
CHAPTER 1 Basic Statistics Statistics in Engineering
FARAH ADIBAH ADNAN ENGINEERING MATHEMATICS INSTITUTE (IMK) C HAPTER 1 B ASIC S TATISTICS.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
CHAPTER 1 Basic Statistics Statistics in Engineering
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Descriptive Statistics(Summary and Variability measures)
CHAPTER 1 EQT 271 (part 1) BASIC STATISTICS. Basic Statistics 1.1Statistics in Engineering 1.2Collecting Engineering Data 1.3Data Presentation and Summary.
Summarizing Data with Numerical Values Introduction: to summarize a set of numerical data we used three types of groups can be used to give an idea about.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Exploratory Data Analysis
Business and Economics 6th Edition
Measures of Dispersion
SUBTOPIC 8.3 : Measures of Location 8.4 : Measures of Dispersion
Measures of Central Tendency
Chapter 3 Created by Bethany Stubbe and Stephan Kogitz.
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Description of Data (Summary and Variability measures)
Lecture 5,6: Measures in Statistics
Descriptive Statistics: Numerical Methods
MEASURES OF CENTRAL TENDENCY
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Business and Economics 7th Edition
Presentation transcript:

CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)

3.1 Measures of Central Tendency/ Location There are 3 popular central tendency measures, mean, median & mode. 1) Mean  The mean of a sample is the sum of the measurements divided by the number of measurements in the set. Mean is denoted by ( ) Mean = Sum of all values / Number of values  Mean can be obtained as below :- - For raw data, mean is defined by,

- For tabular/group data, mean is defined by: Where f = class frequency; x = class mark (mid point) Example 3.1: The mean sample of CGPA (raw/ungroup) is: MLB Team 2002 Total Payroll (Million of dollars) Anaheim Angels 62 Atlanta Braves93 New York Yankees126 St. Louis Cardinals75 Tampa Bay Devil Rays34 Total390 Table 3.1

Example 3.2 : The mean sample for Table 3.2 CGPA (Class)Frequency, f Class Mark (Midpoint ), xfx Total Table 3.2

2) Median  Median is the middle value of a set of observations arranged in order of magnitude and normally is devoted by i) The median for ungrouped data. - The median depends on the number of observations in the data,. - If is odd, then the median is the th observation of the ordered observations. - If is even, then the median is the arithmetic mean of the th observation and the th observation.

ii) The median of grouped data / frequency of distribution. The median of frequency distribution is defined by: where, = the lower class boundary of the median class; = the size of the median class interval; = the sum of frequencies of all classes lower than the median class = the frequency of the median class.

Example 3.3 for ungrouped data :- The median of this data 4, 6, 3, 1, 2, 5, 7, 3 is 3.5. Proof :- - Rearrange the data in order of magnitude becomes 1,2,3,3,4,5,6,7. As n=8 (even), the median is the mean of the 4th and 5th observations that is 3.5. Example 3.4 for grouped data :- CGPA (Class)Frequency, f Cum. frequency Total50 Table 3.3

3) Mode The mode of a set of observations is the observation with the highest frequency and is usually denoted by ( ). Sometimes mode can also be used to describe the qualitative data. i) Mode of ungrouped data :- - Defined as the value which occurs most frequent. - The mode has the advantage in that it is easy to calculate and eliminates the effect of extreme values. - However, the mode may not exist and even if it does exit, it may not be unique.

*Note:  If a set of data has 2 measurements with higher frequency, therefore the measurements are assumed as data mode and known as bimodal data.  If a set of data has more than 2 measurements with higher frequency so the data can be assumed as no mode. ii) The mode for grouped data/frequency distribution data. - When data has been grouped in classes and a frequency curve is drawn to fit the data, the mode is the value of corresponding to the maximum point on the curve.

- Determining the mode using formula. where = the lower class boundary of the modal class; = the size of the modal class interval; = the difference between the modal class frequency and the class before it; and = the difference between the modal class frequency and the class after it. *Note: - The class which has the highest frequency is called the modal class.

Example 3.5 for ungrouped data : The mode for the observations 4,6,3,1,2,5,7,3 is 3. Example 3.6 for grouped data based on table : Proof :- CGPA (Class)Frequency Total50 Table 3.4 Modal Class

3.2 Measure of Dispersion  The measure of dispersion/spread is the degree to which a set of data tends to spread around the average value.  It shows whether data will set is focused around the mean or scattered.  The common measures of dispersion are: 1) range 2) variance 3) standard deviation  The standard deviation actually is the square root of the variance.  The sample variance is denoted by s 2 and the sample standard deviation is denoted by s.

1) Range  The range is the simplest measure of dispersion to calculate. Range = Largest value – Smallest value Example 3.7:- Table 3.5 gives the total areas in square miles of the four western South- Central states the United States. Solution: Range = Largest Value – Smallest Value = 267, 277 – 49, 651 = 217, 626 square miles. StateTotal Area (square miles) Arkansas53,182 Louisiana49,651 Oklahoma69,903 Texas267, 277 Table 3.4

2) Variance i) Variance for ungrouped data  The variance of a sample (also known as mean square) for the raw (ungrouped) data is denoted by s 2 and defined by: ii) Variance for grouped data  The variance for the frequency distribution is defined by:

Example 3.8 for ungrouped data : Refer example. Example 3.9 for grouped data : The variance for frequency distribution in Table 3.5 is: CGPA (Class)Frequency, f Class Mark, xfxfx Total Table 3.5

2Frequency, f Class Mark, xfxfx Total

3) Standard Deviation i) Standard deviation for ungrouped data :- ii) Standard deviation for grouped data :-

Example 3.10 (Based on example 3.8) for ungrouped data: *Refer example Example 3.11 (Based on example 3.9) for grouped data:

3.3 Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval about the mean. i) Chebyshev’s Theorem At least of the observations will be in the range of k standard deviation from mean. where k is the positive number exceed 1 or (k>1). Applicable for any distribution /not normal distribution. Steps: 1) Determine the interval 2) Find value of 3) Change the value in step 2 to a percent 4) Write statement: at least the percent of data found in step 3 is in the interval found in step 1

Example 3.12 : Consider a distribution of test scores that are badly skewed to the right, with a sample mean of 80 and a sample standard deviation of 5. If k=2, what is the percentage of the data fall in the interval from mean? Solution: 1) Determine interval 2) Find 3) Convert into percentage: 4) Conclusion: At least 75% of the data is found in the interval from 70 to 90

ii) Empirical Rule Applicable for a symmetric bell shaped distribution / normal distribution. k is a constant. k is a 1, 2 or 3 for Empirical Rule. There are 3 rules: i. 68% of the observations lie in the interval ii. 95% of the observations lie in the interval iii. 99.7% of the observations lie in the interval If k is not given, then: Formula for k =Distance between mean and each point standard deviation

ClassFrequency (f)Midpoint (x m )f. x m n=20

Arrange the data in order 209, 211, 211, 212, 213, 223, 227, 229, 240, 240 median

3. The median class is 5-6, since it contains the 5th value, (n/2 =5. From the table, Movies Showing Frequenc y, f Cummulative frequency Class Mark, xx2x2 fx fx