Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.

Slides:



Advertisements
Similar presentations
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Advertisements

Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Descriptive Statistics
Basic Business Statistics (10th Edition)
Calculating & Reporting Healthcare Statistics
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
EdPsy 511 August 28, Common Research Designs Correlational –Do two qualities “go together”. Comparing intact groups –a.k.a. causal-comparative and.
1 Pertemuan 02 Ukuran Numerik Deskriptif Matakuliah: I0262-Statistik Probabilitas Tahun: 2007.
Edpsy 511 Homework 1: Due 2/6.
Measures of Dispersion
Coefficient of Variation
© 2003 Prentice-Hall, Inc.Chap 3-1 Business Statistics: A First Course (3 rd Edition) Chapter 3 Numerical Descriptive Measures.
Measures of Central Tendency
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
Describing Data: Numerical
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Programming in R Describing Univariate and Multivariate data.
Numerical Descriptive Measures
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
MSE 600 Descriptive Statistics Chapter 10 in 6 th Edition (may be another chapter in 7 th edition)
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
Statistics Recording the results from our studies.
© The McGraw-Hill Companies, Inc., Chapter 3 Data Description.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Measures of Dispersion & The Standard Normal Distribution 2/5/07.
Chapter 2 Describing Data.
Descriptive Statistics
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
EDPSY Chp. 2: Measurement and Statistical Notation.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Measures of Central Tendency: The Mean, Median, and Mode
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Statistical Analysis Quantitative research is first and foremost a logical rather than a mathematical (i.e., statistical) operation Statistics represent.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Sampling (cont.) Instrumentation Measurement Plan Due 3/7.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
HMS 320 Understanding Statistics Part 2. Quantitative Data Numbers of something…. (nominal - categorical Importance of something (ordinal - rankings)
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Applied Quantitative Analysis and Practices LECTURE#05 By Dr. Osman Sadiq Paracha.
© 1999 Prentice-Hall, Inc. Chap Measures of Central Location Mean, Median, Mode Measures of Variation Range, Variance and Standard Deviation Measures.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Statistics.
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Description of Data (Summary and Variability measures)
STATS DAY First a few review questions.
Numerical Descriptive Measures
Descriptive Statistics
Numerical Descriptive Measures
BUSINESS MATHEMATICS & STATISTICS.
Numerical Descriptive Measures
Presentation transcript:

Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19

Shapes of Distributions ► Normal distribution ► Positive Skew  Or right skewed ► Negative Skew  Or left skewed

How is this variable distributed?

Descriptive Statistics

Statistics vs. Parameters ► A parameter is a characteristic of a population.  It is a numerical or graphic way to summarize data obtained from the population ► A statistic is a characteristic of a sample.  It is a numerical or graphic way to summarize data obtained from a sample

Types of Numerical Data ► There are two fundamental types of numerical data: 1) Categorical data: obtained by determining the frequency of occurrences in each of several categories 2) Quantitative data: obtained by determining placement on a scale that indicates amount or degree

Techniques for Summarizing Quantitative Data ► Frequency Distributions ► Histograms ► Stem and Leaf Plots ► Distribution curves ► Averages ► Variability

Summary Measures Central Tendency Arithmetic Mean Median Mode Quartile Summary Measures Variation Variance Standard Deviation Range

Measures of Central Tendency Central Tendency Average (Mean)MedianMode

Mean (Arithmetic Mean) ► Mean (arithmetic mean) of data values  Sample mean  Population mean Sample Size Population Size

Mean ► The most common measure of central tendency ► Affected by extreme values (outliers) Mean = 5Mean = 6

Mean of Grouped Frequency XffX TotalN 21

Weighted Mean A form of mean obtained from groups of data in which the different sizes of the groups are accounted for or weighted.

GroupxbarNf(xbar)

Median ► Robust measure of central tendency ► Not affected by extreme values ► In an Ordered array, median is the “middle” number  If n or N is odd, median is the middle number  If n or N is even, median is the average of the two middle numbers Median = 5

Mode ► A measure of central tendency ► Value that occurs most often ► Not affected by extreme values ► Used for either numerical or categorical data ► There may may be no mode ► There may be several modes Mode = No Mode

The Normal Curve

Different Distributions Compared

Variability ► Refers to the extent to which the scores on a quantitative variable in a distribution are spread out. ► The range represents the difference between the highest and lowest scores in a distribution. ► A five number summary reports the lowest, the first quartile, the median, the third quartile, and highest score.  Five number summaries are often portrayed graphically by the use of box plots.

Variance ► The Variance, s 2, represents the amount of variability of the data relative to their mean ► As shown below, the variance is the “average” of the squared deviations of the observations about their mean ► The Variance, s 2, is the sample variance, and is used to estimate the actual population variance,  2

Standard Deviation ► Considered the most useful index of variability. ► It is a single number that represents the spread of a distribution. ► If a distribution is normal, then the mean plus or minus 3 SD will encompass about 99% of all scores in the distribution.

Calculation of the Variance and Standard Deviation of a Distribution √ Raw ScoreMeanX – X(X – X) Variance (SD 2 ) = Σ(X – X) 2 N-1 = = Standard deviation (SD) = Σ(X – X) 2 N-1

Comparing Standard Deviations Mean = 15.5 S = Data B Data A Mean = 15.5 S = Mean = 15.5 S = 4.57 Data C

Facts about the Normal Distribution ► 50% of all the observations fall on each side of the mean. ► 68% of scores fall within 1 SD of the mean in a normal distribution. ► 27% of the observations fall between 1 and 2 SD from the mean. ► 99.7% of all scores fall within 3 SD of the mean. ► This is often referred to as the rule

Fifty Percent of All Scores in a Normal Curve Fall on Each Side of the Mean

Probabilities Under the Normal Curve

Standard Scores ► Standard scores use a common scale to indicate how an individual compares to other individuals in a group. ► The simplest form of a standard score is a Z score. ► A Z score expresses how far a raw score is from the mean in standard deviation units. ► Standard scores provide a better basis for comparing performance on different measures than do raw scores. ► A Probability is a percent stated in decimal form and refers to the likelihood of an event occurring. ► T scores are z scores expressed in a different form (z score x ).

Probability Areas Between the Mean and Different Z Scores

Examples of Standard Scores