University of Sunderland CSEM03 R.E.P.L.I. Unit 1 CSEM03 REPLI Research and the use of statistical tools.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Introduction to Summary Statistics
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Lecture 2 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Introduction to Educational Statistics
Measures of Dispersion
Data observation and Descriptive Statistics
SHOWTIME! STATISTICAL TOOLS IN EVALUATION DESCRIPTIVE VALUES MEASURES OF VARIABILITY.
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
PTP 560 Research Methods Week 8 Thomas Ruediger, PT.
Statistics Recording the results from our studies.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Variability. Statistics means never having to say you're certain. Statistics - Chapter 42.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Chapter 2 Characterizing Your Data Set Allan Edwards: “Before you analyze your data, graph your data.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Describing distributions with numbers
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Descriptive Statistics Descriptive Statistics describe a set of data.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Statistics for Psychology CHAPTER SIXTH EDITION Statistics for Psychology, Sixth Edition Arthur Aron | Elliot J. Coups | Elaine N. Aron Copyright © 2013.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Measures of Central Tendency: The Mean, Median, and Mode
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Introduction to statistics I Sophia King Rm. P24 HWB
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Anthony J Greene1 Central Tendency 1.Mean Population Vs. Sample Mean 2.Median 3.Mode 1.Describing a Distribution in Terms of Central Tendency 2.Differences.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Descriptive Statistics(Summary and Variability measures)
REVIEW OF BASIC STATISTICAL CONCEPTS Kerstin Palombaro PT, PhD, CAPS HSED 851 PRIVITERA CHAPTERS 1-4.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
A QUANTITATIVE RESEARCH PROJECT -
Exploratory Data Analysis
Statistical Methods Michael J. Watts
Doc.RNDr.Iveta Bedáňová, Ph.D.
Statistical Methods Michael J. Watts
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
Central Tendency and Variability
Description of Data (Summary and Variability measures)
STATS DAY First a few review questions.
Central Tendency.
Descriptive Statistics
Introduction to Statistics
Ms. Saint-Paul A.P. Psychology
Myers Chapter 1 (F): Statistics in Psychological Research: Measures of Central Tendency A.P. Psychology.
Descriptive Statistics
Lecture 4 Psyc 300A.
Central Tendency & Variability
Presentation transcript:

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 CSEM03 REPLI Research and the use of statistical tools

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Objectives At the end of this lecture you will be able to: 1.Be able to plot graphs and know your way around SPSS 2.Describe the shape of normal and non- normal distributions 3.Describe the characteristics of a normal and non-normal distribution Mode, median, mean, standard deviation

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 The SPSS statistical tool Recommended text: Andy Field’s Field, A (2005) Discovering Statistics using SPSS, 2 nd Edition, Sage Publishing, London (ISBN ) You can get SPSS for your own machine for £5 from the LRC

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Simple statistical models: mean, sum of squares, variance and standard deviation Mean We can consider the mean of a sample as one of the simplest statistical models because it represents a summary of the data. For example: How may CDs does a group of students own? If we take 5 students then the numbers of CDs owned respectively are 1,2,3,4,6. The mean is the sum of these (  ) divided by the number of students. Mean = 16 = This is a theoretical mean, you cannot have.2 of a CD. How well does mean represent the data? What are the differences between the observed and the mean?

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Differences between the mean and the observed data –3.2 = = = = = No of CDs Student Figure 1 Differences between observed no of CDs and the mean

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Cancelling out the errors in the data (deviation from the mean) x i - = = -2.2 where xi = first data point x 1 so x 2 - =2 – 3.2 = -1.2 etc The deviances are -2.2, -1.2, -0.2, and +2.8 Total error =  x i - (sum of deviances) = 0

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Sum of Squared errors (SS) So there is no total between our model and the observed data. Some errors are negative, some are positive but they cancel each other out. To avoid the problem of knowing the direction of error (eg in a large dataset) we square each error (a negative squared becomes positive). This is called the Sum of Squared errors (SS) and is a good measure of the accuracy of our model. This however depends on the amount of data collected and the more data points the higher the SS. To overcome this we average the error by dividing SS by the number of observations N.

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 A more useful statistic is to use the error in the sample to estimate the error in the population – this is done by dividing SS by the no of observations – 1. This measure is known as the variance

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Variance and Standard Deviation Variance s2 = SS =  (xi - )2 N-1 N-1  ( xi - )2 = = 14.8  (xi - )2 = 14.8 = 3.7 N-1 4 From this statistic we can derive a very useful measure called Standard Deviation (SD) SD =  Variance =  3.7 = 1.92

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Levels of data The type of data you collect will depend on the design of your study. Data can be measured on different scales: Interval data These data are measured on a scale along which intervals are equal. For example if you record ratings of a pop video on a range of 1 to 5 the change between each number should be equal. Categorical data These are any variables that are made up of objects/ entities. For example the UK degree classification system comprises 1, 2:1, 2:2, 3, pass or fail. The interval between each class is not equal. Nominal data This is where numbers can represent names e.g. the numbers on the back of football shirts denotes the type of player ( 1 = goalkeeper).. Measurement data The objects being studied are measured on a quantitative scale. With discrete measurement data only certain values are possible The data can be discrete or continuous. Examples of continuous measurement data are age, height, cholesterol level. Ordinal data A type of categorical data where the order is important e.g Degree classification, seriousness of illness.

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Median The median is the "Middle value" of a list. The smallest number such that at least half the numbers in the list are no greater than it. If the list has an odd number of entries, the median is the middle entry in the list after sorting the list into increasing order. If the list has an even number of entries, the median is equal to the sum of the two middle (after sorting) numbers divided by two. The median can be estimated from a histogram by finding the smallest number such that the area under the histogram to the left of that number is 50%

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Mode For lists, the mode is the most common (frequent) value. A list can have more than one mode. In a histogram, a mode is the most frequently occurring interval (seen as a bump).

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Normal distribution example (scores in a test)

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Positive skew

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Standard deviation  34.1% 13.6% 0.1% 2.1% -3  -2  -1   11 22 33 Mean 

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Find the mean, median, mode, and range of these data. Example: Three dice are rolled 12 times. The sum of the numbers after each roll is recorded below: Numbers rolled 12, 11, 3, 7, 4, 4, 17, 13, 12, 5, 8, 12 Step 1: Rearrange the data elements. 3, 4, 4, 5, 7, 8, 11, 12,12, 12, 13, 17 Step 2: Find the mean. Add all the numbers and divide by 12 =116/12 = 9.7 Step 3: Find the median. The sample size is even, for there are 12 data elements. The median is the average value of the sixth and the seventh elements. median = (8+11)/2 = 9.5

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Step 4: Find the mode. The number 3 occurs once. The number 4 occurs twice. The number 5 occurs once. The number 7 occurs once. The number 8 occurs once. The number 11 occurs once. The number 12 occurs three times. The number 13 occurs once. The number 17 occurs once. mode = 12 Step 5: Find the range. The highest value is 17. The lowest value is 3. range = 17-3 = 14

University of Sunderland CSEM03 R.E.P.L.I. Unit 1 Examples for practice The weekly salaries of six employees at McDonalds are £70, £100, £90, £80, £70, £100. For these six salaries, find: (a) the mean (b) the median (c) the mode List the data in order: 70,70,80,90,100,100 Mean: = 510 = Median: 60, 70, 80, 90, 90, 120 The two numbers that fall in the middle need to be averaged = 85 2 Mode: The number that appears the most is 90