Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.

Slides:



Advertisements
Similar presentations
Unit 1: Science of Psychology
Advertisements

QUANTITATIVE DATA ANALYSIS
Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.
PSY 307 – Statistics for the Behavioral Sciences
BHS Methods in Behavioral Sciences I
Biostatistics Unit 2 Descriptive Biostatistics 1.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
Central Tendency and Variability
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
Measures of Central Tendency
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
6 - 1 Basic Univariate Statistics Chapter Basic Statistics A statistic is a number, computed from sample data, such as a mean or variance. The.
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Statistics.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
Descriptive Statistics Anwar Ahmad. Central Tendency- Measure of location Measures descriptive of a typical or representative value in a group of observations.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Smith/Davis (c) 2005 Prentice Hall Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Inferential statistics. Why statistics are important Statistics are concerned with difference – how much does one feature of an environment differ.
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
Descriptive Statistics
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Chapter 1: Science of Psychology Daily Objective (concept map): Apply basic statistical concepts to explain research findings: - Descriptive Statistics:
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
INVESTIGATION 1.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Determination of Sample Size: A Review of Statistical Theory
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Central Tendency. Variables have distributions A variable is something that changes or has different values (e.g., anger). A distribution is a collection.
Chapter Eight: Using Statistics to Answer Questions.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Data Analysis.
Describing Data Descriptive Statistics: Central Tendency and Variation.
STATISTICS FOR SCIENCE RESEARCH (The Basics). Why Stats? Scientists analyze data collected in an experiment to look for patterns or relationships among.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Organizing and Analyzing Data. Types of statistical analysis DESCRIPTIVE STATISTICS: Organizes data measures of central tendency mean, median, mode measures.
Descriptive Statistics(Summary and Variability measures)
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores.
Statistics in Forensics
Different Types of Data
AP Biology Intro to Statistics
Introductory Statistics
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
Science of Psychology AP Psychology
Central tendency and spread
Variability.
AP Biology Intro to Statistic
AP Biology Intro to Statistic
AP Biology Intro to Statistic
Measure of Central Tendency
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
CHAPTER 2: Basic Summary Statistics
BUSINESS MARKET RESEARCH
Presentation transcript:

Statistics

Intro to statistics Presentations More on who to do qualitative analysis Tututorial time

Inferential statistics

Descriptive vs Inferential statistics Descriptive statistics like totals (how many people came?), percentages (what proportion of the total were adolescents?) and averages (how much did they enjoy it?) use numbers to describe things that happen. Descriptive data page Descriptive data page Inferential statistics infer or predict the differences and relationships between things. They also tell us how certain or confident we can be about the predictions.

Why statistics are important Statistics are concerned with difference – how much does one feature of an environment differ from another Suicide rates/100,000 people

Why statistics are important Relationships – how does much one feature of the environment change as another measure changes The response of the fear centre of white people to black faces depending on their exposure to diversity as adolescents

The two tasks of statistics Magnitude: What is the size of the difference or the strength of the relationship? Reliability. What is the degree to which the measures of the magnitude of variables can be replicated with other samples drawn from the same population.

Magnitude – what’s our measure? Suicide rates/100,000 people Raw number? Some aggregate of numbers? Mean, median, mode?

Arithmetic mean or average Mean (M or X), is the sum (  X) of all the sample values ((X 1 + X 2 + X 3.…… X 22 ) divided by the sample size (N). Mean/average =  X/N ABA*BCA*C Overall rating Gener al Unitec ___ 14___ N146 64

Compute the mean GeneralUnitec Total (  X) N14664 mean

The median median is the "middle" value of the sample. There are as many sample values above the sample median as below it. If the number (N) in the sample is odd, then the median = the value of that piece of data that is on the (N-1)/2+1 position of the sample ordered from smallest to largest value. E.g. If N=45, the median is the value of the data at the (45-1)/2+1=23 rd position If the sample size is even then the median is defined as the average of the value of N/2 position and N/2+1. If N=64, the median is the average of the 64/2 (32 nd ) and the 64/2+1(33 rd ) position

Other measures of central tendency The mode is the single most frequently occurring data value. If there are two or more values used equally frequently, then the data set is called bi- modal or tri-modal, etc The midrange is the midpoint of the sample - the average of the smallest and largest data values in the sample. (= (2+10)/2 =6 for both groups The geometric mean (log transformation) =8.46 (general) and 7.38 (Unitec) The harmonic mean (inverse transformation) =8.19 (general) and 6.94 (Unitec) Both these last measures give less weight to extreme scores

Compute the median and mode Overall ratingGeneralUnitec N14664

Means, median, mode GeneralUnitec N14664 mean median98 mode108 geometric mean harmonic mean

The underlying distribution of the data

Normal distribution

Data that looks like a normal distribution

Three things we must know before we can say events are different 1.the difference in mean scores of two or more events - the bigger the gap between means the greater the difference 2.the degree of variability in the data - the less variability the better, as it suggests that differences between are reliable

Variance and Standard Deviation These are estimates of the spread of data. They are calculated by measuring the distance between each data point and the mean variance (s 2 ) is the average of the squared deviations of each sample value from the mean = s 2 =  X-M) 2 /(N-1) The standard deviation (s) is the square root of the variance.

Calculating the Variance (s 2 ) and the Standard Deviation (s) for the Unitec sample Xn(X-Mu)(X-Mu) 2 *n Overall ratingUnitec N Mean Unitec (Mu)=7.70Variance=3.83 SD or s=1.96

All normal distributions have similar properties. The percentage of the scores that is between one standard deviation (s) below the mean and one standard deviation above is always 68.26% s

Is there a difference between Unitec and General overall OAP rating scores

Is there a significant difference between Unitec and General OAP rating scores ss

Three things we must know before we can say events are different 3.The extent to which the sample is representative of the population from which it is drawn - the bigger the sample the greater the likelihood that it represents the population from which it is drawn - small samples have unstable means. Big samples have stable means.

Estimating difference The measure of stability of the mean is the Standard Error of the Mean = standard deviation/the square root of the number in the sample. So stability of mean is determined by the variability in the sample (this can be affected by the consistency of measurement) and the size of the sample. The standard error of the mean (SEM) is the standard deviation of the normal distribution of the mean if we were to measure it again and again

Yes it’s significant. The mean of the smaller sample (Unitec) is not too variable. Its Standard Error of the Mean = *SE = 0.48 = the 95% confidence interval. The General mean falls outside this confidence interval ss

Is the difference between means significant? What is clear is that the mean of the General group is outside the area where there is a 95% chance that the mean for the Unitec Group will fall, so it is likely that the General mean comes from a different population as the Unitec mean. The convention is to say that if mean 2 falls outside of the area (the confidence interval) where 95% of mean 1 scores is estimated to be, then mean 2 is significantly different from mean 1. We say the probability of mean 1 and mean 2 being the same is less than 0.05 (p<0.05) and the difference is significant p

The significance of significance Not an opinion A sign that very specific criteria have been met A standardised way of saying that there is a There is a difference between two groups – p<0.05; There is no difference between two groups – p>0.05; There is a predictable relationship between two groups – p<0.05; or There is no predictable relationship between two groups - p>0.05. A way of getting around the problem of variability

If you argue for a one tailed test – saying the difference can only be in one direction, then you can add 2.5% error from side where no data is expected to the side where it is 2.5% of distri- bution 95% of distri- bution 2-tailed test 1-tailed test Standard deviations One and two tailed tests

T-test result t-Test: Two-Sample Assuming Unequal Variances General adultsUnitec adults Mean Variance Observations14664 t Stat for p< p one-tail0.00 t Critical one-tail1.66 p two-tail0.00 t Critical two-tail1.98

MasseyUnsworth Heights Mean Variance Observations5215 t Stat for p< p one-tail0.06 t Critical one-tail1.75 p two-tail0.12 t Critical two-tail2.12 malefemale Mean Variance Observations83125 t Stat for p< p one-tail0.07 t Critical one-tail1.65 p two-tail0.13 t Critical two-tail1.97