Final review - statistics Spring 03 Also, see final review - research design.

Slides:



Advertisements
Similar presentations
Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Advertisements

Table of Contents Exit Appendix Behavioral Statistics.
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Statistical Tests Karen H. Hagglund, M.S.
QUANTITATIVE DATA ANALYSIS
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Descriptive Statistics
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Social Research Methods
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Richard M. Jacobs, OSA, Ph.D.
Inferential Statistics
Inferential Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Choosing Statistical Procedures
Testing Hypotheses.
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.
AM Recitation 2/10/11.
Statistical Analysis I have all this data. Now what does it mean?
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Fundamentals of Statistical Analysis DR. SUREJ P JOHN.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Chapter 15 Data Analysis: Testing for Significant Differences.
Statistical Analysis I have all this data. Now what does it mean?
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Data Analysis (continued). Analyzing the Results of Research Investigations Two basic ways of describing the results Two basic ways of describing the.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Agenda Descriptive Statistics Measures of Spread - Variability.
Review Hints for Final. Descriptive Statistics: Describing a data set.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Data Analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
HL Psychology Internal Assessment
Chapter 13 Understanding research results: statistical inference.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Agenda n Probability n Sampling error n Hypothesis Testing n Significance level.
Data analysis Research methods.
Statistics.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Summary descriptive statistics: means and standard deviations:
Social Research Methods
Introduction to Inferential Statistics
Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.
Statistical Evaluation
Introduction to Statistics
Basic Statistical Terms
Summary descriptive statistics: means and standard deviations:
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Chapter Nine: Using Statistics to Answer Questions
Presentation transcript:

Final review - statistics Spring 03 Also, see final review - research design

Statistics Descriptive Statistics Statistics to summarize and describe the data we collected Inferential Statistics Statistics to make inferences from samples to the populations

A summary of your data Center / Central Tendencies Indicates a central value for the variable Measures of Dispersion (Variability / Spread) Indicate how much each participants’ score vary from each other Measures of Association Indicates how much variables go together (Shown in Tables, Graphs, Distributions)

Measures of Center u Mode u A value with the highest frequency u The most common value u Median u The “middle” score u Mean u Average

WHY are LEVELS / SCALE of MEASUREMENT IMPORTANT? u Because you need to match the statistic you use to the kind of variable you have

Measures of Central Tendency, Center NominalOrdinalInterval/Ratio Mode Median Mean

Summary Ratio Interval Ordinal Nominal Difference Order Equal Interval Meaningful Zero Calculate Math Info of difference among values Level of Measurement

Why “Equal Distance” Matters? u If the distance between values are equal (as in interval or ratio data), you are able to calculate (add, subtract, multiply, divide) values You can get a mean only for interval/ratio variables A wider variety of statistical tests are available for interval/ratio variables

What are the Mean, Median, and Mode for this distribution? What is this distribution shape called?

Types of Measures of Dispersion Variability / Spread u Frequencies / Percentages u Range u The distance between the highest score and the lowest score (highest – lowest) u Standard deviation / u Variance

Variance / Standard Deviation u Variance (S-squared): An approximate average of the squared deviations from the mean u Standard Deviation(S or SD): Square root of variance u The larger the variance/ SD is, the higher variability the data has or larger variation in scores, or distributions that vary widely from the mean.

Measures of Dispersion NominalOrdinal Frequency, % Range, IQR StandardDeviatn, Variance Interval/Ratio

CORRELATION u Co-relation u 2 variables tend to “go together” u Indicates how strongly and in which direction two variables are correlated with each other  *** Correlation does NOT EQUAL cause

SIGN u 0: No systematic relationship Positive correlation: As one variable increases, so does the 2nd Negative correlation: As one variable increases, the 2 nd gets smaller

Correlation Co-efficient +10 NegativePositive Stronger Weaker Perfect None

SIZE u Ranges from –1 to + 1 u 0 or close to 0 indicates NO relationship u +/ weak u +/ moderate u +/ strong u +/ very strong u +/ perfect Negative relationships are NOT weaker!

Significance Test u Correlation co-efficient also comes with significance test (p-value) u p=.05:.05 probability of no correlation in the population = 5% risk of TYPE I Error = 95% confidence level u If p<.05, reject H 0 and support Ha at 95% confidence level

1.Infer characteristics of a population from the characteristics of the samples. 2.Hypothesis Testing 3.Statistical Significance 4.The Decision Matrix

Sample Statistics X SD n Population Parameters m s N

Inferential Statistics u assess -- are the sample statistics indicators of the population parameters? u Differences between 2 groups -- happened by chance? u What effect do random sampling errors have on our results?

Random sampling error  Random sampling error: Difference between the sample characteristics and the population characteristics caused by chance  Sampling bias: Difference between the sample characteristics and the population characteristics caused by biased (non-random) sampling

Probability u Probability (p) ranges between 1 and 0 u p = 1 means that the event would occur in every trial u p = 0 means the event would never occur in any trial u The closer the probability is to 1, the more likely that the event will occur u The closer the probability is to 0, the less likely the event will occur

P >.05 means that … 95%  Means of two groups fall in 95% central area of normal distribution with one population mean Mean 1 Mean 2

P <.05 means that … 11 22  Means of two groups do NOT fall in 95% central area of normal distribution of one population mean, so it is more reasonable to assume that they belong to different populations

Null Hypothesis Says IV has no influence on DV There is no difference between the two variables. There is no relationship between the two variables.

Null Hypothesis u States there is NO true difference between the groups u If sample statistics show any difference, it is due to random sampling error u Referred as H 0 u (Research Hypothesis = Ha) u If you can reject H 0, you can support Ha u If you fail to reject H 0, you reject Ha

u Be conservative.  What are chances I would get these results if null hypothesis is true?  Only if pattern is highly unlikely (p .05) do you reject null hypothesis and support your hypothesis u Since cannot be 100% sure your conclusion is correct, you take up to 5% risk. u Your p-value tells you the risk /the probability of making TYPE I Error

Correct Wrong person to marry Type II error You think it’s the wrong person to marry Type I error True state

Correct No fire Type II error No Alarm Type I error True state

Correct H o (no fire) H a H o = null hypothesis = there is NO fire H a = alternative hyp. = there IS a FIRE Accept H o (no alarm) Type II error Type I error Reject H o True State You decide...

Easy ways to LOSE points u Use the word “prove” u Better to say support the hypothesis or consistent with the hypothesis u Tentative statements acknowledge possibility of making a Type 1 or Type 2 error u Use the word “random” incorrectly

Significance Test u Significance test examines the probability of TYPE I error (falsely rejecting H 0 ) u Significance test examines how probable it is that the observed difference is caused by random sampling error u Reject the null hypothesis if probability is <.05 (probability of TYPE I error is smaller than.05)

Principle Logic P <.05 Reject Null Hypothesis (H 0 ) Support Your Hypothesis (Ha)

Logic of Hypothesis Testing Statistical tests used in hypothesis testing deal with the probability of a particular event occurring by chance. Is the result common or a rare occurrence if only chance is operating? A score (or result of a statistical test) is “Significant” if score is unlikely to occur on basis of chance alone.

The “Level of Significance” is a cutoff point for determining significantly rare or unusual scores. Scores outside the middle 95% of a distribution are considered “Rare” when we adopt the standard “5% Level of Significance” This level of significance can be written as: p =.05 Level of Significance

Decision Rules Reject Ho (accept Ha) when the sample statistic is statistically significant at the chosen p level, otherwise accept Ho (reject Ha). Possible errors: You reject the Null Hypothesis when in fact it is true, a Type I Error, or Error of Rashness. B.You accept the Null Hypothesis when in fact it is false, a Type II Error, or Error of Caution.

Type I error Correct Data results are by chance (Null is true) Correct Data indicates something significant is happening (reject null) Type II error There is nothing happening except chance variation (accept the null) Data indicates something is happening (Null is false) True state Your decision: 

To compare two groups on Mean Scores use t-test. For more than 2 groups use Analysis of Variance (ANOVA) Can’t get a mean from nominal or ordinal data. Chi Square tests the difference in Frequency Distributions of two or more groups.

Parametric Tests u Used with data w/ mean score or standard deviation. u t-test, ANOVA and Pearson’s Correlation r. u u Use a t-test to compare mean differences between two groups (e.g., male/female and married/single). u

Parametric Tests u use ANalysis Of VAriance (ANOVA) to compare more than two groups (such as age and family income) to get probability scores for the overall group differences. u Use a Post Hoc Tests to identify which subgroups differ significantly from each other.

When comparing two groups on MEAN SCORES use the t-test.

T-test u If p<.05, we conclude that two groups are drawn from populations with different distribution (reject H 0 ) at 95% confidence level

When comparing two groups on MEAN SCORES use the t-test. Our Research Hypothesis: hair length leads to different perceptions of a person. The Null Hypothesis: there will be no difference between the pictures.

Short Hair: Mean = 2.2 SD = 1.9 n = 100 Long Hair: Mean = 4.1 SD = 1.8 n = 100 Mean scores come from different distributions. Mean scores reflect just chance differences from a single distribution. Accept Ha Accept Ho p =.03 “I think she is one of those people who quickly earns respect.”

Short Hair: Mean = 1.6 SD = 1.7 n = 100 Long Hair: Mean = 3.6 SD = 1.2 n = 100 Mean scores come from different distributions. Mean scores reflect just chance differences from a single distribution. Accept Ha Accept Ho p =.01 “In my opinion, she is a mature person.”

Short Hair: Mean = 3.7 SD = 1.8 n = 100 Long Hair: Mean = 3.9 SD = 1.5 n = 100 Mean scores are just chance differences from a single distribution. Accept Ha Accept Ho p =.89 Mean scores come from different distributions. “I think we are quite similar to one another.”

A nonsignificant result may be caused by a u A.low sample size. u B.very cautious significance level. u C.weak manipulation of independent variables. u D.true null hypothesis.

When to use various statistics u Parametric u Interval or ratio data u Non-parametric u Ordinal and nominal data

Chi-Square X 2 u Chi Square tests the difference in frequency distributions of two or more groups. u Test of Significance u of two nominal variables or u of a nominal variable & an ordinal variable u Used with a cross tabulation table