School of Computing FACULTY OF ENGINEERING MJ11 (COMP1640) Modelling, Analysis & Algorithm Design Vania Dimitrova Lecture 19 Statistical Data Analysis:

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

School of Computing FACULTY OF ENGINEERING MJ11 (COMP1640) Modelling, Analysis & Algorithm Design Vania Dimitrova Lecture 18 Statistical Data Analysis:
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Hypothesis Testing Steps in Hypothesis Testing:
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
1 Hypothesis testing. 2 A common aim in many studies is to check whether the data agree with certain predictions. These predictions are hypotheses about.
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Confidence Interval and Hypothesis Testing for:
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
 Once you know the correlation coefficient for your sample, you might want to determine whether this correlation occurred by chance.  Or does the relationship.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Inferences About Process Quality
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Mann-Whitney and Wilcoxon Tests.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Choosing Statistical Procedures
AM Recitation 2/10/11.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Statistical inference: confidence intervals and hypothesis testing.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
User Study Evaluation Human-Computer Interaction.
Hypothesis Testing Using the Two-Sample t-Test
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Review Hints for Final. Descriptive Statistics: Describing a data set.
Academic Research Academic Research Dr Kishor Bhanushali M
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Chapter 10 The t Test for Two Independent Samples
INTRODUCTION TO HYPOTHESIS TESTING From R. B. McCall, Fundamental Statistics for Behavioral Sciences, 5th edition, Harcourt Brace Jovanovich Publishers,
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 13 Understanding research results: statistical inference.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 7 l Hypothesis Tests 7.1 Developing Null and Alternative Hypotheses 7.2 Type I & Type.
Chapter 9 Introduction to the t Statistic
Chapter 8 Introducing Inferential Statistics.
Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative
Logic of Hypothesis Testing
Lecture Nine - Twelve Tests of Significance.
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
Inferences on Two Samples Summary
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Statistical Inference about Regression
Introduction to Hypothesis Testing
Testing Hypotheses about a Population Proportion
What are their purposes? What kinds?
Reasoning in Psychology Using Statistics
Testing Hypotheses about a Population Proportion
Reasoning in Psychology Using Statistics
Presentation transcript:

School of Computing FACULTY OF ENGINEERING MJ11 (COMP1640) Modelling, Analysis & Algorithm Design Vania Dimitrova Lecture 19 Statistical Data Analysis: Hypothesis Checking November 2011

In the previous lecture Introduction to Statistical Modelling Types of data Sampling methods Descriptive statistics Normal distribution Correlation between variables Correlation coefficient r Positive, Negative, or No correlation

In this lecture Hypothesis testing - introduction Steps Significance How to use significance tables Differences between two groups t-test Man-Whitney U-test

Sample Subset of the population; some of the measurements of the characteristics of the population. population sample Measures describing population characteristics PARAMETERS Measures describing sample characteristics STATISTICS Statistics estimate the parameters

Hypothesis Statement about: the value of a population parameter (e.g. its mean), or about a particular relationship that may hold between variables (e.g. correlation), or about difference between two populations. Null hypothesis H 0 : (No effect) We aim at rejecting it, which will support its alternative. Alternative hypothesis H 1 : (positive effect) We test H 0 against H 1, if H 0 is rejected, H 1 is accepted

Research and Null Hypotheses Operationalise – find parameter to compare (e.g. mean) H 1 (research hypothesis) - H 0 (null hypothesis) - H 1 can be: Two-directional (2-tailed) One-directional (1-tailed)

Example Hypotheses Consider the example with the European Area current accounts. Notice the difference between the means. Formulate a hypothesis based on your observation. population sample Hypotheses:

Error Sample Population Use the sample to test hypothesis about the population Can we be sure that the result observed in the sample is valid for the whole population, or could this be pure chance? Could it happen that we erroneously reject H 0 ?

Statistical significance p-value The probability of an error in accepting the observed result as valid (i.e. the risk of rejecting H 0 when H 0 is valid). We want p to be as small as possible:

Hypothesis Testing Procedure Formulate Alternative & Null Hypotheses Decide which sampling method to use Decide what statistical method to use Define Significance level and sample size Find the sampling distribution under the assumption H 0 and obtain the p-value Derive conclusions about H 1

What statistical test to use? Parametric Assumptions Dependent variable is continuous Random sample Independence of observations (tricky! – e.g. people working in groups, influences on behaviour) Normal distribution Homogeneity of variance (samples are from populations with equal variance) Non-Parametric Assumptions At least one of the above is violated Powerful tests Limited power tests

Statistical tests to compare groups Two groups Multiple Groups t-tests Analysis of variance Paired samples Independent samples the same sample several different samples 1 independent var. 2 independent var. m-independent var.

Paired t-test Suppose that a new functionality is introduced to an online banking system. We want to compare whether it improves the customer satisfaction. Suppose parametric assumptions are met H 1 (research hypothesis) - H 0 (null hypothesis) -

Paired t-test calculations OldNew d Run the calculations automatically

Independent (unpaired) t-test Consider A-level students applying to a university. A random sample is obtained including students applying for BA and BSc. The A-level score for each student is calculated. A difference in the means is observed. Test whether the difference is valid for the whole population. Suppose parametric assumptions are met H 1 (research hypothesis) - H 0 (null hypothesis) -

Independent (unpaired) t-test calculations Data in unpaired-t-test-example.xls Run the calculations automatically Calculations based on a weighted average of the two sample variances and

Non-parametric equivalent to t-test Man-Whitney U-test Relaxed assumptions (discrete data) Ranking (ordering) Compares medians Counts how many elements from each sample are before the elements from the other sample Less power but can be helpful Need tables for non-parametric tests

Practical task For each of the claims below identify an appropriate sampling method and statistical test that can be used to test the claim assuming that appropriate data is collected and the parametric conditions are satisfied. Formulate the research and null hypotheses. TV advertisement is more powerful than Radio advertisement. Women are more likely to buy computer games than men. Students use social computing sites more often than staff.

Summary: Hypothesis checking Hypothesis – null & alternative Comparing two groups T-test (parametric) Man Whitney (non-parametric) Significance level & Degree of Freedom min expected value to reject H 0 Calculation using specific software Online (see links on the slides) Software package, e.g. SPSS or Matlab Decision rejecting H 0 ? Statistical significance?

References Rees D.G., Essential Statistics, Chapman & Hall/CRC, Cohen, L., Holliday, M., Practical Statistics for Students, Chapman,