Research Methodology Statistics Maha Omair Teaching Assistant Department of Statistics, College of science King Saud University.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

SADC Course in Statistics (Session 09)
Understanding Power By Jessica Jorge.
Chapter 17/18 Hypothesis Testing
CHAPTER EIGHT TESTS OF HYPOTHESES
HYPOTHESIS TESTING. Purpose The purpose of hypothesis testing is to help the researcher or administrator in reaching a decision concerning a population.
Chapter 7 Hypothesis Testing
Introduction to Hypothesis Testing
6. Statistical Inference: Example: Anorexia study Weight measured before and after period of treatment y i = weight at end – weight at beginning For n=17.
“Students” t-test.
Chapter 15 ANOVA.
Chapter 16 Inferential Statistics
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Hypothesis Testing making decisions using sample data.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Decision Errors and Power
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
Business Statistics - QBM117 Introduction to hypothesis testing.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Chapter 8 Introduction to Hypothesis Testing
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
Chapter 8 Introduction to Hypothesis Testing
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Issues concerning the interpretation of statistical significance tests.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
1 Where we are going : a graphic: Hypothesis Testing. 1 2 Paired 2 or more Means Variances Proportions Categories Slopes Ho: / CI Samples Ho: / CI Ho:
© Copyright McGraw-Hill 2004
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Chapter 9: Hypothesis Tests for One Population Mean 9.2 Terms, Errors, and Hypotheses.
Hypothesis Testing. A statistical Test is defined by 1.Choosing a statistic (called the test statistic) 2.Dividing the range of possible values for the.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter Nine Hypothesis Testing.
One-Sample Tests of Hypothesis
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
One-Sample Tests of Hypothesis
CONCEPTS OF HYPOTHESIS TESTING
Chapter 6 Hypothesis tests.
P-value Approach for Test Conclusion
One-Sample Tests of Hypothesis
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Research Methodology Statistics Maha Omair Teaching Assistant Department of Statistics, College of science King Saud University

Why we need statistical data analysis? Investigations in diverse fields like agriculture, medicine, physics, biology, chemistry etc. require collection of “observations”. Observations are almost always subject to random error. Hence statistical methods have to be employed to collect as well as to analyze the data.

Statistical data analysis Studying a problem through the use of statistical data analysis usually involves four basic steps: 1.Defining the problem. 2.Collecting the data. 3.Analyzing the data. 4.Conclusions and recommendations.

Defining the problem An exact definition of the problem is imperative in order to obtain accurate data about it. It is extremely difficult to gather data without a clear definition of the problem.

Collecting the data The three basic principles of experimental design are: 1.Randomization. 2.Replication. 3.Blocking.

Randomization Randomization is the cornerstone underlying the use of statistical methods in experimental design. By randomization we mean that both the allocation of the experimental material and the order in which individual runs or trials of the experiment are to be performed are randomly determined.

Minitab

Replication By replication we mean a repetition of the basic experiment. Replication has two important properties: 1.It allows the experimenter to obtain an estimate of the experimental error. 2.If the sample mean is used to estimate the effect of a factor in the experiment, then replication permits the experimenter to obtain a more precise estimate of this effect.

Without replication

With replication

Choice of sample size Why would we want to plan? 1.The larger the sample sizes are, the easier it is to detect or find differences in the means. 2.The larger the sample size is, the higher the “cost” and the more likely that practically unimportant differences are to be found statistically significant.

Basic Statistics Terms In order to determine the needed sample size we first must define some basic statistics terms. Null hypothesis Ho is a hypothesis that is presumed true until statistical evidence in the form of a hypothesis test indicates otherwise.hypothesis In formulating a particular null hypothesis, we are always also formulating an alternative hypothesis Ha, which we will accept if the observed data values are sufficiently improbable under the null hypothesis.

Definition of Type I and Type II errors Sometimes our decisions will be correct and sometimes not. There are two possible errors, which we will call Type I and Type II errors, respectively. A Type I error is the error of rejecting the null hypothesis when it is true. The probability of committing a Type I error is usually denoted by . A Type II error is the error of accepting the null hypothesis when it is false. The probability of making a Type II error is usually denoted by .

Type I and Type II errors

Planning to detect any important difference Let  = smallest difference range considered important by the researcher. Specify , , ,  and r use table A.10 (Applied linear statistical models by Neter, Wasserman and Kunter) to determine the needed sample size n (=n 1 =n 2 =…=n r ).

Planning to detect any important difference Example: Let  =3,  =0.1,  =0.05,  =2 and r=4  /  =1.5, Power=1-  =0.9 Need n=14 observations at each factor level. Need 14*4=56 homogeneous units.

Planning sample size to find the best treatment Let = important difference between any two adjacent means. r=number of factor level.  = standard deviation. Specify, ,  and r use table A.11 (Applied linear statistical models by Neter, Wasserman and Kunter) to get d=  n /  and solve for n.

Planning sample size to find the best treatment Example: Let =2,  =0.05,  =3 and r=5 1-  =0.95 d= n= (3.0552*3/2) 2 =  21 We need 21 observations at each of 5 levels we need 105 experimental units.

Blocking If the experimental units are not homogeneous, considerable improvement can be achieved by blocking (grouping) together units that are homogeneous. Example: Rats coming from the same litter.

Analyzing the data Your choice of statistical analysis should be made prior to conducting an experiment. There is little sense in collecting data that you can’t analyze properly. Use the following flow chart to help you decide which statistic to use.

P-value P-value is a measure of how much evidence we have against the null hypotheses. The smaller the p-value, the more evidence we have against H0. Traditionally, researchers will reject a hypothesis if the p-value is less than Sometimes, though, researchers will use a stricter cut-off (e.g., 0.01) or a more liberal cut-off (e.g., 0.10). The general rule is that a small p-value is evidence against the null hypothesis while a large p- value means little or no evidence against the null hypothesis. P-valueInterpretation P 0.01 very strong evidence against H P 0.05 moderate evidence against H P 0.10 suggestive evidence against H P little or no real evidence against H0

ANOVA Example: In a study on the effect of nitrogen fertilization on cereal crops, plots of a particular variety of wheat were randomly given fertilizer at on of four rates: 0, 50, 100, 150. At a certain date, plants were randomly selected from the plots and the plant height (in cm) was measured [based on Ghandorah(1985a)]. Can we conclude that all 4 fertilizer rates have equal effects on the average plant height?

ANOVA

Ho: all 4 fertilizer rates have the same effect on plant height. Ha: Some of the 4 fertilizer rates have different effects on plant height. P-value = 0 We conclude that all 4 fertilizer rates have different effects on the average plant height.

ANOVA From mean separation we can see that there is no significant difference between the effect of 50, 100 and 150 fertilizer rates on plant height. We can recommend to use the 50 fertilizer rate because it is coast effective.

Conclusions and recommendations Once the data has been analyzed, the experimenter may draw conclusions or inferences about the results. The statistical inference must be physically interpreted, and the practical significance of these findings evaluated. Then recommendations concerning these findings must be made. The use of graphical display is a very effective way to present experimental results.