5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.

Slides:



Advertisements
Similar presentations
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
Advertisements

Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
SW388R6 Data Analysis and Computers I Slide 1 Paired-Samples T-Test of Population Mean Differences Key Points about Statistical Test Sample Homework Problem.
One-sample T-Test of a Population Mean
Strategy for Complete Regression Analysis
Assumption of normality
Detecting univariate outliers Detecting multivariate outliers
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Chi-square Test of Independence
Multiple Regression – Assumptions and Outliers
Multiple Regression – Basic Relationships
8/2/2015Slide 1 SPSS does not calculate confidence intervals for proportions. The Excel spreadsheet that I used to calculate the proportions can be downloaded.
Hypothesis Testing Using The One-Sample t-Test
Hypothesis Testing: Two Sample Test for Means and Proportions
Assumption of Homoscedasticity
SW388R6 Data Analysis and Computers I Slide 1 One-sample T-test of a Population Mean Confidence Intervals for a Population Mean.
8/9/2015Slide 1 The standard deviation statistic is challenging to present to our audiences. Statisticians often resort to the “empirical rule” to describe.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of normality Transformations Assumption of normality script Practice problems.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
SW388R6 Data Analysis and Computers I Slide 1 Chi-square Test of Goodness-of-Fit Key Points for the Statistical Test Sample Homework Problem Solving the.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Stepwise Binary Logistic Regression
Sampling Distribution of the Mean Problem - 1
SW318 Social Work Statistics Slide 1 Estimation Practice Problem – 1 This question asks about the best estimate of the mean for the population. Recall.
Slide 1 SOLVING THE HOMEWORK PROBLEMS Simple linear regression is an appropriate model of the relationship between two quantitative variables provided.
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Hierarchical Entry of Variables Sample Problem Steps in Solving Problems.
8/23/2015Slide 1 The introductory statement in the question indicates: The data set to use: GSS2000R.SAV The task to accomplish: a one-sample test of a.
Inferential Statistics: SPSS
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of Homoscedasticity Homoscedasticity (aka homogeneity or uniformity of variance) Transformations.
9/18/2015Slide 1 The homework problems on comparing central tendency and variability extend the focus central tendency and variability to a comparison.
Hierarchical Binary Logistic Regression
9/23/2015Slide 1 Published reports of research usually contain a section which describes key characteristics of the sample included in the study. The “key”
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
Independent Samples t-Test (or 2-Sample t-Test)
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
Chi-Square Test of Independence Practice Problem – 1
Stepwise Multiple Regression
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
110/10/2015Slide 1 The homework problems on comparing central tendency and variability extend our focus on central tendency and variability to a comparison.
Slide 1 SOLVING THE HOMEWORK PROBLEMS Pearson's r correlation coefficient measures the strength of the linear relationship between the distributions of.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Slide 1 Hierarchical Multiple Regression. Slide 2 Differences between standard and hierarchical multiple regression  Standard multiple regression is.
6/2/2016Slide 1 To extend the comparison of population means beyond the two groups tested by the independent samples t-test, we use a one-way analysis.
SW388R6 Data Analysis and Computers I Slide 1 Independent Samples T-Test of Population Means Key Points about Statistical Test Sample Homework Problem.
SW388R7 Data Analysis & Computers II Slide 1 Hierarchical Multiple Regression Differences between hierarchical and standard multiple regression Sample.
6/4/2016Slide 1 The one sample t-test compares two values for the population mean of a single variable. The two-sample t-test of population means (aka.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
11/4/2015Slide 1 SOLVING THE PROBLEM Simple linear regression is an appropriate model of the relationship between two quantitative variables provided the.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
11/16/2015Slide 1 We will use a two-sample test of proportions to test whether or not there are group differences in the proportions of cases that have.
Slide 1 The introductory statement in the question indicates: The data set to use (2001WorldFactBook) The task to accomplish (association between variables)
11/19/2015Slide 1 We can test the relationship between a quantitative dependent variable and two categorical independent variables with a two-factor analysis.
Chi-square Test of Independence
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
SW388R6 Data Analysis and Computers I Slide 1 One-way Analysis of Variance and Post Hoc Tests Key Points about Statistical Test Sample Homework Problem.
SW318 Social Work Statistics Slide 1 Percentile Practice Problem (1) This question asks you to use percentile for the variable [marital]. Recall that the.
SW388R6 Data Analysis and Computers I Slide 1 Percentiles and Standard Scores Sample Percentile Homework Problem Solving the Percentile Problem with SPSS.
SW388R7 Data Analysis & Computers II Slide 1 Detecting Outliers Detecting univariate outliers Detecting multivariate outliers.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
1/5/2016Slide 1 We will use a one-sample test of proportions to test whether or not our sample proportion supports the population proportion from which.
1/11/2016Slide 1 Extending the relationships found in linear regression to a population is procedurally similar to what we have done for t-tests and chi-square.
Analysis of Variance STAT E-150 Statistical Methods.
2/24/2016Slide 1 The standard deviation statistic is challenging to present to our audiences. Statisticians often resort to the “empirical rule” to describe.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
Assumption of normality
Multiple Regression – Split Sample Validation
Presentation transcript:

5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population means compares the population means for two groups of subjects on a single variable. The null hypothesis for this test is: there is no difference between the population mean of the variable for one group of subjects and the population mean of the same variable for a second group of subjects. In addition to our concern with the assumption of normality for each group and the number of cases in each group if we are to apply the Central Limit Theorem, but this test also requires us to examine the spread or dispersion of both groups so that the measure of standard error used in the t-test fairly represents both group. While there is a test of Equality of Variance and a formula to use when the test is satisfied and a formula to use when the test is violated, the authors of our text suggest we always use the formula that assumes the test is violated. If we use this version of the statistic when the variances are in fact equal, the results of the test are comparable to what we would obtain using the formula for equal variances. We will the authors advice and restrict our attention to the “Equal variances not assumed” row in the SPSS output table without examining the Levene test of equality of variance.

5/15/2015Slide 2 The introductory statement in the question indicates: The data set to use (GSS2000R) The variables to use in the analysis: socioeconomic index [sei] for groups of survey respondents defined by the variable sex [sex] The task to accomplish (two-sample t-test for the difference between sample means) The level of significance (0.05, two-tailed)

5/15/2015Slide 3 The first statement asks about the level of measurement. A two-sample t-test for the difference between sample means requires a quantitative dependent variable and a dichotomous independent variable.

5/15/2015Slide 4 "Socioeconomic index" [sei] is quantitative, satisfying the level of measurement requirement for the dependent variable. "Sex" [sex] is dichotomous, satisfying the level of measurement requirement for the independent variable. Mark the statement as correct.

5/15/2015Slide 5 A two-sample t-test for the difference between sample means requires that the distribution of the variable satisfy the nearly normal condition for both groups. We will operationally define the nearly normal condition as having skewness and kurtosis between -1.0 and +1.0 for both groups, and not having any outliers with standard scores equal to or smaller than -3.0 or equal to or larger than +3.0 in the distribution of scores for either group. To justify the use of probabilities based on a normal sampling distribution in testing hypotheses, either the distribution of the variable must satisfy the nearly normal condition or the size of the sample must be sufficiently large to generate a normal sampling distribution under the Central Limit Theorem.

5/15/2015Slide 6 To evaluate the variables conformity to the nearly normal condition, we will use descriptive statistics and standard scores. We will first compute the standard scores. To compute the standard scores, select the Descriptive Statistics > Descriptives command from the Analyze menu.

5/15/2015Slide 7 First, move the variable for the analysis sei to the Variable(s) list box. Third, click on the OK button to produce the output. Second, mark the check box Save standardized values as variables.

5/15/2015Slide 8 There were no outliers that had a standard score less than or equal to Sort the column Zsei in ascending order to show any negative outliers at the top of the column.

5/15/2015Slide 9 There were no outliers that had a standard score greater than or equal to Sort the column Zsei in descending order to show any positive outliers at the top of the column.

5/15/2015Slide 10 Next, we will use the Explore procedure to generate descriptive statistics for each gender.. To compute the descriptive statistics, select the Descriptive Statistics > Explore command from the Analyze menu.

5/15/2015Slide 11 First, move the dependent variable sei to the Dependent List. Second, move the group variable sex to the Factor List. Third, mark the option button to display Statistics only. Fourth, click on the OK button to produce the output.

5/15/2015Slide 12 For survey respondents who were male, "socioeconomic index" satisfied the criteria for a normal distribution. The skewness of the distribution (0.539) was between -1.0 and +1.0 and the kurtosis of the distribution (-0.852) was between -1.0 and +1.0.

5/15/2015Slide 13 For survey respondents who were female, "socioeconomic index" satisfied the criteria for a normal distribution. The skewness of the distribution (0.610) was between -1.0 and +1.0 and the kurtosis of the distribution (-0.921) was between -1.0 and +1.0.

5/15/2015Slide 14 For survey respondents who were male, "socioeconomic index" satisfied the criteria for a normal distribution. The skewness of the distribution (0.539) was between -1.0 and +1.0 and the kurtosis of the distribution (-0.852) was between -1.0 and For survey respondents who were female, "socioeconomic index" satisfied the criteria for a normal distribution. The skewness of the distribution (0.610) was between -1.0 and +1.0 and the kurtosis of the distribution (-0.921) was between -1.0 and There were no outliers that had a standard score less than or equal to -3.0 or greater than or equal to Mark the statement as correct.

5/15/2015Slide 15 To apply the Central Limit Theorem for a two-sample t-test for the difference between sample means requires that both groups defined by the independent variable have 40 or more cases. Though we have satisfied the nearly normal condition and do not need to utilize the Central Limit Theorem to justify the use of probabilities based on the normal distribution, we will still examine the sample size.

5/15/2015Slide 16 There were 110 valid cases for survey respondents who were male and 145 valid cases for survey respondents who were female.

5/15/2015Slide 17 Both groups had 40 or more cases, so the Central Limit Theorem would be applicable. However, since the distribution of "socioeconomic index" satisfied the nearly normal condition, we do not need to rely upon the Central Limit Theorem to satisfy the sampling distribution requirements of a two-sample t-test for the difference between sample means. Mark the statement as correct.

5/15/2015Slide 18 The next statement asks us to identify the mean for each group in the sample data and the standard error of the sampling distribution. To answer this question, we need to produce the output for the two-sample t-test.

5/15/2015Slide 19 To produce the two-sample t-test (which SPSS calls Independent- Samples T-Test), select the Compare Means > Independent Samples T Test command from the Analyze menu.

5/15/2015Slide 20 First, move the variable sei to the Test Variable(s) list box. Third, click on the Define Groups button to enter the group codes. Second, move the grouping variable sex to the text box. SPSS adds ?’s after the variable name to remind us that we need to specify the numeric codes for the groups.

5/15/2015Slide 21 First, enter 1 for males as Group 1. First, enter 2 for females as Group 2. Third, click on the Continue button to close the dialog box. If I did not remember the code numbers for male and female, I would look them up in the Variable View of the SPSS Data Editor.

5/15/2015Slide 22 Third, click on the OK button to produce the output. SPSS replaces the question marks with the codes I entered.

5/15/2015Slide 23 The mean "socioeconomic index" for survey respondents who were male was and the mean for survey respondents who were female was The standard error of the differences between group means was

5/15/2015Slide 24 The mean "socioeconomic index" for survey respondents who were male was and the mean for survey respondents who were female was The standard error of the differences between group means was Mark the question as correct.

5/15/2015Slide 25 The next statement asks us about the null hypothesis for the one-sample t-test. We should check to make certain the relationship is stated correctly.

5/15/2015Slide 26 The null hypothesis for the test is: there is no difference between the population mean of "socioeconomic index" for survey respondents who were male and the population mean of "socioeconomic index" for survey respondents who were female. Since the hypothesis is stated correctly, mark the question as correct.

5/15/2015Slide 27 The next statement asks us to relate the t-test to the data in our problem.

5/15/2015Slide 28 Following the convention in the text book, we will only focus on the “Equal variances not assumed” option. Within this option, the difference and standard error are correctly identified. The t-test statistic is based on the difference between the means of the two groups (2.777) relative to the standard error of the differences between sample means (2.446).

5/15/2015Slide 29 The statement is correct and contains the correct values for both the difference in means and the sampling error that we would typically expect to find in the sampling distribution for differences in means. Mark the statement as correct.

5/15/2015Slide 30 The next statement asks about the probability for the comparison made by the t-test. i.e. what is the probability that the population means for each group are not different. In the last question, the difference in means was only slightly larger than the standard error of the differences, so we should expect a ratio near one and a high value for the probability.

5/15/2015Slide 31 The probability that the population mean for survey respondents who were male (50.3) was not different from the population mean for survey respondents who were female (47.5) was p =.257.

5/15/2015Slide 32 The probability that the population mean for survey respondents who were male (50.3) was not different from the population mean for survey respondents who were female (47.5) was p =.257. Since the probability was correctly stated, mark the question as true.

5/15/2015Slide 33 When the p-value for the statistical test is less than or equal to alpha, we reject the null hypothesis and interpret the results of the test. If the p-value is greater than alpha, we fail to reject the null hypothesis and do not interpret the result.

5/15/2015Slide 34 The p-value for this test (p =.257) is larger than the alpha level of significance (p =.050) supporting the conclusion to fail to reject the null hypothesis. The check box is not marked.

5/15/2015Slide 35 The final statement asks us to interpret the result of our statistical test as a finding in the context of the problem we created. We only interpret the results when the null hypothesis is rejected.

5/15/2015Slide 36 If we had a significant p-value, we would have looked at the means of the two groups to identify the direction of the relationship.

5/15/2015Slide 37 Since we did not have a significant p-value, we cannot reject the null hypothesis and interpret the relationship. The check box is not marked.

5/15/2015Slide 38 Dependent variable is quantitative? Yes Do not mark check box. Mark statement check box. No Mark only “None of the above.” Stop. Independent variable is dichotomous? Yes No

5/15/2015Slide 39 Yes Nearly normal distribution? Do not mark check box. No Mark statement check box. Nearly normal: Skewness and kurtosis between -1.0 and +1.0 for both groups Z-scores between -3.0 and +3.0 CLT applicable (Sample size ≥ 40 in each group)? Yes Do not mark check box. Mark statement check box. Stop. CLT stands for Central Limit Theorem. No If the variable is not normal and the sample size is less than 40, the test is not appropriate.

5/15/2015Slide 40 Yes Nearly normal distribution? Do not mark check box. No Mark statement check box. Nearly normal: Skewness and kurtosis between -1.0 and +1.0 for both groups Z-scores between -3.0 and +3.0 CLT applicable (Sample size ≥ 40 in each group)? Yes Do not mark check box. Mark statement check box. Stop. CLT stands for Central Limit Theorem. No If the variable is not normal and the sample size is less than 40, the test is not appropriate. We will check the applicability of the Central Limit Theorem based on sample size, even when our data satisfies the nearly normal condition.

5/15/2015Slide 41 Yes Do not mark check box. No Mark statement check box. Sample means and standard error correct? Yes Do not mark check box. No Mark statement check box. H 0 : no difference between sample means

5/15/2015Slide 42 T-test accurately described? Yes Do not mark check box. Mark statement check box. No P-value (sig.) stated correctly? Yes Do not mark check box. Mark statement check box. No

5/15/2015Slide 43 Yes Do not mark check box. No Mark statement check box. Reject H 0 is correct decision (p ≤ alpha)? Stop. We interpret results only if we reject null hypothesis. Interpretation is stated correctly? Yes Do not mark check box. Mark statement check box. No