Presentation is loading. Please wait.

Presentation is loading. Please wait.

STAT 312 Introduction Z-Tests and Confidence Intervals for a

Similar presentations


Presentation on theme: "STAT 312 Introduction Z-Tests and Confidence Intervals for a"— Presentation transcript:

1 STAT 312 Introduction 9.1 - Z-Tests and Confidence Intervals for a
Chapter 9 - Inferences Based on Two Samples Introduction 9.1 - Z-Tests and Confidence Intervals for a Difference Between Two Population Means 9.2 - The Two-Sample T-Test and Confidence Interval 9.3 - Analysis of Paired Data 9.4 - Inferences Concerning a Difference Between Population Proportions 9.5 - Inferences Concerning Two Population Variances (Categorical Data)

2 Binary Response: P(Success) = 
“Test of Independence” “Test of Homogeneity” POPULATION TWO POPULATIONS Two random binary variables I and J Random binary variable I “Do you like olives?” “Do you like anchovies?” 1 = P(Yes to olives) 2 = P(Yes to anchovies) “Do you like Brussel sprouts?”  = P(Yes to Brussel sprouts) Null Hypothesis H0: 1 = 2 “No association exists between liking olives and anchovies.” Null Hypothesis H0: 1 = 2 “No difference in liking Brussel sprouts between two pops.” Alternative Hypothesis HA: 1 ≠ 2 “An association exists between liking olives and anchovies.” Alternative Hypothesis HA: 1 ≠ 2 “There is a difference in liking Brussel sprouts bet two pops.”

3 Binary Response: P(Success) = 
“Test of Independence” “Test of Homogeneity” POPULATION TWO POPULATIONS Two random binary variables I and J Random binary variable I “Do you like olives?” “Do you like anchovies?” 1 = P(Yes to olives) 2 = P(Yes to anchovies) “Do you like Brussel sprouts?”  = P(Yes to Brussel sprouts) Sample, size n1 Sample, size n2 Sample, size n1 Sample, size n2 (Assume “large” sample sizes.)

4 Sampling Distribution of
Sample 1, size n1 Sample 2, size n2 X1 = # Successes X2 = # Successes Problem: s.e. depends on  !! Sampling Distribution of Recall… If n  15 and n (1 –  )  15, then via the Normal Approximation to the Binomial… If n  15 and n (1 –  )  15, then via the Normal Approximation to the Binomial… Solution: Use

5 Sampling Distribution of
Sample 1, size n1 Sample 2, size n2 X1 = # Successes X2 = # Successes If n11  15 and n1 (1 – 1 )  15, then via Normal Approximation to the Binomial If n22  15 and n2 (1 – 2 )  15, then via Normal Approximation to the Binomial Sampling Distribution of Recall from section 4.1 (Discrete Models): Mean(X – Y) = Mean(X) – Mean(Y) and if X and Y are independent… Var(X – Y) = Var(X) + Var(Y)

6 Sampling Distribution of
Sample 1, size n1 Sample 2, size n2 X1 = # Successes X2 = # Successes Sampling Distribution of “Null Distribution” standard error Similar problem as “one proportion” inference s.e.! = 0 under H0 For confidence interval, replace 1 and 2 respectively, by For critical region and p-value, replace 1 and 2 respectively, by….. ???? Null Hypothesis H0: 1 = 2 standard error estimate …so replace their common value by a “pooled” estimate.

7 Example: Two Proportions (of “Success”)
Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?”

8 Test of Homogeneity or Independence?
Example: Two Proportions (of “Success”) Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Test of Homogeneity or Independence? Test of Homogeneity (between two populations) Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Let the discrete random variable X = “# Successes” (i.e., “Yes” responses) in each gender of the samples, and use these data to test… Null Hypothesis H0: P(“Yes” among Males) = P(“Yes” among Females), i.e., H0: π1 = π where π = P(Success) in each gender population. “No association exists.” π1 – π2 = 0, Data: Sample 1) n1 = 60 males, X1 = Sample 2) n2 = 40 females, X2 = 16 Analysis via Z-test: Point estimates NOTE: This is > 0. Therefore, REJECT H0 Interpretation: A significant association exists at the .05 level between “liking Bruce Willis movies” and gender, with males showing a 30% preference over females, on average.

9 Test of Homogeneity or Independence
Example: Two Proportions (of “Success”) Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Test of Homogeneity or Independence Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Let the discrete random variable X = “# Successes” (i.e., “Yes” responses) in each gender of the samples, and use these data to test… Null Hypothesis H0: P(“Yes” among Males) = P(“Yes” among Females), i.e., H0: π1 = π where π = P(Success) in each gender population. “No association exists.” π1 – π2 = 0, Data: Sample 1) n1 = 60 males, X1 = Sample 2) n2 = 40 females, X2 = 16 Analysis via Z-test: Point estimates NOTE: This is > 0. Therefore, REJECT H0 Interpretation: A significant association exists at the .05 level; “liking Bruce Willis movies” and gender are dependent, with males showing a 30% preference over females, on average.

10 Binary Response: P(Success) = 
“Test of Independence” “Test of Homogeneity” POPULATION TWO POPULATIONS Two random binary variables I and J Random binary variable I “Do you like olives?” “Do you like anchovies?” Males Females 1 = P(Yes to olives) 2 = P(Yes to anchovies) “Do you like Bruce Willis?”  = P(Yes to Bruce Willis) Null Hypothesis H0: 1 = 2 “No association exists between liking olives and anchovies.” Null Hypothesis H0: 1 = 2 “No difference in liking Bruce Willis between two pops.” Alternative Hypothesis HA: 1 ≠ 2 “An association exists between liking olives and anchovies.” Alternative Hypothesis HA: 1 ≠ 2 “There is a difference in liking Bruce Willis betw two pops.”

11 Binary Response: P(Success) = 
“Test of Independence” “Test of Homogeneity” POPULATION TWO POPULATIONS Two random binary variables I and J Random binary variable I “Gender: Male?” “Do you like Bruce Willis?” Males Females 1 = P(Yes to Male) 2 = P(Yes to Bruce) “Do you like Bruce Willis?”  = P(Yes to Bruce Willis) Null Hypothesis H0: 1 = 2 “No association exists between gender and liking Bruce.” Null Hypothesis H0: 1 = 2 “No difference in liking Bruce Willis between two pops.” Alternative Hypothesis HA: 1 ≠ 2 “An association exists between gender and liking Bruce.” Alternative Hypothesis HA: 1 ≠ 2 “There is a difference in liking Bruce Willis betw two pops.”

12 Example: Two Proportions (of “Success”)
Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Let the discrete random variable X = “# Successes” (i.e., “Yes” responses) in each gender of the samples, and use these data to test… Null Hypothesis H0: P(“Yes” among Males) = P(“Yes” among Females), i.e., H0: π1 = π where π = P(Success) in each gender population. “No association exists.” π1 – π2 = 0, Data: Sample 1) n1 = 60 males, X1 = Sample 2) n2 = 40 females, X2 = 16 ~ ALTERNATE METHOD ~

13 Example: Two Proportions (of “Success”)
Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Let the discrete random variable X = “# Successes” (i.e., “Yes” responses) in each gender of the samples, and use these data to test… Null Hypothesis H0: P(“Yes” among Males) = P(“Yes” among Females), i.e., H0: π1 = π where π = P(Success) in each gender population. “No association exists.” π1 – π2 = 0, Data: Sample 1) n1 = 60 males, X1 = Sample 2) n2 = 40 females, X2 = 16 Observed Males Females Yes 42 16 58 No 18 24 60 40 100 Males Females Yes 42 16 No 60 40 Expected (under H0) Males Females Yes E11 = ? E12 = ? 58 No E21 = ? E22 = ? 42 60 40 100

14 Recall Probability Tables from Chapter 3….
J = 1 J = 2 I = 1 π11 π12 π11 + π12 I = 2 π21 π22 π21 + π22 π11 + π21 π12 + π22 1 Under the null hypothesis, the binary variable I is statistically independent of the binary variable J, i.e., P(I ∩ J) = P(I) P(J).

15  Recall Probability Tables from Chapter 3…. J = 1 J = 2 I = 1 π11 π12
π11 + π12 I = 2 π21 π22 π21 + π22 π11 + π21 π12 + π22 1 Under the null hypothesis, the binary variable I is statistically independent of the binary variable J, e.g., P(“I = 1” ∩ “J = 1”) = P(“I = 1”) P(“J = 1”). Contingency Table Probability Table J = 1 J = 2 I = 1 E11 E12 R1 I = 2 E21 E22 R2 C1 C2 n J = 1 J = 2 I = 1 E11/n E12/n R1/n I = 2 E21/n E22/n R2/n C1/n C2/n 1 Therefore… , etc.

16 “Chi-squared” Test Statistic
Example: Two Proportions (of “Success”) Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Let the discrete random variable X = “# Successes” (i.e., “Yes” responses) in each gender of the samples, and use these data to test… Null Hypothesis H0: P(“Yes” among Males) = P(“Yes” among Females), i.e., H0: π1 = π where π = P(Success) in each gender population. “No association exists.” π1 – π2 = 0, Data: Sample 1) n1 = 60 males, X1 = Sample 2) n2 = 40 females, X2 = 16 Observed Males Females Yes 42 16 No 60 40 Males Females Yes 42 16 58 No 18 24 60 40 100 “Chi-squared” Test Statistic Expected (under H0) Check: Is the null hypothesis true? Males Females Yes E11 = ? E12 = ? 58 No E21 = ? E22 = ? 42 60 40 100 34.8 23.2 where “degrees of freedom” df = (# rows – 1)(# cols – 1), = 1 for a 2  2 table. 25.2 16.8

17 “Chi-squared” Test Statistic
Example: Two Proportions (of “Success”) Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Expected (under H0) Observed Males Females Yes 42 16 58 No 18 24 60 40 100 Males Females Yes 34.8 23.2 58 No 25.2 16.8 42 60 40 100 “Chi-squared” Test Statistic p = ????? = on 1 df

18 The actual p-value = .0029, the same as that found using the Z-test!
Because is much greater than the α =.05 critical value of 3.841, it follows that p << .05. More precisely, < < 9.141; hence < p < .005. α =.05 Yes = c(42, 16) No = c(18, 24) Bruce = rbind(Yes, No) chisq.test(Bruce, correct = F) The actual p-value = .0029, the same as that found using the Z-test! Pearson's Chi-squared test data: Bruce X-squared = 8.867, df = 1, p-value =

19 “Chi-squared” Test Statistic
Example: Two Proportions (of “Success”) Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Expected (under H0) Observed Males Females Yes 42 16 58 No 18 24 60 40 100 Males Females Yes 34.8 23.2 58 No 25.2 16.8 42 60 40 100 “Chi-squared” Test Statistic The α =.05 critical value is p = .0029 Recall… = on 1 df

20 Example: Two Proportions (of “Success”)
Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Let the discrete random variable X = “# Successes” (i.e., “Yes” responses) in each gender of the samples, and use these data to test… Null Hypothesis H0: P(“Yes” in Male population) = P(“Yes” in Female population), i.e., H0: π1 = π where π = P(Success) in each gender population. “No association exists.” π1 – π2 = 0, Data: Sample 1) n1 = 60 males, X1 = Sample 2) n2 = 40 females, X2 = 16 Analysis via Z-test: Point estimates NOTE: This is > 0. Therefore, REJECT H0 Interpretation: A significant association exists at the .05 level; “liking Bruce Willis movies” and gender are dependent, with males showing a 30% preference over females, on average.

21 “Chi-squared” Test Statistic
Example: Two Proportions (of “Success”) Study Question: “Is there an association between liking Bruce Willis movies and gender, or not?” Design: Randomly select two large samples of males and females, and record their binary responses (Yes = 1, No = 0) to the question “Do you like Bruce Willis movies?” Expected (under H0) Observed Males Females Yes 42 16 58 No 18 24 60 40 100 Males Females Yes 34.8 23.2 58 No 25.2 16.8 42 60 40 100 “Chi-squared” Test Statistic The α =.05 critical value is p = .0029 NOTE: (Z-score)2 = (2.9775)2 NOTE: (Z-score)2 = (2.9775)2 Connection between Z-test and Chi-squared test ! = on 1 df = on 1 df Connection between Z-test and Chi-squared test !

22

23

24

25

26

27

28

29

30

31

32

33

34 STAT 312 Categorical Data Analysis Introduction
Chapter 14 - Goodness-of-Fit Tests and Categorical Data Analysis Introduction Goodness-of-Fit Tests When Category Probabilities Are Completely Specified Goodness-of-Fit Tests for Composite Hypotheses Two-Way Contingency Tables

35 “Chi-squared” Test Statistic for Categorical Data
“degrees of freedom” df = (# rows – 1)(# cols – 1) “Chi-squared” Test Statistic for Categorical Data 2  2 Chi-squared Test is only valid if: Null Hypothesis H0: 1 – 2 = 0. One-sided or nonzero null value  Z-test! Expected Values  5, in order to avoid “spurious significance” due to a possibly inflated Chi-squared value. Use “Fisher’s Exact Test” otherwise. Paired version of 2  2 Chi-squared Test = “McNemar’s Test” Categorical data – contingency table with any number of rows and columns Formal Null Hypothesis difficult to write mathematically in terms of 1, 2,… “Test of Independence” “Test of Homogeneity” “Goodness-of-Fit” Test Informal H0: “No association exists between rows and columns.” 80% of Expected Values  5. Can use “Fisher’s Exact Test” otherwise. CATEGORIES MUST MAKE SENSE! BEWARE OF OVER-LUMPING / OVER-SPLITTING!

36 “Goodness-of-Fit” Test
Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test

37 “Goodness-of-Fit” Test (Numerical: Continuous)
Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. > x

38 “Goodness-of-Fit” Test (Numerical: Continuous)
Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypotheses where Pop 1 = Score  90 Pop 2 = Score < 90 But suppose H0: “All students get the same amount of sleep.” Meaningless, since there are not 425 populations, only one value of X for each student in the sample!

39 “Goodness-of-Fit” Test (Numerical: Continuous)
Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypotheses: where Pop 1 = Score  90 Pop 2 = Score < 90 But suppose H0: “All students get the same amount of sleep.” I = 1 2 3 4 5 6 Categorical (but not binary) X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 425 Observed 65 97 70 69 55 Expected ? 425

40 “Goodness-of-Fit” Test (Numerical: Continuous)
Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) H0: “All students get the same amount of sleep.” Categorical (but not binary) X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected ? 425

41 “Goodness-of-Fit” Test (Numerical: Continuous)
Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) H0: “All students get the same amount of sleep.” Categorical (but not binary) X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected 425 In this case, k = 6, so df = 6 – 1 = 5.

42 “Goodness-of-Fit” Test (Numerical: Continuous)
Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) REJECT H0 - Not all students get the same amount of sleep! H0: “All students get the same amount of sleep.” More precisely, there is a statistically significant difference between the proportion of students in at least one of these sleep categories, and the others. Categorical (but not binary) X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected 425 p > pchisq( , 5, lower.tail = F) [1]

43 > obs = c(65, 97, 70, 69, 69, 55) > chisq.test(obs, correct = F) Chi-squared test for given probabilities data: obs X-squared = , df = 5, p-value = > G.of.Fit = chisq.test(x, correct = F) > G.of.Fit$obs [1] > G.of.Fit$exp [1] > G.of.Fit$stat X-squared > G.of.Fit$p.value [1] > obs = c(65, 97, 70, 69, 69, 55) > chisq.test(obs, correct = F) Chi-squared test for given probabilities data: obs X-squared = , df = 5, p-value =

44 (Numerical: Continuous)
What if…? Categorical data – contingency table with any number of rows and columns (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Is there any association between “Number of hours of sleep” and “Final Exam score?” H0: “All students get the same amount of sleep.” Pop 1 = Score  90 Pop 2 = Score < 90 H0: “No such association exists.” Null Hypothesis Observed X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425

45 (Numerical: Continuous)
What if…? Categorical data – contingency table with any number of rows and columns (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Is there any association between “Number of hours of sleep” and “Final Exam score?” Pop 1 = Score  90 Pop 2 = Score < 90 H0: “No such association exists.” Null Hypothesis Observed Score X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8  90 < 90 65 97 70 69 55 425 X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Expected 2 “Test of Homogeneity” (between two populations)

46 (Numerical: Continuous)
What if…? Categorical data – contingency table with any number of rows and columns (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Is there any association between “Number of hours of sleep” and “Final Exam score?” H0: “No such association exists.” Null Hypothesis Let Y = “Final Exam score”

47 Scatterplot, n = 425

48 Correlation and Regression
Scatterplot, n = 425 Correlation and Regression

49 (Numerical: Continuous)
What if…? Categorical data – contingency table with any number of rows and columns (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Pop 1 = Score  90 Pop 2 = Score < 90 Null Hypothesis Is there any significant difference in mean number of hours of sleep between students with Score  90 and Score < 90? T-test!

50 Back to… Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypothesis H0: “All students get the same amount of sleep.” One way to interpret this is that the Uniform Distribution is a good fit to the data. But what about fitting other models? X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected 425

51 “Goodness-of-Fit” Test (Numerical: Continuous)
Back to… Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypothesis X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected ? 425

52 “Goodness-of-Fit” Test (Numerical: Continuous)
Back to… Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypothesis > obs = c(65, 97, 70, 69, 69, 55) > H0 = c(.20,.30,.20,.15,.10,.05) > chisq.test(obs, correct = F, p = H0) Chi-squared test for given probabilities data: obs X-squared = , df = 5, p-value < 2.2e-16 X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected 85.00 127.50 63.75 42.50 21.25 425

53 “Goodness-of-Fit” Test (Numerical: Continuous)
Back to… Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypothesis (new model) H0: “These sample proportions come from a normally-distributed population.” > c(mean(obs), sd(obs)) [1] X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected 425

54 “Goodness-of-Fit” Test (Numerical: Continuous)
Back to… Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypothesis (new model) H0: “These sample proportions come from a normally-distributed population.” For each bin, calculate the corresponding normal probabilities: (pnorm) Multiply each probability by n = 425 to obtain the expected values… X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected 425

55 Back to… Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypothesis (new model) H0: “These sample proportions come from a normally-distributed population.” But what if the categories (i.e., bins) are unspecified? In practice, the bins are often chosen to have equal areas, to make the probability of randomly selecting any one of them equally likely (in this case, 1/6). X < 4 4  X < 5 5  X < 6 6  X < 7 7  X < 8 X  8 65 97 70 69 55 425 Observed Expected 425

56 plot(dnorm, from=-3, to=3, lwd = 2)
lines(c(-3,3), c(0,0)) for (i in 1:5) { z = qnorm(i/6) print(z) y = dnorm(z) points(z,0, pch=19, col = "red") lines(c(z, z), c(0, y), lty = 2) }

57 Back to… Categorical data – contingency table with any number of rows and columns “Goodness-of-Fit” Test (Numerical: Continuous) Let X = “Hours of sleep before Final Exam” in a population of Stat 312 students. Random sample of n = 425 students; X measured for each student. Null Hypothesis (new model) H0: “These sample proportions come from a normally-distributed population.” But what if the categories (i.e., bins) are unspecified? In practice, the bins are often chosen to have equal areas, to make the probability of randomly selecting any one of them equally likely (in this case, 1/6). Obs Exp O1 O2 O3 O4 O5 O6 425 425


Download ppt "STAT 312 Introduction Z-Tests and Confidence Intervals for a"

Similar presentations


Ads by Google