Chapter 7: Statistical Applications in Traffic Engineering

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Hypothesis Testing Steps in Hypothesis Testing:
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide MA4704Gerry Golding Developing Null and Alternative Hypotheses Hypothesis testing can be used to determine whether Hypothesis testing can be.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Chapter 7(7b): Statistical Applications in Traffic Engineering Chapter objectives: By the end of these chapters the student will be able to (We spend 3.
Chapter Seventeen HYPOTHESIS TESTING
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Sample size computations Petter Mostad
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Hypothesis : Statement about a parameter Hypothesis testing : decision making procedure about the hypothesis Null hypothesis : the main hypothesis H 0.
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Inference about a Mean Part II
Lec 7, Ch4, pp83-99: Spot Speed Studies (Objectives)
Chapter 2 Simple Comparative Experiments
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
IENG 486 Statistical Quality & Process Control
Inferences About Process Quality
Today Concepts underlying inferential statistics
BCOR 1020 Business Statistics
Inferential Statistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Statistical Inference for Two Samples
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Overview Definition Hypothesis
Lecture 8 1 Hypothesis tests Hypothesis H 0 : Null-hypothesis is an conjecture which we assume is true until we have too much evidence against it. H 1.
Descriptive statistics Inferential statistics
1 1 Slide © 2005 Thomson/South-Western Chapter 9, Part A Hypothesis Tests Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses.
Fundamentals of Hypothesis Testing: One-Sample Tests
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Chapter 7: Statistical Applications in Traffic Engineering
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
Testing of Hypothesis Fundamentals of Hypothesis.
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 9 1 MER301:Engineering Reliability LECTURE 9: Chapter 4: Decision Making for a Single.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
© Copyright McGraw-Hill 2004
Inferences Concerning Variances
What is a Hypothesis? A hypothesis is a claim (assumption) about the population parameter Examples of parameters are population mean or proportion The.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
Chapter 9 Hypothesis Testing.
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Section 11.1: Significance Tests: Basics
Presentation transcript:

Chapter 7: Statistical Applications in Traffic Engineering Chapter objectives: By the end of these chapters the student will be able to (We do skip simple descriptive stats because they were covered in CE361 and in an undergrad statistics class.): Lecture number Lecture Objectives (after these lectures you will be able to) Section 7.1 through 7.7 are reviews of basic statistics topics and not covered in this class. Assume you took Stat 201 or equivalent. Apply the basic principles of statistics contained in section 7.1 to traffic data analyses Explain the characteristics of the normal distribution and read the normal distribution table correctly (section 7.2) and get necessary values from Excel. Explain the meaning of confidence bounds and determine the confidence interval of the mean (section 7.3) Determine sample sizes of traffic data collection (section 7.4) Explain how random variables are added (section 7.5) Explain the implication of the central limit theorem (section 7.5.1) Explain the characteristics of various probabilistic distributions useful for traffic engineering studies and choose a correct distribution for the study(section 7.6) Explain the special characteristics of the Poisson distribution and its usefulness to traffic engineering studies (section 7-7) We covers these topics in section 7-8 Perform a hypothesis test correctly (two-sided, one-sided, paired test, F-test) Conduct a Chi-square test to test hypotheses on an underlying distribution f(x). This test will be covered again as part of the lecture on speed data analysis in Chapter 10.

7.8 Hypothesis testing Two distinct choices: Null hypothesis, H0 Alternative hypothesis: H1 E.g. Inspect 100,000 vehicles, of which 10,000 vehicles are “unsafe.” This is the fact given to us. H0: The vehicle being tested is “safe.” H1: The vehicle being tested is “unsafe.” In this inspection, 15% of the unsafe vehicles are determined to be safe Type II error (bad error) and 5% of the safe vehicles are determined to be unsafe  Type I error (economically bad but safety-wise it is better than Type II error.)

Types of errors Steps of the Hypothesis Testing Decision Reality State the hypothesis Select the significance level Compute sample statistics and estimate parameters Compute the test statistic Determine the acceptance and critical region of the test statistics Reject or do not reject H0 Reject H0 Accept H0 H0 is true Type I error Correct Correct Type II error H1 is true Fail to reject a false null hypothesis: Accept a false H0 Reject a correct null hypothesis: Reject a true H0 P(type I error) =  (level of significance) P(type II error ) = 

Dependence between , , and sample size n There is a distinct relationship between the two probability values  and  and the sample size n for any hypothesis. The value of any one is found by using the test statistic and set values of the other two. Given  and n, determine . Usually the  and n values are the most crucial, so they are established and the  value is not controlled. Given  and , determine n. Set up the test statistic for  and  with H0 value and an H1 value of the parameter and two different n values. (Read the handout given in class carefully to understand this. Notice we need to compute d value to find these.) Here we are comparing means; hence divide σ by sqrt(n). The t (or z) statistics is: t or z

7.8.2 Before-and-after tests with generalized alternative hypothesis The significance of the hypothesis test is indicated by , the type I error probability.  = 0.05 is most common: there is a 5% level of significance, which means that on the average a type I error (reject a true H0) will occur 5 in 100 times that H0 and H1 are tested. In addition, there is a 95% confidence level that the result is correct. 0.025 each If H1 involves a not-equal relation, no direction is given, so the significance area is equally divided between the two tails of the testing distribution. Two-sided If it is known that the parameter can go in only one direction, a one-sided test is performed, so the significance area is in one tail of the distribution. 0.05 One-sided upper

Two-sided or one-sided test These tests are done to compare the effectiveness of an improvement to a highway or street by using mean speeds. If you want to prove that the difference exists between the two data samples, you conduct a two-way test. (There is no change.) If you are sure that there is no decrease or increase, you conduct a one-sided test. (There was no decrease) Null hypothesis H0: 1 = 2 (there is no change) Alternative H1: 1 ≠ 2 Null hypothesis H0: 1 = 2 (there is no increase) Alternative H1: 1  2

Example (p.137-138) The decision point (or typically zc: Existing After improvement Sample size 55 Mean 60 min 55 min Standard Deviation 8 min For two-sided: 1.96*1.53 = 2.998 For one-sided: 1.65*1.53 =2.525 |µ1 - µ2| = |60-55| = 5 > zc By either test, H0 is rejected. At significance level  = 0.05 (See Table 7-3.) You can compute z score and compare Z computed and Z critical values.

Use of the standard normal distribution table, Tab 7-3 Z = 1.43 Excel functions: NORMSDIST(z) NORMSINV(cum prob) T.DIST.RT (Computed t, DF) T.DIST.2T (Computed, DF) Most popular one is a 95% confidence level and both sided µ ± 1.96  . See section 7.2.2 for confidence interval.

7.8.3 Other useful statistical tests The t-test (for small samples, n<=30) – Table 7.6: tc = 2.101 for two-sided α = 0.05. (See the samples in page 140)

F-test - Table 7.7 The F-Test to test if s1=s2 When the t-test and other similar means tests are conducted, there is an implicit assumption made that s1=s2. The F-test can test this hypothesis. The numerator variance > The denominator variance when you compute a F-value. If Fcomputed ≥ Ftable (n1-1, n2-1, a), then s1≠s2 at a asignificance level. If Fcomputed < Ftable (n1-1, n2-1,a), then s1=s2 at a asignificance level. Discuss the problem in p.140.

Paired difference test You perform a paired difference test only when you have a control over the sequence of data collection. e.g. Simulation  You control parameters. You have two different signal timing schemes. Only the timing parameters are changed. Use the same random number seeds. Then you can pair. If you cannot control random number seeds in simulation, you are not able to do a paired test. Table 7-8 shows an example showing the benefits of paired testing  The only thing changed is the method to collect speed data. The same vehicle’s speed was measured by the two methods.

Paired or not-paired example (Table 7.8) Method 1 Method 2 Difference Estimated mean 56.87 61.20 4.33 Estimated SD 7.74 7.26 1.496 H0: No increase in test scores (means one-sided or one-tailed) Tab 7.6 for t values Not paired: Paired: tcritical = 1.701 for df=28, a = 0.05. Computed 1.5807 < Critical t = 1.701 Hence, H0 is NOT rejected. tcritical = 1.761 for df=14, a = 0.05. Computed 11.218 < Critical t = 1.761. Hence, H0 is clearly rejected.

Chi-square (2-) goodness-of-fit test Example: Distribution of height data in Table 7-9. H0:The underlying distribution is uniform. H1: The underlying distribution is NOT uniform. The authors intentionally used the uniform distribution to make the computation simple. We will test a normal distribution in class using Excel.

Steps of Chi-square (2-) test Define categories or ranges (or bins) and assign data to the categories and find ni = the number of observations in each category i. (At least 5 bins and each should have at least 5 observations.) Compute the expected number of samples for each category (theoretical frequency), using the assumed distribution. Define fi = the number of samples for each category i. Compute the quantity:

Steps of Chi-square (2-) test (cont) 2 is chi-square distributed (see Table 7-11). If this value is lower than the critical 2 value, our null hypothesis is correct. Usually we use  = 0.05 (5% significance level or 95% confidence level). When you look up the table, the degree of freedom is f = N – 1 – g where g is the number of parameters we use in the assumed distribution. For normal distribution g = 2 because we use µ and  to describe the shape of normal distribution. (For the uniform distribution in the example g = 0 (why?). Hence, f = 10 – 1 – 0 = 9. If the computed 2 value (43.20 for this example) is smaller than the critical c2 value (16.92 for this example), we accept H0. Excel functions: CHIDIST(Computed 2 , DF)  Probability CHISQ.INV.RT (α-value, DF)  critical 2 value

What’s the Chi-square goodness of fit (2-) test testing? You need to know how to pull out values from the assumed distribution to create the expected histogram. Assumed distribution Chi-square (2-) test Expected distribution (or histogram) Actual histogram