CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Managerial Economics in a Global Economy
Chapter 7. Statistical Estimation and Sampling Distributions
Statistical Estimation and Sampling Distributions
Sampling: Final and Initial Sample Size Determination
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
SOLVED EXAMPLES.
Simple Linear Regression
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chapter 10 Simple Regression.
Point estimation, interval estimation
Chapter 7 Sampling and Sampling Distributions
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
8 Statistical Intervals for a Single Sample CHAPTER OUTLINE
Chapter 11 Multiple Regression.
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 2 Simple Comparative Experiments
Chapter 7 Estimation: Single Population
Inferences About Process Quality
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Inference for regression - Simple linear regression
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chapter 7 Estimation: Single Population
Statistical Intervals for a Single Sample
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Random Sampling, Point Estimation and Maximum Likelihood.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Statistics Sampling Intervals for a Single Sample Contents, figures, and exercises come from the textbook: Applied Statistics and Probability for Engineers,
Chapter 8: Simple Linear Regression Yang Zhenlin.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Review of Statistical Terms Population Sample Parameter Statistic.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Inference about the slope parameter and correlation
The simple linear regression model and parameter estimation
ESTIMATION.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
STATISTICAL INFERENCE
Chapter 4. Inference about Process Quality
Inference for Regression
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
Simple Linear Regression - Introduction
Introduction to Instrumentation Engineering
CONCEPTS OF ESTIMATION
Simple Linear Regression
Simple Linear Regression
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
DESIGN OF EXPERIMENT (DOE)
Presentation transcript:

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING

Software Statistical Methods Statistical Inference The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These methods utilize the information contained in a simple form the population in drawing conclusions. Statistical inference may be divided into two major areas: parameter estimation and hypothesis testing. Random Sampling Definition A population consists of the totality of the observations with which we are concerned. A sample is a subset of observations selected from a population.

Definition A statistic is any function of the observations in a random sample. Since a statistic is a random variable, it has a probability distribution. We call the probability distribution of a statistic a sampling distribution. In general, if X is a random variable with probability distribution f(x), characterized by the unknown parameter , and if X 1,X 2,…,X n is a random sample of size n from X, then the statistic  = h (X 1,X 2..X n ) is called a point estimator of . Note that  is a random variable, because it is a function of random variables. After the sample has been selected,  takes no particular numerical value  called the point estimate of . Definition A point estimate of some population parameter  is a single numerical value  of a statistic .

Estimation problems occur frequently in engineering. We often need to estimate The mean  of a single population The variance  2 (or standard deviation  ) of a single population The population p of items in a population that belong to a class of interest The difference in means of two populations,  1 -  2 The difference in two population proportions, p 1 – p 2 Reasonable point estimates of these parameters are as follows: For , the estimate is  = x, the sample mean. For  2, the estimate is  2 = s 2, the sample variance. For p, the estimate is p = x/n, the sample portion, where x is the number of items in a random sample of size n that belong to the class of interest. For  1 -  2, the estimate is  1 -  2 = x1 – x2, the difference between the sample means of two independent random samples. For p1 – p2, the estimate is p1 – p2, the difference between two sample portions computed from two independent random samples.

Properties of Estimators Unbiased Estimators The point estimator  is an unbiased estimator for the parameter  if E(  ) =  If the estimator is not unbiased, then the difference E(  ) -  is called the bias of the estimator . Definition The probability distribution of a statistic is called a sampling distribution.

Theorem If X 1,X 2,…..X n is a random sample of size n taken from a population (either finite or infinite) with mean  and finite variance  2, and if X is the sample mean, then the limiting form of the distribution of Z = as n , is the standard normal distribution. If x is the sample mean of a random sample of size n from a population with known variance  2, a 100(1 -  ) percent confidence interval on  is given by Where z  /2 is the upper 100  /2 percentage point of the standard normal distribution.

Definition If x is used as an estimate of , we can be 100(1-  ) percent confident that the error | x -  | will not exceed a specified amount E when the sample size is The 100(1-  ) percent upper-confidence interval for  is and the 100(1-  ) percent lower-confidence interval for  is Inference on the Mean of a Population, Variance Unknown

Definition (Cont.) Let X 1,X 2,…..,X n be a random sample for a normal distribution with unknown mean  and unknown variance  2. The quantity has a t distribution with n – 1 degrees of freedom. If s 2 is the sample variance from a random sample of n observations from a normal distribution with unknown variance  2 is where X 2  /2,n-1 and X 2 1-  /2,n-1 are the upper and lower 100  /2 percentage points of the chi-square distribution with n – 1 degrees of freedom, respectively. The 100(1-  ) percent lower-confidence interval on  2 is

Simple Linear Regression Definition The least squares estimates of the intercept and slope in the simple linear regression model are  0 = y -  1 x whereand where

Definition In simple linear regression the estimated standard error of the slope is and the estimated standard error of the intercept is where  2 is computed from equation

Hypothesis Tests in Simple Linear Regression

Parameter Estimation The regression line is fitted to the data points by finding the line which is the “closest” to the data points in some sense. Consider the vertical deviations between the line and the data points Minimizes the sum of the squares of these vertical deviations and this is referred to as the least squares fit.

Parameter Estimation (Cont.) The parameter estimates and are easily found by taking partial derivatives of Q with respect to and and setting the resulting expressions equal to zero. Since and

Parameter Estimation (Cont.) The parameter estimates and are thus the solutions to the normal equations and The normal equations can be solved to give and

Inference Procedures For a particular value of the explanatory variable, the true regression line specifies the expected value of the dependent variable or, in other words, the expected response. Thus, if the random variable measures the value of the dependent variable when the explanatory variable is equal to, then It is useful to be able to construct confidence intervals for this expected value.

Inference Procedures (Cont.) The point estimate of the average response at is This is an observation from the random variable where

Inference Procedures (Cont.) Since it is a linear combination of normal random variables, this random variable is also normally distributed, and it can be shown that it has an expectation of and a variance of

Inference on the Expected Value of the Dependent Variable A confidence level two-sided confidence interval for, the expected value of the dependent variable for a particular value of the explanatory variable, is where