STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Chapter 6 Introduction to Sampling Distributions
Correlation 2 Computations, and the best fitting line.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Topic 2: Statistical Concepts and Market Returns
Evaluating Hypotheses
SIMPLE LINEAR REGRESSION
Chapter 8 Estimation: Single Population
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
Inferences About Process Quality
SIMPLE LINEAR REGRESSION
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Hypothesis Testing Using The One-Sample t-Test
Business Statistics: Communicating with Numbers
SIMPLE LINEAR REGRESSION
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Statistical inference: confidence intervals and hypothesis testing.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
Albert Morlan Caitrin Carroll Savannah Andrews Richard Saney.
Topic 5 Statistical inference: point and interval estimate
Lecture 14 Sections 7.1 – 7.2 Objectives:
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Estimates and Sample Sizes Lecture – 7.4
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 11 Inferences About Population Variances n Inference about a Population Variance n.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Lecture 7 Dustin Lueker. 2  Point Estimate ◦ A single number that is the best guess for the parameter  Sample mean is usually at good guess for the.
Tests of Hypotheses Involving Two Populations Tests for the Differences of Means Comparison of two means: and The method of comparison depends on.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Confidence Intervals for Variance and Standard Deviation.
Chapter 7 Estimates, Confidence Intervals, and Sample Sizes
CHAPTER 9 Inference: Estimation The essential nature of inferential statistics, as verses descriptive statistics is one of knowledge. In descriptive statistics,
BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.
1 Estimation of Population Mean Dr. T. T. Kachwala.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 12 Inference About One Population. We shall develop techniques to estimate and test three population parameters.  Population mean   Population.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Lecture 13 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Dr.Theingi Community Medicine
TESTING STATISTICAL HYPOTHESES
Confidence Intervals and Sample Size
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Chapter 4. Inference about Process Quality
Chapter 9 Hypothesis Testing.
Chapter 11 Inferences About Population Variances
CONCEPTS OF ESTIMATION
SIMPLE LINEAR REGRESSION
Presentation transcript:

STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or descriptive statistics. Statistics is also used in another way: if the entire population of interest is not accessible for some reason, often only a portion of the population (a sample) is observed and statistics is then used to answer questions about the whole population. This process is called inferential statistics. Statistical inference will be the main focus of this lesson.

Example 1 We choose ten people from a population and for each individual measure his or her height. We obtain the following results in cm: 178, 180, 158, 166, 190, 180, 177, 178, 182, 160 It is known that the the random variable describing the height of an individual from that particular population has a normal distribution What is an unbiased estimate of the average height of the population?

Considering that the arithmetic mean of the the sample arithmetic mean is 174.9, can we maintain that, with a probability of, the hypothesis that the population average mean equals 176 cm is true?

The answer to thefirst question is related to point estimation First we will introduce the notion of a random sample. We view the raw data as an implementation of a random vector with the random variables being independent and each random variable having the same probability distribution F(x). This random vector is called a random sample or a sample. In a similar way, we also define a multidimensional random sample, for example

A function that does not explicitly depend on any parameter of the distribution F(x) is called a statistic of the sample The following are examples of the most frequently used statistics: Various statistics are then used as point estimators of parameters of the distribution F(x). The point estimation is obtained by implementing the point estimator statistic, that is, by substituting the raw data for the random variables in the formula.

Sample arithmetic mean

Sample variance

Sample standard deviation

Sample correlation coefficient

It is desirable for a point estimate to be: (1) Consistent. The larger the sample size, the more accurate the estimate. (2) Unbiased. For each value of the parameter of the population distribution F(x) the sample expectancy E(Y) of the point estimator Y equals. (3) Most efficient or best unbiased: of all consistent, unbiased estimates, the one possessing the smallest variance.

It can be shown that the following statistics are consistent, unbiased, and best unbiased estimators.  sample arithmetic mean for E(X)  sample variance for D(X)  sample standard deviation for  sample correlation coefficient for Returning to Example 1, we can now say that, based on the data given, is a consistent, best unbiased estimation of the average height of the entire population.

Example 2 We choose ten people from a population and for each individual measure his or her height. We obtain the following results in cm: 178, 180, 158, 166, 190, 180, 177, 178, 182, 160 It is known that the the random variable describing the height of an individual from that particular population has a normal distribution For a given probability, is there an interval I such that the average height of the population lies in I with a probality ?

Let us take the sample arithmetic mean If each has a normal distribution, it can be proved that has a distribution

We will show this for n = 2: Let X 0 has a distribution and Y 0 Put denote by F(z) the distribution of Z 0 and calculate : Since X 0 and Y 0 are independent, we can write

A

To calculate we will use the transformation where J = 1.

The second equation from the left is due to the fact that Thus we can conclude that Z 0 has a distribution

To conclude the proof, we will show that if, then

Let us calculate what expectancy the population must have for the probability that the value of the sample arithmetic mean is less than is less than and, what expectancy the population must have for the probability that the value of the sample arithme- tic mean is greater than is less than.

Now we can solve Example 2: We have If we choose we get and so we can write Thus, we can conclude that, with probability 0.95, the average height ofte population in question lies between and 181.1

We can think of the formulas and as of implementations of the statistics and is called a confidence interval and its valuefor a particular sample is called an intercal estimate at confidence level

The conficence interval has been derived on the assumption that the population distribution is normal. With such an assumption, other confidence intervals can also be derived: is a confidence interval for the expectancy if the population variance and expectancy are not known. Heredenotes the quantile of the Student's t-distribution with degrees of freedom.

is a confidence interval for the variance if the population variance and expectancy are not known. Here and are quantiles of Pearson’s chi- squared distribution with k = n-1 degrees of freedom.