We start Part II (Quantifying Uncertainty) today Lab 2 - Equations Tomorrow - Tue 3-5 or 7-9 PM - SN 4117 Assignment 2 – Data Equations Due Wednesday.

Slides:



Advertisements
Similar presentations
Chemistry: Matter and Change
Advertisements

Functions and Their Representations
Ecole Nationale Vétérinaire de Toulouse Linear Regression
STATISTICS Linear Statistical Models
STATISTICS HYPOTHESES TEST (I)
STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Chapter 7 Sampling and Sampling Distributions
Simple Linear Regression 1. review of least squares procedure 2
EC220 - Introduction to econometrics (chapter 1)
The basics for simulations
Chapter 4: Basic Estimation Techniques
3/2003 Rev 1 I – slide 1 of 33 Session I Part I Review of Fundamentals Module 2Basic Physics and Mathematics Used in Radiation Protection.
Independent Measures T-Test Quantitative Methods in HPELS 440:210.
Chi-Square and Analysis of Variance (ANOVA)
2009 Foster School of Business Cost Accounting L.DuCharme 1 Determining How Costs Behave Chapter 10.
1..
Statistical Analysis SC504/HS927 Spring Term 2008
Multiple Regression. Introduction In this chapter, we extend the simple linear regression model. Any number of independent variables is now allowed. We.
Introduction The fit of a linear function to a set of data can be assessed by analyzing residuals. A residual is the vertical distance between an observed.
AP Statistics Section 15 B. Estimating the Parameters To estimate the true regression line, we will use the calculated least-squares regression line.
Evaluation of precision and accuracy of a measurement
Chapter Thirteen The One-Way Analysis of Variance.
Ch 14 實習(2).
Chapter 8 Estimation Understandable Statistics Ninth Edition
Exponents and Radicals
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Simple Linear Regression Analysis
Multiple Regression and Model Building
STAT E-150 Statistical Methods
Spearman’s Rank For relationship data. Non-parametric i.e. no assumptions are made about data fitting a normal distribution You must have more than 5.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sampling: Final and Initial Sample Size Determination
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
The Simple Linear Regression Model: Specification and Estimation
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Simple Linear Regression Least squares line Interpreting coefficients Prediction Cautions The formal model Section 2.6, 9.1, 9.2 Professor Kari Lock Morgan.
Linear Regression/Correlation
Regression and Correlation Methods Judy Zhong Ph.D.
Today: Quizz 8 Friday: GLM review Monday: Exam 2.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Thomas Knotts. Engineers often: Regress data  Analysis  Fit to theory  Data reduction Use the regression of others  Antoine Equation  DIPPR.
2014. Engineers often: Regress data  Analysis  Fit to theory  Data reduction Use the regression of others  Antoine Equation  DIPPR We need to be.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Today: Lab 3 & A3 due Mon Oct 1: Exam I  this room, 12 pm Please, no computers or smartphones Mon Oct 1: No grad seminar Next grad seminar: Wednesday,
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Class 4 Ordinary Least Squares CERAM February-March-April 2008 Lionel Nesta Observatoire Français des Conjonctures Economiques
Curve Fitting Pertemuan 10 Matakuliah: S0262-Analisis Numerik Tahun: 2010.
ReCap Part II (Chapters 5,6,7) Data equations summarize pattern in data as a series of parameters (means, slopes). Frequency distributions, a key concept.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
LECTURE 02: EVALUATING MODELS January 27, 2016 SDS 293 Machine Learning.
1 AAEC 4302 ADVANCED STATISTICAL METHODS IN AGRICULTURAL RESEARCH Part II: Theory and Estimation of Regression Models Chapter 5: Simple Regression Theory.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Chem. 31 – 6/13 Lecture. Announcements I Pipet and Buret Calibration Lab Report Due Quiz and Homework Returned in Lab Exam 1 on Thursday –Will cover material.
CHAPTER 3 Describing Relationships
Part 5 - Chapter
CHAPTER 3 Describing Relationships
AP Statistics Chapter 14 Section 1.
Understanding Standards Event Higher Statistics Award
Linear Regression/Correlation
CHAPTER 3 Describing Relationships
Correlation and Regression
Regression Statistics
Chapter 14 Inference for Regression
Ch 4.1 & 4.2 Two dimensions concept
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Presentation transcript:

We start Part II (Quantifying Uncertainty) today Lab 2 - Equations Tomorrow - Tue 3-5 or 7-9 PM - SN 4117 Assignment 2 – Data Equations Due Wednesday

Data = Model + Residual Chapter 5 Data Equations Data = Model + Residual

Data Equations Central concept in the course We’ll teach a general approach that will allow you to set up an appropriate analysis of data You don’t have to worry about whether you have selected the ‘right’ test We are going to use data equations to compare models to data Data = Model + Residual

Flexible approach Data = Model + Residual

Symbolic expression  Analyse data Increase confidence using model triangle – Link symbols with graphical and verbal model Data Verbal GraphicalFormal

Assessing fit We’ll use data equations to measure: – How well the model fits the data (goodness of fit) – Error rate Model Plant height Time in sunlight Data We do not expect a perfect fit But model and observed values should be close

Definition Three terms: Y=Ŷ+εY=Ŷ+ε DataModelResidual ObservedExpectedError Fitted Residuals: ε= Y -Ŷ

e.g. Dobzhansky’s Fruit Flies Dobzhansky pioneered work on fruit fly evolutionary genetics in the lab and field One research question he addressed was: – Does genetic variability decrease at higher altitude, due to stronger selection in extreme environments? Nothing in Biology Makes Sense Except in the Light of Evolution

e.g. Dobzhansky’s Fruit Flies Nothing in Biology Makes Sense Except in the Light of Evolution Heterozygosity H (%) Elevation E (km) Nothing in Biology Makes Sense Except in the Light of Statistics ?

Heterozygosity H (%) Elevation E (km) Model? Heterozygosity in Drosophila = 40% Data = Model + Residual H =Ĥ + ε H = 40% + ε Many options First we’ll check deviance from a single value

==== = = = = = = = ∑ With this simple model, we can form 7 data equations Summed residuals is a measure of bias Summed residuals 2 is a measure of goodness of fit

What if the parameter is unknown?

==== = = = = = = = ∑ Form 7 data equations

Single value vs. Mean model Mean model: unbiased and better fit But biological criteria have been replaced by statistical criteria

e.g. Dobzhansky’s Fruit Flies Nothing in Biology Makes Sense Except in the Light of Evolution Heterozygosity H (%) Elevation E (km) What about elevation?...don't you remember my question? Does genetic variability decrease at higher altitude, due to stronger selection in extreme environments?

%/kmkm%%

Heterozygosity H (%) Elevation E (km)

==== = = = = = = = ∑ With this equation, we can calculate fitted values ? ? ? ? ? ? ? ? ? ? ? ? ? ?

There’s a better way to estimate slope

Model comparison ∑ res = ∑ res = 0 ∑ res = ∑ res = 0 Two unbiased models ∑ res 2 = ∑ res 2 = Reduction in squared deviance ∑ res 2 =