Dynamic Programming. A Simple Example Capital as a State Variable.

Slides:



Advertisements
Similar presentations
Formal Computational Skills
Advertisements

Boyce/DiPrima 9th ed, Ch 2.4: Differences Between Linear and Nonlinear Equations Elementary Differential Equations and Boundary Value Problems, 9th edition,
Boyce/DiPrima 9th ed, Ch 2.8: The Existence and Uniqueness Theorem Elementary Differential Equations and Boundary Value Problems, 9th edition, by William.
Mathematics1 Mathematics 1 Applied Informatics Štefan BEREŽNÝ.
Boyce/DiPrima 9th ed, Ch 2.7: Numerical Approximations: Euler’s Method Elementary Differential Equations and Boundary Value Problems, 9th edition, by.
Lecture 6 Classifications of Interest Rate Models.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 4 Entity Relationship (ER) Modeling.
Chapter 2: Lasso for linear models
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Ch 2.4: Differences Between Linear and Nonlinear Equations
Sample size computations Petter Mostad
Chapter 9 Simultaneous Equations Models. What is in this Chapter? In Chapter 4 we mentioned that one of the assumptions in the basic regression model.
CE 1501 CE 150 Fluid Mechanics G.A. Kallio Dept. of Mechanical Engineering, Mechatronic Engineering & Manufacturing Technology California State University,
The Islamic University of Gaza Faculty of Engineering Numerical Analysis ECIV 3306 Introduction.
Reinforcement Learning (2) Bob Durrant School of Computer Science University of Birmingham (Slides: Dr Ata Kabán)
Mathematical Modeling in Biology:
Algorithms For Inverse Reinforcement Learning Presented by Alp Sardağ.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
The Final Bugbox Model Let L t be the number of larvae at time t. Let P t be the number of pupae at time t. Let A t be the number of adults at time t.
Numerical solution of Differential and Integral Equations PSCi702 October 19, 2005.
Chapter 5: z-scores.
Chapter 5: z-scores STAT 252 Spring 2013 Gerald D. Nunn, Ph.D., NCSP.
Data Modeling Using the Entity-Relationship Model
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Computational Stochastic Optimization: Bridging communities October 25, 2012 Warren Powell CASTLE Laboratory Princeton University
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Review: Two Main Uses of Statistics 1)Descriptive : To describe or summarize a collection of data points The data set in hand = all the data points of.
Warm-up 1.Find f(12). 2.f(x)=140. Find x.. Answer 1.f(12) = 1 2. f(9) = 140.
DSGE Models and Optimal Monetary Policy Andrew P. Blake.
Algorithms and their Applications CS2004 ( ) Dr Stephen Swift 3.1 Mathematical Foundation.
1 ECE-517 Reinforcement Learning in Artificial Intelligence Lecture 7: Finite Horizon MDPs, Dynamic Programming Dr. Itamar Arel College of Engineering.
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
Mathematics of Measurable Parameters P M V Subbarao Professor Mechanical Engineering Department A Pro-active Philosophy of Inventions….
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Curve-Fitting Regression
Parametric Equations: another process to describe a plane curve on the Cartesian coordinate system – Parametrizations of Plane Curves The coordinates.
Chapter 4 Entity Relationship (ER) Modeling.  ER model forms the basis of an ER diagram  ERD represents conceptual database as viewed by end user 
Learning Theory Reza Shadmehr & Jörn Diedrichsen Reinforcement Learning 1: Generalized policy iteration.
Data Encryption Standard CS-471/971. Category of Standard: Computer Security. Explanation: The Data Encryption Standard (DES) specifies a FIPS approved.
Methodological Problems in Cognitive Psychology David Danks Institute for Human & Machine Cognition January 10, 2003.
Algebraic Plane Solution Let’s solve the system of equations: 2x – 5y + 2z = 15 x + 3y - z = -4 2x - y - z = 2 First we make our assumption that they all.
SD modeling process One drawback of using a computer to simulate systems is that the computer will always do exactly what you tell it to do. (Garbage in.
MA354 An Introduction to Math Models (more or less corresponding to 1.0 in your book)
April 29th, Chapter 6 Decision Making 6.1 The Nature of Management decisions 6.1 The Nature of Management decisions 6.2The decision making process.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Systems of Linear Equations in Two Variables. 1. Determine whether the given ordered pair is a solution of the system.
Unit 7: Sampling Distributions
Reinforcement Learning Dynamic Programming I Subramanian Ramamoorthy School of Informatics 31 January, 2012.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 13 Collecting Statistical Data 13.1The Population 13.2Sampling.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 0.Game Theory – Brief Introduction Lecture
General Analysis Procedure and Calculator Policy Calculator Policy.
Ch. Eick: Num. Optimization with GAs Numerical Optimization General Framework: objective function f(x 1,...,x n ) to be minimized or maximized constraints:
Solving a System of 3 Equations with 3 Unknowns. Breakdown Step 1 Labeling Step 2 Reduce to a 2 by 2 Step 3 Substitute Back In Step 4 Check Solution.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.3.
Team  Spatially distributed deterministic models  Many hydrological phenomena vary spatially and temporally in accordance with the conservation.
Principles of Management
PIVOTING The pivot or pivot element is the element of a matrix, or an array, which is selected first by an algorithm (e.g. Gaussian elimination, simplex.
Copyright © Cengage Learning. All rights reserved.
Fundamentals of Functional Programming Languages
Making Statistical Inferences
Estimating
Systems of Linear Equations in Two Variables
§1-2 State-Space Description
Choose the differential equation corresponding to this direction field
CS 416 Artificial Intelligence
Multiple features Linear Regression with multiple variables
Multiple features Linear Regression with multiple variables
Presentation transcript:

Dynamic Programming

A Simple Example

Capital as a State Variable

The Value Function

The Functional Equation

Computing the Value Function

The standard approach for computing value functions is known as “iteration on the value function” 1.To proceed, first specify functional forms and parameter values for the primitives of the problem: utility functions, production functions, the discount factor, etc. 2.Then set up a grid of possible values of the state variable: in cases where the state variable is continuous (such as the capital accumulation problem) this results in an approximate solution rather than an exact one, but this is usually good enough 3.The next step is to choose an initial value function; this sounds difficult but in practice it usually doesn’t matter too much what you start with: setting V(k) = 0 for every k on the grid is a typical choice

Iteration on the Value Function

Mathematical Foundations We would like our model to yield a unique solution for V(.) (a unique function, not a point) This means no matter which initial value function we choose at the start of the iterations, the final value function is always the same Similarly, we would like g(.) to be single-valued (a function instead of a correspondence) Then, given an initial state, the model would yield a unique prediction for the entire path of capital accumulation Whether the model yields unique solutions for the value function and policy function depends on the primitives of the problem: utility functions, production functions, etc. Stokey and Lucas describe general sufficient conditions for a model to yield a unique value function and a unique policy function (these conditions are sufficient, not necessary)

General Notation

Assumptions

Key Results

The Content of the Assumptions

The Assumptions