Eric Nævdal eric.navdal@econ.uio.no Non-linearities, catastrophic risk and thresholds in resource economics Eric Nævdal eric.navdal@econ.uio.no.

Slides:



Advertisements
Similar presentations
Chapter 9 Maintenance and Replacement The problem of determining the lifetime of an asset or an activity simultaneously with its management during that.
Advertisements

The Basics of Physics with Calculus – Part II
Chapter 12: Basic option theory
Linear Regression.
The securities market economy -- theory Abstracting again to the two- period analysis - - but to different states of payoff.
Mean-variance portfolio theory
Lecture #11: Introduction to the New Empirical Industrial Organization (NEIO) - What is the old empirical IO? The old empirical IO refers to studies that.
SEQUENCES and INFINITE SERIES
Quantum One: Lecture 3. Implications of Schrödinger's Wave Mechanics for Conservative Systems.
Planning under Uncertainty
PHYS2020 NUMERICAL ALGORITHM NOTES ROOTS OF EQUATIONS.
1 Decision making under large uncertainty * Marie-Laure Guillerminet * * ZMK, University of Hamburg Atlantis Meeting January 24 th, 2003.
8 TECHNIQUES OF INTEGRATION. In defining a definite integral, we dealt with a function f defined on a finite interval [a, b] and we assumed that f does.
Human Growth: From data to functions. Challenges to measuring growth We need repeated and regular access to subjects for up to 20 years. We need repeated.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Copyright © Cengage Learning. All rights reserved.
Chapter 13 Stochastic Optimal Control The state of the system is represented by a controlled stochastic process. Section 13.2 formulates a stochastic optimal.
APPLICATIONS OF DIFFERENTIATION 4. In Sections 2.2 and 2.4, we investigated infinite limits and vertical asymptotes.  There, we let x approach a number.
THE MATHEMATICS OF OPTIMIZATION
Copyright © Cengage Learning. All rights reserved. 4 Continuous Random Variables and Probability Distributions.
Numerical Methods for Partial Differential Equations CAAM 452 Spring 2005 Lecture 9 Instructor: Tim Warburton.
Chapter 3 Root Finding.
Algebra Problems… Solutions
SOLUTION FOR THE BOUNDARY LAYER ON A FLAT PLATE
The Game of Algebra or The Other Side of Arithmetic The Game of Algebra or The Other Side of Arithmetic © 2007 Herbert I. Gross by Herbert I. Gross & Richard.
Managerial Economics Managerial Economics = economic theory + mathematical eco + statistical analysis.
Copyright © Cengage Learning. All rights reserved. CHAPTER 11 ANALYSIS OF ALGORITHM EFFICIENCY ANALYSIS OF ALGORITHM EFFICIENCY.
Ch 8.1 Numerical Methods: The Euler or Tangent Line Method
Measuring Returns Converting Dollar Returns to Percentage Returns
Conditional & Joint Probability A brief digression back to joint probability: i.e. both events O and H occur Again, we can express joint probability in.
5.5 The Substitution Rule In this section, we will learn: To substitute a new variable in place of an existing expression in a function, making integration.
Section 2: Finite Element Analysis Theory
Investment Analysis and Portfolio management Lecture: 24 Course Code: MBF702.
More on Stock Pollutants followed by the Environment and Asymmetric Information Lecture ECON 4910.
 We noticed in Section 2.3 that the limit of a function as x approaches a can often be found simply by calculating the value of the function at a.  Functions.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
The importance of sequences and infinite series in calculus stems from Newton’s idea of representing functions as sums of infinite series.  For instance,
Example 5.8 Non-logistics Network Models | 5.2 | 5.3 | 5.4 | 5.5 | 5.6 | 5.7 | 5.9 | 5.10 | 5.10a a Background Information.
Dr. Hatim Dirar Department of Physics, College of Science Imam Mohamad Ibn Saud Islamic University.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
LIMITS AND DERIVATIVES 2. In Sections 2.2 and 2.4, we investigated infinite limits and vertical asymptotes.  There, we let x approach a number.  The.
Given the marginal cost, find the original cost equation. C ' ( x ) = 9 x 2 – 10 x + 7 ; fixed cost is $ 20. In algebra, we were told that what ever was.
DSGE Models and Optimal Monetary Policy Andrew P. Blake.
1 ECE-517 Reinforcement Learning in Artificial Intelligence Lecture 7: Finite Horizon MDPs, Dynamic Programming Dr. Itamar Arel College of Engineering.
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Uniform discretizations: the continuum limit of consistent discretizations Jorge Pullin Horace Hearne Institute for Theoretical Physics Louisiana State.
Lecture 7: What is Regression Analysis? BUEC 333 Summer 2009 Simon Woodcock.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
In section 11.9, we were able to find power series representations for a certain restricted class of functions. Here, we investigate more general problems.
Lecture 7 and 8 The efficient and optimal use of natural resources.
Risk Analysis & Modelling
Math 104 Calculus I Part 6 INFINITE SERIES. Series of Constants We’ve looked at limits and sequences. Now, we look at a specific kind of sequential limit,
March 23 & 28, Csci 2111: Data and File Structures Week 10, Lectures 1 & 2 Hashing.
March 23 & 28, Hashing. 2 What is Hashing? A Hash function is a function h(K) which transforms a key K into an address. Hashing is like indexing.
Investment Performance Measurement, Risk Tolerance and Optimal Portfolio Choice Marek Musiela, BNP Paribas, London.
Advanced Engineering Mathematics, 7 th Edition Peter V. O’Neil © 2012 Cengage Learning Engineering. All Rights Reserved. CHAPTER 4 Series Solutions.
Copyright © Cengage Learning. All rights reserved. 4 Continuous Random Variables and Probability Distributions.
Copyright © Cengage Learning. All rights reserved.
INTEGRALS We saw in Section 5.1 that a limit of the form arises when we compute an area. We also saw that it arises when we try to find the distance traveled.
5 INTEGRALS.
FUNCTIONS AND MODELS Exponential Functions FUNCTIONS AND MODELS In this section, we will learn about: Exponential functions and their applications.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
Chapter 1 Limits and Their Properties Unit Outcomes – At the end of this unit you will be able to: Understand what calculus is and how it differs from.
Ch 9.6: Liapunov’s Second Method In Section 9.3 we showed how the stability of a critical point of an almost linear system can usually be determined from.
 This will explain how consumers allocate their income over many goods.  This looks at individual’s decision making when faced with limited income and.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
Copyright © Cengage Learning. All rights reserved. 4 Continuous Random Variables and Probability Distributions.
Presentation transcript:

Eric Nævdal eric.navdal@econ.uio.no Non-linearities, catastrophic risk and thresholds in resource economics Eric Nævdal eric.navdal@econ.uio.no

Purpose of class Teach some advanced methods in optimal control theory Familiarize students with some applications of these methods to natural resource management. Enable students to solve simple problems numerically. A very applied course. No theorems; just methods!

Prerequisites A decent understanding of ordinary differential equations. A decent understanding of deterministic optimal control theory. E.g. Maxc 0∫T U(c,x)e-rtdt subject to x(0) = x0 and dx/dt=f(x,c). Here 0 < T ≤ ∞.

Cook book solution in easy steps Define the Hamiltonian: H(c,x) = U(c,x) + λf(c,x) 2) Find optimality conditions c = argmax H → c = c(x,λ) dλ/dt = rλ – ∂H/∂x =rλ – ∂U/∂x – λ∂f/∂x Insert c from into dx/dt and = dλ/dt 3) You now have two differential equations.

What to do with the differential equations Compute steady states x* and λ*. f(c(x,λ),x) = 0 and rλ – U’x(c(x*,λ*),x*)– λf’x(c(x*,λ*),x*) = 0 Draw a phase diagram. λ dλ/dt = 0 λ* dx/dt = 0 x* x Paths converging to steady states candidates for optimal solution

Optimal Control cont’d The co-state λ(t) has an interesting economic interpretation. If somebody at time t gave you a present of 1 unit of x so that x(t) jumps to x(t) +1, then λ(t) is (roughly) the value of that present at time t. Entirely analogous to shadow prices in static theory. If T < ∞, then we have the transversality condition λ(t) = 0 if x(T) is free.

Transversality conditions for infinite Horizon problems The economic literature is full of flawed transversality conditions. The reason is that it is hard to find general conditions without using stuff like lim sup. For the purposes of this class we are satisfied if we can find at least one path where both the (current value) shadow price and the state variable converges to finite numbers. If T = ∞, then we really have a hard time with pinning down good transversality conditions. See Seierstad and Sydsæter (1987) for details.

Alternative approach that will (once) be used later Take the c(x,λ) function and differentiate it. Gives dc/dt = c’xdx/dt + c’λdλ/dt = = c’xdx/dt + c’λ(rλ – ∂U/∂x – λ∂f/∂x) The solve c(x,λ) with respect to λ. Gives λ(c,x). Use dc/dt and dx/dt after inserting λ(c,x) as alternative differential equations

Example A firm has benefits from pollution given by: (u0 – u)2 The pollution accumulates in nature according to dx/dt = u – δx Damages from accumulated pollution given by –x2. The problem: maxu∫∞ (-x2 - (u0 – u)2)e-rtdt subject to: dx/dt = u – δx , x(0) given Solution Define the Hamiltonian: H= -x2 - (u0 – u)2 + λ(u – δx) The value of u that maximizes the Hamiltonian is given by u = u0 + ½λ for u0 + ½λ > 0. Else u = 0. dλ/dt = rλ + 2x + δλ dx/dt = u0 + ½λ – δx

Computing Steady States gives x* = u0(r+δ)/(1+δ(r+δ)) λ* = –2u0/(1+δ(r+δ)) From these expressions we see (for example) dx*/dr > 0 dx*/dδ < 0

Phase diagram

Phase diagram with paths

Phase diagram with paths and optimal paths for infinite horizon

The same solution seen as a function of time

Phase diagrams with finite vs inifinite time horizon

Optimal Solution for various time horizons

Crucial insight If T is chosen sufficiently large, there will be some value t* such that the optimal solution for the infinite horizon problem and the optimal solution for the finite horizon problem will be numerically indistinguishable over the interval [0,t*]. This allows us to solve infinite horizon problems on the computer

Getting a deeper understanding of Optimal Control Some mathematical background: Boundary value problems. A general class of differential equations. Consider the problem dL/dt =λL, L(0) = L0. You should all know that the solution is: L0eλt. But what about this problem?: dL/dt =λL, L(T) = LT

Solving a boundary value problem dL/dt =λL implies that L(t) = Ceλt for some constant C. This constant is found by using the boundary value condition: CeλT = LT Gives that C = LTe-λT. Therefore L(t) = LTeλ(t-T) . Important. We can not independtly specify both L(0) and L(T). There is only one constant!

More boundary value problems dL/dt =λL, L(0) = 1. dN/dt = N – L, N(1) = 1. Solution: L(t) = Ceλt and N(t) = (λ – 1)-1 (et – eλt)C +etK. We have two constants C and K. Determined by: Ceλ0 = 1 and (λ – 1)-1 (e1 – eλ1)C +e1K = 1 C = 1 and K = (e(λ – 1))-1 (1 + e – eλ – λ) Solution is L(t) = eλt and N(t) = (λ – 1)-1 ((et – eλt) +et-1 (1 + e – eλ – λ)

Why boundary value problems? The solution to an optimal control problem may be written as a boundary value problem. Best seen in finite time problems: max 0∫T U(c,x)e-rtdt subject to x(0) = x0 and dx/dt=f(x,c). Here 0 < T< ∞. x(T) free. The maxmimum principle we know, but look at the transversality condition. λ(T)=0.

An even simpler example Define the Hamiltonian: H= -ax - ½(u0 – u)2 + λ(u – δx) The value of u that maximizes the Hamiltonian is given by u = u0 + ½λ for u0 + ½λ > 0. Else u = 0. dλ/dt = rλ + a + δλ dx/dt = u0 + ½λ – δx

A Philosophical digression The difference between human ecology (AKA economics) and ecology. An ecosystem and its inhabitants are unemcumbered by precognition. Humans are not. In order to understand an ecosystem we need differential equations and initial values. In order to understand human behaviour we need transversality conditions. Humans operate by backwards induction

Numerical methods Standard Optimal Control Problems in the literature do the following: Find explicit solutions. Works for very few problems. Phase diagram. Only works for problems with one state variable. Steady state analysis. May be hard to do for some problems. Some times steay states are uninteresting Alternative: Numerical analysis

Numerical Methods in finite time - Shooting The basic problem; The Maximum Principle gives us a set of differential equations. An optimal solution must start with the known and correct initial value of x(0). It must also start with an unknown correct value of λ(0) such that λ(T) = 0. Alternatively if x(T) is given, λ(0) must start from a value so that those constraint holds. The fundamental problem: Solve a rather complicated equation to find λ(0).

A solution in two steps First write computer code that solves the differential equations for arbitrary values of x(0) and λ(0). (As if we are solving an initial value problem.) Then write a routine that finds the value of λ(0) that sets λ(T) = 0, (or x(T) to the required value). Luckily, there are ways of doing this without writing much code. Using solver functions in Excel BVP4C routine in Matlab Both methods work well, but may have to be tweaked.

Step 1. The 4th order Runge – Kutta Method General formulation. For an OC problem the vector y = [x, λ]. Let dy/dt = f(t, y). Let h be a small number. Then y(t) is usually well approximated by the following sequence: y(t+h) = y(t) +(h/6)×(k1 + 2k2 + 2k3 + k4) k1= f(t, y(t)), k2=f(t + h/2, y(t) +hk1/2) k3=f(t + h/2, y(t) +hk2/2), k4= f(t + h, y(t) + hk3)

Starting Example Let dy/dt =y (1- y), y(0) = ½. The solution to this differential equation : y(t) = Exp(t)/(1 + Exp(t)) No difference whatsoever!

Setting up the differential equations for a control problem We will return to our previous example. The differential equations are: dλ/dt = rλ + 2x + δλ dx/dt = u0 + ½λ – δx Start Excel. Click on Sheet Tab to View Code Open a module (Not class module! Write code

May look like this:

Implemenent Runge Kutta in Spreadsheet. Time to load spread sheet Small Optimal Control Example

Step 2 – Finding λ(0) May in principle be done by programming some suitable search algorithm. We are going to let Excel take care of it. Two ways of doing this Goal Seek Function. Slow, robust, only handles problems with one state variable The Solver. Fast, will stop if the algorithm encounters errors, Handles a large number state variables. Let’s do it.

Alternative Use Matlab BVC4P function. Not really better or more robust. Good for when a large number of problems must be solved. Also, if the initial guess is far off, all solvers crash. BVC4P is good to generate a sequence of solutions that converges to the problem that one actually wants to solve.

Using Numerical Analysis Optimal Vaccination of non-persistent epidemics Very policy relevant Economists have made very limited contributions Shows the power of numerical analysis when out standard tool kit breaks down Solutions programmed in Matlab

Typical trajectory after outbreak – No vaccination McKendrick-Kermac model Suceptibles x, infected y and recovered/dead z dx/dt = –βxy, x(0) = N – ε dy/dt = βxy – γy y(0) = ε = ininital infected population dz/dt = γy, z(0) = 0 Essensial paramter γ/β.

Trajectory without vaccination Note the effect of reducing suceptibles before outbreak. - More suceptibles at the end of an epidemic episode. - Fewer infected

Model with vaccination Individuals may be vaccinated u. dx/dt = –βxy – u, x(0) = N – ε dy/dt = βxy – γy y(0) = ε = ininital infected dz/dt = γy + u, z(0) = 0 Objective function: ∫(-wy - ½cu2)e-rtdt K is the cost of disease . ½cu2 is the cost of vaccination Must be solved numerically. Standard tools of optimal control useless.

Optimal vaccination - Low cost of disease (w)

Optimal vaccination - High cost of disease (w)

The value of reducing the stock of suceptibles The shadow price of x multiplied by -1 is the value of vaccinating one ”population unit.” The graph show the shadow price on x for different values of w What does it mean that some of the curves are not monotonic? Succeptibles at outbreak

Explaining “increasing returns” ”Brush fire” effekt. At high levels of x, the disease spreads so rapidly that the return on vaccination prior to outbreak is reduced. Flow with the punch (relatively speaking) becomes optimal strategy. High disease costs reduces brush fire effect

New Section Multiple Equilibria Many systems exhibit non-linear dynamics. May or may represent a technical challenge Here we look at convexo-concave differential equations. Important to note that systems that naturally exhibit multiple equilibria may not do so when controlled optimally

Example – Eutrophication Let x be the nutrient (phosphoros and nitrogen) loading in a lake. Let u be the deposition of nutrients. The ecologists claim that the dynamics of the lake may be reasonably modeled by: Analysis taken from W.A. Brock and D. Starrett and K-G Mäler, A. Xepapadeas, A.de Zeeuw

Dynamics with low loading (u)

The effect of increased (constant) loading

The flip is irreversible even if u is set to zero!

Management For economic analysis we need some evaluation of consequences. Instantaneous benefits from nutrient use given by ½ln(u) Instantaneous damages from eutrophication given by –cx2.

The optimal management problem

Optimality Conditions

Transforming into equations in (u,x) space

Draw differential equations We proceed to draw a phase diagram in (x,u) space.

Phase diagram – Low loading optimal

Phase diagram – High loading optimal

Phase diagram –Three steady states

Words of warning The differential equation for dλ/dt was wrong on the slide (but correct in the calculations). The right one is: When doing numerical analysis, you should still use your brain! I didn’t and it took me ages to figure out why numeric methods didn’t work on the eutrophication example. Can you see why it doesn’t work?

Modify the Eutrophication model

High cost of eutrophication – Only oligotrophic Equilibrium

Intermediate cost – The Skiba story

High cost of eutrophication B

Oh dear! For low values of x we have two solution candidates! Which one to choose. I havn’t proved this formally (although it may be worth a try) that as long as we choose a finite horizon, then the we stay close to A manifold in order to satisfy transversality conditions. I therefore believe this is the relevant manifold

Genetic Management Many people are concerned with biodiversity. I am, for various reasons, not. But I am concerned about outcomes. Harvesting living resources leads to evolutionary genetic selection. How to regulate this? Also, an example of non-standard analysis.

Objective Construct a bioeconomic model where we analyze the effect of selective harvesting on genetic frequency for one specific gene in terms of the socially optimal long-term management of the resource. This object is determined solely through the profits generated by harvesting

Genetic Dynamics Standard model of Mendelian genetics Two alleles, A and a of the same gene. The homozygotes AA and the heterozygote Aa are of phenotype G (for good) The homozygote aa are of phenotype B (for bad) The frequency of a is q. (q = 1 is bad. q = 0 is good.) Only individuals of type G are of commercial interest, and harvesting is totally selective

The model Determination of phenotype frequency is determined by gene frequency. Population dynamics. In this model B will lose in the absence of harvesting.

Comparing Phenotype Productivity Let hG = xGE and hB = 0. One can calculate the difference in productivity between B and G. The difference is:

Genetic Dynamics Changes in x = xB + xG is determined by Because of evolutionary pressure imposed by selection, the Hardy-Weinberg law may not apply and q will be a non-constant function of time. q is determined by Where s is measure of the extent to which natural selection is acting to reduce the relative contribution of a given genotype to the next generation. Note that it is also the productivity difference!

Optimal Management Focus on how selection affects the dynamics of q. Assume a sole owner that wants to maximize the discounted profits from harvesting the resource. Harvesting is assumed costless and valued at an exogenous price Gives objective function

Optimization Problem s.t

Solution We had to invent some stability analysis. From the expression for , s = 0 in steady state. From the expression for : Inserting these expressions into yields a key expression (q), which happens to be a 4th order polynomial.

Results No explicit solution, but the structure of the problem enables us to determine steady state values of q. Also possible to find the stability properties of the steady states. Boils down to the initial magnitude of q and the relationship between:

Results, key expressions the intrinsic productivity of phenotype B, measures how fast phenotype B regenerates gives the selection in absence of harvesting or the rate at which nature selects for phenotype G. Can also be interpreted as the internal rate of return on preserving phenotype G the discount rate measuring the opportunity cost of capital

Case 1 q* is a stable steady state.

Case 2 Optimal to let the gene A become extinct

Case 3a Optimal to let the gene a become extinct

Case 3b q1 is a Skibapoint, i.e., unstable steady state. q2* is a stable steady state

Case 4 Optimal to conserve A at low levels of q because the low productivity of phenotype B, but at some point the low internal rate of return makes the extinction of phenotype G optimal

The value function We have our original OC problem: maxc 0∫T U(c,x)e-rtdt subject to x(0) = x0 and dx/dt=f(x,c). The value that we get depends on x0. Thus we have J(0,x0,T) = maxc 0∫T U(c,x)e-rtdt . The function J(0,x0,T) is called the value function. It is what we get when our system is run as optimized.

The embedded problem Now consider the problem: maxc t∫T U(c,y)e-rsds subject to y(t) = x and dy/dt=f(y,c). What have we done? Renamed the state variable from x to y. A new arbitrary starting point t rather than zero. (t> 0) Have a new initial condition y(t) = x. We now have a new value function J(t,x) =J(0,x) e-rt.

What is J(t,x)? It is the same problem as the original, except that we have an arbitrary starting point. This function has some economically important properties. Shadow price: Interpretation: If somebody gives you a present of one unit of x at time t, what is the value of that present.

Another property The value of living. This has an important interpretation. At every point in time you eat the Hamiltonian. This has given rise to the interpretation of the Hamiltonian as the Net National Product. See recent book by Weitzman. HΔt = U(c,x) Δt + λf(c,x) Δt The value of Consumption today The marginal value of “capital” The amount of investment

Some useful extensions of deterministic control theory (1) Variable final time. Maxc,T 0∫T U(c,x)e-rtdt subject to x(0) = x0 and dx/dt=f(x,c). T is a choice variable. Extra condition required to fix T. It turns out that Necessary condition is then H(T,c(T), x(T)) = 0

Very simple example Maxu,T ∫T(–9 – ¼u2)dt subject to dx/dt=u and x(0) = 0 and x(T) = 16. Hamiltonian H=(–9 – ¼u2)+λu. Maximising H w.r.t. implies u = 2λ. dλ/dt = -dH/dx = 0 → λ = constant K. dx/dt=u = 2λ = 2K → x(t) = 2Kt + C. We now have two constants, K and C, that we need to determine. We cant use λ(T) = 0 as we have constrained x(T).

Therefore… x(t) = 2Kt + C. → x(0) = 0 → C = 0. x(T) =16 = 2KT → K = 8/T. We then have that λ =K = 8/T and u = 2×K = 16/T Now find optimal T. The Hamiltonian is now H*= -9 - ¼(16/T)2+ 2×(8/T)2 = 0 Solving this for T gives T=8/3

Some useful extensions of deterministic control theory (2) Scrap value problems - a complicated problem finite horizon problem. E.g. Maxc,T 0∫T U(c,x)e-rtdt +S(x(T)) e-rT. subject to x(0) = x0 and dx/dt=f(x,c). Conditions: The same conditions on c and λ. Different transversality conditions: λ(T) = S’(x(T)) e-rT. U(c,x)e-rt + λf(x,c) = r S(x(T)) e-rT .

A closer look at these conditions Why are these conditions not surprising? λ(T) = S’(x(T)). Says that the shadow price of x should be continuous. U(c,x) + λf(x,c) = r S(x(T)) . Says that the change in utility from continuing one more unit of time should be the same as the loss from not cashing in on the scrap value. Remember that U(c,x)e-rT + λe-rTf(x,c) = J’(T,x).

Load Excel file

Deterministic Threshold Problems Irreversible case. A threshold is here defined to be some curve in state space such that crossing this curve leads to a discrete jump in state-variables. Preliminary observation; if the location of the threshold is known one can choose to cross the threshold or not cross the threshold. An irreversible threshold effect can be modeled as a scrap value problem with endogenous time horizon

Mathematical description From Seierstad and Sydsæter Theorem 3 and Note 2, pp 182. Let x be of dimension n Let the threshold be given by R(x) = 0 Let τ be the first point in time when the threshold is reached. We assume that is R(x(τ)) = 0. Let the threshold effect be x(τ+)-x(τ) = g(x(τ)) x(τ+) =x(τ-) = limt→τx(t) from below x(τ+) = limt→τx(t) from above

Formulating the Optimization Problem(s) - 1 Going over the threshold Maxc,τ 0∫τ U(c,x)e-rtdt + S(x(τ)+ g(x(τ))) e-rτ subject to x(0) = x0, dx/dt=f(x,c) and R(x(τ)) = 0. Here we should think of S S(x(τ)+ g(x(τ))) as a value function. In fact S(x) is given by Maxc,τ τ∫∞ U(c,y)e-rtdt subject to y(0) = x and dy/dt=f(y,c).

Solve the problem recursively From Maxc τ∫∞ U(c,y)e-rtdt subject to y(0) = x and dy/dt=f(y,c) we get the shadow price λ(x|τ).

Optimality conditions (Present value) u maximizes the Hamiltonian dλ/dt = –∂H/∂x λ(τ) = λ(x(τ)|τ)(1+g’(x)) + γ∂R/∂x Note that all these may be vectors If τ lies in (0, ∞), then H + ∂(S(x(τ)+ g(x(τ))) e-rτ)/∂t = 0

Optimality conditions (Present value) u maximizes the Hamiltonian dλ/dt = –∂H/∂x λ(τ) = λ(x(τ)|τ)(1+g’(x)) + γ∂R/∂x Note that all these may be vectors If τ lies in (0, ∞), then H + ∂(S(x(τ)+ g(x(τ))) e-rτ)/∂t = H – r(S(x(τ)+ g(x(τ))) e-rτ) = 0.

Formulating the Optimization Problem(s) - 2 Going over the threshold Maxc,τ 0∫τ U(c,x)e-rtdt + S(x(τ)))e-rτ subject to x(0) = x0, dx/dt=f(x,c) and R(x(τ)) = 0. Again we should think of S(x(τ)) as a value function. In fact S(x) is given by Maxc,τ τ∫∞ U(c,y)e-rtdt subject to y(0) = x, dy/dt=f(y,c) and R(x(t)) = 0.

Solve the problem recursively From Maxc τ∫∞ U(c,y)e-rtdt subject to y(0) = x, dy/dt=f(y,c), and R(x(t)) = 0. We get the shadow price λ*(x|τ). Note: Here I have excluded the possibility that R(x(t)) ≠ 0 for some t > τ. It is however perfectly possible that we may “turn away” from the threshold at some point in time. In particular if we are studying a finite horizon problem

Optimality conditions (Present value) u maximizes the Hamiltonian dλ/dt = –∂H/∂x λ(τ) = λ*(x(τ)|τ) + γ∂R/∂x Note that all these may be vectors If τ lies in (0, ∞), then H + ∂(S(x(τ))) e-rτ)/∂t = 0

A simple threshold model (Nævdal 2003) Let dx/dt = u – δx. Let the threshold be x’. Let B(u) be maximised at B(u*) for u* > 0. Let the let the utility function be A + B(u) if the threshold is not crossed and let the benefit function be B(u) if it is crossed.

Avoid Crossing the threshold? Let Te be the time at which the threshold is crossed. If we do not cross the threshold, then after getting to x(Te) = x’ we must freeze u. Thereafter there is no need to reduce u so u = δx’ for all t > Te. Then Se(x(Te))e-rTe = e-rTeTe∫∞(A+B(u*))e-rtdt

Crossing the threshold? Let Ta be the time at which the threshold is crossed. If we cross the threshold, then we accept the damage. Thereafter there is no need to reduce u so u = u* for all t > Ta. Then Sa(x(Ta))e-rTa = e-rTaTa∫∞B(u*)e-rtdt

Optimality conditions prior to hitting the threshold Hamiltonian is the same for both problems H = (A + B(u))e-rt + p(u –δu) Gives that B’(ui)e-rt + pi = 0 dpi/dt = δpi → pi(t) = Cie-δt x(Ti) = x’ i = a,e The shadow prices after hitting or crossing the threshold are both zero. u is a decreasing function of time. Pollute early and then reduce

Note Both are problems with variable final time and the requirement that xi(T) = x’. We need condition to find Ti.

Going over the threshold Condition for optimal Ta. We can prove that ua is discontinuous at Ta.

Optimal paths of ua and xa

Staying on the edge Condition for optimal Te. This condition can be used to prove that ue(t) is continuous at Te.

Optimal paths of ue and xe

Important to note There are kinks (ue) or jumps (ua) in the optimal paths of the control. The previous literature on deterministic thresholds had overlooked: That a threshold could either be crossed or observed That optimal controls may be discontinuous at the time the threshold is crossed. (In some models this can happen when the threshold is reached even if it is not crossed.)

Comparing Scenarios I said no proofs, but this one is rather instructive and a good example of how simple proofs can give interesting results

Incorporating uncertainty into optimal control Most processes are subject to uncertainty One class is Brownian motion. I will not talk about that at all. Catastrophic events. Tsunamis Floods Car breakdown

Categories of catastrophic uncertainty Endogenous vs. exogenous risk Some processes we can (partially) control. This is endogenous risk. Some processes are truly beyond our abilities. Exogenous risk. State-space distributed risk vs time-distributed risk. State space distributed risk requires movement through state space. (thresholds) Time distributed risk depends on location in state- space. For a fixed location in state-space, uncertainty is invariant with respect to time. This will perhaps become clearer. (I hardly understand it myself)

Preliminaries – Poisson processes in continuous time We are here concerned with events that occur at random points in time. In order to deal with these problems they must have some kind of distribution that we actually know. We shall refer to the point in time when the event occurs as τ (tau) and it is distributed over a subset of the positive real numbers [0, β) where 0 < β ≤ ∞. The pdf is g(t) and the cdf is G(t) = 0∫tg(s)ds.

Poisson processes and conditional updating. Our process starts t = 0. We make it to t* >0. What is the distribution of τ conditional on us having made it to t*? Answer: We need optimality conditions that reflect updating of the distribution as long as τ does not happen.

The hazard rate - A very useful concept We ask the question; What if we have made it to time t? What is the probability that τ occurs in the time interval (t, t + Δt)? If dt is small, then that probability is roughly Pr(τ (t, t + Δt))=g(t|τ>t)×dt. g(t|τ>1) is of course:

Hazard rate μ(t)– Formal definition

Example - the exponential distribution Let μ0 be a positive constant number. g(t) = μ0e-μ0t. → G(t) = 1 - e-μ0t. Then μ(t) = μ0. We have a constant hazard rate. This is only true for the exponential distribution. Interesting fact: E(τ - t) = t∫∞sµe-µsds = 1/µ. The expected conditional waiting time is constant We can in principle calculate hazard functions for any g(t). They are frequently messy.

We have the hazard rate – What is the pdf? A very useful property. Let μ(t) be any function that is: continuous and positive for all t has the property that 0∫βμ(t)dt=∞. Then we can construct a cdf by solving the following differential equation. μ(t) = y’(t)/(1 – y(t)), y(β) = 1

Finding a pdf μ(t) = y’(t)/(1 – y(t)), y(β) = 1 implies that y(t) = 1 – C×Exp(-∫tμ(s)ds) Here C is a constant determined by the boundary condition. For many choices of hazard rate this constant will be equal to 1. Counter example: μ(t) = Exp(t) Having found y(t), the cdf, we can find the pdf y’(t) = C×μ(t)×Exp(-∫tμ(s)ds)

Example – linear hazard rate Assume that μ(t) = μ0t defined for t € [0, ∞). Then Exp(-∫tμ(s)ds) = ½μ0t2. Therefore y(t) = 1 – C× Exp(-½μ0t2) . y(∞) = 1 implies that C = 1, so the cdf is 1 – Exp(-½μ0t2) and the pdf is y’(t) = μ0t×Exp(-½μ0t2) .

Hazard rate continued The previous results also hold if the hazard rate can be written μ(t) = φ(x(t)) (or φ(t,x(t)) for that matter) where x(t) is any function of t. This holds as long as: φ(x(t)) only takes positive values The integral ∫φ(x(t))dt does not converge Thus we can work with very general density pdfs over time over time: φ(x(t))Exp(- ∫tφ(x(s))ds)

Exogenous vs endogenous uncertainty Slightly confusing literature. Here the difference is as follows. If φ(x(t))is the hazard rate and x(t) is determined by a controllable differential equation, then the stochastic process is endogenous. If not, the process is exogenous his is not clear cut. Hurricanes may be (in part) endogenous to US policymakers but exogenous New Orleans

Controlling exogenous catastrophic uncertainty Basic problem: Nature or (somebody we can’t affect) triggers a catastrophic event. We can not control the probability of the event occurring, but we can control: preparedness (what to do before the event) consequences (how to act after the event) No strict boundary between preparedness and consequence management.

The mathematical formulation In infinite time: Max E{∫∞(U(x,u)e-rt)dt + h(x(τ-)) subject to dx/dt=f(x,u) for all t ≠ τ x(τ+)-x(τ-) = g(x(τ-)) τ distributed μe-μt over [0, ∞) g(x(τ-)) is the physical description of the event h(x(τ-)) is the instantaneous value of the shock. x(τ-) = limt→τx(t) from below x(τ+) = limt→τx(t) from above

Extreme example A tsunami in a community We have utility before the disaster. Depends on consumption and stock of capital. The instantaneous cost of the disaster, depends on stochasticity and the stock of preparedness capital. Utility after the disaster, depends on consumption and the stock of capital that survived the disaster

Solving the problem a recursive algorithm First find the optimal policy after the disaster. Find optimal policy before the disaster. Sounds pretty simple…

Post-disaster control There is no stochasticity. Simply solve the following problem: J(t,x|τ = t)ert = maxu ertt∫∞U(u,y)e-rsds subject to y(0) = x and dy/dt=f(y,u). This looks like our old friend the embedded problem. Here t and x are arbitrary. The indicate the possible state(s) of the world after a disaster. The notation J(t,x|τ = t) indicates that J is evaluated conditional on τ happening at time t.

Pre-disaster control From the post disaster control we will need J’x(t,x|τ = t)ert = λ(x|τ). This is the shadow price of x at the instant after the catastrophic event occurs. In this problem, that is really all we need from the post-disaster program.

Pre-disaster program- Necessary conditions Hazard rate depends on t. Define the Hamiltonian: H = U(u,x) + λf(u,x) + µ(t)(J(x+g(x)|τ) –J(x) + h(x)) Utility today Hazard rate Marginal value of X × increase in x Change in utility if Catastrophic event occurs

Optimality Conditions Apply the maximum principle to the Hamiltonian. u =argmax H dλ/dt = rλ - ∂H/∂x dx/dt=f(x,u) Transversality condition as in deterministic models.

The differential equation for λ dλ/dt = rλ - ∂H/∂x = rλ - ∂U/∂x - λ∂f/∂x - µ((∂J(x+g(x)|τ)/∂x –∂J(x)/∂x)+h’(x)) But ∂J(x+g(x)|τ)/∂x = λ(x|τ)(1 + g’(x)) and ∂J(x)/∂x = λ, so: dλ/dt = rλ - ∂U/∂x - λ∂f/∂x + µ(λ - λ(x|τ)(In + g’(x)) – h’(x)) Note: In is the identiy matrix where n is the number of elements in x(t)

Example Let U(x,u) = -ax - ½(u0 – u)2 and f(x,u) + u – δx Let τ be exponentially distributed with hazard rate µ. Let x(τ+)-x(τ-) = βx(τ-). h(x) = 0. Could be a model of pollution where there is a possibility that a shock increases the stock of pollutants.

Optimality conditions after event H= -ax - ½(u0 – u)2 + λ(u – δx) The value of u that maximizes the Hamiltonian is given by u = u0 + λ for u0 + λ > 0. Else u = 0. dλ/dt = rλ + a + δλ dx/dt = u0 + λ – δx dλ/dt = rλ + a + δλ → λ(t|τ) = -a/(r + δ) for all t. We make note of λ(t|τ) and proceed to the pre-event control

Pre event conditions The value of u that maximizes the Hamiltonian is given by u = u0 + ½λ for u0 + ½λ > 0. Else u = 0. Same as before! dx/dt = u0 + ½λ – δx. Same as before! dλ/dt = rλ + a + δλ +µ(λ – (–a(1+β)/(r + δ))) Here is the difference in the conditions. Obviously this implies that u and x will be affected by the risk

We can solve for λ Solution is: Here K is a constant that must be determine by transversality conditions. We just not that K ≠ 0 implies no convergence to steady state so we just set K = 0. Note that if β = 0, then pre-event solution is same as post-event

The we have that…. The optimal value of u(t) is found by inserting λ(t) into u0 + ½λ. The optimal value of x(t) is fund by integrating dx/dt = u0 + ½λ – δx I will not do this as the resulting expression is a bugger. Numeric solutions are straightforward to find

Entering Equations in Excel

Result

The effect of uncertainty – Let us increase µ

Weird hazard rates The setup is good enough to include time dependent hazard rates. That is, the hazard rate depends on time. Let us consider a hazard rate a×(Sin(t) + 1). This implies a cdf given by 1 – exp(-1-t+Cos(t)). Cdf looks like this:

Pre-disaster program- Necessary conditions Hazard rate depends on t. Define the Hamiltonian: H = U(u,x) + λf(u,x) + µ(t)(J(x+g(x)|τ) –J(x) + h(x)) Utility today Hazard rate Marginal value of X × increase in x Change in utility if Catastrophic event occurs

Optimality Conditions Apply the maximum principle to the Hamiltonian. u =argmax H dλ/dt = rλ - ∂H/∂x dx/dt=f(x,u) Transversality condition as in deterministic models. NO CHANGE!

The differential equation for λ dλ/dt = rλ - ∂H/∂x = rλ - ∂U/∂x - λ∂f/∂x - µ(t)((∂J(x+g(x)|τ)/∂x –∂J(x)/∂x)+h’(x)) But ∂J(x+g(x)|τ)/∂x = λ(x|τ)(1 + g’(x)) and ∂J(x)/∂x = λ, so: dλ/dt = rλ - ∂U/∂x - λ∂f/∂x + µ(t)(λ - λ(x|τ)(In + g’(x)) – h’(x)) Note: In is the identiy matrix where n is the number of elements in x(t)

Modifying the code (Hazard rate is actually Sin(2*t – 1)+1) Old code Tiny change

Results

Endogenous risk – Time distributed Recall that we know face a controllable hazard rate. The hazard rate depends on a state variable

Endogenous time distributed risk Here the hazard rate will depend on the state variable. µ = µ(x). Example µ(x) = µ0x Remember that x is a function of t because it is determined by an differential equation This is the pdf

Endogenous risk continued Say that x(t) is driven by dx/dt = - ax + b, x(0) = X. Then x(t) = e-at(X – b/a) + b/a. The hazard rate is then transformed from a function of x to a function of time. µ = µ0×(e-at(X – b/a) + b/a) Illustration for different starting points of x(0)

Controlling Endogenous Uncertainty The general problem

Recursivity - Working yourself backwards As before. First solve the post disaster problem. Get the shadow prices

Post-disaster control (Copy of previous slide) There is no stochasticity. Simply solve the following problem: J(t,x|τ = t)ert = maxu ertt∫∞U(u,y)e-rsds subject to y(0) = x and dy/dt=f(y,u). This looks like our old friend the embedded problem. Here t and x are arbitrary. The indicate the possible state(s) of the world after a disaster. The notation J(t,x|τ = t) indicates that J is evaluated conditional on τ happening at time t.

Pre Disaster Solution Define the risk Augmented Hamiltonian Then apply the maximum principle Note: λ is here the hazard rate and µ is the shadow price

Applying the Maximum Principle Find the control that maximizes the Hamiltonian and use d Compared to the case with exogenous risk we have an additional term. If risk is exogenous, we have that λ’(x) = 0

A closer look at dµ/dt As in the exogenous risk case, we have that We have J(x+g(x)|τ) from the post-disaster problem. But what about J(x)?

I turns out that: J(x) = z is given by: So we can rewrite dµ/dt as: Additional transversality condition z(T) = 0 (Why?)

A pedagogic problem After inserting for the optimal u, we have three differential equations in x, µ and z. Tough to do. In fact, I know of no application where one can find a closed form solution or even a reasonably manageable steady state. Numerical methods are all that is left. Oh, and general statements.

The Problem we are going to solve

The post event solution Really simple. No more damage from x so post event shadow price is zero. This implies that u = u0 for all t > τ. J(x|t) = -K/r

The pre-event problem Hamiltonian given by H= α – ½(u0-u)2 + λ(u – δx) + µ×x×(– K/r - z) Apply maximum principle to get: u=argmax H = u0 + λ dλ/dt = (r+δ)λ + µxλ – µ(–K/r – z) dz = rz + ½(u0-u)2 – µ(-K/r – z) Transversality conditions λ(T) = z(T) = 0

The code

The results

Play around to deal with instability Load Excel file

Endogenous, state-space distributed risk – AKA stochastic threshold We move a state variable through time. If we move in the “right” direction we may trigger a threshold effect. x(t) Possible threshold locations t Risky time segment

Preliminary math. We all know the chain rule. We should also know that

Let f(x) be a distribution If the event x = x’ is distributed f(x) over [a, b] and… x(t) is such that x’(t)  0 for all t and Range[x(t)] = [a, b] Then the cdf for the event x = x’ and the event x(τ) = x are interchangeable. The pdf for the event is distributed over time is f(x(t))x’(t). Regardless of whether Range[x(t)] = [a, b], the hazard rate is

The hazard rate continued Note how the hazard rate depends on the rate of increase in x. dx/dt = 0 implies that the hazard rate is zero.

Example. The hazard rate in state-space is linear The hazard rate is linear in state space. This is the pdf: Assume that x’(t) = -ax(t) + b

However, the hazard rate in time behaves differently x(t) Hazard rate as a funtion of time t

Properties of the hazard rate If we move away from the threshold, the hazard rate is zero If we let x take values that have happened before, then the hazard rate is also zero.

Optimal control Use exactly the same conditions as with endogenous time distributed problems, but with modified hazard rate. Technical note: Previously the hazard rate depended only on state variables. Now it will in general also depend on control variables. (Why?) This does not matter as long as we can restrict ourselves to look at continuous controls,

Example Same as before. However, this time we will consider a threshold problem.

The Problem we are going to solve

The post event solution Same as before. No more damage from x so post event shadow price is zero. This implies that u = u0 for all t > τ. J(x|t) = -K/r

The pre-event problem Hamiltonian given by H= α – ½(u0-u)2 + λ(u – δx) + µ× (u-δx)(– K/r - z) Apply maximum principle to get: u=argmax H = u0 + λ - µ(K/r + z) dλ/dt = (r+δ)λ + µ(u – δx)λ +µδ(– K/r - z) dz = rz + ½(u0-u)2 – µ(u – δx)(-K/r – z) Note the difference Hazard rate now directly affects control! Derivative of hazard rate Optimal u from maximization

Analytics One can actually compute steady states here. In steady state x’(t) is zero so lots of stuff disappears. To whit: u= u0 + λ - µ(K/r + z) dλ/dt = (r+δ)λ - µδ( K/r + z)=0 dz/dt = rz + ½(u0-u)2 = 0 dx/dt = u – δx = 0

Steady state solution Note the following: K = 0 implies x and u equal to unregulated levels Otherwise K > 0 implies small x and u. K large enough implies x and u less than zero in steady state.

A caveat This solution assumed that x(0) ≤ xss. Otherwise we get negative hazard rate. What happens if we have x(0) > xss Answer: Freeze x at x(0) indefinitely.

Application – Disintegration of the Western Antarctic Ice Sheet Oppenheimer (1998) estimates that a WAIS disintegration could increase sea levels by as much as 4-6 meters. Oppenheimer evaluates that there is a threshold temperature increase above pre-industrial levels where this event could occur. This threshold lies in the range 2.5° to 8° degrees Celsius. What does this imply for human welfare?

Consequences of WAIS disintegration Considerable impact on real state markets. For some areas such as Bangladesh, the picture is even grimmer

Some relevant geoscience There are several GHGs that contribute to global warming. Methane (1/5) and Carbon (3/5) being the most important. Because of different retention periods in the atmosphere they must be treated separately. Let c be carbon and m be methane. Then there is a threshold defined by: ac(t) + bm(t) = A + .   The functions c and m are determined by and A +  is the temperature threshold,  is a random variable. Emissions are given by ui

Possible paths of the system The Main Technical Problem.   How to translate the event that the threshold is crossed from a probability distribution in State-space to a probability distribution over time? Answer: Change of variable techniques from integration theory.

Steady states are calculated in the paper The Economy The instantanous cost of emission reduction The cost of crossing Wais disintegration: g = G if threshold is crossed, g = 0 otherwise We can now state society’s optimization problem as follows: Steady states are calculated in the paper

Optimal Paths

Optimal Stopping in Exogenous Risk Problems

Some numerical comparative dynamics

The Economics of the Thermohaline Circulation – A Problem with two Thresholds

Some Unpleasant Facts and Possibilities Historical Geophysical data suggests that the Thermohaline Circulation has been disrupted in the past. Scientific results suggests that this may be triggered by Global Warming. The consequences, although uncertain, will be a bugger. They include but are not limited to: Regional disruptions in weather patterns Permanent regional climate change May trigger other global catastrophic events.

Some Science Global Warming will affect ocean temperatures and salinity which are the main drivers of the thermohaline circulation. THC disruption depends on temperature levels and the rate of change in temperature. If temperature levels or rates of temperature change exceed certain thresholds a shutdown may occur. The location of these thresholds are unknown.

A Simple Model of Global Warming c – Atmospheric carbon concentration F – Increase in global temperature average u – CO2 emissions If F or increase above certain thresholds, the THC collapses.

Illustration of the Risk Structure

The Economic Problem(s) What is the optimal CO2 emission pattern in the presence of this risk? How to optimize such a system? Piecewise Deterministic Optimal Control! Some technical issues must be resolved before optimization can be done. The events distributed in state space must be transformed so that they are distributed over time. A distribution for the minimum of these events must be derived.

The Optimization Problem Climate cost. 0 if THC is OK, G if THC collapses Cost of reducing emissions below unregulated emission level u0.

The Optimization Problem contnd. Subject to the differential equations and two concurrent stochastic processes. As long as both F and dF/dt are increasing the hazard rate is the sum of two hazard rates. When only F is increasing, only this hazard rate is active

The Hamiltonian for the present problem First terms are the standard Hamiltonian This term is the expected loss/gain from disaster in the interval (t, t + dt)

The Optimal Path

Optimal Time path for Temperature

Optimal stopping We have a risky process and we wonder when to stop. In deterministic problems, we use H = 0. Here too. In Finite time: Max E{∫T(U(x,u)e-rt)dt + h(x(τ-))e-rτ + G(x(T))e-rT subject to dx/dt=f(x,u) for all t ≠ τ x(τ+)-x(τ-) = g(x(τ-)) τ distributed μe-μt over [0, ∞)

A quick word about the value function If we have a scrap value, then we have that J(T,x(T)) = Scrapvalue. Intuitively obvious, but needs to be pointed out

Optimal Stopping (Eksogenous Risk) Apply the maximum principle to the Hamiltonian: H = U(u,x) + λf(u,x) + µ(t)(h(x)e-rt +J(x+g(x)|τ) –J(x)) u =argmax H dλ/dt = rλ - ∂H/∂x λ(T) = ∂G(x(T))e-rT/∂x dx/dt=f(x,u)

Optimality Conditions The Stopping Condition U(u,x) + λf(u,x) +∂ G(x(τ-))e-rτ/∂t – µ(t)(h(x)e-rt +J(x+g(x)|τ) –G(x(T))e-rT)

Nice ting to know about hazard rates If we have two stochastic events τ1 and τ2, with hazard rates λ1 and λ2, then…. The event τ = (τ1 , τ2) has a hazard rate λ = λ1 + λ2. Practical as it enables us to treat two stochastic processes as one!

Optimal harvesting of illegal drugs A drug producer grows marihuana. At optimally chosen time T he sells the stuff at a price q, unless… He gets raided by the cops, in which case he receives a punishment –c per weight unit of weed The local college boys finds the farm and nicks a fraction 1 – b of the drugs. The farmer then sells the remaining.

Mathematical formulation Equations of motion Maximization problem max E((λ1(-cx1e-rτ)+λ2bqx1e-rτ)/λ) +qx1(T)e-rT

The Jump x2 is a just a way of saying that if τ occurs, the dope stops growing

The Maximum Principle We only need p(T) = qe-rT.

The Stopping condition Which can be solved and yield: