HJB Equations

Lecture 3:
Hamilton-Jacobi-Bellman Equations
ECO 521: Advanced Macroeconomics I
Benjamin Moll
Princeton University, Fall 2016

1
Outline
1. Hamilton-Jacobi-Bellman equations in deterministic settings (with

derivation)
2. Numerical solution: nite difference method
2
Hamilton-Jacobi-Bellman Equation: Some History
(a) William Hamilton (b) Carl Jacobi (c) Richard Bellman

Aside: why called dynamic programming?
Bellman: Try thinking of some combination that will possibly give it
a pejorative meaning. Its impossible. Thus, I thought dynamic
programming was a good name. It was something not even a
Congressman could object to. So I used it as an umbrella for my
activities. http://en.wikipedia.org/wiki/Dynamic_programming#History 3
Hamilton-Jacobi-Bellman Equations
Recall the generic deterministic optimal control problem from

Lecture 1:

v (x0 ) = max e t r (x (t) , (t)) dt
{(t)}t0 0
subject to the law of motion for the state
x (t) = f (x (t) , (t)) and (t) A
for t 0, x(0) = x0 given.

0: discount rate
x X RN : state vector
A RM : control vector
r : X A R: instantaneous return function
4
Example: Neoclassical Growth Model

v (k0 ) = max e t u(c(t))dt
{c(t)}t0 0
subject to
k (t) = F (k(t)) k(t) c(t)
for t 0, k(0) = k0 given.
Here the state is x = k and the control = c
r (x, ) = u()
f (x, ) = F (x) x
5
Generic HJB Equation
How to analyze these optimal control problems? Here: cookbook

approach
Result: the value function of the generic optimal control problem

satises the Hamilton-Jacobi-Bellman equation
v (x) = max r (x, ) + v (x) f (x, )

A
In the case with more than one state variable N > 1, v (x) RN is
the gradient of the value function.
6
Example: Neoclassical Growth Model
cookbook implies:
v (k) = max u(c) + v (k)(F (k) k c)

c
Proceed by taking rst-order conditions etc
u (c) = v (k)
7
Derivation from Discrete-time Bellman
Here: derivation for neoclassical growth model
Extra class notes: generic derivation
Time periods of length
discount factor
() = e
Note that lim0 () = 1 and lim () = 0
Discrete-time Bellman equation:
v (kt ) = max u(ct ) + e v (kt+ ) s.t.

ct
kt+ = (F (kt ) kt ct ) + kt
8
Derivation from Discrete-time Bellman
For small (will take 0), e = 1
v (kt ) = max u(ct ) + (1 )v (kt+ )

ct
Subtract (1 )v (kt ) from both sides
v (kt ) = max u(ct ) + (1 )(v (kt+ ) v (kt ))

ct
Divide by and manipulate last term

v (kt+ ) v (kt ) kt+ kt
v (kt ) = max u(ct ) + (1 )
ct kt+ kt
Take 0
v (kt ) = max u(ct ) + v (kt )kt
ct
9
Connection Between HJB Equation and Hamiltonian
Hamiltonian
H(x, , ) = r (x, ) + f (x, )
HJB equation
v (x) = max r (x, ) + v (x)f (x, )
A
Connection: (t) = v (x(t)), i.e. co-state = shadow value
Bellman can be written as v (x) = maxA H(x, , v (x)) ...

... hence the Hamilton in Hamilton-Jacobi-Bellman
Can show: playing around with FOC and envelope condition gives
conditions for optimum from Lecture 1
Mathematicians notation: in terms of maximized Hamiltonian H
v (x) = H(x, v (x))
H(x, p) := max r (x, ) + pf (x, )
A
10
Some general, somewhat philosophical thoughts
MAT 101 way (rst-order ODE needs one boundary condition) is

not the right way to think about HJB equations
these equations have very special structure which one should

exploit when analyzing and solving them
Particularly true for computations
Important: all results/algorithms apply to problems with more than

one state variable, i.e. it doesnt matter whether you solve ODEs or
PDEs
11
Existence and Uniqueness of Solutions to (HJB)
Recall Hamilton-Jacobi-Bellman equation:
{ }
v (x) = max r (x, ) + v (x) f (x, ) (HJB)
A
Two key results, analogous to discrete time:
Theorem 1 (HJB) has a unique nice solution
Theorem 2 nice solution equals value function, i.e. solution to
sequence problem
Here: nice solution = viscosity solution
See supplement Viscosity Solutions for Dummies
http://www.princeton.edu/~moll/viscosity_slides.pdf
Theorems 1 and 2 hold for both ODE and PDE cases, i.e. also with
multiple state variables...
... also hold if value function has kinks (e.g. from non-convexities)
Remark re Thm 1: in typical application, only very weak boundary
conditions needed for uniqueness (s, boundedness assumption) 12
Numerical Solution of HJB Equations
13
Finite Difference Methods
See http://www.princeton.edu/~moll/HACTproject.htm
Explain using neoclassical growth model, easily generalized to

other applications
v (k) = max u(c) + v (k)(F (k) k c)

c
Functional forms
c 1
u(c) = , F (k) = k
1
Use nite difference method
Two MATLAB codes

http://www.princeton.edu/~moll/HACTproject/HJB_NGM.m
http://www.princeton.edu/~moll/HACTproject/HJB_NGM_implicit.m
14
Barles-Souganidis
There is a well-developed theory for numerical solution of HJB

equation using nite difference methods
Key paper: Barles and Souganidis (1991), Convergence of
approximation schemes for fully nonlinear second order equations
https://www.dropbox.com/s/vhw5qqrczw3dvw3/barles-souganidis.pdf?dl=0
Result: nite difference scheme converges to unique viscosity

solution under three conditions
1. monotonicity
2. consistency
3. stability
Good reference: Tourin (2013), An Introduction to Finite Difference
Methods for PDEs in Finance.
15
Finite Difference Approximations to v (ki )
Approximate v (k) at I discrete points in the state space,

ki , i = 1, ..., I . Denote distance between grid points by k .
Shorthand notation
vi = v (ki )
Need to approximate v (ki ).
Three different possibilities:
vi vi1
v (ki )
= vi,B backward difference
k
vi+1 vi
v (ki )
= vi,F forward difference
k
vi+1 vi1
v (ki )
= vi,C central difference
2k
16
Finite Difference Approximations to v (ki )
Forward
Backward
!# !#%$
! !#$
Central
#$ # #%$ "
17
Finite Difference Approximation
FD approximation to HJB is
vi = u(ci ) + vi [F (ki ) ki ci ] ()
where ci = (u )1 (vi ), and vi is one of backward, forward, central FD

approximations.
Two complications:
1. which FD approximation to use? Upwind scheme
2. () is extremely non-linear, need to solve iteratively:
explicit vs. implicit method
My strategy for next few slides:

what works
at end of lecture: why it works (Barles-Souganidis)
18
Which FD Approximation?
Which of these you use is extremely important
Best solution: use so-called upwind scheme. Rough idea:
forward difference whenever drift of state variable positive
backward difference whenever drift of state variable negative
In our example: dene
si,F = F (ki ) ki (u )1 (vi,F

), si,B = F (ki ) ki (u )1 (vi,B

)
Approximate derivative as follows
vi = vi,F

1{si,F >0} + vi,B 1{si,B <0} + vi 1{si,F <0<si,B }
where 1{} is indicator function, and vi = u (F (ki ) ki ).
Where does vi term come from? Answer:
< v (see gure) s
since v is concave, vi,F i,B i,F < si,B
< 0 < s , set s = 0 v (k ) = u (F (k ) k ), i.e.
if si,F i,B i i i i
were at a steady state.
19
Sparsity
Discretized HJB equation is
vi+1 vi + vi vi1
vi = u(ci ) + si,F + si,B
k k
Notation: for any x , x + = max{x, 0} and x = min{x, 0}
Can write this in matrix notation
v = u + Av
where A is I I (I = no of grid points) and looks like...
20
Visualization of A (output of spy(A) in Matlab)
10
20
30
40
50
60
70
0 10 20 30 40 50 60 70
nz = 136
21
The matrix A
FD method approximates process for k with discrete Poisson

process, A summarizes Poisson intensities
entries in row i :

vi1

si,B si,B s+ +
si,F
i,F
v
k}
| {z k
| {z } k k
|{z} i
inowi1 0 outow 0 inow 0

i i+1
vi+1
negative diagonals, positive off-diagonals, rows sum to zero:
tridiagonal matrix, very sparse
A (and u) depend on v (nonlinear problem)

v = u(v) + A(v)v
Next: iterative method...
22
Iterative Method
Idea: Solve FOC for given vn , update vn+1 according to
vin+1 vin
+ vin = u(cin ) + (v n ) (ki )(F (ki ) ki cin ) ()

Algorithm: Guess vi0 , i = 1, ..., I and for n = 0, 1, 2, ... follow
1. Compute (v n ) (ki ) using FD approx. on previous slide.
2. Compute c n from cin = (u )1 [(v n ) (ki )]
3. Find vn+1 from ().
4. If vn+1 is close enough to vn : stop. Otherwise, go to step 1.
See http://www.princeton.edu/~moll/HACTproject/HJB_NGM.m
Important parameter: = step size, cannot be too large (CFL
condition).
Pretty inefcient: I need 5,990 iterations (though quite fast)
23
Efciency: Implicit Method
Efciency can be improved by using an implicit method

vin+1 vin
+ vin+1 = u(cin ) + (vin+1 ) (ki )[F (ki ) ki cin ]

Each step n involves solving a linear system of the form
1 n+1
(v vn ) + vn+1 = u + An vn+1
( )
( + 1 )I An vn+1 = u + 1 vn
but An is super sparse super fast
See http://www.princeton.edu/~moll/HACTproject/HJB_NGM_implicit.m
In general: implicit method preferable over explicit method
1. stable regardless of step size
2. need much fewer iterations
3. can handle many more grid points 24
Implicit Method: Practical Consideration
In Matlab, need to explicitly construct A as sparse to take

advantage of speed gains
Code has part that looks as follows
X = -min(mub,0)/dk;
Y = -max(muf,0)/dk + min(mub,0)/dk;
Z = max(muf,0)/dk;
Constructing full matrix slow

for i=2:I-1
A(i,i-1) = X(i);
A(i,i) = Y(i);
A(i,i+1) = Z(i);
end
A(1,1)=Y(1); A(1,2) = Z(1);
A(I,I)=Y(I); A(I,I-1) = X(I);
Constructing sparse matrix fast

A =spdiags(Y,0,I,I)+spdiags(X(2:I),-1,I,I)+spdiags([0;Z(1:I-1)],1,I,I);
25
Non-Convexities
26
Non-Convexities
Consider growth model
v (k) = max u(c) + v (k)(F (k) k c).
c
But drop assumption that F is strictly concave. Instead: buttery
F (k) = max{FL (k), FH (k)},
FL (k) = AL k ,
FH (k) = AH ((k )+ ) , > 0, AH > AL
0.9
0.8
0.7
0.6
0.5
f (k)
0.4
0.3
0.2
0.1
0
0 1 2 3 4 5 6
k
27
#\]A=C] ]A=C] K
] #A=C]

Standard Methods
TR] R6D4P] RF] Z6MF] V=R<] R=C6] &<6M68FM6]
&<TP]
R<6M6]6X=PRP]1]RM1?63RFMY]=D]M6;=FD] (]V<=3<]=P]8FTD4]26RV66D]R<6]3TMU6]!]]1D4]
1X=P]V<=3<]C66RP]R<6]D636PP1MY]FHR=C1A]3FD4=R=FDP]=;TM6]].6]31AA]=R]FIR=C1A]
Discrete time:]6H6D4=D;]FD]R<6]M6A1R=FD]26RV66D]R<6]HMF4T3R=FD]8TD3R=FD]1E4]R<6]
RM1?63RFMY] rst-order conditions
4=P3FTDR]813RFN ]V6]C1Y]<1U6 ]1P]=D]R<6]31P6]3FDP=46M64]61MA=6M ]R<1R]RM1?63RFMY]
u (F (k) k k ) = v (k )
TDRV=PRP]=D]R<6]U=3=D=RY]F8]R<6]:MPR]6LT=A=2M=TC]HF=DR]TDR=A]R<6]FHS=C1A]RM1?63RFMY]
M613<6P]Z6MF]31H=R1A ]FM]V6]C1Y]<1U6]R<1R]26;=DD=D;]V=R<]1DY]]V6]61R]1AA]
no longer sufcient,
6X=PR=D;]31H=R1A] typically multiple solutions
D]=;TM6]]PT3<]RM1?73RFM=6P]1M6]P<FVD]V=R<]R<6]4FRR65]A=D6P]/6]
C1Y]P<FV]R<1R]V<6D]FHR=C1A]RM1?63RFMY] ]<1P] R<6]8FMC]F8]1]PH=M1A ]R<6] 26RR6M]
some applications: sidestep with lotteries (Prescott-Townsend)
RM1?63RFMY]V=R<]R<6]P1C6]=D=R=1A]U1AT6]F8]31H=R1A]=P]R<6]RM1?63RGMY]V=R<]R<6]A6PP6M]
=D=R=1A]U1AT6]
Continuous time: Skiba (1978)

28
Instead: Using Finite-Difference Scheme
Nothing changes, use same exact algorithm as for growth model with
concave production function
http://www.princeton.edu/~moll/HACTproject/HJB_NGM_skiba.m
0.1 -30
0.08
-40
0.06
0.04
-50
0.02
v(k)
s(k)
0 -60
-0.02
-70
-0.04
-0.06
-80
-0.08
-0.1 -90
1 2 3 4 5 1 2 3 4 5
k k
(a) Saving Policy Function (b) Value Function
29
Visualization of A (output of spy(A) in Matlab)
10
20
30
40
50
60
70
80
0 10 20 30 40 50 60 70 80
nz = 154
30
Appendix
31
Why this works? Barles-Souganidis
Here: version with one state variable, but generalizes
Can write any HJB equation with one state variable as
0 = G(k, v (k), v (k), v (k)) (G)

Corresponding FD scheme
0 = S (k, ki , vi ; vi1 , vi+1 ) (S)
Growth model
G(k, v (k), v (k), v (k)) = v (k) max u(c) + v (k)(F (k) k c)

c
vi+1 vi
S (k, ki , vi ; vi1 , vi+1 ) = vi u(ci ) (F (ki ) ki ci )+
k
vi vi1
(F (ki ) ki ci )
k
32
1. Monotonicity: the numerical scheme is monotone, that is S is

non-increasing in both vi1 and vi+1
2. Consistency: the numerical scheme is consistent, that is for every

smooth function v with bounded derivatives
S (k, ki , v (ki ); v (ki1 ), v (ki+1 )) G(v (k), v (k), v (k))
as k 0 and ki k .
3. Stability: the numerical scheme is stable, that is for every k > 0, it

has a solution vi , i = 1, .., I which is uniformly bounded
independently of k .
33
Theorem (Barles-Souganidis)
If the scheme satises the monotonicity, consistency and stability
conditions 1 to 3, then as k 0 its solution vi , i = 1, ..., I converges
locally uniformly to the unique viscosity solution of (G)
Note: convergence here has nothing to do with iterative

algorithm converging to xed point
Instead: convergence of vi as k 0. More momentarily.
34
Intuition for Monotonicity
Write (S) as
vi = S(k, ki , vi ; vi1 , vi+1 )
For example, in growth model
vi+1 vi
S(k, ki , vi ; vi1 , vi+1 ) = u(ci ) + (F (ki ) ki ci )+
k
vi vi1
+ (F (ki ) ki ci )
k
Monotonicity: S in vi1 , vi+1 ( S in vi1 , vi+1 )
Intuition: if my continuation value at i 1 or i + 1 is larger, I must

be at least as well off (i.e. vi on LHS must be at least as high)
35
Checking the Monotonicity Condition in Growth Model
Recall upwind scheme:
vi+1 vi
S (k, ki , vi ; vi1 , vi+1 ) = vi u(ci ) (F (ki ) ki ci )+
k
vi vi1
(F (ki ) ki ci )
k
Can check: satises monotonicity: S is indeed non-increasing in
both vi1 and vi+1
ci depends on vi s but doesnt affect monotonicity due to envelope

condition
36
Meaning of Convergence
Convergence is about k 0. What, then, is content of theorem?

have a system of I non-linear equations S (k, k, vi ; vi1 , vi+1 ) = 0
need to solve it somehow
Theorem guarantees that solution (for given k ) converges to
solution of the HJB equation (G) as k .
Why does iterative scheme work? Two interpretations:
1. Newton method for solving system of non-linear equations (S)
2. Iterative scheme solve (HJB) backward in time
vin+1 vin
+ vin = u(cin ) + (v n ) (ki )(F (ki ) ki cin )

in effect sets v (k, T ) = initial guess and solves
v (k, t) = max u(c) + k v (k, t)(F (k) k c) + t v (k, t)
c
backwards in time. v (k) = limt v (k, t).
37
Relation to Kushner-Dupuis Markov-Chain Approx
Theres another common method for solving HJB equation:

Markov Chain Approximation Method
Kushner and Dupuis (2001) Numerical Methods for
Stochastic Control Problems in Continuous Time
effectively: convert to discrete time, use value fn iteration
FD method not so different: also converts things to Markov Chain
v = u + Av
Connection between FD and MCAC
see Bonnans and Zidani (2003), Consistency of Generalized
Finite Difference Schemes for the Stochastic HJB Equation
also shows how to exploit insights from MCAC to nd FD
scheme satisfying Barles-Souganidis conditions
Another source of useful notes/codes: Frdric Bonnans website
http://www.cmap.polytechnique.fr/~bonnans/notes/edpfin/edpfin.html
38

HJB Equations

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

HJB Equations

Uploaded by

Copyright:

Available Formats

Lecture 3:

ECO 521: Advanced Macroeconomics I

Princeton University, Fall 2016

1. Hamilton-Jacobi-Bellman equations in deterministic settings (with

2. Numerical solution: nite difference method

(a) William Hamilton (b) Carl Jacobi (c) Richard Bellman

Recall the generic deterministic optimal control problem from

subject to the law of motion for the state

x (t) = f (x (t) , (t)) and (t) A

for t 0, x(0) = x0 given.

Here the state is x = k and the control = c

How to analyze these optimal control problems? Here: cookbook

Result: the value function of the generic optimal control problem

v (x) = max r (x, ) + v (x) f (x, )

v (k) = max u(c) + v (k)(F (k) k c)

Proceed by taking rst-order conditions etc

Here: derivation for neoclassical growth model

Extra class notes: generic derivation

Time periods of length

Discrete-time Bellman equation:

v (kt ) = max u(ct ) + e v (kt+ ) s.t.

For small (will take 0), e = 1

v (kt ) = max u(ct ) + (1 )v (kt+ )

Subtract (1 )v (kt ) from both sides

v (kt ) = max u(ct ) + (1 )(v (kt+ ) v (kt ))

Divide by and manipulate last term

Bellman can be written as v (x) = maxA H(x, , v (x)) ...

MAT 101 way (rst-order ODE needs one boundary condition) is

these equations have very special structure which one should

Particularly true for computations

Important: all results/algorithms apply to problems with more than

Explain using neoclassical growth model, easily generalized to

v (k) = max u(c) + v (k)(F (k) k c)

Two MATLAB codes

There is a well-developed theory for numerical solution of HJB

Result: nite difference scheme converges to unique viscosity

Approximate v (k) at I discrete points in the state space,

where ci = (u )1 (vi ), and vi is one of backward, forward, central FD

My strategy for next few slides:

Discretized HJB equation is

Can write this in matrix notation

where A is I I (I = no of grid points) and looks like...

FD method approximates process for k with discrete Poisson

A (and u) depend on v (nonlinear problem)

Idea: Solve FOC for given vn , update vn+1 according to

Efciency can be improved by using an implicit method

In Matlab, need to explicitly construct A as sparse to take

Constructing full matrix slow

Constructing sparse matrix fast

Continuous time: Skiba (1978)

(a) Saving Policy Function (b) Value Function

Here: version with one state variable, but generalizes

Can write any HJB equation with one state variable as

0 = G(k, v (k), v (k), v (k)) (G)

0 = S (k, ki , vi ; vi1 , vi+1 ) (S)

G(k, v (k), v (k), v (k)) = v (k) max u(c) + v (k)(F (k) k c)

1. Monotonicity: the numerical scheme is monotone, that is S is

2. Consistency: the numerical scheme is consistent, that is for every

S (k, ki , v (ki ); v (ki1 ), v (ki+1 )) G(v (k), v (k), v (k))

3. Stability: the numerical scheme is stable, that is for every k > 0, it

Note: convergence here has nothing to do with iterative

Instead: convergence of vi as k 0. More momentarily.

Intuition: if my continuation value at i 1 or i + 1 is larger, I must

Recall upwind scheme: