next up previous contents index
Next: 7.2.2 Canonical equations as Up: 7.2 HJB equation, canonical Previous: 7.2 HJB equation, canonical   Contents   Index

7.2.1 Method of characteristics

Suppose that we are looking for a function $ v:\mathbb{R}^n\to
\mathbb{R}$ that solves the PDE

$\displaystyle F(x,v(x),\nabla v(x))=0$ (7.9)

where $ F:\mathbb{R}^n\times \mathbb{R}\times \mathbb{R}^n\to \mathbb{R}$ is a given continuous function. This is the same equation as (5.34) but here we assume that $ v$ is differentiable in the classical sense. We are especially interested in the situation where $ F$ depends linearly on $ \nabla v$ ; such PDEs are calledpartial differential equation (PDE)!quasi-linear quasi-linear. In addition, for simplicity and better geometric intuition, we start with the case when $ n=2$ . Then (7.9) takes the form

$\displaystyle a(x_1,x_2,v){v}_{x_1}+b(x_1,x_2,v)v_{x_2}=c(x_1,x_2,v)$ (7.10)

for some functions $ a$ , $ b$ , and $ c$ . Here and below, we omit some or all arguments of functions wherever convenient.

The graph of a solution $ v=v(x_1,x_2)$ to (7.10) is a surface in the $ (x_1,x_2,v)$ -space defined by the equation $ h(x_1,x_2,v):=v(x_1,x_2)-v=0$ . The gradient vector $ \nabla h=({v}_{x_1},{v}_{x_2},-1)^T$ is normal to this surface at each point. Noting that the PDE (7.10) can be equivalently written as

$\displaystyle \left\langle \left(\begin{matrix}
a \vspace*{-.25ex}\\
b \vspa...
...{x_2} \vspace*{-.ex}\\
-1 \vspace*{-.35ex}\\
\end{pmatrix}\right\rangle =0
$

we conclude that the vector $ (a,b,c)^T$ is everywhere orthogonal to the normal $ \nabla h$ , hence it lies in the tangent space to the solution surface. This is the geometric interpretation of the quasi-linear PDE (7.10): the vector field $ (a,b,c)^T$ is tangent to the solution surface $ v=v(x_1,x_2)$ at each point. The system of ODEs associated with this vector field can be written as

$\displaystyle \frac{dx_1}{ds}=a(x_1,x_2,v),\qquad \frac{dx_2}{ds}=b(x_1,x_2,v),\qquad \frac{dv}{ds}=c(x_1,x_2,v).$ (7.11)

These are called the characteristic ODEs of the PDE (7.10), and their solution curves are called the characteristics. With a slight abuse of terminology, we sometimes use the term ``characteristics" not only for the solution curves of the equations (7.11) but also for these equations themselves.

The characteristic curves ``fill" our solution surface $ v=v(x_1,x_2)$ ; skipping ahead to Figure 7.2 may help the reader visualize this situation. As an example, it is useful to think about the case when $ c\equiv 0$ . Then (7.10) says that the derivative of $ v$ in the direction of the vector $ (a,b)^T$ is 0, implying that $ v$ stays constant along solutions of the equations $ \dot x_1=a$ , $ \dot x_2=b$ . Every function $ v$ whose level sets are solution curves of these two equations in the $ (x_1,x_2)$ -plane is a solution of the PDE, and the characteristics are horizontal slices of the corresponding solution surface. Of course, in general (for nonzero $ c$ ) characteristics are not horizontal.

Now, in principle we should be able to solve the characteristic ODEs and obtain the characteristics. How does this help us solve the original PDE (7.10)? Without additional specifications, there are ``too many" characteristics to identify a desired solution surface $ v=v(x_1,x_2)$ ; indeed, there is a characteristic passing through every point in the $ (x_1,x_2,v)$ -space, so they fill the entire space. We know that for ODEs, to fix a particular solution we must specify a point through which it passes (an initial condition). The counterpart of this for the PDE (7.10) is that we must specify a suitable curve in the $ (x_1,x_2,v)$ -space through which the solution surface passes; we then obtain what is called a Cauchy (initial value) problem. Geometrically, the idea is that characteristics that pass through points on this initial curve will determine the solution surface of the PDE, as illustrated in Figure 7.2 (with the thick curve representing an initial curve).

Figure: Characteristics for an initial value problem
\includegraphics{figures/characteristics.eps}

An initial curve can be defined in parametric form as

$\displaystyle x_1=x_1(r),\qquad x_2=x_2(r),\qquad v=v(r)
$

where $ r\in\mathbb{R}$ . Then, the desired characteristics--i.e., the solutions of the characteristic equations (7.11) whose initial conditions lie on this initial curve--are given by

$\displaystyle x_1=x_1(r,s),\qquad x_2=x_2(r,s),\qquad v=v(r,s)$ (7.12)

which yields a description of the solution surface of our PDE (7.10). However, we are looking for a different representation of this surface, namely, $ v=v(x_1,x_2)$ . To bring (7.12) to the desired form, we need to express $ r,s$ as functions of $ x_1,x_2$ and plug them into $ v(r,s)$ . For this to be possible, the map $ (r,s)\mapsto (x_1(r,s),x_2(r,s))
$ must be invertible. It follows from the Inverse Function Theorem that invertibility of this map, at least in some neighborhood of the initial curve, is in turn guaranteed if the corresponding Jacobian matrix

$\displaystyle \begin{pmatrix}
{(x_1)}_{r} & {(x_1)}_{s} \\
{(x_2)}_{r} & {(x_2)}_{s} \\
\end{pmatrix}$

is nonsingular along the initial curve. What is the geometric significance of this latter condition? The columns of the above Jacobian matrix are the $ (x_1,x_2)$ -components of the tangent vectors to the initial curve and to a characteristic, respectively, and they must be linearly independent. In other words, when projected onto the $ (x_1,x_2)$ -plane, the characteristics and the initial curve must be transversal to each other. We conclude that under this transversality condition, the Cauchy problem for (7.10) is (locally) uniquely solvable, at least in principle, via the method of characteristics. To calculate the actual value of $ v(x_1,x_2)$ for some specific $ x_1$ and $ x_2$ , we need to find a characteristic that connects the initial curve to a point with these $ (x_1,x_2)$ -coordinates and then integrate the last equation in (7.11) along that characteristic segment. We will discuss this procedure in more detail later for the case of the HJB equation.

Characteristics can also be defined for the general PDE (7.9) which for $ n=2$ takes the form

$\displaystyle F(x_1,x_2,v,v_{x_1},v_{x_2})=0.$ (7.13)

In the case of the quasi-linear PDE (7.10), we saw that the tangent vector $ (a,b,c)^T$ to the solution surface does not depend on the partial derivatives $ {v}_{x_1}$ and $ {v}_{x_2}$ (which give the first two components of the normal vector $ ({v}_{x_1},{v}_{x_2},-1)^T$ to this surface); consequently, the characteristic equations (7.11) involve $ x_1$ , $ x_2$ , and $ v$ only. For the more general PDE (7.13), this is no longer guaranteed and so five differential equations--describing the joint evolution of the variables $ x_1,x_2,v,v_{x_1},v_{x_2}$ --are needed instead of three. The solution curves of these equations are called the characteristic strips. To simplify the notation, let us introduce the symbols

$\displaystyle \xi_1:=v_{x_1},\qquad \xi_2:=v_{x_2}.
$

Then the characteristic strips are defined by the following equations:

$\displaystyle \frac{dx_1}{ds}=F_{\xi_1},\quad \frac{dx_2}{ds}=F_{\xi_2},\quad \...
... \frac{d\xi_1}{ds}=-F_{x_1}-\xi_1F_v,\quad \frac{d\xi_2}{ds}=-F_{x_2}-\xi_2F_v.$ (7.14)

We do not give a complete derivation of (7.14) but note that the first three equations in (7.14) reduce to the earlier characteristic equations (7.11) in the case of the quasi-linear PDE (7.10), whereas to arrive at the fourth equation in (7.14) it is enough to write $ \frac{d\xi_1}{ds}={(\xi_1)}_{x_1}\frac{dx_1}{ds}+
{(\xi_1)}_{x_2}\frac{dx_2}{ds}$ , use the first two characteristic equations to rewrite this result as $ {(\xi_1)}_{x_1}{F}_{\xi_1}+
{(\xi_1)}_{x_2}{F}_{\xi_2}$ , and then apply the identity $ {F}_{x_1}+{F}_{v}\xi_1+{F}_{\xi_1}{(\xi_1)}_{x_1}+{F}_{\xi_2}{(\xi_2)}_{x_1}$ which follows by differentiating the PDE (7.13) with respect to $ x_1$ ; the last equation in (7.14) is derived in the same manner. The last two equations in (7.14), which describe the evolution of the normal vector to the solution surface of the PDE, will play an important role in the next subsection where we turn to the HJB equation.

In the above discussion we assumed that the independent variable $ x$ lives in the plane (i.e., $ n=2$ ). However, it is straightforward to write down the equations of the characteristic strips for a general dimension $ n\ge 2$ : we must simply replace the indices $ 1,2$ in (7.14) by $ i\in\{1,\dots,n\}$ . In the next subsection we will work in this more general setting.


next up previous contents index
Next: 7.2.2 Canonical equations as Up: 7.2 HJB equation, canonical Previous: 7.2 HJB equation, canonical   Contents   Index
Daniel 2010-12-20