4.2.1 From Lagrange to Mayer form

Next: 4.2.2 Temporal control perturbation Up: 4.2 Proof of the Previous: 4.2 Proof of the Contents Index

4.2.1 From Lagrange to Mayer form

As in Section 3.3.2, we define an additional state variable, $x^0\in\mathbb{R}$ , to be the solution of

$\displaystyle \dot x^0=L(x,u),\qquad x^0(t_0)=0$

and arrive at the augmented system

$\begin{displaymath}\begin{split}\dot x^0&=L(x,u)\\ \dot x&=f(x,u) \end{split}\end{displaymath}$

(4.5)

with the initial condition $\Big({\textstyle{0}\atop \textstyle{x_0}}\Big)$ . The cost can then be rewritten as

$\displaystyle J(u)=\int_{t_0}^{t_f}\dot x^0(t) dt=x^0(t_f)$

(4.6)

which means that in the new coordinates the problem is in the Mayer form (there is a terminal cost and no running cost). Also, the target set becomes $[t_0,\infty)\times \mathbb{R}\times\{x_1\}=: [t_0,\infty)\times S'$ ; here

is the line in $\mathbb{R}^{n+1}$ that passes through $\Big({\textstyle{0}\atop \textstyle{x_1}}\Big)$ and is parallel to the

-axis. To simplify the notation, we define

$\displaystyle y:=\begin{pmatrix}x^0\\ x\end{pmatrix}\in\mathbb{R}^{n+1}$

and write the system (4.5) more compactly as

$\displaystyle \dot y=\begin{pmatrix}L(x,u) \\ f(x,u) \end{pmatrix}=:g(y,u)$

(4.7)

(the right-hand side actually does not depend on

). Note that this system is well posed because we assumed that

has the same regularity properties as

An optimal trajectory $x^*(\cdot)$ of the original system in $\mathbb{R}^n$ corresponds in an obvious way to an optimal trajectory $y^*(\cdot)$ of the augmented system in $\mathbb{R}^{n+1}$ . The first component $x^{0,*}$ of describes the evolution of the cost in the original problem, and is recovered from by projection onto $\mathbb{R}^n$ parallel to the -axis. This situation is depicted in Figure 4.1 (note that is not necessarily positive, so need not actually be increasing along ). In this and all subsequent figures, the -axis will be vertical.

**Figure:** The optimal trajectory of the augmented system
$\includegraphics{figures/projection.eps}$

From now on, we let denote the terminal time of the optimal trajectory (or, what is the same, of ). The next exercise offers a geometric interpretation of optimality; it will not be directly used in the current proof, but we will see a related idea in Section 4.2.6.

$\begin{Exercise}Let $t_1$\ and $t_2$\ be arbitrary time instants satisfying $t_0... ...x^{*}(t_2)}}\Big)$\ at any time $t_3$, even different from~$t_2$. \end{Exercise}$

In the particular case when , the claim in the exercise should be obvious: no other trajectory starting from some point on the optimal trajectory can hit the line at a point lower than . In other words, a final portion of the optimal trajectory must itself be optimal with respect to its starting point as the initial condition. This idea, known as the principle of optimality, is illustrated in Figure 4.2. (The reader will notice that we are using different axes in different figures.)

**Figure:** Principle of optimality
$\includegraphics{figures/nocone-optimality.eps}$

Another simple observation, which will be useful later, is that the Hamiltonian (4.2) can be equivalently represented as the following inner product in $\mathbb{R}^{n+1}$ :

$\displaystyle H(x,u,p,p_0)=\left\langle\begin{pmatrix}p_0 \\ p \end{pmatrix},\begin{pmatrix}L(x,u) \\ f(x,u) \end{pmatrix}\right\rangle.$

(4.8)

Next: 4.2.2 Temporal control perturbation Up: 4.2 Proof of the Previous: 4.2 Proof of the Contents Index

Daniel 2010-12-20