We now collect the results obtained in this section so far, as well as a few additional claims not yet proved, into a single theorem summarizing the solution of the infinite-horizon LQR problem and its main properties.
PROOF (remaining claims). Statement 1 of the theorem was proved in Section 6.2.1 with the exception of two claims: the strict positive definiteness of (we only showed that ) and the uniqueness property. Statements 2 and 3 were proved in Section 6.2.2, except for the uniqueness of the optimal control. Statement 4 was proved in Section 6.2.3. With these results in place, we now establish the remaining claims.
Let us first prove that . We already know that . Suppose that for some , which in view of statement 2 means that for this initial condition the optimal cost equals 0. This is possible only if and (since ). By observability we must then have , as it is well known that the output of an observable linear time-invariant system (with the zero input) can be identically 0 only along the zero trajectory. Thus which proves that is indeed positive definite.
Let us now prove that is a unique solution of the ARE in the class of positive definite matrices. In fact, we will show that it is unique even in the class of positive semidefinite matrices. Suppose that the ARE has another positive semidefinite solution . Consider the new cost
and its infinite-horizon counterpart
By statements 4 and 2 of this theorem, the same control as in statement 3 gives
and this is the optimal cost with respect to because for every other control we have
Finally, to show that the optimal control is unique, recall the equations (5.13) and (5.14) in Section 5.1.3 which say that an optimal control must satisfy
or, what is the same,
For the infinite-horizon LQR problem and the value function this condition takes the form
and it is easy to check (cf. Section 6.1.3) that this uniquely identifies the feedback law given in statement 3.
We can see from the proof that the observability assumption was only used for establishing exponential stability of the optimal closed-loop system (statement 4) and the positive definiteness and uniqueness of in statement 1 (the uniqueness proof relied on observability indirectly because it invoked statement 4). The controllability assumption, on the other hand, was used for showing the existence of which is crucial for all the other claims. Nevertheless, there remains the possibility that some or all claims could be proved without relying on these assumptions.
In view of statement 1 of Theorem 6.1 and its proof, we know that the ARE has exactly one positive semidefinite solution , and this solution is in fact positive definite and gives the optimal cost and optimal control via statements 2 and 3. For example, consider the infinite-horizon version of Example 6.1, with the system and the cost . The ARE has two solutions, , of which is the one we want. The optimal cost is and the optimal control is . The result that the reader obtained in Exercise 6.4 should also immediately yield the optimal cost for that problem and the exponentially stable optimal closed-loop system.
Linearity, time-invariance, and exponential stability are very desirable features of the optimal closed-loop system, indicating that the infinite-horizon LQR problem provides good control design guidelines. The choice of the matrices and is often part of the design process, which helps shape the behavior of the state and the control signal.