最佳投影方程

最佳投影方程（optimal projection equations）^[1]^[2]^[3]是控制理论中，建构局部最佳降阶LQG控制器的充分必要条件^[4]。

LQG控制（线性二次高斯控制）问题是最优控制领域中最基础的问题之一，这问题包括了存在不确定性的线性系统，受到加性高斯白噪声的影响，没有完整的状态资讯（无法量测到所有的状态变数，也无法透过回授得知），对应二次的成本泛函。不过存在唯一解，而且可以建构线性动态回授的控制律，易于计算以及实现。而LQG控制器也是非线性系统中最佳扰动控制的基础^[5]。

LQG控制器的架构会类似要控制的系统，两者会有相同的维度。因此若系统本身就是高维度，要实现（全阶）LQG控制器会很困难。降阶LQG问题（固定阶LQG问题）事先固定LQG控制器的阶数，因此克服了这个困难。不过在全阶LQG控制器中适用的分离原理，在降阶LQG问题中已无法适用，因此这方面会更困难，而且其解也不唯一。不过可以找到数值分析的算法^[4]^[6]^[7]^[8]来求解对应的最佳投影方程。

问题的数学表示以及其解

连续时间

降阶的LQG控制问题几乎和全阶的LQG控制问题相同。令 ${\hat {\mathbf {x} }}_{r}(t)$ 表示降阶LQG控制器的状态，唯一的差异是LQG控制器的状态维度 $n_{r}=dim({\hat {\mathbf {x} }}_{r}(t))$ 是事先定义好的值，比受控系统的状态维度 $n=dim({\mathbf {x} }(t))$ 要少。

降阶LQG控制器可以表示为下式：

{\dot {\hat {\mathbf {x} }}}_{r}(t)=A_{r}(t){\hat {\mathbf {x} }}_{r}(t)+B_{r}(t){\mathbf {u} }(t)+K_{r}(t)\left({\mathbf {y} }(t)-C_{r}(t){\hat {\mathbf {x} }}_{r}(t)\right),{\hat {\mathbf {x} }}_{r}(0)={\mathbf {x} }_{r}(0),

{\mathbf {u} }(t)=-L_{r}(t){\hat {\mathbf {x} }}_{r}(t).

上述公式刻意写的类似传统全阶LQG控制器的形式，降阶的LQG控制问题也可以改写为下式：

{\dot {\hat {\mathbf {x} }}}_{r}(t)=F_{r}(t){\hat {\mathbf {x} }}_{r}(t)+K_{r}(t){\mathbf {y} }(t),{\hat {\mathbf {x} }}_{r}(0)={\mathbf {x} }_{r}(0),

{\mathbf {u} }(t)=-L_{r}(t){\hat {\mathbf {x} }}_{r}(t),

其中

F_{r}(t)=A_{r}(t)-B_{r}(t)L_{r}(t)-K_{r}(t)C_{r}(t).

降阶LQG控制器的矩阵 $F_{r}(t),K_{r}(t),L_{r}(t)$ 和 ${\mathbf {x} }_{r}(0)$ 是由所谓的最佳投影方程（optimal projection equations、OPE）来决定^[3]。

$n$ 维的最佳投影方阵 $\tau (t)$ 是OPE的核心。此矩阵的秩在所有状态下几乎都等于 $n_{r}$ 。相关投影为斜投影（oblique projection）： $\tau ^{2}(t)=\tau (t)$ 。最佳投影方程包括四个矩阵微分方程。前二个是LQG控制器对应的矩阵Riccati微分方程的扩展。在方程式中 $\tau _{\perp }(t)$ 表示 $I_{n}-\tau (t)$ ，而 $I_{n}$ 为 $n$ 维的单位矩阵

{\begin{aligned}{\dot {P}}(t)={}&A(t)P(t)+P(t)A'(t)-P(t)C'(t)W^{-1}(t)C(t)P(t)+V(t)\\[6pt]&{}+\tau _{\perp }(t)P(t)C'(t)W^{-1}(t)C(t)P(t)\tau '_{\perp }(t),\\[6pt]P(0)={}&E\left({\mathbf {x} }(0){\mathbf {x} }'(0)\right),\\[6pt]&{}-{\dot {S}}(t)=A'(t)S(t)+S(t)A(t)-S(t)B(t)R^{-1}(t)B'(t)S(t)+Q(t)\\[6pt]&{}+\tau '_{\perp }(t)S(t)B(t)R^{-1}(t)B'(t)S(t)\tau _{\perp }(t),\end{aligned}}

S(T)=F.

若LQG的维度没有减少，也就是 $n=n_{r}$ ，则 $\tau (t)=I_{n},\tau _{\perp }(t)=0$ ，上述二个方程就是二个没有耦合的矩阵Riccati微分方程，对应全阶的LQG控制器。若 $n_{r}<n$ ，则两个方程会有斜投影项 $\tau (t).$ 。这也是为何降阶的LQG控制器无法分离的原因，斜投影 $\tau (t)$ 是由另外二个矩阵微分方程所决定，其中也和秩的条件（rank conditions）有关。这四个矩阵微分方程组成了最佳投影方程。为了要列出另外二个矩阵微分方程，先定义以下二个矩阵：

\Psi _{1}(t)=(A(t)-B(t)R^{-1}(t)B'(t)S(t)){\hat {P}}(t)+{\hat {P}}(t)(A(t)-B(t)R^{-1}(t)B'(t)S(t))'

{}+P(t)C'(t)W^{-1}(t)C(t)P(t),

\Psi _{2}(t)=(A(t)-P(t)C'(t)W^{-1}(t)C(t))'{\hat {S}}(t)+{\hat {S}}(t)(A(t)-P(t)C'(t)W^{-1}(t)C(t))

{}+S(t)B(t)R^{-1}(t)B'(t)S(t).

则最后二个矩阵微分方程如下：

{\dot {\hat {P}}}(t)=1/2\left(\tau (t)\Psi _{1}(t)+\Psi _{1}(t)\tau '(t)\right),{\hat {P}}(0)=E({\mathbf {x} }(0))E({\mathbf {x} }(0))',\operatorname {rank} ({\hat {P}}(t))=n_{r}

almost everywhere,

-{\dot {\hat {S}}}(t)=1/2\left(\tau '(t)\Psi _{2}(t)+\Psi _{2}(t)\tau (t)\right),{\hat {S}}(T)=0,\operatorname {rank} ({\hat {S}}(t))=n_{r}

almost everywhere,

其中

\tau (t)={\hat {P}}(t){\hat {S}}(t)\left({\hat {P}}(t){\hat {S}}(t)\right)^{*}.

此处的 * 表示群广义逆矩阵（group generalized inverse）或Drazin逆矩阵（英语：Drazin inverse），是唯一的，定义如下

A^{*}=A(A^{3})^{+}A.

其中 + 是摩尔－彭若斯广义逆.

矩阵 $P(t),S(t),{\hat {P}}(t),{\hat {S}}(t)$ 都需要是非负对称矩阵。可以建构最佳投影方程的解，而此解可以决定降阶LQG控制器矩阵 $F_{r}(t),K_{r}(t),L_{r}(t)$ 和 ${\mathbf {x} }_{r}(0)$ ：

F_{r}(t)=H(t)\left(A(t)-P(t)C'(t)W^{-1}(t)C(t)-B(t)R^{-1}(t)B'(t)S(t)\right)G(t)+{\dot {H}}(t)G'(t),

K_{r}(t)=H(t)P(t)C'(t)W^{-1}(t),

L_{r}(t)=R^{-1}(t)B'(t)S(t)G'(t),

{\mathbf {x} }_{r}(0)=H(0)E({\mathbf {x} }(0)).

上式中的矩阵 $G(t),H(t)$ 是符合以下性质的矩阵：

G'(t)H(t)=\tau (t),G(t)H'(t)=I_{n_{r}}

几乎在所有状态下。

可以由 ${\hat {P}}(t){\hat {S}}(t)$ 的投影分解中得到^[4]：

若降阶LQG问题中的所有矩阵都是非时变的，且最终时间（horizon） $T$ 趋近无限大，则最佳降阶LQG控制器和最佳投影方程也都会是非时变的^[1]。此情形下，最佳投影方程左侧的微分项会为零。

离散时间

离散时间的情形类似连续时间的例子，要处理的是将 $n$ 阶传统离散时间全阶LQG问题转换为事先已知固定阶数的 $n_{r}<n$ 阶降阶LQG控制器。为了要表示离散时间的OPE，先引入以下二个矩阵：

\Psi _{i}^{1}=\left(A_{i}-B_{i}(B'_{i}S_{i+1}B_{i}+R_{i})^{-1}B'_{i}S_{i+1}A_{i})\right){\hat {P}}_{i}\left(A_{i}-B_{i}(B'_{i}S_{i+1}B_{i}+R_{i})^{-1}B'_{i}S_{i+1}A_{i})\right)'

{}+A_{i}P_{i}C'_{i}(C_{i}P_{i}C'_{i}+W_{i})^{-1}C_{i}P_{i}A'_{i}

\Psi _{i+1}^{2}=\left(A_{i}-A_{i}P_{i}C'_{i}(C_{i}P_{i}C'_{i}+W_{i})^{-1}C_{i}\right)'{\hat {S}}_{i+1}\left(A_{i}-A_{i}P_{i}C'_{i}(C_{i}P_{i}C'_{i}+W_{i})^{-1}C_{i}\right)

{}+A'_{i}S_{i+1}B_{i}(B'_{i}S_{i+1}B_{i}+R_{i})^{-1}B'_{i}S_{i+1}A_{i}

则离散时间OPE为

P_{i+1}=A_{i}\left(P_{i}-P_{i}C'_{i}\left(C_{i}P_{i}C'_{i}+W_{i}\right)^{-1}C_{i}P_{i}\right)A'_{i}+V_{i}+\tau _{\perp i+1}\Psi _{i}^{1}\tau '_{\perp i+1},P_{0}=E\left({\mathbf {x} }_{0}{\mathbf {x'} }_{0}\right)

.

S_{i}=A'_{i}\left(S_{i+1}-S_{i+1}B_{i}\left(B'_{i}S_{i+1}B_{i}+R_{i}\right)^{-1}B'_{i}S_{i+1}\right)A_{i}+Q_{i}+\tau '_{\perp i}\Psi _{i+1}^{2}\tau _{\perp i},S_{N}=F

.

{\hat {P}}_{i+1}=1/2(\tau _{i+1}\Psi _{i}^{1}+\Psi _{i}^{1}\tau '_{i+1}),{\hat {P}}_{0}=E({\mathbf {x} }(0))E({\mathbf {x} }(0))',\operatorname {rank} ({\hat {P}}_{i})=n_{r}

almost everywhere,

{\hat {S}}_{i}=1/2(\tau '_{i}\Psi _{i+1}^{2}+\Psi _{i+1}^{2}\tau _{i}),{\hat {S}}_{N}=0,\operatorname {rank} ({\hat {S}}_{i})=n_{r}

almost everywhere.

斜投影（oblique projection）矩阵为

\tau _{i}={\hat {P}}_{i}{\hat {S}}_{i}\left({\hat {P}}_{i}{\hat {S}}_{i}\right)^{*}.

非负对称矩阵 $P_{i},S_{i},{\hat {P}}_{i},{\hat {S}}_{i}$ 是离散时间OPE的解，也决定了降阶LQG控制器的矩阵 $F_{i}^{r},K_{i}^{r},L_{i}^{r}$ and ${\mathbf {x} }_{0}^{r}$ ：

F_{i}^{r}=H_{i+1}\left(A_{i}-P_{i}C'_{i}\left(C_{i}P_{i}C'_{i}+W_{i}\right)^{-1}C_{i}-B_{i}\left(B'_{i}S_{i+1}B_{i}+R_{i}\right)^{-1}B'_{i}S_{i+1}\right)G'_{i},

K_{i}^{r}=H_{i+1}P_{i}C'_{i}\left(C_{i}P_{i}C'_{i}+W_{i}\right)^{-1},

L_{i}^{r}=\left(B'_{i}S_{i+1}B_{i}+R_{i}\right)^{-1}B'_{i}S_{i+1}G'_{i},

{\mathbf {x} }_{0}^{r}=H_{0}E({\mathbf {x} }_{0}).

在上述的方程中，矩阵 $G_{i},H_{i}$ 是有以下性质的矩阵：

G'_{i}H_{i}=\tau _{i},G_{i}H'_{i}=I_{n_{r}}

几乎在所有状态下。

这些矩阵可以从 ${\hat {P}}_{i}{\hat {S}}_{i}$ 的投影因式分解中求得^[4]。

如同在连续时间中的例子一样，若问题中所有的矩阵都是非时变，且且最终时间（horizon） $T$ 趋近无限大，降阶LQG控制器就会是非时变的。因此离散时间OPE会收敛到稳态解，决定非时变的降阶LOG控制器^[2]。

离散时间OPE也可以应用在状态维度，输入维度或是输出维度可变的离散时间系统（具有时变维度的离散时间系统）^[6]。若在数位控制器中的取样是不同步的，就可能会出现这类的系统。

参考资料

^ ^1.0 ^1.1 Hyland D.C; Bernstein D.S. The optimal projection equations for fixed order dynamic compensation. IEEE Transactions on Automatic Control. 1984, AC–29 (11): 1034–1037. doi:10.1109/TAC.1984.1103418. hdl:2027.42/57875.
^ ^2.0 ^2.1 Bernstein D.S.; Davis L.D.; Hyland D.C. The optimal projection equations for reduced-order discrete-time modeling estimation and control (PDF). Journal of Guidance Control and Dynamics. 1986, 9 (3): 288–293 [2020-02-04]. Bibcode:1986JGCD....9..288B. doi:10.2514/3.20105. hdl:2027.42/57880. （原始内容 (PDF)存档于2022-01-09）.
^ ^3.0 ^3.1 Haddad W.M.; Tadmor G. Reduced-order LQG controllers for linear time-varying plants. Systems & Control Letters. 1993, 20 (2): 87–97. doi:10.1016/0167-6911(93)90020-7.
^ ^4.0 ^4.1 ^4.2 ^4.3 Van Willigenburg L.G.; De Koning W.L. Numerical algorithms and issues concerning the discrete-time optimal projection equations. European Journal of Control. 2000, 6 (1): 93–100. doi:10.1016/s0947-3580(00)70917-4. Associated software download from Matlab Central （页面存档备份，存于互联网档案馆）.
^ Athans M. The role and use of the stochastic linear-quadratic-Gaussian problem in control system design. IEEE Transactions on Automatic Control. 1971, AC–16 (6): 529–552. doi:10.1109/TAC.1971.1099818.
^ ^6.0 ^6.1 Van Willigenburg L.G.; De Koning W.L. Optimal reduced-order compensators for time-varying discrete-time systems with deterministic and white parameters. Automatica. 1999, 35: 129–138. doi:10.1016/S0005-1098(98)00138-1. Associated software download from Matlab Central （页面存档备份，存于互联网档案馆）.
^ Zigic D.; Watson L.T.; Collins E.G.; Haddad W.M.; Ying S. Homotopy methods for solving the optimal projection equations for the H2 reduced order model problem. International Journal of Control. 1996, 56 (1): 173–191. doi:10.1080/00207179208934308.
^ Collins Jr. E.G; Haddad W.M.; Ying S. A homotopy algorithm for reduced-order dynamic compensation using the Hyland–Bernstein optimal projection equations. Journal of Guidance Control & Dynamics. 1996, 19 (2): 407–417. doi:10.2514/3.21633.

[Bern1-1] 1.0 ^1.1 Hyland D.C; Bernstein D.S. The optimal projection equations for fixed order dynamic compensation. IEEE Transactions on Automatic Control. 1984, AC–29 (11): 1034–1037. doi:10.1109/TAC.1984.1103418. hdl:2027.42/57875.

[Bern2-2] 2.0 ^2.1 Bernstein D.S.; Davis L.D.; Hyland D.C. The optimal projection equations for reduced-order discrete-time modeling estimation and control (PDF). Journal of Guidance Control and Dynamics. 1986, 9 (3): 288–293 [2020-02-04]. Bibcode:1986JGCD....9..288B. doi:10.2514/3.20105. hdl:2027.42/57880. （原始内容 (PDF)存档于2022-01-09）.

[Haddad1-3] 3.0 ^3.1 Haddad W.M.; Tadmor G. Reduced-order LQG controllers for linear time-varying plants. Systems & Control Letters. 1993, 20 (2): 87–97. doi:10.1016/0167-6911(93)90020-7.

[Wil1-4] 4.0 ^4.1 ^4.2 ^4.3 Van Willigenburg L.G.; De Koning W.L. Numerical algorithms and issues concerning the discrete-time optimal projection equations. European Journal of Control. 2000, 6 (1): 93–100. doi:10.1016/s0947-3580(00)70917-4. Associated software download from Matlab Central （页面存档备份，存于互联网档案馆）.

[Athans-5] Athans M. The role and use of the stochastic linear-quadratic-Gaussian problem in control system design. IEEE Transactions on Automatic Control. 1971, AC–16 (6): 529–552. doi:10.1109/TAC.1971.1099818.

[Wil2-6] 6.0 ^6.1 Van Willigenburg L.G.; De Koning W.L. Optimal reduced-order compensators for time-varying discrete-time systems with deterministic and white parameters. Automatica. 1999, 35: 129–138. doi:10.1016/S0005-1098(98)00138-1. Associated software download from Matlab Central （页面存档备份，存于互联网档案馆）.

[Bern3-7] Zigic D.; Watson L.T.; Collins E.G.; Haddad W.M.; Ying S. Homotopy methods for solving the optimal projection equations for the H2 reduced order model problem. International Journal of Control. 1996, 56 (1): 173–191. doi:10.1080/00207179208934308.

[Had1-8] Collins Jr. E.G; Haddad W.M.; Ying S. A homotopy algorithm for reduced-order dynamic compensation using the Hyland–Bernstein optimal projection equations. Journal of Guidance Control & Dynamics. 1996, 19 (2): 407–417. doi:10.2514/3.21633.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]