Stationarity of P-RHPG Stage Gains (PDC Riccati Analogue)
Establish that the time-varying optimal stage gains produced by the Polytopic Receding-Horizon Policy Gradient (P-RHPG) backward recursion become approximately stationary as the horizon N tends to infinity, thereby providing the Parallel Distributed Compensation (PDC) analogue of Riccati convergence and implying convergence of the optimal finite-horizon integrated cost J_N^*(Q_N) to the integrated infinite-horizon optimum for any terminal cost Q_N ⪰ 0.
References
This limit holds if and only if the time-varying optimal stage gains become approximately stationary as N\to\infty, the PDC analogue of Riccati convergence, which is the key open problem in the polytopic setting.
— Receding-Horizon Policy Gradient for Polytopic Controller Synthesis
(2603.29283 - Shakeri et al., 31 Mar 2026) in Remark (Convergence for general Q_N), Section 4.2