Near Optimal Strategy for Nonlinear Stochastic Differential Games Based on the Technique of Statistical Linearization
-
摘要: 针对二人零和随机非线性微分对策问题, 利用统计线性化技术并提出一种新的逼近策略控制方法. 通过求解具有统计线性化参数的Riccati微分方程得到逼近控制策略, 该Riccati微分方程与一般线性系统的Riccati微分方程具有明显的区别; 同时对控制量受约束情形的控制策略也进行了求解; 最后通过仿真验证了所得结论的正确性.Abstract: A novel solution for a class of nonlinear zero-sum stochastic differential games is given based on the technique of statistical linearization. The near optimal feedback strategies are derived by solving the statistical state dependent Riccati equation, which is significantly different from the Riccati equation of linear systems. The case of strategy with bound limitation is also investigated. An example is given to illustrate the application of the theory.
-
[1] Bardi M, Raghavan T E S, Parthasarathy T. Stochastic and Differential Games: Theory and Numerical Methods. Berlin: Birkhuser, 1999. 69-177[2] Lewis F L, Syrmos V L. Optimal Control (Second edition). New Jersey: Wiley, 1995. 30-52[3] Crandall M G, Lions P L. Viscosity solutions of Hamilton-Jacobi equations. Transactions on Mathematical Software, 1983, 277(1): 1-42[4] Subbotin A I. Existence and uniqueness results for Hamilton-Jacobi equations. Nonlinear Analysis: Theory, Methods and Applications, 1991, 16(7-8): 683-689[5] Sinar J, Farber N. Nonlinear zero-sum differential game analysis by singular perturbation methods, Technical Memorandum 84271, NASA, USA, 1982. 1-14[6] Bardi M, Falcone M, Soravia P. Fully discrete schemes for the value function of pursuit-evasion games. Annals of Dynamic Games. Boston: Birkhuser, 1993. 66-74[7] Plumer E S. Optimal control of terminal processes using neural networks. IEEE Transactions on Neural Networks, 1996, 7(2): 408-418[8] Zhou X Y. Deterministic near optimal control, part 1: necessary and sufficient conditions for near-optimality. Journal of Optimization Theory and Applications, 1995, 85(2): 473-488[9] Huang J H, Li X, Wang G C. Near-optimal control problems for linear forward-backward stochastic systems. Automatica, 2010, 46(2): 397-404[10] Cheng D Z, Hu X M, Shen T L. Analysis and Design of Nonlinear Control Systems. Beijing: Science Press, 2010. 1-20[11] Huber M F. Adaptive Gaussian mixture filter based on statistical linearization. In: Proceedings of the 14th International Conference on Information Fusion (FUSION). Chicago, IL: IEEE, 2011. 1-8[12] Geist M, Pietquin O. Statistically linearized least-squares temporal differences. In: Proceedings of the 2010 IEEE International Conference on Ultra Modern Telecommunications and Control Systems (ICUMT 2010). Moscow, Russia: IEEE, 2010. 450-457[13] Geist M, Pietquin O. Statistically linearized recursive least squares. In: Proceedings of the 2010 IEEE International Workshop on Machine Learning for Signal Processing. Kittila, Finland: IEEE, 2010. 272-276[14] Fang Yang-Wang. Optimal Control for Stochastic Systems. Beijing: Tsinghua University and Springer, 2005. 27-33(方洋旺. 随机系统最优控制. 北京: 清华大学出版社, 2005. 27-33)[15] Chemyshov K R. Statistical linearization based on the maximal correlation. In: Siberian Conference on Control and Communications (SIBCON). Moscow, Russia: IEEE, 2007. 29-36[16] Wang X S, Xie H. Simulation of covariance analysis describing equation technique (CADET) in missile hit probability calculation. In: Proceedings of the 2010 6th International Conference on Natural Computation (ICNC 2010). Yantai, China: IEEE, 2010. 4282-4285
点击查看大图
计量
- 文章访问数: 1680
- HTML全文浏览量: 70
- PDF下载量: 1043
- 被引次数: 0