Optimal Tracking Control for a Class of Unknown Discrete-time Systems with Actuator Saturation via Data-based ADP Algorithm
-
摘要: 针对一类带有执行器饱和的未知动态离散时间非线性系统, 提出了一种新的最优跟踪控制方案. 该方案基于迭代自适应动态规划算法, 为了实现最优控制, 首先建立了未知系统动态的数据辨识器. 通过引入M网络, 获得了稳态控制的精确表达式. 为了消除执行器饱和的影响, 提出了一个非二次的性能指标函数. 然后提出了一种迭代自适应动态规划算法获得最优跟踪控制的解, 并给出了收敛性分析. 为了实现最优控制方案, 神经网络被用来构建数据辨识器、计算性能指标函数、近似最优控制策略和求解稳态控制. 仿真结果验证了本文所提出的最优跟踪控制方法的有效性.Abstract: A novel optimal tracking control method for a class of discrete-time systems with actuator saturation and unknown dynamics is proposed in this paper. The scheme is based on the iterative adaptive dynamic programming (ADP) algorithm. In order to implement the control scheme, a data-based identifier is first constructed for the unknown system dynamics. By introducing the M network, the explicit formula of the steady control is achieved. In order to eliminate the effect of the actuator saturation, a nonquadratic performance functional is presented, and then an iterative ADP algorithm is established to achieve the optimal tracking control solution with convergence analysis. For implementing the optimal control method, neural networks are used to establish the data-based identifier, compute the performance index functional, approximate the optimal control policy and solve the steady control, respectively. Simulation example is provided to verify the effectiveness of the presented optimal tracking control scheme.
-
[1] Zhang X, Zhang H G, Sun Q Y, Luo Y H. Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence. Neurocomputing, 2012, 91: 48-55 [2] Hou Z S, Wang Z. From model-based control to data-driven control: survey, classification and perspective. Information Sciences, 2013, 235: 3-35 [3] Shen D, Hou Z S. Iterative learning control with unknown control direction: a novel data-based approach. IEEE Transactions on Neural Networks, 2011, 22(12): 2237-2249 [4] Lai C Y, Xiang C, Lee T H. Data-based identification and control of nonlinear systems via piecewise affine approximation. IEEE Transactions on Neural Networks, 2011, 22(12): 2189-2200 [5] Zhang H G, Luo Y H, Liu D R. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Transactions on Neural Networks, 2009, 20(9): 1490-1503 [6] Saberi A, Lin Z L, Teel A R. Control of linear systems with saturating actuators. IEEE Transactions on Automatic Control, 1996, 41(3): 368-378 [7] Bernstein D S. Optimal nonlinear, but continuous, feedback control of systems with saturating actuators. International Journal of Control, 1995, 62(5): 1209-1216 [8] Sussmann H J, Sontag E D, Yang Y. A general result on the stabilization of linear systems using bounded controls. IEEE Transactions on Automatic Control, 1994, 39(12): 2411-2425 [9] Bellman R E. Dynamic Programming. Princeton, NJ: Princeton University Press, 1957 [10] Bryson A E, Ho Y C. Applied Optimal Control: Optimization, Estimation, and Control. New York, NY: Hemisphere-Wiley, 1975 [11] Liu D R, Xiong X X, Zhang Y. Action-dependent adaptive critic designs. In: Proceedings of the 2001 International Joint Conference on Neural Networks. Washington, D.C.: IEEE, 2001, 2: 990-995 [12] Zhang H G, Song R Z, Wei Q L, Zhang T Y. Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming. IEEE Transactions on Neural Networks, 2011, 22(12): 1851-1862 [13] Wei Q L, Liu D R. An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state. Neural Networks, 2012, 32: 236-244 [14] Song R Z, Zhang H G, Luo Y H, Wei Q L. Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming. Neurocomputing, 2010, 73(16-18): 3020-3027 [15] Song R Z, Zhang H G. The finite-horizon optimal control for a class of time-delay affine nonlinear system. Neural Computing and Applications, 2013, 22(2): 229-235 [16] Liu Z W, Zhang H G. Delay-dependent stability for systems with fast-varying neutral-type delays via a PTVD compensation. Acta Automatica Sinica, 2010, 36(1): 147-152 [17] Wei Q L, Zhang H G, Cui L L. Data-based optimal control for discrete-time zero-sum games of 2-D systems using adaptive critic designs. Acta Automatica Sinica, 2009, 35(6): 682-692 [18] Zhang H G, Wei Q L, Liu D R. An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica, 2011, 47(1): 207-214 [19] Fu J, He H B, Zhou X M. Adaptive learning and control for MIMO system based on adaptive dynamic programming. IEEE Transactions on Neural Networks, 2011, 22(7): 1133-1148 [20] He H B, Ni Z, Fu J. A three-network architecture for on-line learning and optimization based on adaptive dynamic programming. Neurocomputing, 2012, 78(1): 3-13 [21] Wei Q L, Zhang H G, Liu D R, Zhao Y. An optimal control scheme for a class of discrete-time nonlinear systems with time delays using adaptive dynamic programming. Acta Automatica Sinica, 2010, 36(1): 121-129 [22] Zhang H G, Wei Q L, Luo Y H. A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008, 38(4): 937-942 [23] Wei Q L, Zhang H G, Dai J. Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing, 2009, 72(7-9): 1839-1848 [24] Wang D, Liu D R, Wei Q L, Zhao D B, Jin N. Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming. Automatica, 2012, 48(8): 1825-1832 [25] Zhang H G, Cui L L, Zhang X, Luo Y H. Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Transactions on Neural Networks, 2011, 22(12): 2226-2236 [26] Liu D R, Wang D, Zhao D B, Wei Q L, Jin N. Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming. IEEE Transactions on Automation Science and Engineering, 2012, 9(3): 628-634
点击查看大图
计量
- 文章访问数: 2057
- HTML全文浏览量: 69
- PDF下载量: 1944
- 被引次数: 0