异构非线性多智能体系统无模型输出一致性控制

孙一仆; 陈鑫; 贺文朋; 佘锦华; 吴敏

doi:10.16383/j.aas.c240459

异构非线性多智能体系统无模型输出一致性控制

doi: 10.16383/j.aas.c240459 cstr: 32138.14.j.aas.c240459

孙一仆^{1, 2, 3,},
陈鑫^{1, 2, 3,},
贺文朋^{1, 2, 3,},
佘锦华^4,,
吴敏^{1, 2, 3,}

1.
中国地质大学(武汉)自动化学院武汉 430074 中国
2.
复杂系统先进控制与智能自动化湖北省重点实验室武汉 430074 中国
3.
地球探测智能化技术教育部工程研究中心武汉 430074 中国
4.
东京工科大学东京 192-0982 日本

基金项目: 高等学校学科创新引智计划(B17040), 湖北省科技创新重大专项(2020AEA010), 国家自然科学基金(61873248), 湖北省自然科学基金(2020CFA031), 国家电网公司科技专项(52153216000R)资助

详细信息

作者简介:
孙一仆：中国地质大学(武汉)自动化学院博士研究生. 主要研究方向为多智能体系统, 强化学习. E-mail: 20141000976@cug.edu.cn

陈鑫：中国地质大学(武汉) 自动化学院教授. 主要研究方向为智能控制, 过程控制, 机器人运动控制. 本文通信作者. E-mail: chenxin@cug.edu.cn

贺文朋：中国地质大学(武汉)自动化学院博士研究生. 主要研究方向为多智能体系统分布式控制. E-mail: wenpenghe@cug.edu.cn

佘锦华：日本东京工科大学教授. 主要研究方向为重复控制, 机电系统的高精度控制, 康复机器人, 计算智能的工业应用. E-mail: she@stf.teu.ac.jp

吴敏：中国地质大学(武汉) 自动化学院教授. 主要研究方向为过程控制, 鲁棒控制和智能系统. E-mail: wumin@cug.edu.cn

计量
- 文章访问数: 614
- HTML全文浏览量: 2507
- PDF下载量: 202
- 被引次数: 0
出版历程
- 收稿日期: 2024-07-01
- 录用日期: 2024-11-11
- 网络出版日期: 2024-12-18
- 刊出日期: 2025-03-18

Model-free Output Consensus Control for Heterogeneous Nonlinear Multi-agent Systems

SUN Yi-Pu^{1, 2, 3
,},
CHEN Xin^{1, 2, 3
,},
HE Wen-Peng^{1, 2, 3
,},
SHE Jin-Hua^4
,,
WU Min^{1, 2, 3
,}

1.
School of Automation, China University of Geosciences, Wuhan 430074, China
2.
Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems, Wuhan 430074, China
3.
Engineering Research Center of Intelligent Technology for Geo-exploration, Ministry of Education, Wuhan 430074, China
4.
Tokyo University of Technology, Tokyo 192-0982, Japan

Funds: Supported by the 111 Project (B17040), Technical Innovation Major Project of Hubei Province (2020AEA010), National Natural Science Foundation of China (61873248), Natural Science Foundation of Hubei Province (2020CFA031), and Science and Technology Project of State Grid Corporation of China (52153216000R)

More Information

Author Bio:
SUN Yi-Pu　Ph.D. candidate at the School of Automation, China University of Geosciences. His research interest covers multi-agent system and reinforcement learning

CHEN Xin　Professor at the School of Automation, China University of Geosciences. His research interest covers intelligent control, process control, and robot motion control. Corresponding author of this paper

HE Wen-Peng　Ph.D. candidate at the School of Automation, China University of Geosciences. His main research interest is multi-agent system distributed control

SHE Jin-Hua　Professor at the Tokyo University of Technology, Japan. His research interest covers repetitive control, high precision control of mechatronic systems, rehabilitation robots, and industrial applications of computational intelligence

WU Min　Professor at the School of Automation, China University of Geosciences. His research interest covers process control, robust control, and intelligent systems

摘要

摘要: 针对异构非线性多智能体系统(Multi-agent system, MAS)的输出一致性控制难题, 设计了一种基于同胚分布式控制协议的无模型方法. 通过将输出反馈线性化理论与自适应动态规划相结合, 可以在不需要精确系统模型的情况下实现非线性智能体的线性化, 简化分布式控制器的设计复杂性. 具体而言, 设计一种双层分布式控制结构, 在物理空间层通过无模型反馈线性化方法实现未知系统线性化, 在微分同构空间层利用线性控制技术进行分布式共识控制. 通过两个实验验证了所提方法在处理未知异构非线性多智能体系统中的有效性, 将传统的线性分布式控制方法扩展到未知非线性多智能体系统的控制器设计.
- 非线性多智能体系统 /
- 无模型输出共识控制 /
- 微分同胚 /
- 输入输出反馈线性化 /
- 自适应动态规划
Abstract: A model-free method based on homeomorphic distributed control protocol is proposed to address the output consensus control problem of heterogeneous nonlinear multi-agent systems (MASs). By integrating output feedback linearization theory with adaptive dynamic programming, this approach linearizes nonlinear agents without requiring precise system models, simplifying the design of distributed controllers. Specifically, a two-layer distributed control structure is designed: In the physical space layer, model-free feedback linearization is applied to linearize unknown systems, while in the diffeomorphic space layer, linear control techniques are used for distributed consensus control. The effectiveness of the proposed method in handling unknown heterogeneous nonlinear multi-agent systems is validated through two experiments, extending traditional linear distributed control methods to the design of controllers for unknown nonlinear multi-agent systems.
- Nonlinear multi-agent system /
- model-free output consensus control /
- diffeomorphic /
- input-output feedback linearization /
- adaptive dynamic programming

HTML全文

图 1 同胚分布式控制协议结构图

Fig. 1 Structure diagram of homeomorphic distributed control protocol

下载: 全尺寸图片幻灯片

图 2 无模型反馈线性化学习模块

Fig. 2 Model-free feedback linearized learning modules

下载: 全尺寸图片幻灯片

图 3 通讯拓扑

Fig. 3 Communication topology

下载: 全尺寸图片幻灯片

图 4 学习前后输出和一致性误差轨迹对比

Fig. 4 The output and consensus error trajectory comparison before and after learning

下载: 全尺寸图片幻灯片

图 5 智能体双评价网络权值更新轨迹

Fig. 5 Agent dual-critic network weight update trajectory

下载: 全尺寸图片幻灯片

图 6 智能体奖励网络权值更新轨迹

Fig. 6 Agent reward network weight update trajectory

下载: 全尺寸图片幻灯片

图 7 网络更新损失演化轨迹

Fig. 7 Evolution trajectory of network update loss

下载: 全尺寸图片幻灯片

图 8 学习收敛后输出一致性轨迹切换实验

Fig. 8 Output consensus trajectory switching experiment after learning convergence

下载: 全尺寸图片幻灯片

表 1 异构多智能体系统参数

Table 1 Heterogeneous multi-agent system parameters

变量	值 (m)	变量	值 (m)	变量	值 (m)
$ {m_1} $	0.04	$ {m_2} $	0.04	$ {m_3} $	0.06
$ {h_1} $	0.06	$ {h_2} $	0.04	$ {h_3} $	0.06
$ {m_4} $	0.06	$ {m_5} $	0.08	$ {m_6} $	0.08
$ {h_4} $	0.04	$ {h_5} $	0.06	$ {h_6} $	0.04

下载: 导出CSV

表 2 学习参数

Table 2 Learning parameters

参数	值	参数	值	参数	值
$ {\eta _r} $	0.05	$ {\eta _c} $	0.02	$ {\eta _a} $	0.01
$ \gamma $	0.9	$ {\mu _j} $	0.01	$ {\mu _\lambda } $	0.01
$ \varepsilon_i $	0.08	$ H $	$ [1,\; 0.2] $

下载: 导出CSV

参考文献(19)

[1]	Nair R R, Behera L. Robust adaptive gain higher order sliding mode observer based control-constrained nonlinear model predictive control for spacecraft formation flying. IEEE/CAA Journal of Automatica Sinica, 2016, 5(1): 367−381
[2]	Guo X C, Wei G L, Yao M, Zhang P J. Consensus control for multiple Euler-Lagrange systems based on high-order disturbance observer: An event-triggered approach. IEEE/CAA Journal of Automatica Sinica, 2022, 9(5): 945−948 doi: 10.1109/JAS.2022.105584
[3]	Peng Z H, Wang D, Li T S, Han M. Output-feedback cooperative formation maneuvering of autonomous surface vehicles with connectivity preservation and collision avoidance. IEEE Transactions on Cybernetics, 2019, 50(6): 2527−2535
[4]	Simões D, Lau N, Reis L P. Multi-agent actor centralized-critic with communication. Neurocomputing, 2020, 390: 40−56 doi: 10.1016/j.neucom.2020.01.079
[5]	Wu J, Lou Y C. Efficient centralized traffic grid signal control based on meta-reinforcement learning. IEEE/CAA Journal of Automatica Sinica, DOI: 10.1109/JAS.2023.123270
[6]	Yan B, Shi P, Lim C C. Robust formation control for nonlinear heterogeneous multiagent systems based on adaptive event-triggered strategy. IEEE Transactions on Automation Science and Engineering, 2021, 19(4): 2788−2800
[7]	Bai C C, Yan P, Pan W, Guo J F. Learning-based multi-robot formation control with obstacle avoidance. IEEE Transactions on Intelligent Transportation Systems, 2021, 23(8): 11811−11822
[8]	Huang J Y, Zhou S Y, Tu H, Yao Y H, Liu Q S. Distributed optimization algorithm for multi-robot formation with virtual reference center. IEEE/CAA Journal of Automatica Sinica, 2022, 9(4): 732−734 doi: 10.1109/JAS.2022.105473
[9]	Ju Y M, Ding D R, He X, Han Q L, Wei G L. Consensus control of multi-agent systems using fault-estimation-in-the-loop: Dynamic event-triggered case. IEEE/CAA Journal of Automatica Sinica, 2021, 9(8): 1440−1451
[10]	Yu X Y, Yang F, Zou C, Ou L L. Stabilization parametric region of distributed PID controllers for general first-order multi-agent systems with time delay. IEEE/CAA Journal of Automatica Sinica, 2019, 7(6): 1555−1564
[11]	Bidram A, Lewis F L, Davoudi A. Synchronization of nonlinear heterogeneous cooperative systems using input-output feedback linearization. Automatica, 2014, 50(10): 2578−2585 doi: 10.1016/j.automatica.2014.08.016
[12]	Sun Y P, Chen X, He W P, Zhang Z Y, Fukushima E F, She J. Q-learning based model-free input-output feedback linearization control method. IFAC-PapersOnLine, 2023, 56(2): 9534−9539 doi: 10.1016/j.ifacol.2023.10.253
[13]	Li K, Hua C C, You X, Guan X P. Output feedback-based consensus control for nonlinear time delay multiagent systems. Automatica, 2020, 111: Article No. 108669 doi: 10.1016/j.automatica.2019.108669
[14]	Wang D, Gao N, Liu D R, Li J N, Lewis F L. Recent progress in reinforcement learning and adaptive dynamic programming for advanced control applications. IEEE/CAA Journal of Automatica Sinica, 2024, 11 (1): 18−36
[15]	Jiang H, He H B. Data-driven distributed output consensus control for partially observable multiagent systems. IEEE Transactions on Cybernetics, 2018, 49(3): 848−858
[16]	Jiang Y, Fan J L, Gao W N, Chai T Y, Lewis F L. Cooperative adaptive optimal output regulation of nonlinear discrete-time multi-agent systems. Automatica, 2020, 121: Article No. 109149 doi: 10.1016/j.automatica.2020.109149
[17]	Lu X D, Li H T. Consensus of singular linear multiagent systems via hybrid control. IEEE Transactions on Control of Network Systems, 2022, 9(2): 647−656 doi: 10.1109/TCNS.2022.3161193
[18]	Wen G X, Chen C L P, Feng J, Zhou N. Optimized multi-agent formation control based on an identifier-actor-critic reinforcement learning algorithm. IEEE Transactions on Fuzzy Systems, 2018, 26(5): 2719−2731 doi: 10.1109/TFUZZ.2017.2787561
[19]	Bayili G, Nicaise S, Silga R. Rational energy decay rate for the wave equation with delay term on the dynamical control. Journal of Mathematical Analysis and Applications, 2021, 495 (1): Article No. 124693