-
摘要: 作为多智能体对抗博弈问题的重要分支, 追逃博弈(Pursuit-evasion, PE)问题在控制和机器人领域得到了广泛的应用, 受到众多研究者的密切关注. 追逃博弈问题主要聚焦于追逐者和逃跑者双方为实现各自目标而展开的动态博弈: 追逐者试图在最短时间内抓到逃跑者, 逃跑者的目标则是避免被捕获. 本文概述追逃博弈问题的相关研究进展, 从空间环境、信息获取等五个方面介绍追逃博弈问题的各类设定; 简述理论求解、数值求解等四种当下主流的追逃博弈问题求解方法. 通过对现有研究的总结和分析, 给出几点研究建议, 对未来追逃博弈问题的发展具有一定指导意义.Abstract: As an important branch of multi-agent adversarial games, Pursuit-evasion (PE) games have found widespread applications in the fields of control and robotics, attracting considerable attention from researchers. PE games primarily focus on the dynamic games between pursuer and evader, each striving to achieve their respective objectives: The pursuer aims to capture the evader as quickly as possible, while the evader's goal is to avoid capture. This article provides an overview of the research progress in PE games, and introduces various settings of PE games across five key dimensions, including spatial environment, information acquisition, and so on. It briefly describes four mainstream methods for solving PE games, including theoretical approaches, numerical approaches, and so on. By summarizing and analyzing existing researches, this article offers several research suggestions, which are expected to provide significant guidance for future developments in PE games.
-
Key words:
- Pursuit-evasion (PE) games /
- multi-agent /
- adversarial games /
- differential games
-
A1 代表性追逃博弈文献分类
A1 Classification of representative literature on PE games
求解方法 文献 动态模型 追逃双方数量 维度 信息完整程度 状态空间 一阶积分器 二阶积分器 独轮车 其他 一对一 多对一 多对多 二维平面 三维空间 完全信息 不完全信息 连续 离散 理论求解法 [55] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [52] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [169] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [92] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ 数值求解法 [68] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [6] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [81] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [44] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [39] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ 积分强化学习法 [147] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [140] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [41] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [15] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [35] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [146] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ 几何法 [157] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [91] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [155] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [161] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [102] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [18] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [26] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ [34] $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ $\checkmark $ -
[1] Domenici P, Blagburn J M, Bacon J P. Animal escapology Ⅱ: Escape trajectory case studies. Journal of Experimental Biology, 2011, 214(15): 2474−2494 doi: 10.1242/jeb.053801 [2] Tay N E, Warburton N M, Moseby K E, Fleming P A. Predator escape behaviour in threatened marsupials. Animal Conservation, 2023, 26(4): 587−601 doi: 10.1111/acv.12847 [3] FitzGibbon C D. The costs and benefits of predator inspection behaviour in Thomson's gazelles. Behavioral Ecology and Sociobiology, 1994, 34: 139−148 doi: 10.1007/BF00164184 [4] Scheel D, Packer C. Group hunting behaviour of lions: A search for cooperation. Animal Behaviour, 1991, 41(4): 697−709 doi: 10.1016/S0003-3472(05)80907-8 [5] Wang J, Li G, Liang L, Wang C, Deng F. Pursuit-evasion games of multiple cooperative pursuers and an evader: A biological-inspired perspective. Communications in Nonlinear Science and Numerical Simulation, 2022, 110: Article No. 106386 doi: 10.1016/j.cnsns.2022.106386 [6] Li W. A dynamics perspective of pursuit-evasion: Capturing and escaping when the pursuer runs faster than the agile evader. IEEE Transactions on Automatic Control, 2016, 62(1): 451−457 [7] Li W. The confinement-escape problem of a defender against an evader escaping from a circular region. IEEE Transactions on Cybernetics, 2016, 46(4): 1028−1039 doi: 10.1109/TCYB.2015.2503285 [8] Weintraub I E, Pachter M, Garcia E. An introduction to pursuit-evasion differential games. In: Proceedings of the American Control Conference (ACC). Denver, USA: IEEE, 2020. 1049-1066 [9] Mu Z, Pan J, Zhou Z, Yu J & Cao L. A survey of the pursuit-evasion problem in swarm intelligence. Frontiers of Information Technology & Electronic Engineering, 2023, 24(8): 1093−1116 [10] Isaacs R. Differential Games: A Mathematical Theory With Applications to Warfare and Pursuit, Control and Optimization. New York: John Wiley & Sons, Inc. 1965. [11] Starr A W, Ho Y C. Further properties of nonzero-sum differential games. Journal of Optimization Theory and Applications, 1969, 3: 207−219 doi: 10.1007/BF00926523 [12] Ho Y C. Differential games, dynamic optimization, and generalized control theory. Journal of Optimization Theory and Applications, 1970, 6(3): 179−209 doi: 10.1007/BF00926600 [13] 张嗣瀛. 微分对策. 北京: 科学出版社, 1987.Zhang Si-Ying. Differential Games. Beijing: Science Press, 1987. [14] 刘坤, 郑晓帅, 林业茗, 韩乐, 夏元清. 基于微分博弈的追逃问题最优策略设计. 自动化学报, 2021, 47(8): 1840−1854Liu Kun, Zheng Xiao-Shuai, Lin Ye-Ming, Han Le, Xia Yuan-Qing. Optimal strategy design for pursuit and evasion problem based on differential game. Acta Automatica Sinica, 2021, 47(8): 1840−1854 [15] 耿远卓, 袁利, 黄煌, 汤亮. 基于终端诱导强化学习的航天器轨道追逃博弈. 自动化学报, 2023, 49(5): 974−984Geng Yuan-Zhuo, Yuan Li, Huang Huang, Tang Liang. Terminal induced reinforcement learning for orbital pursuit and evasion game of spacecraft. Acta Automatica Sinica, 2023, 49(5): 974−984 [16] Flynn J. Lion and man: The general case. SIAM Journal on Control, 1974, 12(4): 581−597 doi: 10.1137/0312043 [17] Oyler D W, Kabamba P T, Girard A R. Pursuit-evasion games in the presence of a line segment obstacle. In: Proceedings of the IEEE Conference on Decision and Control(CDC). Los Angeles, USA: IEEE, 2014. 1149-1154 [18] Garcia E, Casbeer D W, Pachter M. Optimal strategies for a class of multi-player reach-avoid differential games in 3d space. IEEE Robotics and Automation Letters, 2020, 5(3): 4257−4264 doi: 10.1109/LRA.2020.2994023 [19] Nath S, Ghose D. A two-phase evasive strategy for a pursuit-evasion problem involving two non-holonomic agents with incomplete information. European Journal of Control, 2022, 68: Article No. 100677 doi: 10.1016/j.ejcon.2022.100677 [20] Oyler D W, Girard A R. Dominance regions in the homicidal chauffeur problem. In: Proceedings of the American Control Conference (ACC). Boston, USA: IEEE, 2016. 2494-2499 [21] Pachter M, Moll A V, Garcia E, Casbeer D, Milutinovi$\acute{c}$. Cooperative pursuit by multiple pursuers of a single evader. Journal of Aerospace Information Systems, 2020, 17(8): 371−389 doi: 10.2514/1.I010739 [22] Bakolas E, Tsiotras P. Optimal pursuit of moving targets using dynamic Voronoi diagrams. In: Proceedings of the IEEE Conference on Decision and Control (CDC). Atlanta, USA: IEEE, 2010. 7431-7436 [23] Ramana M V, Kothari M. Pursuit-evasion games of high speed evader. Journal of Intelligent & Robotic Systems, 2017, 85: 293−306 [24] Shishika D, Kumar V. Local-game decomposition for multiplayer perimeter-defense problem. In: Proceedings of the IEEE Conference on Decision and Control (CDC). Miami, USA: IEEE, 2018. 2093-2100 [25] Liang L, Deng F, Lu M, Chen J. Analysis of role switch for cooperative target defense differential game. IEEE Transactions on Automatic Control, 2020, 66(2): 902−909 [26] Yan R, Shi Z, Zhong Y. Task assignment for multiplayer reach-avoid games in convex domains via analytical barriers. IEEE Transactions on Robotics, 2019, 36(1): 107−124 [27] Zhao Y, Tao Q, Xian C, Li Z, Duan Z. Prescribed-time distributed Nash equilibrium seeking for noncooperation games. Automatica, 2023, 151: Article No. 110933 doi: 10.1016/j.automatica.2023.110933 [28] Xue L, Ye J, Wu Y, Liu J, Wunsch D C. Prescribed-Time Nash Equilibrium Seeking for Pursuit-Evasion Game. IEEE/CAA Journal of Automatica Sinica, 2024, 11(6): 1518−1520 doi: 10.1109/JAS.2023.124077 [29] Bakolas E. Evasion from a group of pursuers with double integrator kinematics. In: Proceedings of the IEEE Conference on Decision and Control. Firenze, Italy: IEEE, 2013. 1472-1477 [30] Selvakumar J, Bakolas E. Evasion from a group of pursuers with a prescribed target set for the evader. In: Proceedings of the American Control Conference (ACC). Boston, USA: IEEE, 2016. 155-160 [31] Coon M, Panagou D. Control strategies for multiplayer target-attacker-defender differential games with double integrator dynamics. In: Proceedings of the IEEE Annual Conference on Decision and Control (CDC). Melbourne, Australia: IEEE, 2017. 1496-1502 [32] Chipade V S, Panagou D. IDCAIS: Inter-defender collision-aware interception strategy against multiple attackers. arXiv preprint arXiv: 2112. 12098, 2021. [33] Chipade V S, Panagou D. Multiagent planning and control for swarm herding in 2-d obstacle environments under bounded inputs. IEEE Transactions on Robotics, 2021, 37(6): 1956−1972 doi: 10.1109/TRO.2021.3072026 [34] Li S, Wang C, Xie G. Optimal strategies for pursuit-evasion differential games of players with damped double integrator dynamics. IEEE Transactions on Automatic Control, 2024, 69(8): 5278−5293 doi: 10.1109/TAC.2023.3346815 [35] Kokolakis N M T, Vamvoudakis K G. Bounded rational Dubins vehicle coordination for target tracking using reinforcement learning. Automatica, 2023, 149: Article No. 110732 doi: 10.1016/j.automatica.2022.110732 [36] Patsko V S, Turova V L. Homicidal chauffeur game: History and modern studies. Advances in Dynamic Games: Theory, Applications and Numerical Methods for Differential and Stochastic Games, 2011227−251 [37] Pachter M, Coates S. The classical homicidal chauffeur game. Dynamic Games and Applications, 2019, 9: 800−850 doi: 10.1007/s13235-018-0264-8 [38] Exarchos I, Tsiotras P, Pachter M. On the suicidal pedestrian differential game. Dynamic Games and Applications, 2015, 5: 297−317 doi: 10.1007/s13235-014-0130-2 [39] Nath S, Ghose D. Worst-case scenario evasive strategies in a two-on-one engagement between Dubins' vehicles with partial information. IEEE Control Systems Letters, 2022, 7: 25−30 [40] Sani M, Robu B, Hably A. Pursuit-evasion game for nonholonomic mobile robots with obstacle avoidance using NMPC. In: Proceedings of the Mediterranean Conference on Control and Automation (MED). Saint-Rapha, France: IEEE, 2020. 978-983 [41] Manoharan A, Thakur P, Singh A K. Multi-agent target defense game with learned defender to attacker assignment. In: Proceedings of the International Conference on Unmanned Aircraft Systems (ICUAS). Warsaw, Poland: IEEE, 2023: 297-304 [42] 祝海. 基于微分对策的航天器轨道追逃最优控制策略[硕士学位论文], 国防科技大学, 中国, 2017.Zhu Hai. Optimal Control of Spacecraft Orbital Pursuit-evasion Based on Differential Game, [Master thesis], National University of Defense Technology, China, 2017. [43] Clohessy W H, Wiltshire R S. Terminal guidance system for satellite rendezvous. Journal of the Aerospace Sciences, 1960, 27(9): 653−658 doi: 10.2514/8.8704 [44] Zhang C, Zhu Y, Yang L, Zeng X. An optimal guidance method for free-time orbital pursuit-evasion game. Journal of Systems Engineering and Electronics, 2022, 33(6): 1294−1308 [45] Venigalla C, Scheeres D. Spacecraft rendezvous and pursuit/evasion analysis using reachable sets. In: Proceedings of the Space Flight Mechanics Meeting. Kissimmee, USA: 2018. 0219 [46] Li Z, Zhu H, Yang Z, Luo Y. A dimension-reduction solution of free-time differential games for spacecraft pursuit-evasion. Acta Astronautica, 2019, 163: 201−210 doi: 10.1016/j.actaastro.2019.01.011 [47] Lin B, Qiao L, Jia Z, Sun Z, Sun M, Zhang W. Control strategies for target-attacker-defender games of USVs. In: Proceedings of the International Conference on Automation, Control and Robotics Engineering (CACRE). Dalian, China: IEEE, 2021. 191-198 [48] Ho Y, Bryson A, Baron S. Differential games and optimal pursuit-evasion strategies. IEEE Transactions on Automatic Control, 1965, 10(4): 385−389 doi: 10.1109/TAC.1965.1098197 [49] Kothari M, Manathara J G, Postlethwaite I. Cooperative multiple pursuers against a single evader. Journal of Intelligent & Robotic Systems, 2017, 86: 551−567 [50] Yufereva O. Lion and man game in compact spaces. Dynamic Games and Applications, 2019, 9(1): 281−292 doi: 10.1007/s13235-018-0239-9 [51] Oyler D W, Kabamba P T, Girard A R. Pursuit-evasion games in the presence of obstacles. Automatica, 2016, 65: 1−11 doi: 10.1016/j.automatica.2015.11.018 [52] Exarchos I, Tsiotras P. An asymmetric version of the two car pursuit-evasion game. In: Proceedings of the IEEE Conference on Decision and Control(CDC). Los Angeles, USA: IEEE, 2014. 4272-4277 [53] Das G, Dorothy M, Bell Z I, Shishika D. Guarding a non-maneuverable translating line with an attached defender. arXiv preprint arXiv: 2209. 09318, 2022 [54] Liang L, Deng F, Wang J, Lu M, Chen J. A reconnaissance penetration game with territorial-constrained defender. IEEE Transactions on Automatic Control, 2022, 67(11): 6295−6302 doi: 10.1109/TAC.2022.3183034 [55] Levchenkov A Y, Pashkov A G. Differential game of optimal approach of two inertial pursuers to a noninertial evader. Journal of Optimization Theory and Applications, 1990, 65: 501−518 doi: 10.1007/BF00939563 [56] Chen J, Zha W, Peng Z, Gu D. Multi-player pursuit-evasion games with one superior evader. Automatica, 2016, 71: 24−32 doi: 10.1016/j.automatica.2016.04.012 [57] Yan R, Shi Z, Zhong Y. Cooperative strategies for two-evader-one-pursuer reach-avoid differential games. International Journal of Systems Science, 2021, 52(9): 1894−1912 doi: 10.1080/00207721.2021.1872116 [58] Liu S Y, Zhou Z, Tomlin C, Hedrick K. Evasion as a team against a faster pursuer. In: Proceedings of the American Control Conference(ACC). Washington, USA: IEEE, 2013. 5368-5373 [59] Wang D, Peng Z. Pursuit-evasion games of multi-players with a single faster player. In: Proceedings of the Chinese Control Conference (CCC). Chengdu, China: IEEE, 2016. 2583-2588 [60] Scott W L, Leonard N E. Optimal evasive strategies for multiple interacting agents with motion constraints. Automatica, 2018, 94: 26−34 doi: 10.1016/j.automatica.2018.04.008 [61] Garcia E, Casbeer D W, Von Moll A, Pachter M. Multiple pursuer multiple evader differential games. IEEE Transactions on Automatic Control, 2020, 66(5): 2345−2350 [62] Wei M, Chen G, Cruz J B, Haynes L S, Pham K, Blasch E. Multi-pursuer multi-evader pursuit-evasion games with jamming confrontation. Journal of Aerospace Computing, Information and Communication, 2007, 4(3): 693−706 doi: 10.2514/1.25329 [63] Wei M, Chen G, Cruz J B, Haynes L S, Chang M H, Blasch E. A decentralized approach to pursuer-evader games with multiple superior evaders in noisy environments. In: Proceedings of the IEEE Aerospace Conference. Big Sky, USA: IEEE, 2007. 1-10 [64] Xu L, Hu B, Guan Z, Cheng X, Li T, Xiao J. Multi-agent deep reinforcement learning for pursuit-evasion game scalability. In: Proceedings of the Chinese Intelligent Systems Conference: Volume I 15th. Singapore: Springer, 2020. 658-669 [65] Li D, Cruz J B, Chen G, Kwan C, Chang M H. A hierarchical approach to multi-player pursuit-evasion differential games. In: Proceedings of the IEEE Conference on Decision and Control(CDC). Seville, Spain: IEEE, 2005. 5674-5679 [66] Li D, Cruz Jr J B, Schumacher C J. Stochastic multi-player pursuit-evasion differential games. International Journal of Robust and Nonlinear Control: IFAC-Affiliated Journal, 2008, 18(2): 218−247 [67] Yan R, Deng R, Lai H, Zhang W, Shi Z, Zhong Y. Multiplayer homicidal chauffeur reach-avoid games via guaranteed winning strategies. arXiv preprint arXiv: 2107. 04709, 2021 [68] LaValle S M, Lin D, Guibas L J, Latombe J C, Motwani. Finding an unpredictable target in a workspace with obstacles. In: Proceedings of the International Conference on Robotics and Automation(ICRA). Albuquerque, USA: IEEE, 1997. 737-742 [69] Razali S, Meng Q, Yang S H. A refined immune systems inspired model for multi-robot shepherding. In: Proceedings of the Second World Congress on Nature and Biologically Inspired Computing (NaBIC). Kitakyushu, Japan: IEEE, 2010. 473-478 [70] Bhadauria D, Gosse S, Pipp J. Capturing an evader in a polygonal environment with obstacles[Online], available: https://www. conservancy. umn. edu/items/256a22f6-ba8c-4b12-a80d-300b7cba947c, June 15, 2024 [71] De Souza C, Newbury R, Cosgun A, Castillo Pedro, Vidolov B, Kuli$acute{c} D$. Decentralized multi-agent pursuit using deep reinforcement learning. IEEE Robotics and Automation Letters, 2021, 6(3): 4552−4559 doi: 10.1109/LRA.2021.3068952 [72] Zhang R, Zong Q, Zhang X, Dou L, Tian B. Game of drones: Multi-UAV pursuit-evasion game with online motion planning by deep reinforcement learning. IEEE Transactions on Neural Networks and Learning Systems, 2022, 34(10): 7900−7909 [73] Liang X, Zhou B, Jiang L, Meng G, Xiu Y. Collaborative pursuit-evasion game of multi-UAVs based on Apollonius circle in the environment with obstacle. Connection Science, 2023, 35(1): Article No. 2168253 doi: 10.1080/09540091.2023.2168253 [74] Garcia E, Casbeer D W, Pachter M. Optimal strategies of the differential game in a circular region. IEEE Control Systems Letters, 2019, 4(2): 492−497 [75] Casini M, Garulli A. A novel family of pursuit strategies for the lion and man problem. In: Proceedings of the IEEE Annual Conference on Decision and Control (CDC). Melbourne, Australia: IEEE, 2017. 6436-6441 [76] Yan R, Mi S, Duan X, Chen J, Ji X. Pursuit winning strategies for Reach-Avoid games with polygonal obstacles. arXiv preprint arXiv: 2403. 06202, 2024 [77] Flynn J O. Lion and man: The boundary constraint. SIAM Journal on Control, 1973, 11(3): 397−411 doi: 10.1137/0311032 [78] Yan R, Shi Z, Zhong Y. Defense game in a circular region. In: Proceedings of the IEEE Annual Conference on Decision and Control (CDC). Melbourne, Australia: IEEE, 2017. 5590-5595 [79] Ruiz U, Isler V. Capturing an omnidirectional evader in convex environments using a differential drive robot. IEEE Robotics and Automation Letters, 2016, 1(2): 1007−1013 doi: 10.1109/LRA.2016.2530854 [80] Okabe A, Boots B, Sugihara K, Chiu S N, Kendall D G. Concepts and Applications of Voronoi Diagrams. New York: John Wiley, 2000. [81] Pierson A, Wang Z, Schwager M. Intercepting rogue robots: An algorithm for capturing multiple evaders with multiple pursuers. IEEE Robotics and Automation Letters, 2016, 2(2): 530−537 [82] Li S, Wang C, Xie G. Pursuit-evasion differential games of players with different speeds in spaces of different dimensions. In: Proceedings of the American Control Conference (ACC). Atlanta, USA: IEEE, 2022. 1299-1304 [83] Zhang R, Li S, Wang C, Xie G. Optimal strategies for the game with two faster 3D pursuers and one slower 2D evader. In: Proceedings of the Chinese Control Conference (CCC). Hefei, China: IEEE, 2022. 1767-1772 [84] Zhi J, Hao Y, Vo C, Morales M, Lien J M. Computing 3-d from-region visibility using visibility integrity. IEEE Robotics and Automation Letters, 2019, 4(4): 4286−4291 doi: 10.1109/LRA.2019.2931280 [85] Chen N, Li L, Mao W. Equilibrium strategy of the Pursuit-Evasion game in three-dimensional space. IEEE/CAA Journal of Automatica Sinica, 2024, 11(2): 446−458 doi: 10.1109/JAS.2023.123996 [86] Shen H X, Casalino L. Revisit of the three-dimensional orbital pursuit-evasion game. Journal of Guidance, Control and Dynamics, 2018, 41(8): 1823−1831 doi: 10.2514/1.G003127 [87] Yan R, Duan X, Shi Z, Zhong Y, Bullo F. Matching-based capture strategies for 3D heterogeneous multiplayer reach-avoid differential games. Automatica, 2022, 140: Article No. 110207 doi: 10.1016/j.automatica.2022.110207 [88] Dobbie J M. Solution of some surveillance-evasion problems by the methods of differential games. In: Proceedings of the International Conference on Operational Research. New York, USA: John Wiley & Sons, 1966 [89] Lewin J, Breakwell J V. The surveillance-evasion game of degree. Journal of Optimization Theory and Applications, 1975, 16: 339−353 doi: 10.1007/BF01262940 [90] Greenfeld I. A differential game of surveillance evasion of two identical cars. Journal of Optimization Theory and Applications, 1987, 52: 53−79 doi: 10.1007/BF00938464 [91] Bopardikar S D, Bullo F, Hespanha J P. Sensing limitations in the lion and man problem. In: Proceedings of the American Control Conference(ACC). New York, USA: IEEE, 2007. 5958-5963 [92] Lopez V G, Lewis F L, Wan Y, Sanchez E N, Fan L. Solutions for multiagent pursuit-evasion games on communication graphs: Finite-time capture and asymptotic behaviors. IEEE Transactions on Automatic Control, 2019, 65(5): 1911−1923 [93] Zemskov K A, Pashkow A G. Construction of optimal position strategies in a differential pursuit-evasion game with one pursuer and two evaders. Journal of Applied Mathematics and Mechanics, 1997, 61(3): 391−399 doi: 10.1016/S0021-8928(97)00050-6 [94] Liu S Y, Zhou Z, Tomlin C, Hedrick J K. Evasion of a team of dubins vehicles from a hidden pursuer. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA). Hong Kong, China: IEEE, 2014. 6771-6776 [95] Von Moll A, Garcia E, Casbeer D, Suresh M, Swar S C. Multiple-pursuer, single-evader border defense differential game. Journal of Aerospace Information Systems, 2020, 17(8): 407−416 doi: 10.2514/1.I010740 [96] Kartal Y, Subbarao K, Dogan A, Lewis F. Optimal game theoretic solution of the pursuit-evasion intercept problem using on-policy reinforcement learning. International Journal of Robust and Nonlinear Control, 2021, 31(16): 7886−7903 doi: 10.1002/rnc.5719 [97] J. E. Littlewood, A Mathematician 's Miscellany. Cambridge: Cambridge University Press, 1953. [98] Guy R K. Unsolved Problems in Combinatorial Games. Boston: Combinatorics Advances, 1995. 161-179 [99] Kohlenbach U, López-Acedo G, Nicolae A. A uniform betweenness property in metric spaces and its role in the quantitative analysis of the "lion-man" game. Pacific Journal of Mathematics, 2021, 310(1): 181−212 doi: 10.2140/pjm.2021.310.181 [100] Sgall J. Solution of David Gale's lion and man problem. Theoretical Computer Science, 2001, 259(1-2): 663−670 doi: 10.1016/S0304-3975(00)00411-4 [101] Casini M, Garulli A. An improved lion strategy for the lion and man problem. IEEE Control Systems Letters, 2017, 1(1): 38−43 doi: 10.1109/LCSYS.2017.2702652 [102] Casini M, Garulli A. A new class of pursuer strategies for the discrete-time lion and man problem. Automatica, 2019, 100: 162−170 doi: 10.1016/j.automatica.2018.11.015 [103] Casini M, Criscuoli M, Garulli A. A discrete-time pursuit-evasion game in convex polygonal environments. Systems & Control Letters, 2019, 125: 22−28 [104] Bopardikar S D, Bullo F, Hespanha J P. Cooperative pursuit with sensing limitations. In: Proceedings of the American Control Conference(ACC). New York, USA: IEEE, 2007. 5394-5399 [105] Li D, Cruz J B. Graph-based strategies for multi-player pursuit evasion games. In: Proceedings of the IEEE Conference on Decision and Control(CDC). New Orleans, USA: IEEE, 2007. 4063-4068 [106] Chen H, Kalyanam K, Zhang W, Casbeer D. Intruder isolation on a general road network under partial information. IEEE Transactions on Control Systems Technology, 2016, 25(1): 222−234 [107] Kalyanam K, Casbeer D, Pachter M. Graph search of a moving ground target by a UAV aided by ground sensors with local information. Autonomous Robots, 2020, 44: 831−843 doi: 10.1007/s10514-019-09900-0 [108] Sundaram S, Kalyanam K, Casbeer D W. Pursuit on a graph under partial information from sensors. In: Proceedings of the American Control Conference (ACC). Seattle, USA: IEEE, 2017. 4279-4284 [109] Dong X, Zhang H, Ming Z. Adaptive optimal control via Q-learning for multi-agent Pursuit-Evasion games. IEEE Transactions on Circuits and Systems Ⅱ: Express Briefs, 2024 [110] Sugihara K, Suzuki I. Optimal algorithms for a pursuit-evasion problem in grids. SIAM Journal on Discrete Mathematics, 1989, 2(1): 126−143 doi: 10.1137/0402013 [111] Dawes R W. Some pursuit-evasion problems on grids. Information Processing Letters, 1992, 43(5): 241−247 doi: 10.1016/0020-0190(92)90218-K [112] Bhattacharya S, Banerjee A, Bandyopadhyay S. CORBA-based analysis of multi agent behavior. Journal of Computer Science and Technology, 2005, 20: 118−124 doi: 10.1007/s11390-005-0013-5 [113] Bhattacharya S, Paul G, Sanyal S. A cops and robber game in multidimensional grids. Discrete Applied Mathematics, 2010, 158(16): 1745−1751 doi: 10.1016/j.dam.2010.06.014 [114] Das S, Gahlawat H. Variations of cops and robbers game on grids. Discrete Applied Mathematics, 2021, 305: 340−349 doi: 10.1016/j.dam.2020.02.004 [115] Lewin J, Olsder G J. The isotropic rocket—a surveillance evasion game. Computers & Mathematics with Applications, 1989, 18(1-3): 15−34 [116] Altaher M, Elmougy S, Nomir O. Intercepting a superior missile: A reachability analysis of an Apollonius circle-based multiplayer differential game. Int. J. Innov. Comput. Inf. Control, 2019, 15(1): 369−381 [117] Jang J S, Tomlin C. Control strategies in multi-player pursuit and evasion game. In: Proceedings of the AIAA Guidance, Navigation and Control Conference and Exhibit. Stanford, USA: 2005. 6239 [118] M. J. Osborne and A. Rubinstein. A Course in Game Theory. Cambridge: MIT Press Books, 1994. [119] Başar T, Olsder G J. Dynamic Noncooperative Game Theory. New York: Academic Press Inc, 1998. [120] Krasovskij N N, Subbotin A I, Kotz S. Game-theoretical Control Problems. New York: Springer, 1988. [121] Mizukami K, Eguchi K. A geometrical approach to problems of pursuit-evasion games. Journal of the Franklin Institute, 1977, 303(4): 371−384 doi: 10.1016/0016-0032(77)90118-1 [122] Samatov B T, Soyibboev U B. Differential game with a lifeline for the inertial movements of players. Ural Mathematical Journal, 2021, 7(2): 94−109 doi: 10.15826/umj.2021.2.007 [123] Pontryagin L S. On the theory of differential games. Russian Mathematical Surveys, 1966, 21(4): 193 doi: 10.1070/RM1966v021n04ABEH004171 [124] Petrov N N. "Soft" capture in pontryagin's example with many participants. Journal of Applied Mathematics and Mechanics, 2003, 67(5): 671−680 doi: 10.1016/S0021-8928(03)90040-2 [125] Blagodatskikh A I. On group pursuit problem in Pontryagin's nonstationary example. Vestn. Udmurt. Gos. Univ. Ser. Mat, 2007, 1: 17−24 [126] Petrov N N, Solov'eva N A. Multiple capture in Pontryagin's recurrent example. Automation and Remote Control, 2016, 77: 855−861 doi: 10.1134/S0005117916050088 [127] Lewin J. Differential Games: Theory and Methods for Solving Game Problems with Singular Surfaces. London: Springer Science & Business Media, 2012. [128] Wise K A, Sedwick J L. Successive approximation solution of the HJI equation. In: Proceedings of the IEEE Conference on Decision and Control(CDC). Lake Buena Vista, USA: IEEE, 1994, 2. 1387-1391 [129] Yang X, Liu D, Ma H, Xu Y. Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems. Information Sciences, 2016, 328: 435−454 doi: 10.1016/j.ins.2015.09.001 [130] Earl M G, D'Andrea R. Modeling and control of a multi-agent system using mixed integer linear programming. In: Proceedings of the IEEE Conference on Decision and Control(CDC). Las Vegas, USA: IEEE, 2002, 1 . 107-111 [131] Ni Y, Gao S, Huang S, Xiang C, Ren Q, Lee T H. Multi-agent cooperative pursuit-evasion control using gene expression programming. In: Proceedings of the IECON Annual Conference of the IEEE Industrial Electronics Society. Toronto, Canada: IEEE, 2021. 1-6 [132] Asgharnia A, Schwartz H M, Atia M. Multi-invader multi-defender differential game using reinforcement learning. In: Proceedings of the International Conference on Fuzzy Systems (FUZZ-IEEE). Padua, Italy: IEEE, 2022: 1-8 [133] Kachroo P, Shedied S A, Bay J S, Vanlandingham H. Dynamic programming solution for a class of pursuit evasion problems: The herding problem. IEEE Transactions on Systems, Man and Cybernetics, 2001, 31(1): 35−41 doi: 10.1109/5326.923266 [134] Hespanha J P, Pappas G J, Prandini M. Greedy control for hybrid pursuit-evasion games. In: Proceedings of the European Control Conference(ECC). Los Angeles, USA: 2001. 2621-2626 [135] Cristiani E, Falcone M. Fully-discrete schemes for the value function of pursuit-evasion games with state constraints. Advances in Dynamic Games and Their Applications: Analytical and Numerical Developments, 20091−30 [136] Andeson G. Feedback control for pursuit-evasion problems between two spacecraftbased on differential dynamic programming. In: Proceedings of the Aerospace Sciences Meeting. Los Angeles, USA: 1977. 34 [137] Alexopoulos A, Schmidt T, Badreddin E. A pursuit-evasion game between unmanned aerial vehicles. In: Proceedings of the International Conference on Informatics in Control, Automation and Robotics (ICINCO). Vienna, Austria: IEEE, 2014, 2. 74-81 [138] Tan M. Multi-agent reinforcement learning: independent vs. cooperative agents. In: Proceedings of the International Conference on Machine Learning. Amherst, USA: 1993. 330-337 [139] Xing J, Zeng X. A deep reinforcement learning method for lion and man problem. In: Proceedings of the Chinese Control Conference (CCC). Shanghai, China: IEEE, 2021. 8366-8371 [140] Qu X, Gan W, Song D, Zhou L. Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment. Ocean Engineering, 2023, 273: Article No. 114016 doi: 10.1016/j.oceaneng.2023.114016 [141] Wan K, Wu D, Zhai Y, Li B, Gao X, Hu Z. An improved approach towards multi-agent pursuit-evasion game decision-making using deep reinforcement learning. Entropy, 2021, 23(11): Article No. 1433 doi: 10.3390/e23111433 [142] Li B, Wang J, Song C, Yang Z, Wan K, Zhang Q. Multi-UAV roundup strategy method based on deep reinforcement learning CEL-MADDPG algorithm. Expert Systems with Applications, 2024, 245: Article No. 123018 doi: 10.1016/j.eswa.2023.123018 [143] Li X, Li Z, Zheng X, Yang X, Yu X. The study of crash-tolerant, multi-agent offensive and defensive games using deep reinforcement learning. Electronics, 2023, 12(2): Article No. 327 doi: 10.3390/electronics12020327 [144] Vamvoudakis K G, Lewis F L. Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations. Automatica, 2011, 47(8): 1556−1569 doi: 10.1016/j.automatica.2011.03.005 [145] Kokolakis N M T, Kanellopoulos A, Vamvoudakis K G. Bounded rational unmanned aerial vehicle coordination for adversarial target tracking. In: Proceedings of the American Control Conference (ACC). Denver, USA: IEEE, 2020. 2508-2513 [146] Xiong H, Zhang Y. Reinforcement learning-based formation-surrounding control for multiple quadrotor UAVs pursuit-evasion games. ISA Transactions, 2024, 145: 205−224 doi: 10.1016/j.isatra.2023.12.006 [147] Zhou Z, Xu H. Decentralized optimal large scale multi-player pursuit-evasion strategies: A mean field game approach with reinforcement learning. Neurocomputing, 2022, 484: 46−58 doi: 10.1016/j.neucom.2021.01.141 [148] Vrabie D. Online adaptive optimal control for continuous-time systems[Ph. D. dissertation], The University of Texas at Arlington, USA, 2010 [149] Gong Z, He B, Liu G, Zhang X. Solution for Pursuit-Evasion game of agents by adaptive dynamic programming. Electronics, 2023, 12(12): Article No. 2595 doi: 10.3390/electronics12122595 [150] Gong Z, He B, Hu C, Zhang X, Kang W. Online adaptive dynamic programming-based solution of networked multiple-pursuer and single-evader game. Electronics, 2022, 11(21): Article No. 3583 doi: 10.3390/electronics11213583 [151] Sun J, Liu C. Finite-horizon differential games for missile-target interception system using adaptive dynamic programming with input constraints. International Journal of Systems Science, 2018, 49(2): 264−283 doi: 10.1080/00207721.2017.1401153 [152] Zhang Z X, Zhang K, Xie X P, Sun J Y. Fixed-time zero-sum Pursuit-Evasion game control of multi-satellite via adaptive dynamic programming. IEEE Transactions on Aerospace and Electronic Systems, 2024, 60(2): 2224−2235 doi: 10.1109/TAES.2024.3351810 [153] Vamvoudakis K G, Lewis F L, Hudas G R. Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality. Automatica, 2012, 48(8): 1598−1611 doi: 10.1016/j.automatica.2012.05.074 [154] Peng C, Liu X, Ma J. Design of safe optimal guidance with obstacle avoidance using control barrier function-based actor-critic reinforcement learning. IEEE Transactions on Systems, Man and Cybernetics, 2023, 53(11): 6861−6873 doi: 10.1109/TSMC.2023.3288826 [155] Oyler D W, Kabamba P T, Girard A R. Dominance in pursuit-evasion games with uncertainty. In: Proceedings of the IEEE Conference on Decision and Control (CDC). Osaka, Japan: IEEE, 2015. 5859-5864 [156] Agasti A, Reddy P V, Bhikkaji B. Optimal role assignment for multiplayer Reach-Avoid differential games in 3D space. arXiv preprint arXiv: 2303. 07885, 2023 [157] Meier L. A new technique for solving pursuit-evasion differential games. IEEE Transactions on Automatic Control, 1969, 14(4): 352−359 doi: 10.1109/TAC.1969.1099226 [158] Getz W M, Pachter M. Capturability in a two-target'game of two cars'. Journal of Guidance and Control, 1981, 4(1): 15−21 doi: 10.2514/3.19715 [159] Scott W, Leonard N E. Dynamics of pursuit and evasion in a heterogeneous herd. In: Proceedings of the IEEE Conference on Decision and Control(CDC). Los Angeles, USA: IEEE, 2014. 2920-2925 [160] Garcia E, Fuchs Z E, Milutinovic D, Casbeer D W, Pachter M. A geometric approach for the cooperative two-pursuer one-evader differential game. IFAC-PapersOnLine, 2017, 50(1): 15209−15214 doi: 10.1016/j.ifacol.2017.08.2366 [161] Zhou Z, Zhang W, Ding J, Huang H, Stipanovi$\acute{c}$ D M, Tomlin C J. Cooperative pursuit with Voronoi partitions. Automatica, 2016, 72: 64−72 doi: 10.1016/j.automatica.2016.05.007 [162] Zhou S, Li H, Chen Z. Optimal containment strategies on high-speed evader using multiple pursuers with point-capture. In: Proceedings of the Chinese Control Conference(CCC). Tianjin, China: IEEE, 2023. 01-06 [163] Zhang Z, Zhang D, Zhang Q, Pan W, Hu T. DACOOP-A: decentralized adaptive cooperative pursuit via attention. IEEE Robotics and Automation Letters, 2023, 9(6): 5504−5511 [164] Sun Z, Sun H, Li P, Zou J. Cooperative strategy for pursuit-evasion problem with collision avoidance. Ocean Engineering, 2022, 266: Artice No. 112742 doi: 10.1016/j.oceaneng.2022.112742 [165] Isler V, Sun D, Sastry S. Roadmap based pursuit-evasion and collision avoidance. In: Proceedings of the Robotics: Science and Systems. Berkeley, USA: 2005. 1: 257-264 [166] Selvakumar J, Bakolas E. Feedback strategies for a reach-avoid game with a single evader and multiple pursuers. IEEE Transactions on Cybernetics, 2019, 51(2): 696−707 [167] Shishika D, Paulos J, Kumar V. Cooperative team strategies for multi-player perimeter-defense games. IEEE Robotics and Automation Letters, 2020, 5(2): 2738−2745 doi: 10.1109/LRA.2020.2972818 [168] Garcia E, Casbeer D W, Pachter M. Active target defense using first order missile models. Automatica, 2017, 78: 139−143 doi: 10.1016/j.automatica.2016.12.032 [169] Bera R, Makkapati V R, Kothari M. A comprehensive differential game theoretic solution to a game of two cars. Journal of Optimization Theory and Applications, 2017, 174: 818−836 doi: 10.1007/s10957-017-1134-z
计量
- 文章访问数: 38
- HTML全文浏览量: 24
- 被引次数: 0