2.845

2023影响因子

(CJCR)

  • 中文核心
  • EI
  • 中国科技核心
  • Scopus
  • CSCD
  • 英国科学文摘

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于改进动态系统稳定估计器的机器人技能学习方法

金聪聪 刘安东 LIU Steven 张文安

金聪聪, 刘安东, LIU Steven, 张文安. 基于改进动态系统稳定估计器的机器人技能学习方法. 自动化学报, 2022, 48(7): 1771−1781 doi: 10.16383/j.aas.c200341
引用本文: 金聪聪, 刘安东, LIU Steven, 张文安. 基于改进动态系统稳定估计器的机器人技能学习方法. 自动化学报, 2022, 48(7): 1771−1781 doi: 10.16383/j.aas.c200341
Jin Cong-Cong, Liu An-Dong, Liu Steven, Zhang Wen-An. A robot skill learning method based on improved stable estimator of dynamical systems. Acta Automatica Sinica, 2022, 48(7): 1771−1781 doi: 10.16383/j.aas.c200341
Citation: Jin Cong-Cong, Liu An-Dong, Liu Steven, Zhang Wen-An. A robot skill learning method based on improved stable estimator of dynamical systems. Acta Automatica Sinica, 2022, 48(7): 1771−1781 doi: 10.16383/j.aas.c200341

基于改进动态系统稳定估计器的机器人技能学习方法

doi: 10.16383/j.aas.c200341
基金项目: 浙江省自然科学基金重大项目(LD21F030002), 国家自然科学基金(61822311, 61973275)资助
详细信息
    作者简介:

    金聪聪:浙江工业大学信息工程学院硕士生. 主要研究方向为人体动作识别和机器人技能学习. E-mail: jcc19960602@gmail.com

    刘安东:浙江工业大学信息工程学院副教授. 主要研究方向为模型预测控制, 网络化控制系统和移动机器人. E-mail: lad@zjut.edu.cn

    LIU Steven:凯泽斯劳滕工业大学电气与计算机工程系教授. 主要研究方向为机电和电力系统控制, 机器人技术, 网络控制和基于模型的故障诊断. E-mail: sliu@eit.uni-kl.de

    张文安:浙江工业大学信息工程学院教授. 主要研究方向为多源信息融合和机器人技能学习. 本文通信作者. E-mail: wazhang@zjut.edu.cn

A Robot Skill Learning Method Based on Improved Stable Estimator of Dynamical Systems

Funds: Supported by Zhejiang Provincial Natural Science Foundation of China (LD21F030002) and National Natural Science Foundation of China (61822311, 61973275)
More Information
    Author Bio:

    JIN Cong-Cong Master student at the College of Information Engineering, Zhejiang University of Technology. His research interest covers human motion recognition and robot skills learning

    LIU An-Dong Associate professor at the College of Information Engineering, Zhejiang University of Technology. His research interest covers model predictive control, networked control systems and mobile robots

    LIU Steven Professor in the Department of Electrical and Computer Engineering, University of Kaiserslautern. His research interest covers control of mechatronic and power systems, robotics, networked control and model-based fault diagnosis

    ZHANG Wen-An Professor at the College of Information Engineering, Zhejiang University of Technology. His research interest covers information fusion estimation and robot skill learning. Corresponding author of this paper

  • 摘要: 提出一种基于改进动态系统稳定估计器的机器人技能学习方法. 现有的动态系统稳定估计器方法可以通过非线性优化来确保学习系统的全局稳定性, 但是存在确定高斯混合分量个数困难以及稳定性和精度无法兼顾的问题. 因此, 根据贝叶斯非参数模型可以自动确定合适分量个数的特性, 采用狄利克雷过程高斯混合模型对演示进行初始拟合. 随后利用参数化二次李雅普诺夫函数重新推导新的稳定性约束, 有效地解决了动态系统稳定估计器方法中稳定性和精度难以兼顾的问题. 最后, 在LASA数据库和Franka-panda机器人上的实验验证了新方法的有效性和优越性.
  • 图  1  基于SEDS的机器人示教学习流程图

    Fig.  1  Flow chart of LfD based on SEDS

    图  2  DPGMM概率结构图

    Fig.  2  DPGMM probability structure diagram

    图  3  DPGMM对数似然值关于超参数的变化趋势

    Fig.  3  The change trend of DPGMM log-likelihood value on hyperparameters

    图  4  i-SEDS方法在LASA数据集上的复现效果

    Fig.  4  The reproductions of i-SEDS method on LASA dataset

    图  5  SEDS和i-SEDS在不收缩轨迹上的复现结果

    Fig.  5  i-SEDS and SEDS reproductions on the non-contractive demonstrations

    图  6  扫描误差区域(SEA)示意图

    Fig.  6  Schematic diagram of Swept error area (SEA)

    图  7  4种方法在收缩和不收缩轨迹上的SEA分析

    Fig.  7  Performance analysis of four methods on contraction and non-contraction demonstrations with SEA

    图  8  物品搬运任务示教流程图

    Fig.  8  The process of object transport task teaching

    图  9  预处理后的物品搬运任务的演示轨迹

    Fig.  9  Preprocessed demonstrations of object transport task

    图  10  物品搬运任务复现位置和速度轨迹

    Fig.  10  Position and velocity trajectories of reproduction in object transport task

    图  11  机械臂运行时改变目标位置后的轨迹变化

    Fig.  11  Trajectory changes after moving the target container when the robot arm is running

    图  12  对机械臂施加外界扰动时的轨迹变化

    Fig.  12  Trajectory changes when external disturbance is applied to the robot arm

    表  1  4种GMR算法在数据库LASA上的性能比较

    Table  1  Performance comparison of four GMR algorithmson database LASA

    方法总RMSE总训练时间 (${\rm s}$)
    BIC-GMM (EM)269.4375.26
    DdGMM (VI)206.1549.39
    DPGMM (Gibbs)118.58157.48
    DPGMM (VI)130.7939.79
    下载: 导出CSV
  • [1] 曾超, 杨辰光, 李强, 戴诗陆. 人-机器人技能传递研究进展. 自动化学报, 2019, 45(10): 1813-1828.

    Zeng Chao, Yang Chen-Guang, Li Qiang, Dai Shi-Lu. Research progress on human-robot skill transfer. Acta Automatica Sinica, 2019, 45(10): 1813–1828.
    [2] 刘乃军, 鲁涛, 蔡莹皓, 王硕. 机器人操作技能学习方法综述. 自动化学报, 2019, 45(3): 458-470.

    Liu Nai-Jun1, Lu Tao, Cai Ying-Hao, Wang Shuo. A review of robot manipulation skills learning methods. Acta Automatica Sinica, 2019, 45(3): 458-470.
    [3] 秦方博, 徐德. 机器人操作技能模型综述. 自动化学报, 2019; 45(8): 1401-1418.

    Qing Fang-Bo; Xu De. Review of robot manipulation skill models. Acta Automatica Sinica, 2019, 45(8): 1401–1418.
    [4] Ravichandar H, Polydoros A S, Chernova S, Billard A. Recent advances in robot learning from demonstration. Annual Review of Control, Robotics, and Autonomous Systems, 2020, 3(1): 297-330. doi: 10.1146/annurev-control-100819-063206
    [5] Brock O, Khatib O. Elastic Strips: A Framework for Integrated Planning and Execution. London: Experimental Robotics VI, 2000. 329−338
    [6] Billard A, Hayes G. DRAMA, A connectionist architecture for control and learning in autonomous robots. Adaptive Behavior, 1999, 7(1): 35-63. doi: 10.1177/105971239900700103
    [7] Ijspeert A J, Nakanishi J, Hoffmann H, Pastor P, Schaal Stefan. Dynamical movement primitives: learning attractor models for motor behaviors. Neural Computation, 2012, 25(2): 328-373.
    [8] Gribovskaya E, Khansari-Zadeh S M, Billard A. Learning non-linear multivariate dynamics of motion in robotic manipulators. The International Journal of Robotics Research, 2011, 30(1): 80-117. doi: 10.1177/0278364910376251
    [9] Khansari-Zadeh S M, Billard A. Learning stable nonlinear dynamical systems with Gaussian mixture models. IEEE Transactions on Robotics, 2011, 27(5): 943-957. doi: 10.1109/TRO.2011.2159412
    [10] Yang C G, Chen C Z, He W, Cui R X, Li Z J. Robot learning system based on adaptive neural control and dynamic movement primitives. IEEE Transactions on Neural Networks and Learning Systems 2019, 30(3): 777-787. doi: 10.1109/TNNLS.2018.2852711
    [11] Neumann K, Steil J J. Learning robot motions with stable dynamical systems under diffeomorphic transformations. Robotics and Autonomous Systems, 2015, 70: 1-15. doi: 10.1016/j.robot.2015.04.006
    [12] Duan J H, Ou Y S, Hu J B, Wang Z Y, Jin S K, Xu C. Fast and stable learning of dynamical systems based on extreme learning machine. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2019, 49(6): 1175-1185. doi: 10.1109/TSMC.2017.2705279
    [13] Jin S K, Wang Z Y, Ou Y S, Feng W. Learning accurate and stable dynamical system under manifold immersion and submersion. IEEE Transactions on Neural Networks and Learning Systems, 2019, 30(12): 3598-3610. doi: 10.1109/TNNLS.2019.2892207
    [14] Schwarz G. Estimating the Dimension of a model. The annals of statistics, 1978, 6(2): 461–464. doi: 10.1214/aos/1176344136
    [15] Chatzis S P., Korkinof D, Demiris Y. A nonparametric Bayesian approach toward robot learning by demonstration. Robotics and Autonomous Systems, 2012, 60(6): 789-802. doi: 10.1016/j.robot.2012.02.005
    [16] Blei D M, Jordan M I. Variational methods for the Dirichlet process. In: Proceedings of the twenty-first international conference on Machine learning. New York, USA: Association for Computing Machinery, 2004. 12
    [17] Müller P, Quintana F A. Nonparametric Bayesian data analysis. Statistical Science, 2004, 19(1): 95-110. doi: 10.1214/088342304000000017
    [18] Blei D M., Kucukelbir A, McAuliffe J D. Variational inference: A review for statisticians. Journal of the American Statistical Association, 2017, 112(518): 859-877. doi: 10.1080/01621459.2017.1285773
    [19] Figueroa N, Billard A. A physically-consistent Bayesian non-parametric mixture model for dynamical system learning. In: Proceedings of the Machine Learning Research. Zurich, Switzerland: CoRL, 2018. 927−946
    [20] Deisenroth M P, Rasmussen C E, Peters J. Gaussian process dynamic programming. Neurocomputing, 2009, 72(7): 1508-1524.
    [21] Vijayakumar S, D’Souza A, Schaal S. Incremental online learning in high dimensions. Neural Computation, 2005, 17(12): 2602-2634. doi: 10.1162/089976605774320557
    [22] Ferguson T S. A Bayesian Analysis of Some Nonparametric Problems. The Annals of Statistics, 1973, 1(2): 209-230. doi: 10.1214/aos/1176342360
    [23] Ahmed A, Xing E P. Dynamic non-parametric mixture models and the recurrent Chinese restaurant process: With applications to evolutionary clustering. In: Proceedings of the 2008 SIAM International Conference on Data Mining. Atlanta, USA: 2008. 219−230
    [24] Blei D M., Jordan M I. Variational inference for Dirichlet process mixtures. Bayesian Anal, 2006, 1(1): 121-143. doi: 10.1214/06-BA104
    [25] Chatzis S P., Kosmopoulos D I., Varvarigou T A. signal modeling and classification using a robust latent space model based on t distributions. IEEE Transactions on Signal Processing, 2008, 56(3): 949-963. doi: 10.1109/TSP.2007.907912
    [26] Bishop C M. Pattern Recognition and Machine Learning. New York: Springer, 2006. 461−522
    [27] Zhu J L, Ge Z Q, Song Z H. Variational Bayesian Gaussian mixture regression for soft sensing key variables in non-Gaussian industrial processes. IEEE Transactions on Control Systems Technology, 2017, 25(3): 1092-1099. doi: 10.1109/TCST.2016.2576999
    [28] Khansari-Zadeh S M, Billard A. Learning control Lyapunov function to ensure stability of dynamical system-based robot reaching motions. Robotics and Autonomous Systems, 2014, 62(6): 752-765. doi: 10.1016/j.robot.2014.03.001
  • 加载中
图(12) / 表(1)
计量
  • 文章访问数:  1382
  • HTML全文浏览量:  359
  • PDF下载量:  223
  • 被引次数: 0
出版历程
  • 收稿日期:  2020-05-22
  • 录用日期:  2020-09-30
  • 网络出版日期:  2022-06-08
  • 刊出日期:  2022-07-20

目录

    /

    返回文章
    返回