2.845

2023影响因子

(CJCR)

  • 中文核心
  • EI
  • 中国科技核心
  • Scopus
  • CSCD
  • 英国科学文摘

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

具有尺度和旋转适应性的长时间目标跟踪

熊丹 卢惠民 肖军浩 郑志强

熊丹, 卢惠民, 肖军浩, 郑志强. 具有尺度和旋转适应性的长时间目标跟踪. 自动化学报, 2019, 45(2): 289-304. doi: 10.16383/j.aas.2018.c170359
引用本文: 熊丹, 卢惠民, 肖军浩, 郑志强. 具有尺度和旋转适应性的长时间目标跟踪. 自动化学报, 2019, 45(2): 289-304. doi: 10.16383/j.aas.2018.c170359
XIONG Dan, LU Hui-Min, XIAO Jun-Hao, ZHENG Zhi-Qiang. Robust Long-term Object Tracking With Adaptive Scale and Rotation Estimation. ACTA AUTOMATICA SINICA, 2019, 45(2): 289-304. doi: 10.16383/j.aas.2018.c170359
Citation: XIONG Dan, LU Hui-Min, XIAO Jun-Hao, ZHENG Zhi-Qiang. Robust Long-term Object Tracking With Adaptive Scale and Rotation Estimation. ACTA AUTOMATICA SINICA, 2019, 45(2): 289-304. doi: 10.16383/j.aas.2018.c170359

具有尺度和旋转适应性的长时间目标跟踪

doi: 10.16383/j.aas.2018.c170359
基金项目: 

国家自然科学基金 61503401

国家自然科学基金 61403409

中国博士后基金 2014M562648

详细信息
    作者简介:

    熊丹  国防科技大学智能科学学院博士研究生.2012年获得国防科技大学硕士学位.主要研究方向为机器人视觉和视觉SLAM.E-mail:xiongdan@nudt.edu.cn

    肖军浩  国防科技大学智能科学学院讲师.2007年获得国防科技大学学士学位, 2013年获得德国汉堡大学博士学位.主要研究方向为移动机器人三维感知和多机器人协同控制.E-mail:junhao.xiao@ieee.org

    郑志强  国防科技大学智能科学学院教授.1994年获得比利时列日大学博士学位.主要研究方向为多机器人协同控制, 飞行器控制.E-mail:zqzheng@nudt.edu.cn

    通讯作者:

    卢惠民  国防科技大学智能科学学院副教授.2010年获得国防科技大学博士学位.主要研究方向为机器人视觉, 视觉SLAM和机器人足球.本文通信作者.E-mail:lhmnew@nudt.edu.cn

Robust Long-term Object Tracking With Adaptive Scale and Rotation Estimation

Funds: 

National Natural Science Foundation of China 61503401

National Natural Science Foundation of China 61403409

China Postdoctoral Science Foundation 2014M562648

More Information
    Author Bio:

     Ph. D. candidate at the College of Intelligence Science and Technology, National University of Defense Technology. He received his master degree from National University of Defense Technology in 2012. His research interest covers robot vision and visual SLAM

     Lecturer at the College of Intelligence Science and Technology, National University of Defense Technology. He received his bachelor degree from National University of Defense Technology in 2007, and Ph. D. degree from University of Hamburg, Germany in 2013. His research interest covers 3D perception for mobile robots and multi-robot coordination

     Professor at the College of Intelligence Science and Technology, National University of Defense Technology. He received his Ph. D. degree from University of Liege, Belgium in 1994. His research interest covers multi-robot coordination control and flight control

    Corresponding author: LU Hui-Min  Associate professor at the College of Intelligence Science and Technology, National University of Defense Technology. He received his Ph. D. degree from National University of Defense Technology in 2010. His research interest covers robot vision, visual SLAM and robot soccer. Corresponding author of this paper
  • 摘要: 目标发生尺度和旋转变化会给长时间目标跟踪带来很大的挑战,针对该问题,本文提出了具有尺度和旋转适应性的鲁棒目标跟踪算法.首先针对跟踪过程中目标存在的尺度变化和旋转运动,提出一种基于傅里叶-梅林变换和核相关滤波的目标尺度和旋转参数估计方法.该方法能够实现连续空间的目标尺度和旋转参数估计,采用核相关滤波提高了估计的鲁棒性和准确性.然后针对长时间目标跟踪过程中,有时不可避免地会出现跟踪失败的情况(例如由于长时间半遮挡或全遮挡等),提出一种基于直方图和方差加权的目标搜索方法.当目标丢失时,通过提出的搜索方法能够快速从图像中确定目标可能存在的区域,使得跟踪算法具有从失败中恢复的能力.本文还训练了两个核相关滤波器用于估计跟踪结果的置信度和目标平移,通过专门的核相关滤波器能够使得估计的跟踪结果置信度更加准确和鲁棒,置信度的估计结果可用于激活基于直方图和方差加权的目标搜索模块,并判断搜索窗口中是否包含目标.本文在目标跟踪标准数据集(Online object tracking benchmark,OTB)上对提出的算法和目前主流的目标跟踪算法进行对比实验,验证了本文提出算法的有效性和优越性.
    1)  本文责任编委 左旺孟
  • 图  1  基于傅里叶-梅林变换的核相关滤波模型学习

    Fig.  1  The kernelized correlation filtering model learning based on the Fourier-Mellin transform

    图  2  用于目标位移估计和跟踪结果置信度估计的核相关滤波模型

    Fig.  2  The kernelized correlation filtering models for the estimation of object translation and the confidence of the tracking result

    图  3  具有尺度和旋转适应性的长时间目标跟踪算法框图

    Fig.  3  The architecture of robust long-term object tracking with adaptive scale and rotation estimation

    图  4  目标标注真值、跟踪产生的非标准矩形和非标准矩形的外接矩形示意图

    Fig.  4  The diagram of target annotations, nonstandard rectangles from our trackers and external rectangles of nonstandard rectangles

    图  5  通过OPE, SRE和TRE估算准则得到的跟踪算法精度图和成功率图

    Fig.  5  Precision plots and success rate plots of tracking algorithms evaluated by OPE, SRE and TRE standards

    图  6  不同视觉挑战情况下通过OPE估算准则得到的跟踪算法精度图和成功率图

    Fig.  6  Precision plots and success rate plots of tracking algorithms evaluated by OPE standard under different visual tracking challenges

    图  7  RLOT, ROT, SRDCF[19], LCT[23], TLD[14]和Struck[28]在11个OTB序列上的跟踪结果

    Fig.  7  Tracking results using RLOT, ROT, SRDCF[19], LCT[23], TLD[14] and Struck[28] on 11 OTB image sequences

    表  1  三个核相关滤波器参数

    Table  1  The parameters of three kernelized

    核相关滤波器位移估计 置信度估计 尺度旋转估计
    核相关滤波器 核相关滤波器 核相关滤波器
    高斯核宽度参数 $\sigma $ 0.6 0.6 0.4
    学习率 $\beta$ 0.012 0.012 0.075
    高斯标签宽度参数 $s$ 0.125 $\sqrt{mn}$ 0.125 $\sqrt{mn}$ 0.075( $d$ , $a$ )
    正则化参数 $\alpha$ $10^{-4} $ $10^{-4} $ $5^{-5} $
    下载: 导出CSV

    表  2  OTB数据集中选择的11个序列包含的视觉挑战

    Table  2  The visual tracking challenges included in the 11 image sequences selected from the OTB datasets

    图像序列 光照变化 平面外旋转 尺变变化 半遮挡 扭曲 运动模糊 快速运动 平面内旋转 出相机视野 凌乱背景 低分辨率
    David 1 1 1 1 1 1 0 1 0 0 0
    CarScale 0 1 1 1 0 0 1 1 0 0 0
    Dog1 0 1 1 0 0 0 0 1 0 0 0
    FaceOcc2 1 1 0 1 0 0 0 1 0 0 0
    Jogging-2 0 1 0 1 1 0 0 0 0 0 0
    Lemming 1 1 1 1 0 0 1 0 1 0 0
    MotorRolling 1 0 1 0 0 1 1 1 0 1 1
    Shaking 1 1 1 1 1 0 0 0 0 1 0
    Singer2 1 1 0 0 1 0 0 1 0 1 0
    Tiger1 1 1 0 1 1 1 1 1 0 0 0
    Soccer 1 1 1 1 0 1 1 1 0 1 0
    下载: 导出CSV

    表  3  不同跟踪算法的平均处理帧速率

    Table  3  The average frame rates of different object tracking algorithms

    跟踪算法 帧速率
    RLOT 36
    SRDCF 4
    LCT 27.4
    KCF 167
    DSST 27
    TLD 20
    Struck 28
    下载: 导出CSV
  • [1] Yilmaz A, Javed O, Shah M. Object tracking:a survey. ACM Computing Surveys (CSUR), 2006, 38(4):Article No. 13.1-13.45 http://d.old.wanfangdata.com.cn/Periodical/jsjyyyj200912002
    [2] Hu W M, Tan T N, Wang L, Maybank S. A survey on visual surveillance of object motion and behaviors. IEEE Transactions on Systems, Man, and Cybernetics, Part C:Applications and Reviews, 2004, 34(3):334-352 doi: 10.1109/TSMCC.2004.829274
    [3] Li X, Hu W M, Shen C H, Zhang Z F, Dick A, van den Hengel A V D. A survey of appearance models in visual object tracking. ACM Transactions on Intelligent Systems and Technology (TIST), 2013, 4(4):Article No. 58.1-58.48 http://d.old.wanfangdata.com.cn/OAPaper/oai_arXiv.org_1303.4803
    [4] 尹宏鹏, 陈波, 柴毅, 刘兆栋.基于视觉的目标检测与跟踪综述.自动化学报, 2016, 42(10):1466-1489 http://www.aas.net.cn/CN/abstract/abstract18935.shtml

    Yin Hong-Peng, Chen Bo, Chai Yi, Liu Zhao-Dong. Vision-based object detection and tracking:a review. Acta Automatica Sinica, 2016, 42(10):1466-1489 http://www.aas.net.cn/CN/abstract/abstract18935.shtml
    [5] 管皓, 薛向阳, 安志勇.深度学习在视频目标跟踪中的应用进展与展望.自动化学报, 2016, 42(6):834-847 http://www.aas.net.cn/CN/abstract/abstract18874.shtml

    Guan Hao, Xue Xiang-Yang, An Zhi-Yong. Advances on application of deep learning for video object tracking. Acta Automatica Sinica, 2016, 42(6):834-847 http://www.aas.net.cn/CN/abstract/abstract18874.shtml
    [6] Wu Y, Lim J, Yang M H. Online object tracking: a benchmark. In: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Portland, OR, USA: IEEE, 2013. 2411-2418
    [7] Wu Y, Lim J, Yang M H. Object tracking benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9):1834-1848 doi: 10.1109/TPAMI.2014.2388226
    [8] Kristan M, Matas J, Leonardis A, Felsberg M, Cehovin L, Fernandez G, Vojir T, Hager G, Nebehay G, Pflugfelder R, Gupta A, Bibi A, Lukezic A, Garcia-Martin A, Saffari A, Petrosino A, Montero A S et al. The visual object tracking VOT2015 challenge results. In: Proceedings of the 2015 IEEE International Conference on Computer Vision Workshops (ICCVW). Santiago, Chile: IEEE, 2015. 564-586
    [9] Wang H Z, Suter D, Schindler K, Shen C H. Adaptive object tracking based on an effective appearance filter. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(9):1661-1667 doi: 10.1109/TPAMI.2007.1112
    [10] Black M J, Jepson A D. Eigen tracking: robust matching and tracking of articulated objects using a view-based representation. In: Proceedings of the 4th European Conference on Computer Vision. Cambridge, UK: Springer Berlin Heidelberg, 1996. 329-342
    [11] Chin T J, Suter D. Incremental kernel principal component analysis. IEEE Transactions on Image Processing IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 16(6):1662-1674 doi: 10.1109/TIP.2007.896668
    [12] Grabner H, Grabner M, Bischof H. Real-time tracking via on-line boosting. In: Proceedings of the 2006 British Machine Vision Conference. Edinburgh, England: BMVA Press, 2006. 1: 47-56
    [13] Zhang K H, Zhang L, Yang M H. Fast compressive tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(10):2002-2015 doi: 10.1109/TPAMI.2014.2315808
    [14] Kalal Z, Mikolajczyk K, Matas J. Tracking-learning-detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(7):1409-1422 doi: 10.1109/TPAMI.2011.239
    [15] Bolme D S, Beveridge J R, Draper B A, Lui Y M. Visual object tracking using adaptive correlation filters. In: Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR). San Francisco, CA, USA: IEEE, 2010. 2544-2550
    [16] Henriques J F, Caseiro R, Martins P, Batista J. High-speed tracking with kernelized correlation filters. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3):583-596 doi: 10.1109/TPAMI.2014.2345390
    [17] Li Y, Zhu J K. A scale adaptive kernel correlation filter tracker with feature integration. In: Proceedings of the 2014 European Conference on Computer Vision Workshops. Zurich, Switzerland: Springer Berlin Heidelberg, 2014. 254-265
    [18] Danelljan M, Häger G, Shahbaz K F, Felsberg M. Accurate scale estimation for robust visual tracking. In: Proceedings of the 2014 British Machine Vision Conference. Nottingham, UK: BMVA Press, 2014. 65.1-65.11
    [19] Danelljan M, Häger G, Shahbaz K F, Felsberg M. Learning spatially regularized correlation filters for visual tracking. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV). Santiago, Chile: IEEE, 2015. 4310-4318
    [20] Bertinetto L, Valmadre J, Golodetz S, Miksik O, Torr P H S. Staple: complementary learners for real-time tracking. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, USA: IEEE, 2016. 1401-1409
    [21] Zuo W M, Wu X H, Lin L, Zhang L, Yang M-H. Learning support correlation filters for visual tracking. arXiv preprint arXiv: 1601.06032, 2016.
    [22] 徐建强, 陆耀.一种基于加权时空上下文的鲁棒视觉跟踪算法.自动化学报, 2015, 41(11):1901-1912 http://www.aas.net.cn/CN/abstract/abstract18765.shtml

    Xu Jian-Qiang, Lu Yao. Robust visual tracking via weighted spatio-temporal context learning. Acta Automatica Sinica, 2015, 41(11):1901-1912 http://www.aas.net.cn/CN/abstract/abstract18765.shtml
    [23] Ma C, Yang X K, Zhang C Y, Yang M H. Long-term correlation tracking. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Boston, MA, USA: IEEE, 2015. 5388-5396
    [24] Zokai S, Wolberg G. Image registration using log-polar mappings for recovery of large-scale similarity and projective transformations. IEEE Transactions on Image Processing, 2005, 14(10):1422-1434 doi: 10.1109/TIP.2005.854501
    [25] Reddy B S, Chatterji B N. An FFT-based technique for translation, rotation, and scale-invariant image registration. IEEE Transactions on Image Processing, 1996, 5(8):1266-1271 doi: 10.1109/83.506761
    [26] Sarvaiya J N, Patnaik S, Kothari K. Image registration using log polar transform and phase correlation to recover higher scale. Journal of Pattern Recognition Research, 2012, 7(1):90-105
    [27] Adam A, Rivlin E, Shimshoni I. Robust fragments-based tracking using the integral histogram. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, NY, USA: IEEE, 2006, 1: 798-805
    [28] Hare S, Golodetz S, Saffari A, Vineet V, Cheng M M, Hicks S L, Torr P H S et al. Struck:structured output tracking with kernels. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(10):2096-2109 doi: 10.1109/TPAMI.2015.2509974
    [29] Zhong W, Lu H C, Yang M H. Robust object tracking via sparsity-based collaborative model. In: Proceedings of the 2012 IEEE International Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI, USA: IEEE, 2012. 1838-1845
    [30] Jia X, Lu H C, Yang M H. Visual tracking via adaptive structural local sparse appearance model. In: Proceedings of the 2012 IEEE International Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI, USA: IEEE, 2012. 1822-1829
    [31] Dinh T B, Vo N, Medioni G. Context tracker: exploring supporters and distracters in unconstrained environments. In: Proceedings of the 2011 IEEE International Conference on Computer Vision and Pattern Recognition (CVPR). Colorado Springs, CO, USA: IEEE, 2011. 1177-1184
  • 加载中
图(7) / 表(3)
计量
  • 文章访问数:  3142
  • HTML全文浏览量:  436
  • PDF下载量:  709
  • 被引次数: 0
出版历程
  • 收稿日期:  2017-06-29
  • 录用日期:  2017-11-17
  • 刊出日期:  2019-02-20

目录

    /

    返回文章
    返回