基于深度匹配的由稀疏到稠密大位移运动光流估计

陈震; 张道文; 张聪炫; 汪洋

doi:10.16383/j.aas.c190716

基于深度匹配的由稀疏到稠密大位移运动光流估计

doi: 10.16383/j.aas.c190716

陈震^1,,
张道文^1,,
张聪炫^1,,
汪洋^1,

1.
南昌航空大学无损检测技术教育部重点实验室南昌 330063

基金项目: 国家自然科学基金(61866026, 61772255), 江西省优势科技创新团队(20165BCB19007), 江西省杰出青年人才计划(20192BCB23011), 航空科学基金(2018ZC56008)资助

详细信息

作者简介:
陈震：南昌航空大学测试与光电工程学院教授. 主要研究方向为图像检测与智能识别, 计算机视觉.E-mail: dr_chenzhen@163.com

张道文：南昌航空大学测试与光电工程学院硕士研究生. 主要研究方向为图像处理与模式识别.E-mail: daowenzhang@163.com

张聪炫：南昌航空大学测试与光电工程学院教授. 主要研究方向为图像检测与智能识别. 本文通信作者. E-mail: zcxdsg@163.com

汪洋：南昌航空大学测试与光电工程学院助教. 主要研究方向为数字图像处理. E-mail: 70876@nchu.edu.cn

计量
- 文章访问数: 661
- HTML全文浏览量: 1006
- PDF下载量: 160
- 被引次数: 0
出版历程
- 收稿日期: 2019-10-16
- 录用日期: 2020-04-06
- 网络出版日期: 2022-08-10
- 刊出日期: 2022-09-16

Sparse-to-dense Large Displacement Motion Optical Flow Estimation Based on Deep Matching

1.
Key Laboratory of Nondestructive Testing, Ministry of Education, Nanchang Hangkong University, Nanchang 330063

Funds: Supported by National Natural Science Foundation of China (61866026, 61772255), Advantage Subject Team of Jiangxi Province (20165BCB19007), Outstanding Young Scientist Project of Jiangxi Province (20192BCB23011), and Aeronautical Science Foundation of China (2018ZC56008)

More Information

Author Bio:
CHEN Zhen　Professor at the School of Measuring and Optical Engineering, Nanchang Hangkong University. His research interest covers image detection and intelligent recognition, and computer vision

ZHANG Dao-Wen　Master student at the School of Measuring and Optical Engineering, Nanchang Hangkong University. His research interest covers image processing and pattern recognition

ZHANG Cong-Xuan　Professor at the School of Measuring and Optical Engineering, Nanchang Hangkong University. His research interest covers image detection and intelligent recognition. Corresponding author of this paper

WANG Yang　Assistant at the School of Measuring and Optical Engineering, Nanchang Hangkong University. His main research interest is digital image processing

摘要

摘要: 针对非刚性大位移运动场景的光流计算准确性与鲁棒性问题, 提出一种基于深度匹配的由稀疏到稠密大位移运动光流估计方法. 首先利用深度匹配模型计算图像序列相邻帧的初始稀疏运动场; 其次采用网格化邻域支持优化模型筛选具有较高置信度的图像网格和匹配像素点, 获得鲁棒的稀疏运动场; 然后对稀疏运动场进行边缘保护稠密插值, 并设计全局能量泛函优化求解稠密光流场. 最后分别利用MPI-Sintel和KITTI数据库提供的测试图像集对本文方法和Classic + NL, DeepFlow, EpicFlow以及FlowNetS等变分模型、匹配策略和深度学习光流计算方法进行综合对比与分析, 实验结果表明本文方法相对于其他方法具有更高的光流计算精度, 尤其在非刚性大位移和运动遮挡区域具有更好的鲁棒性与可靠性.
- 稠密光流 /
- 深度匹配 /
- 邻域支持 /
- 图像网格 /
- 全局优化
Abstract: In order to improve the accuracy and robustness of optical flow estimation under the non-rigid large displacement motion, we propose a sparse-to-dense large displacement motion optical flow estimation method based on deep matching. First, we utilize the deep matching model to compute an initial sparse motion field from the consecutive two frames of the image sequence. Second, we adopt the gridded neighborhood support optimization scheme to extract the image grids and matching pixels which have the high confidence, and acquire the robust sparse motion field. Third, we interpolate the sparse motion field to obtain the dense motion field and plan a global energy function to estimate the optimized dense optical flow. Finally, we respectively employ the MPI-Sintel and KITTI datasets to compare the performance of the proposed method with several variational, region matching and deep-learning based optical flow approaches including Classic + NL, DeepFlow, EpicFlow and FlowNetS models. The experimental results indicate that the proposed method has the higher computational accuracy compared with those of the other state-of-the-art approaches, especially owns the better robustness and reliability in the areas of non-rigid large displacements and motion occlusions.
- Dense optical flow /
- deep matching /
- neighborhood support /
- image grid /
- global optimization

HTML全文

图 1 基于区域划分的深度匹配采样窗口示意图 ((a)参考帧采样窗口; (b)传统匹配方法采样窗口;(c)深度匹配算法采样窗口)

Fig. 1 Illustration of the deep matching sample window based on the regional division ((a) Sample window of the reference frame; (b) Sample window of the traditional matching method; (c) Sample window of the deep matching method)

下载: 全尺寸图片幻灯片

图 2 深度匹配金字塔采样示意图 ((a)第1帧子区域聚合; (b)第2帧子区域聚合)

Fig. 2 Illustration of the pyramid sampling based deep matching ((a) Subregion polymerization of the first frame; (b) Subregion polymerization of the second frame)

下载: 全尺寸图片幻灯片

图 3 本文邻域支持模型运动场优化效果 (蓝色标记符表示匹配正确像素点, 红色标记符表示匹配错误像素点)

Fig. 3 Optimization effect of the motion field by using the proposed neighborhood supporting model (The blue mark indicates the correct matching pixels, the red mark denotes the false matching pixels)

下载: 全尺寸图片幻灯片

图 4 不同参数设置对本文光流估计精度的影响

Fig. 4 Variation of optical flow estimation results respect to different parameters

下载: 全尺寸图片幻灯片

图 5 非刚性大位移与运动遮挡图像序列光流估计结果

Fig. 5 Optical flow results of the image sequences including non-rigidly large displacements and motion occlusions

下载: 全尺寸图片幻灯片

图 6 KITTI数据库测试图像序列光流误差图

Fig. 6 Optical flow error maps of KITTI dataset

下载: 全尺寸图片幻灯片

图 7 MPI-Sintel数据库消融实验光流图

Fig. 7 Optical flow results of the ablation experiment tested on MPI-Sintel database

下载: 全尺寸图片幻灯片

表 1 MPI-Sintel数据库光流估计误差对比

Table 1 Comparison results of optical flow errors on MPI-Sintel database

对比方法	AAE	AEE
Classic+NL^[7]	10.12	5.75
DeepFlow^[18]	7.88	4.12
EpicFlow^[21]	8.32	5.10
FlowNetS^[12]	7.55	4.07
本文方法	7.08	3.74

下载: 导出CSV

表 2 非刚性大位移与运动遮挡图像序列光流估计误差对比

Table 2 Comparison results of optical flow errors on the image sequences including non-rigidly large displacements and motion occlusions

对比方法	平均误差	Ambush_5	Cave_2	Market_2	Market_5	Temple_2
对比方法	AAE/AEE	AAE/AEE	AAE/AEE	AAE/AEE	AAE/AEE	AAE/AEE
Classic+NL^[7]	14.71/9.28	22.53/11.06	15.78/14.03	7.64/0.98	18.93/16.59	8.39/3.72
DeepFlow^[18]	10.89/6.66	18.86/8.75	9.23/9.30	8.00/0.85	12.19/11.89	6.15/2.50
EpicFlow^[21]	10.64/6.47	19.19/8.48	7.45/7.81	7.91/0.89	12.15/12.47	6.48/2.72
FlowNetS^[12]	15.63/9.77	25.37/12.43	17.24./15.66	8.56/1.26	16.56/15.24	10.45/4.24
本文方法	9.77/6.12	18.43/8.43	6.98/7.49	7.05/0.78	10.58/11.35	5.83/2.56

下载: 导出CSV

表 3 KITTI数据库光流估计误差对比

Table 3 Comparison results of optical flow errors on KITTI database

对比方法	AEE_noc	AEE_all	out_noc (%)	out_all (%)
Classic+NL^[7]	11.98	12.59	13.78	20.37
DeepFlow^[18]	5.43	6.13	9.51	14.72
EpicFlow^[21]	6.38	7.56	10.92	14.18
FlowNetS^[12]	239.8	285.8	86.64	86.68
本文方法	4.72	5.05	8.01	9.52

下载: 导出CSV

表 4 本文方法消融实验结果对比

Table 4 Comparison results of the ablation experiment

消融模型	Alley_2	Cave_4	Market_6
本文方法	0.07	1.16	3.72
无匹配优化	0.09	1.28	5.07
无稠密插值	0.14	1.31	5.85
无全局优化	0.09	1.21	3.84

下载: 导出CSV

表 5 本文方法与其他方法时间消耗对比(s)

Table 5 Comparison of time consumption between the proposed method and the other approaches (s)

对比方法	MPI-Sintel	KITTI
Classic+NL^[7]	565	211
DeepFlow^[18]	19.0	21.2
EpicFlow^[21]	16.4	16.0
FlowNetS^[12]	0.10	1.05
本文方法	88.2	104

下载: 导出CSV

参考文献(24)

[1]	潘超, 刘建国, 李峻林. 昆虫视觉启发的光流复合导航方法. 自动化学报, 2015, 41(6): 1102-1112. PAN Chao, LIU Jian-Guo, LI Jun-Lin. An optical flow-based composite navigation method inspired by insect vision. Acta Automatica Sinica, 2015, 41(6): 1102-1112 (in Chinese).
[2]	Colque R, Caetano C, Andrade M, Schwartz W. Histograms of optical flow orientation and magnitude and entropy to detect anomalous events in Videos. IEEE Transactions on Circuits and Systems for Video Technology, 2017, 27(3): 673-682. doi: 10.1109/TCSVT.2016.2637778
[3]	王飞, 崔金强, 陈本美, 李崇兴. 一套完整的基于视觉光流和激光扫描测距的室内无人机导航系统. 自动化学报, 2013, 39(11): 1889-1900. WANG Fei, CUI Jin-Qiang, CHEN Ben-Mei, LEE Tong H. A comprehensive uav indoor navigation system based on vision optical flow and laser fastSLAM. Acta Automatica Sinica, 2013, 39(11): 1889-1900 (in Chinese).
[4]	张桂梅, 孙晓旭, 刘建新, 储珺. 基于分数阶微分的TV-L1光流模型的图像配准方法研究. 自动化学报, 2017, 43(12): 2213-2224. ZHANG Gui-Mei, SUN Xiao-Xu, LIU Jian-Xin, CHU Jun. Research on TV-L1 optical flow model for image registration based on fractional-order differentiation. Acta Automatica Sinica, 2017, 43(12): 2213-2224 (in Chinese).
[5]	Horn B K P, Schunck B G. Determining optical flow. Artificial Intelligence, 1980, 17(1): 185- 203.
[6]	Brox T, Bruhn A, Papenberg N, Weickert J. High accuracy optical flow estimation based on a theory for warping. In: Proceedings of the 2004 European Conference on Computer Vision. Prague, Czech Republic: Springer, 2004. 25−36
[7]	Sun D, Roth S, Black M J. A quantitative analysis of current practices in optical flow estimation and the principles behind them. International Journal of Computer Vision, 2014, 106(2): 115-137. doi: 10.1007/s11263-013-0644-x
[8]	Drulea M, Nedevschi S. Total variation regularization of local-global optical flow. In: Proceedings of the 2011 International Conference on Intelligent Transportation Systems. Washington DC, USA: IEEE, 2011. 318−323
[9]	Perona P, Malik J. Scale-space and edge detection using anisotropic diffusion. IEEE Transactions on pattern analysis and machine intelligence, 1990, 12(7): 629-639. doi: 10.1109/34.56205
[10]	Weickert J, Schnörr C. A theoretical framework for convex regularizers in PDE-based computation of image motion. International Journal of Computer Vision, 2001, 45(3): 245- 264. doi: 10.1023/A:1013614317973
[11]	Zimmer H, Bruhn A, Weickert J. Optic flow in harmony. International Journal of Computer Vision, 2011, 93(3): 368-388. doi: 10.1007/s11263-011-0422-6
[12]	Dosovitskiy A, Fischer P, Ilg E, Häusser P, Hazirbas C, Golkov V, et al. Flownet: Learning optical flow with convolutional networks. In: Proceedings of the 2015 International Conference on Computer Vision and Pattern Recognition. Santiago, Chile: IEEE, 2015. 2758−2766
[13]	Ilg E, Mayer N, Saikia T, Keuper M, Dosovitskiy A, Brox T. FlowNet 2.0: Evolution of optical flow estimation with deep networks. In: Proceedings of the 2017 International Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE, 2017. 1647−1655
[14]	Ranjan A, Black M J. Optical flow estimation using a spatial pyramid network. In: Proceedings of the 2017 International Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE, 2017. 4161−4170
[15]	Hui T W, Tang X, Loy C. LiteFlowNet: A lightweight convolutional neural network for optical flow estimation. In: Proceedings of the 2018 International Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018. 8981−8989
[16]	Ilg E, Saikia T, Keuper M, Brox T. Occlusions, motion and depth boundaries with a generic network for disparity, optical flow or scene flow estimation. In: Proceedings of the 2018 European Conference on Computer Vision. Munich, Germany: Springer, 2018. 614−630
[17]	Brox T, Malik J. Large displacement optical flow: descriptor matching in variational motion estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(3): 500-513. doi: 10.1109/TPAMI.2010.143
[18]	Weinzaepfel P, Revaud J, Harchaoui Z, Schmid C. DeepFlow: Large displacement optical flow with deep matching. In: Proceedings of the 2013 International Conference on Computer Vision. Sydney, Australia: IEEE, 2013. 1385−1392
[19]	张聪炫, 陈震, 熊帆, 黎明, 葛利跃, 陈昊. 非刚性稠密匹配大位移运动光流估计. 电子学报, 2019, 47(6): 1316-1323. doi: 10.3969/j.issn.0372-2112.2019.06.019 ZHANG Cong-xuan, CHEN Zhen, XIONG Fan, LI Ming, GE Li-yue, CHEN Hao. Large displacement motion optical flow estimation with non-rigid dense patch matching. Acta Electronica Sinica, 2019, 47(6): 1316-1323 (in Chinese). doi: 10.3969/j.issn.0372-2112.2019.06.019
[20]	Hu Y L, Song R, Li Y S. Efficient coarse-to-fine patch match for large displacement optical flow. In: Proceedings of the 2016 International Conference on Computer Vision and Pattern Recognition, Las Vegas, USA: IEEE, 2016. 5704−5712
[21]	Revaud J, Weinzaepfel P, Harchaoui Z, Schmid C. Epicflow: Edge-preserving interpolation of correspondences for optical flow. In: Proceedings of the 2015 International Conference on Computer Vision and Pattern Recognition. Boston, USA: IEEE, 2015. 1164−1172
[22]	Hu Y L, Li Y S, Song R. Robust interpolation of correspondences for large displacement optical flow. In: Proceedings of the 2017 International Conference on Computer Vision and Pattern Recognition, Honolulu, USA: IEEE, 2017. 481−489
[23]	Revaud J, Weinzaepfel P, Harchaoui Z, Schmid C. DeepMatching: Hierarchical deformable dense matching. International Journal of Computer Vision, 2016, 120(3): 300-323. doi: 10.1007/s11263-016-0908-3
[24]	Bian J W, Lin W Y, Matsushita Y, Yeung S K, Nguyen T D, Cheng M M. GMS: Grid-based motion statistics for fast, ultra- robust feature correspondence. In: Proceedings of the 2017 International Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE, 2017. 4181−4190