基于模糊核估计的图像盲超分辨率神经网络

李公平; 陆耀; 王子建; 吴紫薇; 汪顺舟

doi:10.16383/j.aas.c200987

基于模糊核估计的图像盲超分辨率神经网络

doi: 10.16383/j.aas.c200987 cstr: 32138.14.j.aas.c200987

1.
北京理工大学计算机学院北京 100081

基金项目: 国家自然科学基金(61273273), 国家重点研究发展计划(2017YFC0112001), 中央电视台基金(JG2018-0247)资助

详细信息

作者简介:
李公平：北京理工大学计算机学院硕士研究生. 主要研究方向为计算机视觉, 深度学习. E-mail: gongping_li@bit.edu.cn

陆耀：北京理工大学计算机学院教授. 主要研究方向为视觉神经计算, 图像图形处理与视频分析, 模式识别和机器学习. 本文通信作者. E-mail: vis_yl@bit.edu.cn

王子建：北京理工大学计算机学院博士研究生. 主要研究方向为计算机视觉, 深度学习. E-mail: wangzijian@bit.edu.cn

吴紫薇：北京理工大学计算机学院硕士研究生. 主要研究方向为计算机视觉, 深度学习. E-mail: wzw_cs@bit.edu.cn

汪顺舟：北京理工大学计算机学院博士研究生. 主要研究方向为计算机视觉, 深度学习. E-mail: shunzhouwang@bit.edu.cn

计量
- 文章访问数: 2088
- HTML全文浏览量: 1233
- PDF下载量: 244
- 被引次数: 0
出版历程
- 收稿日期: 2020-11-26
- 录用日期: 2021-04-16
- 网络出版日期: 2021-05-26
- 刊出日期: 2023-10-24

Blurred Image Blind Super-resolution Network via Kernel Estimation

1.
School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081

Funds: Supported by National Natural Science Foundation of China (61273273), National Key Research and Development Program of China (2017YFC0112001), and Funds by China Central Television (JG2018-0247)

More Information

Author Bio:
LI Gong-Ping　Master student at the School of Computer Science and Technology, Beijing Institute of Technology. His research interest covers computer vision and deep learning

LU Yao　Professor at the School of Computer Science and Technology, Beijing Institute of Technology. His research interest covers vision neural computing, image processing, video analysis, pattern recognition, and machine learning. Corresponding author of this paper

WANG Zi-Jian　Ph.D. candidate at the School of Computer Science and Technology, Beijing Institute of Technology. His research interest covers computer vision and deep learning

WU Zi-Wei　Master student at the School of Computer Science and Technology, Beijing Institute of Technology. Her research interest covers computer vision and deep learning

WANG Shun-Zhou　Ph.D. candidate at the School of Computer Science and Technology, Beijing Institute of Technology. His research interest covers computer vision and deep learning

摘要

摘要: 模糊图像的超分辨率重建具有挑战性并且有重要的实用价值. 为此, 提出一种基于模糊核估计的图像盲超分辨率神经网络(Blurred image blind super-resolution network via kernel estimation, BESRNet). 该网络主要包括两个部分: 模糊核估计网络 (Blur kernel estimation network, BKENet)和模糊核自适应的图像重建网络(Kernel adaptive super-resolution network, SRNet). 给定任意低分辨率图像(Low-resolution image, LR), 首先利用模糊核估计子网络从输入图像估计出实际的模糊核, 然后根据估计到的模糊核, 利用模糊核自适应的图像重建子网络完成输入图像的超分辨率重建. 与其他图像盲超分辨率方法不同, 所提出的模糊核估计网络能够显式地从输入低分辨率图像中估计出完整的模糊核, 然后模糊核自适应的图像重建网络根据估计到的模糊核, 动态地调整网络各层的图像特征, 从而适应不同输入图像的模糊. 在多个基准数据集上进行了有效性实验, 定性和定量的结果都表明该网络优于同类的图像盲超分辨率神经网络.
- 模糊图像 /
- 模糊核估计 /
- 卷积神经网络 /
- 盲超分辨率
Abstract: Blind blurred image super-resolution is challenging and has important application values. This paper proposes a blurred image blind super-resolution network via kernel estimation (BESRNet), which mainly includes two parts: Blur kernel estimation network (BKENet) and kernel adaptive super-resolution network (SRNet). Given a low-resolution image (LR), the network uses the blur kernel estimation subnetwork to estimate the blur kernel from the input image, and then it uses the kernel adaptive super-resolution subnetwork to super-resolve the input low-resolution image. Different from other blind super-resolution methods, the proposed blur kernel estimation subnetwork gives the whole blur kernel, then the kernel adaptive super-resolution subnetwork dynamically adjusts the image features of different network layers according to the estimated blur kernel to adapt to different image degradations. In this paper, extensive experiments are carried out on multiple benchmark datasets. The qualitative and quantitative results show that proposed method is superior to other blind super-resolution methods.
- Blurred image /
- blur kernel estimation /
- convolutional neural network /
- blind super-resolution

HTML全文

图 1 BESRNet 结构示意图

Fig. 1 Overview of the BESRNet

下载: 全尺寸图片幻灯片

图 2 BKENet 结构示意图

Fig. 2 Architecture of the BKENet

下载: 全尺寸图片幻灯片

图 3 模糊核自适应的特征选择模块示意图

Fig. 3 Architecture of the proposed KAFS module

下载: 全尺寸图片幻灯片

图 4 动态特征选择器结构示意图

Fig. 4 Architecture of the proposed DFS

下载: 全尺寸图片幻灯片

图 5 训练所用的高斯模糊核

Fig. 5 Visualization of Gaussian kernels used for training

下载: 全尺寸图片幻灯片

图 6 (×4) 各个超分方法的视觉效果对比

Fig. 6 (×4) Visual comparison of different methods

下载: 全尺寸图片幻灯片

图 7 (×4) 真实图像“chip”上的视觉对比结果

Fig. 7 (×4) Visual comparison on real-world image “chip”

下载: 全尺寸图片幻灯片

图 8 不同方法在Set5^[39]上估计出的模糊核的视觉效果对比

Fig. 8 Visual comparison of blur kernels estimated by different methods on Set5^[39]

下载: 全尺寸图片幻灯片

图 9 不同基准数据集上模糊核估计结果的视觉效果对比

Fig. 9 Visual comparison of blur kernels estimated by different methods on different benchmark datasets

下载: 全尺寸图片幻灯片

图 10 (×4) 使用真值模糊核作为先验情况下, 各个超分辨率方法的视觉效果对比, 放大观看效果更佳

Fig. 10 (×4) Visual comparison of different methods with real blur kernels as prior, zoom in for best view

下载: 全尺寸图片幻灯片

表 1 各个超分方法在基准数据集上的性能对比(PSNR (dB)/SSIM)

Table 1 Performance comparison of different super-resolution methods on benchmark datasets (PSNR (dB)/SSIM)

方法	放大倍数	数据集
方法	放大倍数	Set5^[39]	Set14^[40]	BSD100^[41]	Urban100^[42]	DIV2K_val^[37]
Bicubic	× 2	25.76/0.800	23.73/0.699	24.15/0.681	21.51/0.670	25.73/0.776
RDN^[14]	× 2	28.03/0.840	25.20/0.713	25.44/0.697	23.04/0.699	27.93/0.807
RCAN^[17]	× 2	24.53/0.751	23.05/0.668	23.49/0.653	21.04/0.633	24.70/0.733
DRN^[8]	× 2	—	—	—	—	—
HAN^[19]	× 2	24.45/0.714	22.90/0.650	23.29/0.634	20.91/0.615	24.54/0.708
RDNMD	× 2	29.00/0.879	25.89/0.803	25.97/0.798	24.16/0.818	28.23/0.863
ZSSR^[30]	× 2	26.06/0.804	24.02/0.707	24.43/0.688	21.90/0.685	25.99/0.785
IKC^[22]	× 2	—	—	—	—	—
BESRNet (本文)	× 2	30.96/0.903	27.73/0.834	27.20/0.827	25.38/0.845	29.96 /0.886
Bicubic	× 4	24.72/0.755	22.83/0.647	23.34/0.628	20.65/0.613	24.79/0.733
RDN^[14]	× 4	27.46/0.808	24.72/0.694	25.03/0.671	22.53/0.690	27.24/0.775
RCAN^[17]	× 4	22.83/0.619	21.62/0.548	22.16/0.541	19.77/0.521	23.25/0.619
DRN^[8]	× 4	23.07/0.679	21.92/0.596	22.50/0.580	20.07/0.562	23.96/0.683
HAN^[19]	× 4	22.65/0.603	20.81/0.524	22.09/0.536	19.33/0.497	22.83/0.605
RDNMD	× 4	28.63/0.834	25.33/0.716	25.51/0.690	23.29/0.718	27.68/0.793
ZSSR^[30]	× 4	25.09/0.710	23.75/0.640	24.15/0.620	21.52/0.622	26.72/0.752
IKC^[22]	× 4	28.93/0.844	25.94/0.719	25.73/0.696	23.49/0.729	28.15/0.800
BESRNet (本文)	× 4	29.18/0.860	26.10/0.742	25.74/0.714	23.81/0.751	28.23/0.813
Bicubic	× 8	21.90/0.622	20.68/0.535	21.58/0.530	18.73/0.493	22.66/0.640
RDN^[14]	× 8	—	—	—	—	—
RCAN^[17]	× 8	20.91/0.518	20.15/0.468	21.10/0.463	18.51/0.434	22.26/0.567
DRN^[8]	× 8	21.09/0.536	20.76/0.499	21.31/0.493	18.81/0.471	22.67/0.594
HAN^[19]	× 8	20.30/0.492	19.88/0.486	19.53/0.467	18.17/0.401	21.47/0.529
RDNMD	× 8	23.86/0.710	21.79/0.560	22.70/0.569	20.29/0.586	24.18/0.686
ZSSR^[30]	× 8	—	—	—	—	—
IKC^[22]	× 8	—	—	—	—	—
BESRNet (本文)	× 8	24.15/0.722	22.64/0.600	22.87/0.571	20.54/0.599	24.75/0.691

下载: 导出CSV

表 2 各个模糊核预测方法在基准数据集上的定量结果对比 (MSE × 10⁻⁵/MAE × 10⁻³)

Table 2 Quantitative comparison of kernel estimation methods on the benchmark datasets (MSE × 10⁻⁵/MAE × 10⁻³)

方法	数据集
方法	Set5^[39]	Set14^[40]	BSD100^[41]	Urban100^[42]	DIV2K_val^[37]
Pan 等^[33]	3.83/3.85	2.56/3.87	3.23/3.58	2.55/3.32	2.13/2.89
BKENet ${\rm{w/o}}$ R	1.91/2.69	2.12/2.66	1.83/2.73	2.15/2.90	2.00/2.67
BKENet ${{\rm{w}}/}$ R	1.76/2.61	1.80/2.53	1.78/2.70	2.13/2.88	1.89/2.59

下载: 导出CSV

表 3 (×4) 使用真值模糊核作为先验的不同方法的量化指标对比(PSNR (dB)/SSIM)

Table 3 (×4) Quantitative comparison of different methods with real blur kernels as prior (PSNR (dB)/SSIM)

方法	数据集
方法	Set5^[39]	Set14^[40]	DIV2K_val^[37]
KZNet	26.45/0.818	22.59/0.702	22.75/0.752
ZSSR ${\rm{w}}/\ k$	24.38/0.734	23.17/0.672	25.50/0.771
SRNet ${\rm{w}}/{\rm{o}}\ k$	25.14/0.796	23.09/0.688	24.72/0.762
SRNet ${\rm{w}}/\ k$	29.65/0.864	26.39/0.747	28.45/0.814

下载: 导出CSV

表 4 (×4) 不同DFS分支数的KAFS 模块在Set5^[39]数据集上的定量结果对比

Table 4 (×4) Quantitative comparison of KAFS module with different numbers of DFS on Set5^[39]

DFS	PSNR (dB)/SSIM	Params (M)	Multi-adds (G)
1	29.50/0.861	12.92	151.04
2	29.61/0.863	12.98	151.05
4	29.54/0.862	13.12	151.06

下载: 导出CSV

表 5 (×4) 不同失活通道数的KAFS模块在Set5^[39]数据集上的定量结果对比

Table 5 (×4) Quantitative comparison of KAFS module with different numbers of inactive channel on Set5^[39]

失活通道数	PSNR (dB)/SSIM
4	29.60/0.860
8	29.65/0.864
16	29.61/0.863
24	29.60/0.860

下载: 导出CSV

表 6 (×4) 使用不同$\delta$值训练的模型在DIV2K^[37] 数据集的验证集上的性能对比

Table 6 (×4) Performance comparison of BESRNet with different $\delta$ on the validation set of DIV2K^[37]

δ	PSNR (dB)/SSIM
0.01	28.01/0.809
0.05	28.09/0.811
0.1	28.23/0.813
0.5	28.12/0.811
1	27.99/0.810

下载: 导出CSV

参考文献(43)

[1]	Luo Y M, Zhou L G, Wang S, Wang Z Y. Video satellite imagery super resolution via convolutional neural networks. IEEE Geoscience and Remote Sensing Letters, 2017, 14(12): 2398-2402 doi: 10.1109/LGRS.2017.2766204
[2]	Shi W Z, Caballero J, Ledig C, Zhuang X H, Bai W J, Bhatia K, et al. Cardiac image super-resolution with global correspondence using multi-atlas PatchMatch. In: Proceedings of the 16th International Conference on Medical Image Computing and Computer-Assisted Intervention. Nagoya, Japan: Springer, 2013. 9−16
[3]	Zou W W W, Yuen P C. Very low resolution face recognition problem. IEEE Transactions on Image Processing, 2011, 21(1): 327-340
[4]	Dong C, Loy C C, He K M, Tang X O. Image super-resolution using deep convolutional networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(2): 295-307 doi: 10.1109/TPAMI.2015.2439281
[5]	Kim J, Lee J K, Lee K M. Accurate image super-resolution using very deep convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, USA: IEEE, 2016. 1646−1654
[6]	Ledig C, Theis L, Huszár F, Caballreo J, Cunningham A, Acosta A, et al. Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Hawaii, USA: IEEE, 2017. 4681−4690
[7]	Lim B, Son S, Kim H, Nah S, Lee K M. Enhanced deep residual networks for single image super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Honolulu, USA: IEEE, 2017. 136−144
[8]	Guo Y, Chen J, Wang J D, Chen Q, Cao J Z, Deng Z S, et al. Closed-loop matters: Dual regression networks for single image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, USA: IEEE, 2020. 5407−5416
[9]	Kim J, Lee J K, Lee K M. Deeply-recursive convolutional network for image super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, USA: IEEE, 2016. 1637−1645
[10]	Tai Y, Yang J, Liu X M. Image super-resolution via deep recursive residual network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, USA: IEEE, 2017. 3147−3155
[11]	周登文, 赵丽娟, 段然, 柴晓亮. 基于递归残差网络的图像超分辨率重建. 自动化学报, 2019, 45(6): 1157-1165 doi: 10.16383/j.aas.c180334 Zhou Deng-Wen, Zhao Li-Juan, Duan Ran, Chai Xiao-Liang. Image super-resolution based on recursive residual networks. Acta Automatica Sinica, 2019, 45(6): 1157-1165 doi: 10.16383/j.aas.c180334
[12]	Han W, Chang S Y, Liu D, Yu M, Witbrock M, Huang T S. Image super-resolution via dual-state recurrent networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018. 1654−1663
[13]	Tong T, Li G, Liu X J, Gao Q Q. Image super-resolution using dense skip connections. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV). Venice, Italy: IEEE, 2017. 4799−4807
[14]	Zhang Y L, Tian Y P, Kong Y, Zhong B N, Fu Y. Residual dense network for image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018. 2472−2481
[15]	Liu J, Zhang W J, Tang Y T, Tang J, Wu G S. Residual feature aggregation network for image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, USA: IEEE, 2020. 2359−2368
[16]	李金新, 黄志勇, 李文斌, 周登文. 基于多层次特征融合的图像超分辨率重建. 自动化学报, 2023, 49(1): 161-171 doi: 10.16383/j.aas.c200585 Li Jin-Xin, Huang Zhi-Yong, Li Wen-Bin, Zhou Deng-Wen. Image super-resolution based on multi-hierarchical features fusion network. Acta Automatica Sinica, 2023, 49(1): 161-171 doi: 10.16383/j.aas.c200585
[17]	Zhang Y L, Li K P, Li K, Wang L C, Zhong B N, Fu Y. Image super-resolution using very deep residual channel attention networks. In: Proceedings of the 15th European Conference on Computer Vision. Munich, Germany: Springer, 2018. 286−301
[18]	Dai T, Cai J R, Zhang Y B, Xia S T, Zhang L. Second-order attention network for single image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach, USA: IEEE, 2019. 11065−11074
[19]	Niu B, Wen W L, Ren W Q, Zhang X D, Yang L P, Wang S Z, et al. Single image super-resolution via a holistic attention network. In: Proceedings of the 16th European Conference on Computer Vision. Glasgow, UK: Springer, 2020. 191−207
[20]	Bulat A, Yang J, Tzimiropoulos G. To learn image super-resolution, use a GAN to learn how to do image degradation first. In: Proceedings of the 15th European Conference on Computer Vision. Munich, Germany: Springer, 2018. 185−200
[21]	Zhang K, Zuo W M, Zhang L. Learning a single convolutional super-resolution network for multiple degradations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018. 3262−3271
[22]	Gu J J, Lu H N, Zuo W M, Dong C. Blind super-resolution with iterative kernel correction. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach, USA: IEEE, 2019. 1604−1613
[23]	Wang X T, Yu K, Dong C, Loy C C. Recovering realistic texture in image super-resolution by deep spatial feature transform. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018. 606−615
[24]	Luo Z X, Huang Y, Li S, Wang L, Tan T N. Unfolding the alternating optimization for blind super resolution. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. Vancouver, Canada: Curran Associates Inc., 2020. Article No. 473
[25]	常振春, 禹晶, 肖创柏, 孙卫东. 基于稀疏表示和结构自相似性的单幅图像盲解卷积算法. 自动化学报, 2017, 43(11): 1908-1919 doi: 10.16383/j.aas.2017.c160357 Chang Zhen-Chun, Yu Jing, Xiao Chuang-Bai, Sun Wei-Dong. Single image blind deconvolution using sparse representation and structural self-similarity. Acta Automatica Sinica, 2017, 43(11): 1908-1919 doi: 10.16383/j.aas.2017.c160357
[26]	Pan J S, Lin Z C, Su Z X, Yang M H. Robust kernel estimation with outliers handling for image deblurring. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, USA: IEEE, 2016. 2800−2808
[27]	Yan R M, Shao L. Blind image blur estimation via deep learning. IEEE Transactions on Image Processing, 2016, 25(4): 1910-1921
[28]	Yuan Y, Liu S Y, Zhang J W, Zhang Y B, Dong C, Lin L. Unsupervised image super-resolution using cycle-in-cycle generative adversarial networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Salt Lake City, USA: IEEE, 2018. 701−710
[29]	Zhang Y B, Liu S Y, Dong C, Zhang X F, Yuan Y. Multiple cycle-in-cycle generative adversarial networks for unsupervised image super-resolution. IEEE Transactions on Image Processing, 2020, 29: 1101-1112 doi: 10.1109/TIP.2019.2938347
[30]	Shocher A, Cohen N, Irani M. Zero-shot super-resolution using deep internal learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018. 3118−3126
[31]	Soh J W, Cho S, Cho N I. Meta-transfer learning for zero-shot super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, USA: IEEE, 2020. 3516−3525
[32]	Efrat N, Glasner D, Apartsin A, Nadler B, Levin A. Accurate blur models vs. image priors in single image super-resolution. In: Proceedings of the IEEE International Conference on Computer Vision. Sydney, Australia: IEEE, 2013. 2832−2839
[33]	Pan J S, Sun D Q, Pfister H, Yang M H. Blind image deblurring using dark channel prior. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, USA: IEEE, 2016. 1628−1636
[34]	He K M, Zhang X Y, Ren S Q, Sun J. Identity mappings in deep residual networks. In: Proceedings of the 14th European Conference on Computer Vision. Amsterdam, The Netherlands: Springer, 2016. 630−645
[35]	Su Z, Fang L P, Kang W X, Hu D W, Pietikäinen M, Liu L. Dynamic group convolution for accelerating convolutional neural networks. In: Proceedings of the 16th European Conference on Computer Vision. Glasgow, UK: Springer, 2020. 138−155
[36]	Ronneberger O, Fischer P, Brox T. U-Net: Convolutional networks for biomedical image segmentation. In: Proceedings of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention. Munich, Germany: Springer, 2015. 234−241
[37]	Agustsson E, Timofte R. NTIRE 2017 challenge on single image super-resolution: Dataset and study. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Honolulu, USA: IEEE, 2017. 126−135
[38]	Timofte R, Agustsson E, Van Gool L, Yang M H, Zhang L, Lim B, et al. NTIRE 2017 challenge on single image super-resolution: Methods and results. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Honolulu, USA: IEEE, 2017. 114−125
[39]	Bevilacqua M, Roumy A, Guillemot C, Alberi-Morel M L. Low-complexity single-image super-resolution based on nonnegative neighbor embedding. In: Proceedings of the 23rd British Machine Vision Conference BMVC 2012. Surrey, UK: BMVA Press, 2012. 135.1−135.10
[40]	Zeyde R, Elad M, Protter M. On single image scale-up using sparse-representations. In: Proceedings of the 7th International Conference on Curves and Surfaces. Avignon, France: Springer, 2010. 711−730
[41]	Martin D, Fowlkes C, Tal D, Malik J. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: Proceedings of the 8th IEEE International Conference on Computer Vision. ICCV 2001. Vancouver, Canada: IEEE, 2001. 416−423
[42]	Huang J B, Singh A, Ahuja N. Single image super-resolution from transformed self-exemplars. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Boston, USA: IEEE, 2015. 5197−5206
[43]	Kingma D P, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv: 1412.6980v8, 2015.