基于深度学习的极性电子元器件目标检测与方向识别方法

陈文帅; 任志刚; 吴宗泽; 付敏跃

doi:10.16383/j.aas.c190037

基于深度学习的极性电子元器件目标检测与方向识别方法

doi: 10.16383/j.aas.c190037 cstr: 32138.14.j.aas.c190037

陈文帅^{1, 2,},
任志刚^{1, 2, 3, 4,},
吴宗泽^{1, 2, 3, 4, ,},
付敏跃^{1, 5,}

1.
广东工业大学自动化学院广州 510006 中国
2.
广东省物联网信息技术重点实验室广州 510006 中国
3.
精密电子制造技术与装备国家重点实验室广州 510006 中国
4.
广东省离散制造知识自动化工程技术研究中心广州 510006 中国
5.
纽卡斯尔大学电气工程与计算机科学学院纽卡斯尔 2308 澳大利亚

基金项目:

国家自然科学基金 61673126

国家自然科学基金 61703114

国家自然科学基金 U1701261

国家自然科学基金 U1701264

详细信息

作者简介:
陈文帅  广东工业大学自动化学院硕士研究生. 主要研究方向为深度学习, 计算机视觉. E-mail: 2111704068@mail2.gdut.edu.cn

任志刚  广东工业大学自动化学院副教授. 主要研究方向为最优控制, 知识自动化, 人工智能. E-mail: renzhigang@gdut.edu.cn

付敏跃  广东工业大学自动化学院教授, 澳大利亚纽卡斯尔大学终身教授.主要研究方向为自动控制, 信号处理和通信. E-mail: minyue.fu@newcastle.edu.au

通讯作者:
吴宗泽广东工业大学自动化学院教授. 主要研究方向为自动控制, 信号处理, 大数据, 知识自动化, 人工智能. 本文通信作者. E-mail: zzwu@gdut.edu.cn

计量
- 文章访问数: 1423
- HTML全文浏览量: 587
- PDF下载量: 341
- 被引次数: 0
出版历程
- 收稿日期: 2019-01-15
- 录用日期: 2019-04-30
- 刊出日期: 2021-07-27

Detecting Object and Direction for Polar Electronic Components via Deep Learning

CHEN Wen-Shuai^{1, 2
,},
REN Zhi-Gang^{1, 2, 3, 4
,},
WU Zong-Ze^{1, 2, 3, 4
, ,},
FU Min-Yue^{1, 5
,}

1.
School of Automation, Guangdong University of Technology, Guangzhou 510006, China
2.
Guangdong Key Laboratory of IoT Information Technology, Guangzhou 510006, China
3.
State Key Laboratory of Precision Electronic Manufacturing Technology and Equipment, Guangzhou 510006, China
4.
Guangdong Discrete Manufacturing Knowledge Automation Engineering Technology Research Center, Guangzhou 510006, China
5.
School of Electrical Engineering and Computer Science, the University of Newcastle, Newcastle 2308, Australia

Funds:

National Natural Science Foundation of China 61673126

National Natural Science Foundation of China 61703114

National Natural Science Foundation of China U1701261

National Natural Science Foundation of China U1701264

More Information

Author Bio:
CHEN Wen-Shuai Master student at the School of Automation, Guangdong University of Technology. His research interest covers deep learning and computer vision

REN Zhi-Gang Associate professor at the School of Automation, Guangdong University of Technology. His research interest covers optimal control, knowledge automation, and artificial intelligence

FU Min-Yue Professor at the School of Automation, Guangdong University of Technology and tenured professor at University of Newcastle, Australia. His research interest covers automation control, signal processing, and communications

Corresponding author: WU Zong-Ze Professor at the School of Automation, Guangdong University of Technology. His research interest covers automation control, signal processing, big data, knowledge automation, and artificial intelligence. Corresponding author of this paper

摘要

摘要:
极性电子元器件的类别、方向识别、定位在工业生产、焊接和检测等领域发挥着至关重要的作用. 本文首先将极性电子元器件的方向识别问题转化为一个分类问题, 然后, 采用Faster RCNN (Region convolutional neural network) 与YOLOv3方法实现了极性电子元器件的准确分类、方向识别和精准定位. 实验取得良好的效果, 两种算法的平均准确率(Mean average precision, mAP) 分别达到97.05 %、99.22 %. 此外, 我们通过数据集目标框的长宽分布, 利用K-means算法对Faster RCNN和YOLOv3的Anchor boxes进行了改进设计, 使准确率分别提高了1.16 %、0.1 %, 并提出针对大目标检测的网络结构: YOLOv3-BigObject, 在提高准确率的同时, 将检测单张图片的时间大幅缩减为原来检测时间的一半, 并最终用焊接有元器件的电路板进行检测, 得到了很好的实验结果.
- 电子制造 /
- 深度学习 /
- 方向识别 /
- 目标检测 /
- Faster RCNN
Abstract:
The category, direction identification and positioning of polar electronic components play an important role in the industrial production, welding and inspection. In this paper, we first successfully transform the original problem of directional identification of polar electronic components into a classification problem. Then, the Faster RCNN (region convolutional neural network) and YOLOv3 methods are used to realize the correct classification, direction identification and accurate positioning of the polar electronic components. The experiments validate the effectiveness of our proposed method and the mAP (mean average precision) of the two proposed algorithms can reach 97.05 %, 99.22 %. In addition, we improve the anchor boxes of the Faster RCNN and YOLOv3 by K-means algorithm through the length and width distributions of the target frames of the datasets, the accuracy can be improved by 1.16 %, 0.1 %. We also propose the YOLOv3-BigObject network structure for the large target detection, while improving the accuracy, the cost time for detecting a single picture is also greatly reduced. Finally, the board with the electronic components is chosen to test and good experimental results are obtained.
- Electronic manufacturing /
- deep learning /
- direction recognition /
- object detection /
- Faster RCNN

HTML全文

图 1 基于传统和深度学习图像分类方法模型

Fig. 1 Traditional and deep learning image classification method based on model

下载: 全尺寸图片幻灯片

图 2 数据集1图片示例

Fig. 2 Image examples of the dataset 1

下载: 全尺寸图片幻灯片

图 3 数据集2图片示例

Fig. 3 Image examples of the dataset 2

下载: 全尺寸图片幻灯片

图 4 方向识别转化为分类

Fig. 4 Direction recognition change to classification

下载: 全尺寸图片幻灯片

图 5 基于Faster RCNN的Anchor boxes和训练集的分布

Fig. 5 Distribution of training set and anchors boxes based on Faster RCNN

下载: 全尺寸图片幻灯片

图 6 基于YOLOv3的Anchor boxes和训练集的分布

Fig. 6 Distribution of training set and anchor boxes based on YOLOv3

下载: 全尺寸图片幻灯片

图 7 YOLOv3的网络结构

Fig. 7 YOLOv3 network structure

下载: 全尺寸图片幻灯片

图 8 YOLOv3-BigObject的网络结构

Fig. 8 YOLOv3-BigObject network structure

下载: 全尺寸图片幻灯片

图 9 YOLOv3-BigObject的内部网络参数设置结构

Fig. 9 The Internal network parameter setting structure of the YOLO v3-BigObject

下载: 全尺寸图片幻灯片

图 10 测试集结果展示

Fig. 10 The display of test sets result

下载: 全尺寸图片幻灯片

图 11 Faster RCNN (ResNet101)的Loss曲线

Fig. 11 The loss curves of Faster RCNN (ResNet101)

下载: 全尺寸图片幻灯片

图 12 YOLOv3和YOLOv3-BigObject的Loss曲线

Fig. 12 The loss curves of YOLOv3 and YOLOv3-BigObject

下载: 全尺寸图片幻灯片

图 13 视频检测结果

Fig. 13 Video experimental result

下载: 全尺寸图片幻灯片

表 1 数据集1包含各类的图片数量和目标数量

Table 1 The dataset 1 contains the number of images and targets of targets in each category

二极管型号	方向1 (上)	方向2 (右)	方向3 (下)	方向4 (左)	总计
型号1	2 775 (2 934)	3 129 (3 300)	2 739 (2 899)	3 264 (3 460)	9 793 (12 593)
型号2	2 833 (2 970)	3 123 (3 315)	2 813 (2 946)	3 180 (3 369)	9 800 (12 600)
型号3	2 840 (2 993)	3 090 (3 309)	2 827 (2 977)	3 108 (3 321)	9 800 (12 600)
总计	7 032 (8 897)	7 570 (9 924)	6 975 (8 822)	7 676 (10 150)	14 000 (37 793)

下载: 导出CSV

表 2 数据集2包含各类的图片数量和目标数量

Table 2 The dataset 2 contains the number of images and targets of targets in each category

二极管型号	方向1 (上)	方向2 (右)	方向3 (下)	方向4 (左)	总计
型号1	1 278 (1 365)	1 349 (1 448)	1 322 (1 461)	1 329 (1 441)	4 949 (5 714)
型号2	1 282 (1 380)	1 380 (1 467)	1 307 (1 400)	1 344 (1 430)	4 970 (5 677)
总计	2 155 (2 745)	2 289 (2 915)	2 214 (2 861)	2 673 (2 871)	5 691 (11 391)

下载: 导出CSV

表 3 数据集1的实验结果

Table 3 Experimental result of dataset 1

	Backbone	Anchor boxes	Anchor boxes与训练集的平均IoU	mAP (%)	测试速度(s)
Faster RCNN	VGG-16	9	0.7037	93.77	-
Faster RCNN	VGG-16	6	0.8577	94.95	-
Faster RCNN	ResNet101	9	0.7037	97.05	-
Faster RCNN	ResNet101	6	0.8577	97.36	-
YOLOv3	Darknet-53	9	0.5728	99.22	-
YOLOv3	Darknet-53	3	0.7362	99.31	0.0217
YOLOv3-BigObject	-	3	0.7362	99.44	0.0118

下载: 导出CSV

表 4 数据集1和数据集2的实验结果

Table 4 Experimental result of datasets 1 and 2

	Backbone	Anchor boxes	Anchor boxes与训练集的平均IoU	mAP (%)	测试速度(s)
Faster RCNN	ResNet101	9	0.6294	97.28	-
Faster RCNN	ResNet101	6	0.8000	97.53	-
FPN	ResNet101	9+	-	98.70	-
SSD	ResNet101	9+	-	76.25	-
YOLOv3	Darknet-53	9	0.4476	99.26	-
YOLOv3-BigObject	-	3	0.6645	99.20	0.0118

下载: 导出CSV

参考文献(25)

[1]	杜思思, 李卓球, 张雪峰. 电子元器件图像外形特征的精确定位技术研究. 武汉理工大学学报, 2008, 30(3): 359-362 doi: 10.3963/j.issn.1007-144X.2008.03.007 Du Si-Si, Li Zhuo-Qiu, Zhang Xue-Feng. Research on accurate positioning technology of image features of electronic components. Journal of Wuhan University of Technology, 2008, 30(3): 359-362 doi: 10.3963/j.issn.1007-144X.2008.03.007
[2]	李文英, 曹斌, 曹春水, 黄永祯. 一种基于深度学习的青铜器铭文识别方法. 自动化学报, 2018, 44(11): 2023-2030 doi: 10.16383/j.aas.2018.c180152 Li Wen-Ying, Cao Bin, Cao Chun-Shui, Huang Yong-Zhen. A deep learning based method for bronze inscription recognition. Acta Automatica Sinica, 2018, 44(11): 2023-2030. doi: 10.16383/j.aas.2018.c180152
[3]	罗建豪, 吴建鑫. 基于深度卷积特征的细粒度图像分类研究综述. 自动化学报, 2017, 43(8): 1306-1318 doi: 10.16383/j.aas.2017.c160425 Luo Jian-Hao, Wu Jian-Xin. A survey on fine-grained image categorization using deep convolutional features. Acta Automatica Sinica, 2017, 43(8): 1306-1318 doi: 10.16383/j.aas.2017.c160425
[4]	田娟秀, 刘国才, 谷珊珊, 鞠忠建, 刘劲光, 顾冬冬. 医学图像分析深度学习方法研究与挑战. 自动化学报, 2018, 44(3): 401-424 doi: 10.16383/j.aas.2018.c170153 Tian Juan-Xiu, Liu Guo-Cai, Gu Shan-Shan, Ju Zhong-Jian, Liu Jin-Guang, Gu Dong-Dong. Deep learning in medical image analysis and its challenges. Acta Automatica Sinica, 2018, 44(3): 401-424 doi: 10.16383/j.aas.2018.c170153
[5]	Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks. In: Proceedings of the 2014 Advances in Neural Information Processing Systems, Lake Tahoe, Spain, 2012. 1097-1105
[6]	郑文博, 王坤峰, 王飞跃. 基于贝叶斯生成对抗网络的背景消减算法. 自动化学报, 2018, 44(5): 878-890 doi: 10.16383/j.aas.2018.c170562 Zheng Wen-Bo, Wang Kun-Feng, Wang Fei-Yue. Background subtraction algorithm with bayesian generative adversarial networks. Acta Automatica Sinica, 2018, 44(5): 878-890 doi: 10.16383/j.aas.2018.c170562
[7]	张慧, 王坤峰, 王飞跃. 深度学习在目标视觉检测中的应用进展与展望. 自动化学报, 2017, 43(8): 1289-1305 doi: 10.16383/j.aas.2017.c160822 Zhang Hui, Wang Kun-Feng, Wang Fei-Yue. Advances and perspectives on applications of deep learning in visual object detection. Acta Automatica Sinica, 2017, 43(8): 1289-1305 doi: 10.16383/j.aas.2017.c160822
[8]	于进勇, 丁鹏程, 王超. 卷积神经网络在目标检测中的应用综述. 计算机科学, 2018, 45(Z11): 17-26 https://www.cnki.com.cn/Article/CJFDTOTAL-JSJA2018S2003.htm Yu Jin-Yong, Ding Peng-Cheng, Wang Chao. Overview: Application of convolution neural network in object detection. Computer Science, 2018, 45(Z11): 17-26 https://www.cnki.com.cn/Article/CJFDTOTAL-JSJA2018S2003.htm
[9]	Girshick R, Donahue J, Darrell T, Jitendra M. Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, USA: IEEE, 2014. 580-587
[10]	Uijlings J R R, Van De Sande K E A, Gevers T, Smeulders A W M. Selective search for object recognition. International Journal of Computer Vision, 2013, 104(2): 154-171 doi: 10.1007/s11263-013-0620-5
[11]	Zitnick C L, Dollár P. Edge boxes: Locating object proposals from edges. In: Proceedings of the 13th European Conference on Computer Vision, Zurich, Switzerland: Springer, 2014. 391-405
[12]	Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognitions[Online], available: https://arxiv.org/abs/1406.2952, April 10, 2015
[13]	He K M, Zhang X Y, Ren S Q, Sun J. Deep residual learning for image recognition. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, USA: IEEE, 2016. 770-778
[14]	Ren S Q, He K M, Girshick R, Sun J. Faster R-CNN: Towards real-time object detection with region proposal networks. In: Proceedings of the 2015 Advances in Neural Information Processing Systems, Montreal, Canada, 2015: 91-99
[15]	Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA: IEEE, 2015. 3431-3440
[16]	Girshick R. Fast R-CNN. In: Proceedings of the 2015 IEEE International Conference on Computer Vision, Santiago, Chile: IEEE, 2015. 1440-1448
[17]	Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C Y, Berg A C. SSD: Single shot multibox detector. In: Proceedings of the 2016 European Conference on Computer Vision, Berlin, Germany: Springer-Verlag, 2016. 21-37
[18]	Redmon J, Divvala S, Girshick R, Farhadi A. You only look once: Unified, real-time object detection. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA: IEEE, 2016. 779-788
[19]	Redmon J, Farhadi A. YOLO9000: Better, faster, stronger. In: Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, Hawaii, USA: IEEE, 2017. 7263-7271
[20]	Redmon J, Farhadi A. Yolov3: An incremental improvement[Online], available: https://arxiv.org/abs/1804.02767, April 8, 2018
[21]	杜思思, 单忠德, 黄樟灿, 刘红旗, 刘世立. 电子元器件自动分类的算法研究与系统开发. 机电产品开发与创新, 2008, 21(6): 133- 135 https://www.cnki.com.cn/Article/CJFDTOTAL-JDCP200806059.htm Du Si-Si, San Zhong-De, Huang Zhang-Can, Liu Hong-Qi, Liu Shi-Li. Algorithm research and system development for automatic classification of electronic components. Mechanical and Electrical Product Development and Innovation, 2008, 21(6): 133-135 https://www.cnki.com.cn/Article/CJFDTOTAL-JDCP200806059.htm
[22]	陈翔, 俞建定, 陈晓爱, 翟影. 基于卷积神经网络的电子元器件分类研究. 无线通信技术, 2018, 27(2): 7-12 https://www.cnki.com.cn/Article/CJFDTOTAL-WYWT201802002.htm Chen Xiang, Yu Jian-Ding, Chen Xiao-Ai, Zhai Ying. Classification of electronic components based on convolutional neural networks. Wireless Communication Technology, 2018, 27(2): 7-12 https://www.cnki.com.cn/Article/CJFDTOTAL-WYWT201802002.htm
[23]	Rumelhart D E, Hinton G E, Williams R J. Learning representations by back-propagating errors. Nature, 1986, 323(6088): 533-536 doi: 10.1038/323533a0
[24]	Hinton G E, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov R R. Improving neural networks by preventing co-adaptation of feature detectors [Online], available: https://arxiv.org/abs/1207.0580, July 3, 2012
[25]	Lin T Y, Dollár P, Girshick R, He K M, Hariharan B, Belongie S. Feature pyramid networks for object detection. In: Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Hawaii, USA: IEEE, 2017. 2117-2125

施引文献

资源附件(0)

访问统计

图(13) / 表(4)

计量

文章访问数: 1423
HTML全文浏览量: 587
PDF下载量: 341
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于深度学习的极性电子元器件目标检测与方向识别方法

doi: 10.16383/j.aas.c190037 cstr: 32138.14.j.aas.c190037

通讯作者:
吴宗泽广东工业大学自动化学院教授. 主要研究方向为自动控制, 信号处理, 大数据, 知识自动化, 人工智能. 本文通信作者. E-mail: zzwu@gdut.edu.cn

计量

Detecting Object and Direction for Polar Electronic Components via Deep Learning

Corresponding author: WU Zong-Ze Professor at the School of Automation, Guangdong University of Technology. His research interest covers automation control, signal processing, big data, knowledge automation, and artificial intelligence. Corresponding author of this paper

计量

目录

留言板

基于深度学习的极性电子元器件目标检测与方向识别方法

doi: 10.16383/j.aas.c190037 cstr: 32138.14.j.aas.c190037

通讯作者: 吴宗泽 广东工业大学自动化学院教授. 主要研究方向为自动控制, 信号处理, 大数据, 知识自动化, 人工智能. 本文通信作者. E-mail: zzwu@gdut.edu.cn

计量

出版历程

Detecting Object and Direction for Polar Electronic Components via Deep Learning

Corresponding author: WU Zong-Ze Professor at the School of Automation, Guangdong University of Technology. His research interest covers automation control, signal processing, big data, knowledge automation, and artificial intelligence. Corresponding author of this paper

计量

出版历程

目录

通讯作者:
吴宗泽广东工业大学自动化学院教授. 主要研究方向为自动控制, 信号处理, 大数据, 知识自动化, 人工智能. 本文通信作者. E-mail: zzwu@gdut.edu.cn