一种基于深度学习的青铜器铭文识别方法

李文英; 曹斌; 曹春水; 黄永祯

doi:10.16383/j.aas.2018.c180152

一种基于深度学习的青铜器铭文识别方法

doi: 10.16383/j.aas.2018.c180152

李文英^1,,
曹斌^1, ,,
曹春水^2,3,,
黄永祯^2,3,

1.
中国人民大学历史学院北京 100872
2.
中国科学院自动化研究所模式识别国家重点实验室北京 100190
3.
银河水滴科技(北京)有限公司北京 100190

基金项目:

国家重点基础研究发展计划973计划 2016YFB1001000

国家自然科学基金 61420106015

国家自然科学基金 61525306

教育部人文社会科学研究青年基金项目 18YJC780001

国家自然科学基金 61633021

详细信息

作者简介:
李文英  华中科技大学控制科学与工程系硕士研究生.中国人民大学历史学院考古文博系硕士研究生.主要研究方向为基于模式识别方法的古文字识别, 基于计算机视觉的考古学研究.E-mail:freemin77@126.com

曹春水  中国科学技术大学自动化系与中国科学院自动化研究所模式识别国家重点实验室联合培养的博士研究生.主要研究方向是深度学习与计算机视觉.E-mail:ccs@mail.ustc.edu.cn

黄永祯  中科院自动化所模式识别国家重点实验室副研究员.主要研究方向为模式识别, 计算机视觉.E-mail:yzhuang@nlpr.ia.ac.cn

通讯作者:
曹斌中国人民大学历史学院考古文博系副教授.主要研究方向为商周考古、青铜器与金文研究.本文通信作者.E-mail:caobin@ruc.edu.cn

计量
- 文章访问数: 3206
- HTML全文浏览量: 1100
- PDF下载量: 1344
- 被引次数: 0
出版历程
- 收稿日期: 2018-03-19
- 录用日期: 2018-04-04
- 刊出日期: 2018-11-20

A Deep Learning Based Method for Bronze Inscription Recognition

LI Wen-Ying^1
,,
CAO Bin^{1
, ,},
CAO Chun-Shui^{2,3
,},
HUANG Yong-Zhen^{2,3
,}

1.
School of History, Renmin University of China, Beijing 100872
2.
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190
3.
Watrix Technology Co. Ltd., Beijing 100190

Funds:

National Basic Research Program of China (973 Program) 2016YFB1001000

National Natural Science Foundation of China 61420106015

National Natural Science Foundation of China 61525306

Humanities and Social Sciences Research Youth Fund Project, Ministry of Education 18YJC780001

National Natural Science Foundation of China 61633021

More Information

Author Bio:
Master student in Department of Control Science and Engineering, HuaZhong University of Science and Technology. Master student in the Archaeology and Museology Department, School of History, Renmin University of China. Her research interest covers ancient character recognition based on pattern recognition, and archaeological research based on computer vision

Ph. D. candidate in the Department of Automation, University of Science and Technology of China and now studying in the National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences. His research interest covers artificial intelligence, machine learning, and computer vision

Associate professor at National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences. His research interest covers computer vision and pattern recognition

Corresponding author: CAO Bin Associate professor at Archaeology and Museology Department, School of History, Renmin University of China. His research interest covers Shang and Zhou archaeology (archaeology on the Chinese bronze age), bronze and inscriptions research. Corresponding author of this paper

摘要

摘要: 考古出土的青铜器铭文是非常宝贵的文字材料，准确、快速地了解其释义和字形演变源流对考古学、历史学和语言学研究均有重要意义.青铜器铭文的辨识需要综合文字的形、音、义进行研究，其中第一步也是最重要的一步就是分析文字的形体特征.本文提出一种基于两阶段特征映射的神经网络模型来提取每个文字的形体特征，最后对比目前已知的文字研究成果，如《古文字类编》、《说文解字》，得出识别的结果.通过定性和定量的实验分析，我们发现本文提出的方法可达到较高的识别精度.特别地，在前10个预测类别中（Top-10）准确率达到了94.2%，大幅缩小了考古研究者的搜索推测空间，提高了青铜铭文识别的效率和准确性.
- 模式识别 /
- 青铜器铭文 /
- 文字识别 /
- 深度学习 /
- 深度卷积神经网络
Abstract: Bronze inscriptions from archaeology are very valuable text materials. Accurate and rapid understanding of their meaning and shape evolution is important for archeology, history and linguistics. It is necessary to combine characters shape, phonology and meaning for recognition of bronze inscription, wherein the first and also the most important step is to analyze shapes of bronze inscriptions. In this paper, we present a bronze inscription analysis method based on convolutional neural network (CNN) with two-phase feature mapping. We first extract the bronze inscriptions by image acquisition, and then, by comparing with the currently known character research results, e.g., "Ancient Chinese Character Type Series" and "Shuo Wen Jie Zi", we obtain the recognition results. Through qualitative and quantitative experimental analyses, we find that the proposed method achieves high recognition accuracy. Specifically, we achieve 94.2% accuracy for the Top-10, greatly reducing the space of archaeological search and improving the efficiency and accuracy of bronze inscription recognition.
- Pattern recognition /
- bronze inscription /
- character recognition /
- deep learning /
- convolutional neural network (CNN)
注释:

1) 本文责任编委刘成林

HTML全文

图 1 "保"字的各种演化变体(包括甲骨文、青铜器铭文、篆书等)

Fig. 1 Various evolutionary shapes of character "保" (including oracle-bone, bronze inscription, seal character, etc.)

下载: 全尺寸图片幻灯片

图 2 单人旁的不同形态

Fig. 2 Different shapes of character component "人"

下载: 全尺寸图片幻灯片

图 3 "女"字的不同形态

Fig. 3 Different shapes of character "女"

下载: 全尺寸图片幻灯片

图 4 "妇"字和"好"字的不同形态

Fig. 4 Different shapes of character "妇" and "好"

下载: 全尺寸图片幻灯片

图 5 字库图片示例

Fig. 5 Example images of the character database

下载: 全尺寸图片幻灯片

图 6 77个古文字库

Fig. 6 Ancient character database with 77 characters

下载: 全尺寸图片幻灯片

图 7 基于18层ResNet的古文字识别模型示意

Fig. 7 Pipeline of ancient character recognition based on 18-level ResNet

下载: 全尺寸图片幻灯片

图 8 两阶段映射示意(第一个Loss有能力把杂乱的原始数据聚类得比较好; 第二个Loss进一步聚类数据)

Fig. 8 Demonstration of two-stage mapping (The first loss has the ability to originally cluster the messy raw data and the second further clusters the data.)

下载: 全尺寸图片幻灯片

图 9 "母"字的网络学习与预测过程示意图

Fig. 9 Illustration of learning and prediction of character "母"

下载: 全尺寸图片幻灯片

图 10 识别错误的3个"母"字

Fig. 10 Three cases of wrong recognition of character "母"

下载: 全尺寸图片幻灯片

图 11 "子"、"吉"、"名" 3个字的甲骨文、金文和鸟文的对比

Fig. 11 The comparison of oracle-bone, bronze inscriptions and bird-writing for character "子", "吉" and "名"

下载: 全尺寸图片幻灯片

表 1 测试集的识别准确率

Table 1 Recognition accuracy in the testing dataset

分类器结果对比	Top-1	Top-3	Top-5	Top-8	Top-10
基准分类器	57.1%	73.7%	85.8%	89.6%	92.7%
分类器Ⅰ	57.7%	74.9%	86.2%	90.5%	93.6%
分类器Ⅱ	58.3%	76.1%	87.1%	91.4%	94.2%

下载: 导出CSV

参考文献(23)

[1]	马承源.中国古代青铜器.第2版.上海:上海人民出版社, 2016. 9-41 Ma Cheng-Yuan. Ancient Chinese Bronzes (Second Edition). Shanghai:Shanghai Renmin Press, 2016. 9-41
[2]	李学勤.古文字学初阶.第2版.北京: 中华书局, 2006. 9-46 Li Xue-Qin. Primary Chinese Paleography (Second Edition). Beijing: Zhonghua Book Company, 2006. 9-46
[3]	高明.中国古文字学通论.北京:北京大学出版社, 1996. 56-170 Gao Ming. General Theory of Chinese Paleography. Beijing:Beijing University Press, 1996. 56-170
[4]	高明, 涂白奎.古文字类编.上海:上海古籍出版社, 2014. 1-1427 Gao Ming, Tu Bai-Kui. Ancient Chinese Character Type Series. Shanghai:Shanghai Classics Publishing House, 2014. 1-1427
[5]	张亚初.殷周金文集成引得.北京: 中华书局, 2001. 1-225 Zhang Ya-Chu. Shang and Zhou Dynasties Bronze Inscriptions Integration Index. Beijing: Zhonghua Book Company, 2001. 1-225
[6]	周新伦, 李锋, 华星城, 韦剑.甲骨文计算机识别方法研究.复旦学报(自然科学版), 1996, 35(5):481-486 http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=QK199600072768 Zhou Xin-Lun, Li Feng, Hua Xing-Cheng, Wei Jian. A method of Jia Gu Wen recognition based on a two-level classification. Journal of Fudan University (Natural Science), 1996, 35(5):481-486 http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=QK199600072768
[7]	李峰, 周新伦.甲骨文自动识别的图论方法.电子科学学刊, 1996, 18(S1):41-47 http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=QK199600067408 Li Feng, Zhou Xin-Lun. Recohnition of Jia Gu Wen based on graph theory. Journal of Electronics & Information Technology, 1996, 18(S1):41-47 http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=QK199600067408
[8]	顾绍通.基于拓扑配准的甲骨文字形识别方法.计算机与数字工程, 2016, 44(10):2001-2006 doi: 10.3969/j.issn.1672-9722.2016.10.029 Gu Shao-Tong. Identification of oracle-bone script fonts based on topological registration. Computer & Digital Engineering, 2016, 44(10):2001-2006 doi: 10.3969/j.issn.1672-9722.2016.10.029
[9]	吕肖庆, 李沫楠, 蔡凯伟, 王晓, 唐英敏.一种基于图形识别的甲骨文分类方法.北京信息科技大学学报, 2010, 25(S2):92-96 http://d.old.wanfangdata.com.cn/Conference/7452730 Lv Xiao-Qing, Li Mo-Nan, Cai Kai-Wei, Wang Xiao, Tang Ying-Min. A graphic-based method for Chinese oracle-bone classification. Journal of Beijing Information Science & Technology University, 2010, 25(S2):92-96 http://d.old.wanfangdata.com.cn/Conference/7452730
[10]	王嘉梅, 文永华, 李燕青, 高雅莉.基于图像分割的古彝文字识别系统研究.云南民族大学学报(自然科学版), 2008, 17(1):76-79 doi: 10.3969/j.issn.1672-8513.2008.01.019 Wang Jia-Mei, Wen Yong-Hua, Li Yan-Qing, Gao Ya-Li. The recognition system of old-Yi character based on the image segmentation. Journal of Yunnan Nationalities University (Natural Sciences Edition), 2008, 17(1):76-79 doi: 10.3969/j.issn.1672-8513.2008.01.019
[11]	孙华.基于多特征融合SVM的古汉字图像识别研究[硕士学位论文], 中南大学, 中国, 2010 http://www.wanfangdata.com.cn/details/detail.do?_type=degree&id=Y1721380 Sun Hua. Study of Ancient Chinese Character based on Multi-feature SVM Image Recognition Method[Master thesis], Central South University, China, 2010 http://www.wanfangdata.com.cn/details/detail.do?_type=degree&id=Y1721380
[12]	孙莹莹.基于混合核LS-SVM的古汉字图像识别[硕士学位论文], 安徽大学, 中国, 2015 http://www.wanfangdata.com.cn/details/detail.do?_type=degree&id=Y2805808 Sun Ying-Ying. Recognition of Ancient Chinese Characters Based on Hybrid Kernel LS-SVM[Master thesis], Anhui University, China, 2015 http://www.wanfangdata.com.cn/details/detail.do?_type=degree&id=Y2805808
[13]	Krizhevsky K, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems. Lake Tahoe, USA: ACM, 2012. 1097-1105
[14]	Deng J, Dong W, Socher R, Li L J, Li K, Li F F. ImageNet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009. 248-255
[15]	Cao C S, Liu X M, Yang Y, Yu Y N, Wang J, Wang Z L, et al. Look and think twice: capturing top-down visual attention with feedback convolutional neural networks. In: Proceedings of the 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015. 2956-2964
[16]	Zhang X Y, Bengio Y, Liu C L. Online and offline handwritten Chinese character recognition:a comprehensive study and new benchmark. Pattern Recognition, 2017, 61:348-360 doi: 10.1016/j.patcog.2016.08.005
[17]	Wu Y C, Yin F, Liu C L. Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models. Pattern Recognition, 2017, 65:251-264 doi: 10.1016/j.patcog.2016.12.026
[18]	Zeiler M D, Fergus R. Visualizing and understanding convolutional networks. In: Proceedings of the 13th European Conference on Computer Vision. Zurich, Switzerland: Springer, 2014. 818-833
[19]	Zhou B L, Khosla A, Lapedriza A, Oliva A, Torralba A. Object detectors emerge in deep scene CNNs. In: Proceedings of the 2015 International Conference on Learning Representations. San Diego, USA: ICLR, 2015.
[20]	Simonyan K, Zisserman A. Very deep convolutional networks for large-Scale image recognition. In: Proceedings of the 2015 International Conference on Learning Representations. San Diego, USA: ICLR, 2015.
[21]	Szegedy C, Liu W, Jia Y Q, Sermanet P, Reed S, Anguelov D, et al. Going deeper with convolutions. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, USA: IEEE, 2015. 1-9
[22]	He K M, Zhang X Y, Ren S Q, Sun J. Deep residual learning for image recognition. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, USA: IEEE, 2016. 770-778
[23]	Mnih V, Heess N, Graves A, Kavukcuoglu K. Recurrent models of visual attention. In: Proceedings of the 27th International Conference on Neural Information Processing Systems. Montreal, Canada: ACM, 2014. 2204-2212