2.845

2023影响因子

(CJCR)

  • 中文核心
  • EI
  • 中国科技核心
  • Scopus
  • CSCD
  • 英国科学文摘

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于边际Fisher准则和迁移学习的小样本集分类器设计算法

舒醒 于慧敏 郑伟伟 谢奕 胡浩基 唐慧明

舒醒, 于慧敏, 郑伟伟, 谢奕, 胡浩基, 唐慧明. 基于边际Fisher准则和迁移学习的小样本集分类器设计算法. 自动化学报, 2016, 42(9): 1313-1321. doi: 10.16383/j.aas.2016.c150560
引用本文: 舒醒, 于慧敏, 郑伟伟, 谢奕, 胡浩基, 唐慧明. 基于边际Fisher准则和迁移学习的小样本集分类器设计算法. 自动化学报, 2016, 42(9): 1313-1321. doi: 10.16383/j.aas.2016.c150560
SHU Xing, YU Hui-Min, ZHENG Wei-Wei, XIE Yi, HU Hao-Ji, TANG Hui-Ming. Classifier-designing Algorithm on a Small Dataset Based on Margin Fisher Criterion and Transfer Learning. ACTA AUTOMATICA SINICA, 2016, 42(9): 1313-1321. doi: 10.16383/j.aas.2016.c150560
Citation: SHU Xing, YU Hui-Min, ZHENG Wei-Wei, XIE Yi, HU Hao-Ji, TANG Hui-Ming. Classifier-designing Algorithm on a Small Dataset Based on Margin Fisher Criterion and Transfer Learning. ACTA AUTOMATICA SINICA, 2016, 42(9): 1313-1321. doi: 10.16383/j.aas.2016.c150560

基于边际Fisher准则和迁移学习的小样本集分类器设计算法

doi: 10.16383/j.aas.2016.c150560
基金项目: 

教育部--中国移动科研基金 MCM20150503

浙江省自然科学基金 LQ12F02014

国家自然科学基金 61471321

国家自然科学基金 61202400

详细信息
    作者简介:

    舒醒 浙江大学信息与电子工程学院硕士研究生.主要研究方向为计算机视觉,图像分类.E-mail:21331093@zju.edu.cn

    郑伟伟 浙江大学信息与电子工程学院博士研究生.主要研究方向为图像匹配,跟踪技术.E-mail:3090102748@zju.edu.cn

    谢奕 浙江大学信息与电子工程学院博士研究生.主要研究方向为图像匹配,跟踪技术.E-mail:Yixie@zju.edu.cn

    胡浩基 浙江大学信息与电子工程学院副教授.主要研究方向为图像处理,计算机视觉.E-mail:haoji_hu@zju.edu.cn

    唐慧明 浙江大学信息与电子工程学院副教授.主要研究方向为图像识别,计算机视觉.E-mail:tanghm@isee.zju.edu.cn

    通讯作者:

    于慧敏 浙江大学信息与电子工程学院教授.主要研究方向为医学图像处理,计算机视觉.本文通信作者.E-mail:yhm2005@zju.edu.cn

Classifier-designing Algorithm on a Small Dataset Based on Margin Fisher Criterion and Transfer Learning

Funds: 

Ministry of Education -- China Mobile Research Fund MCM20150503

Natural Science Foundation of Zhejiang Province LQ12F02014

National Natural Science Foundation of China 61471321

National Natural Science Foundation of China 61202400

More Information
    Author Bio:

    Master student at the College of Information Science and Electronic Engineering, Zhejiang University. Her research interest covers computer vision and image classification.

    Ph.D. candidate at the College of Information Science and Electronic Engineering, Zhejiang University. His research interest covers object matching and target tracking.

    Ph.D. candidate at the College of Information Science and Electronic Engineering, Zhejiang University. His research interest covers object matching and target tracking.

    Assistant professor at the College of Information Science and Electronic Engineering, Zhejiang University. His research interest covers image processing and computer vision.

    Assistant professor at the College of Information Science and Electronic Engineering, Zhejiang University. His research interest covers image recognition and computer vision.

    Corresponding author: YU Hui-Min Professor at the College of Information Science and Electronic Engineering, Zhejiang University. His research interest covers medical image processing and computer vision. Corresponding author of this paper.
  • 摘要: 如何利用大量已有的同构标记数据(源域)设计小样本训练数据(目标域)的分类器是一个具有很强应用意义的研究问题. 由于不同域的数据特征分布有差异,直接使用源域数据对目标域样本进行分类的效果并不理想. 针对上述问题,本文提出了一种基于迁移学习的分类器设计算法. 首先,本文利用内积度量的边际Fisher准则对源域进行特征映射,提高源域中类内紧凑性和类间区分性. 其次,为了筛选合理的训练样本对,本文提出一种去除边界奇异点的算法来选择源域密集区域样本点,与目标域中的标记样本点组成训练样本对. 在核化空间上,本文学习了目标域特征到源域特征的非线性转换,将目标域映射到源域. 最后,利用邻近算法(k-nearest neighbor,kNN)分类器对映射后的目标域样本进行分类. 本文不仅改进了边际Fisher准则方法,并且将基于自适应样本对 筛选的迁移学习应用到小样本数据的分类器设计中,提高域间适应性. 在通用数据集上的实验结果表明,本文提出的方法能够有效提高小样本训练域的分类器性能.
  • 图  1  源域(上)与目标域(下)是存在差异的(与源域相比, 目标域中的背景更复杂, 分辨率更低, 视角更多样)

    Fig.  1  There exist differences between source domain (top) and target domain (bottom) (Compared with the source domain, the target domain contains more complex backgrounds, the lower solution, and more camera angles.)

    图  2  基于边际Fisher准则和迁移学习的小样本数据分类器设计算法流程图

    Fig.  2  The flow diagram of classifier-designing algorithm on a small dataset based on margin Fisher criterion and transfer learning

    图  3  边际Fisher准则的图结构: 本征图和惩罚图

    Fig.  3  The graph structure of margin Fisher criterion: intinsic graph and penalty graph

    图  4  圆形虚线内是源域的同类样本的密集区, 源域的其他样本点被称为奇异点

    Fig.  4  The concentrated region of samples belonging to the same category in the source domain is surrounded by the round dashed line, while the other samples in the source domain are called singular points

    图  5  域间特征转换示意图(灰色表示源域, 黑色表示目标域)

    Fig.  5  The diagrammatic sketch of features transformation between the target domain and source domain (The gray points represent the target domain, while the balck ones represent the source domain.)

    图  6  webcam, amazon和dslr中椅子类别中的样例图

    Fig.  6  The samples image of chair in three domain: webcam, amazon, dslr

    图  7  在目标域上的分类准确率随目标域中可用标记样本数量的变化曲线(其中源域中选择了20个标记样本, 源域为dslr, 目标域为webcam)

    Fig.  7  The accuracy rate curves in the target domain varying with the number of labeled samples in the target domain (Where 20 labeled samples in source domain is selected, the source domain here is dslr, while the target domain is webcam.

    表  1  本文算法的实验参数设置

    Table  1  The experiment parameters set of our method

    AB k1 k2 kA kB σ
    aw 22 1700 20 3 1
    ad 22 1700 20 3 1
    wa 20 680 15 3 1
    wd 20 680 15 3 1
    da 17 420 15 3 1
    dw 17 420 15 3 1
    下载: 导出CSV

    表  2  在三个数据域上的分类准确率(%)(加粗字体表示最佳性能, 缩写: a: amazom, w: webcam, d: dslr)

    Table  2  Accuracy rates in the three domains (%)(The bold font represents the best performance, abbreviation: a: amazom, w: webcam, d: dslr.

    AB kNN-ab kNN-bb symm[3] ARCt[16] gfk[20] svm-s hfa[28] mmf-Euclid mmf
    aw 9.6±1.0 51.0±0.8 51.0±1.4 55.7±0.9 57.8±1.0 34.5±0.8 61.5±0.9 49.3±0.8 55.5±0.7
    ad 4.9±1.1 47.9±0.9 47.9±1.4 50.2±0.7 50.5±0.8 35.3±0.6 52.4±1.0 50.6±0.9 57.6±0.9
    wa 10.5±0.6 40.1±0.5 43.7±0.7 43.4±0.5 44.1±0.4 34.9±0.4 44.5±0.7 42.9±0.6 44.6±0.8
    wd 23.2±0.8 54.1±0.9 69.8±1.0 71.3±0.8 68.5±0.5 65.8±0.8 52.7±1.1 70.1±0.7 71.5±0.6
    da 11.3±0.5 35.6±0.7 42.7±0.5 42.5±0.5 45.7±0.8 33.8±0.4 45.4±0.9 41.4±0.7 42.6±0.6
    dw 37.6±0.8 54.0±0.7 63.4±0.9 65.3±0.5 66.4±0.5 68.1±0.6 62.4±0.8 68.5±1.0 69.5±0.7
    平均准确率 12.2±0.8 37.8±0.8 55.6±1.0 56.9±0.7 57.7±0.7 45.4±0.6 53.2±0.9 55.4±0.8 58.6±0.8
    下载: 导出CSV
  • [1] Cristianini N, Shawe-Taylor J, Kandola J S. Spectral kernel methods for clustering. In: Proceedings of the 2001 Advances in Neural Information Processing Systems 14 (NIPS01). London: MIT, 2002. 649-655
    [2] Bosch A, Zisserman A, Munoz X. Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM International Conference on Image & Video Retrieval. New York: ACM, 2007. 401-408
    [3] Saenko K, Kulis B, Fritz M, Darrell T. Adapting visual category models to new domains. In: Proceedings of the 11th European Conference on Computer Vision, Lecture Notes in Computer Science. Heraklion, Crete, Greece: Springer, 2010. 213-226
    [4] Patel V M, Gopalan R, Li R, Chellappa R. Visual domain adaptation: a survey of recent advances. IEEE Signal Processing Magazine, 2015, 32(3): 53-69 doi: 10.1109/MSP.2014.2347059
    [5] Hoffman J, Rodner E, Donahue J, Darrell T, Saenko K. Efficient learning of domain-invariant image representations. arXiv: 1301.3224, 2013.
    [6] Shao L, Zhu F, Li X. Transfer learning for visual categorization: a survey. IEEE Transactions on Neural Networks & Learning Systems, 2015, 26(5): 1019-1034 http://cn.bing.com/academic/profile?id=2163345210&encoded=0&v=paper_preview&mkt=zh-cn
    [7] 顾鑫, 王士同, 许敏. 基于多源的跨领域数据分类快速新算法. 自动化学报, 2014, 40(3): 531-547 http://www.aas.net.cn/CN/abstract/abstract18319.shtml

    Gu Xin, Wang Shi-Tong, Xu Min. A new cross-multidomain classification algorithm and its fast version for large datasets. Acta Automatica Sinica, 2014, 40(3): 531-547 http://www.aas.net.cn/CN/abstract/abstract18319.shtml
    [8] 张倩, 李明, 王雪松, 程玉虎, 朱美强. 一种面向多源领域的实例迁移学习. 自动化学报, 2014, 40(6): 1176-1183 http://www.aas.net.cn/CN/abstract/abstract18387.shtml

    Zhang Qian, Li Ming, Wang Xue-Song, Cheng Yu-Hu, Zhu Mei-Qiang. Instance-based transfer learning for multi-source domains. Acta Automatica Sinica, 2014, 40(6): 1176-1183 http://www.aas.net.cn/CN/abstract/abstract18387.shtml
    [9] Yang J, Yan R, Hauptmann A G. Cross-domain video concept detection using adaptive SVMS. In: Proceedings of the 15th ACM International Conference on Multimedia. New York: ACM, 2007. 188-197
    [10] 王雪松, 潘杰, 程玉虎, 曹戈. 基于相似度衡量的决策树自适应迁移. 自动化学报, 2013, 39(12): 2186-2192 http://www.aas.net.cn/CN/abstract/abstract18260.shtml

    Wang Xue-Song, Pan Jie, Cheng Yu-Hu, Cao Ge. Self-adaptive transfer for decision trees based on similarity metric. Acta Automatica Sinica, 2013, 39(12): 2186-2192 http://www.aas.net.cn/CN/abstract/abstract18260.shtml
    [11] Bergamo A, Torresani L. Exploiting weakly-labeled web images to improve object classification: a domain adaptation approach. In: Proceedings of the Advances in Neural Information Processing Systems 23. Vancouver, British Columbia, Canada: Curran Associates, Inc., 2010. 181-189
    [12] Li X. Regularized Adaptation: Theory, Algorithms and Applications [Ph.D. dissertation], University of Washington, USA, 2007
    [13] 董爱美, 王士同. 共享隐空间迁移SVM. 自动化学报, 2014, 40(10): 2276-2287 http://www.aas.net.cn/CN/abstract/abstract18502.shtml

    Dong Ai-Mei, Wang Shi-Tong. A shared latent subspace transfer learning algorithm using SVM. Acta Automatica Sinica, 2014, 40(10): 2276-2287 http://www.aas.net.cn/CN/abstract/abstract18502.shtml
    [14] Oquab M, Bottou L, Laptev I, Sivic J. Learning and transferring mid-level image representations using convolutional neural networks. In: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Columbus, OH: IEEE, 2014. 1717-1724
    [15] Razavian A S, Azizpour H, Sullivan J, Carlsson S. CNN features off-the-shelf: an astounding baseline for recognition. Computer Vision & Pattern Recognition Workshops, 2014. 512-519 http://cn.bing.com/academic/profile?id=2062118960&encoded=0&v=paper_preview&mkt=zh-cn
    [16] Kulis B, Saenko K, Darrell T. What you saw is not what you get: domain adaptation using asymmetric kernel transforms. In: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI: IEEE, 2011. 1785-1792
    [17] Farhadi A, Tabrizi M K. Learning to recognize activities from the wrong view point. In: Proceedings of the 10th European Conference on Computer Vision——ECCV 2008. Marseille, France: Springer, 2008. 154-166
    [18] Chopra S, Balakrishnan S, Gopalan R. DLID: deep learning for domain adaptation by interpolating between domains. In: Proceedings of the ICML Workshop on Representation Learning. Atlanta, Georgia, USA, 2013.
    [19] Wang H, Nie F P, Huang H, Ding C. Dyadic transfer learning for cross-domain image classification. In: Proceedings of the 2011 IEEE International Conference on Computer Vision. Barcelona: IEEE, 2011. 551-556
    [20] Gong B, Shi Y, Sha F, Grauman, K. Geodesic flow kernel for unsupervised domain adaptation. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Providence, RI: IEEE, 2012. 2066-2073
    [21] Long M, Wang J, Sun J, Yu P S. Domain invariant transfer kernel learning. IEEE Transactions on Knowledge & Data Engineering, 2015, 27(6): 1519-1532 http://cn.bing.com/academic/profile?id=2087977130&encoded=0&v=paper_preview&mkt=zh-cn
    [22] Shao L, Liu L, Li X L. Feature learning for image classification via multiobjective genetic programming. IEEE Transactions on Neural Networks and Learning Systems, 2014, 25(7): 1359-1371 doi: 10.1109/TNNLS.2013.2293418
    [23] Davis J V, Kulis B, Jain P, Sra S, Dhillon I S. Information-theoretic metric learning. In: Proceedings of the 24th International Conference on Machine Learning. Corvallis, Oregon, USA: ACM, 2007. 209-216
    [24] Yan S C, Xu D, Zhang B Y, Zhang H J, Yang Q, Lin S. Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(1): 40-51 doi: 10.1109/TPAMI.2007.250598
    [25] Li Y L, Liu P, Du H S, Li Z, Liu J H, Yu D Y, Li M Q. Marginal fisher analysis-based feature extraction for identification of drug and explosive concealed by body packing. Analytical Methods, 2013, 5(22): 6331-6337 doi: 10.1039/c3ay40998h
    [26] Wang Z Q, Sun X. Optimal kernel marginal fisher analysis for face recognition. Journal of Computers, 2012, 7(9): 2298-2305 http://cn.bing.com/academic/profile?id=2171475570&encoded=0&v=paper_preview&mkt=zh-cn
    [27] Al Censor Y, Zenios S A. Parallel optimization: theory, algorithms and applications. Scalable Computing: Practice & Experience. New York: Oxford University Press, 1997.
    [28] Duan L X, Xu D, Tsang I W. Learning with augmented features for heterogeneous domain adaptation. In: Proceedings of the 29th International Conference on Machine Learning. Edinburgh, Scotland, UK: Omnipress, 2012. 711-718
  • 加载中
图(7) / 表(2)
计量
  • 文章访问数:  2505
  • HTML全文浏览量:  467
  • PDF下载量:  1359
  • 被引次数: 0
出版历程
  • 收稿日期:  2015-09-09
  • 录用日期:  2015-12-11
  • 刊出日期:  2016-09-01

目录

    /

    返回文章
    返回