2.845

2023影响因子

(CJCR)

  • 中文核心
  • EI
  • 中国科技核心
  • Scopus
  • CSCD
  • 英国科学文摘

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于自适应时间步脉冲神经网络的高效图像分类

李千鹏 贾顺程 张铁林 陈亮

李千鹏, 贾顺程, 张铁林, 陈亮. 基于自适应时间步脉冲神经网络的高效图像分类. 自动化学报, 2024, 50(9): 1724−1735 doi: 10.16383/j.aas.c230656
引用本文: 李千鹏, 贾顺程, 张铁林, 陈亮. 基于自适应时间步脉冲神经网络的高效图像分类. 自动化学报, 2024, 50(9): 1724−1735 doi: 10.16383/j.aas.c230656
Li Qian-Peng, Jia Shun-Cheng, Zhang Tie-Lin, Chen Liang. Adaptive timestep improved spiking neural network for efficient image classification. Acta Automatica Sinica, 2024, 50(9): 1724−1735 doi: 10.16383/j.aas.c230656
Citation: Li Qian-Peng, Jia Shun-Cheng, Zhang Tie-Lin, Chen Liang. Adaptive timestep improved spiking neural network for efficient image classification. Acta Automatica Sinica, 2024, 50(9): 1724−1735 doi: 10.16383/j.aas.c230656

基于自适应时间步脉冲神经网络的高效图像分类

doi: 10.16383/j.aas.c230656 cstr: 32138.14.j.aas.c230656
基金项目: 国家重点研发计划(2021ZD0200300)资助
详细信息
    作者简介:

    李千鹏:中国科学院自动化研究所硕士研究生. 主要研究方向为类脑智能, 类脑处理器. E-mail: liqianpeng2021@ia.ac.cn

    贾顺程:中国科学院自动化研究所博士研究生. 主要研究方向为类脑脉冲神经网络模型与学习算法. E-mail: jiashuncheng2020@ia.ac.cn

    张铁林:中国科学院脑科学与智能技术卓越创新中心研究员. 主要研究方向为类脑计算. E-mail: zhangtielin@ion.ac.cn

    陈亮:中国科学院自动化研究所副研究员. 主要研究方向为计算机架构与集成电路设计, 类脑计算. 本文通信作者. E-mail: liang.chen@ia.ac.cn

Adaptive Timestep Improved Spiking Neural Network for Efficient Image Classification

Funds: Supported by National Key Research and Development Program of China (2021ZD0200300)
More Information
    Author Bio:

    LI Qian-Peng Master student at the Institute of Automation, Chinese Academy of Sciences. His research interest covers brain-inspired intelligence and brain-inspired processor

    JIA Shun-Cheng Ph.D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers brain-inspired spiking neural network model and learning algorithm

    ZHANG Tie-Lin Researcher at the Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences. His main research interest is brain-inspired computing

    CHEN Liang Associate researcher at the Institute of Automation, Chinese Academy of Sciences. His research interest covers computer architecture and integrated circuit design and brain-inspired computing. Corresponding author of this paper

  • 摘要: 脉冲神经网络(Spiking neural network, SNN)由于具有相对人工神经网络(Artifcial neural network, ANN)更低的计算能耗而受到广泛关注. 然而, 现有SNN大多基于同步计算模式且往往采用多时间步的方式来模拟动态的信息整合过程, 因此带来了推理延迟增大和计算能耗增高等问题, 使其在边缘智能设备上的高效运行大打折扣. 针对这个问题, 本文提出一种自适应时间步脉冲神经网络(Adaptive timestep improved spiking neural network, ATSNN)算法. 该算法可以根据不同样本特征自适应选择合适的推理时间步, 并通过设计一个时间依赖的新型损失函数来约束不同计算时间步的重要性. 与此同时, 针对上述ATSNN特点设计一款低能耗脉冲神经网络加速器, 支持ATSNN算法在VGG和ResNet等成熟框架上的应用部署. 在CIFAR10、CIFAR100、CIFAR10-DVS等标准数据集上软硬件实验结果显示, 与当前固定时间步的SNN算法相比, ATSNN算法的精度基本不下降, 并且推理延迟减少36.7% ~ 58.7%, 计算复杂度减少33.0% ~ 57.0%. 在硬件模拟器上的运行结果显示, ATSNN的计算能耗仅为GPU RTX 3090Ti的4.43% ~ 7.88%. 显示出脑启发神经形态软硬件的巨大优势.
  • 图  1  ATSNN结构及训练和动态时间推理流程图

    Fig.  1  ATSNN structure and training and dynamic time inference flow chart

    图  2  加速器架构((a)常规加速器; (b) DTSNN的加速器; (c)本文加速器)

    Fig.  2  Accelerator architecture ((a) Conventional accelerator; (b) Accelerator of DTSNN; (c) Our accelerator)

    图  3  不同平均推理时间步时的准确率

    Fig.  3  Accuracy at different average inference timesteps

    图  4  固定时间步时使用三种损失函数的准确率

    Fig.  4  Accuracy by using three loss functions at fixed timestep

    图  5  超参数对准确率的影响

    Fig.  5  The effect of hyperparameters on accuracy

    图  6  超参数对时间步的影响

    Fig.  6  The effect of hyperparameters on timestep

    图  7  $ \tau_{CL}\ne1 $和$ \tau_{CL} =1$时的性能对比

    Fig.  7  Performance comparison between $ \tau_{CL}\ne1 $ and $ \tau_{CL}= 1 $

    表  1  SNN、DTSNN和ATSNN在时间步、准确率和复杂度的对比

    Table  1  Comparison of SNN, DTSNN and ATSNN in timestep, accuracy and complexity

    网络结构 算法 CIFAR10 CIFAR100 CIFAR10-DVS
    时间步 准确率(%) 复杂度 时间步 准确率(%) 复杂度 时间步 准确率(%) 复杂度
    VGG16 SNN 4.00 91.35 1.00 4.00 66.99 1.00 10.00 72.60 1.00
    DTSNN 2.53 (4) 91.13 0.79 3.58 (4) 69.68 0.96 5.70 (10) 73.30 0.56
    ATSNN 2.19 (4) 91.61 0.67 2.49 (4) 70.05 0.63 4.13 (10) 73.00 0.43
    ResNet19 SNN 4.00 91.86 1.00 4.00 67.22 1.00 10.00 70.00 1.00
    DTSNN 2.52 (4) 91.51 0.88 3.54 (4) 67.58 1.02 6.95 (10) 70.50 0.94
    ATSNN 2.00 (4) 91.84 0.67 2.53 (4) 70.64 0.64 5.57 (10) 69.50 0.63
    下载: 导出CSV

    表  2  每个时间步的相对计算复杂度及网络性能

    Table  2  Relative computational complexity per timestep and network performance

    数据集 T1 T2 T3 T4 T5 $\tilde{T}$ (%) $\Delta ACC$ (%)
    ImageNet-100 0.998 1.009 1.009 1.010 1.003 68.49 0.90
    N-Caltech-101 0.959 0.977 0.978 0.959 0.989 68.32 0.45
    下载: 导出CSV

    表  3  加速器与GPU关于FPS、耗能对比

    Table  3  Comparison of FPS and energy consumption between accelerator and GPU

    网络结构数据集GPU本文加速器
    FPS耗能(J)FPS耗能(J)
    VGG16CIFAR102156814248537.6 (7.88%)
    CIFAR1002007499228543.6 (7.24%)
    ResNet19CIFAR1021511088212606.1 (5.46%)
    CIFAR10017213914180617.2 (4.43%)
    下载: 导出CSV

    表  4  默认超参数的性能

    Table  4  Performance with default hyperparameters

    数据集网络结构时间步准确率(%)
    CIFAR10ResNet191.897 (4)91.77
    CIFAR10VGG162.183 (4)91.57
    CIFAR100ResNet192.501 (4)70.46
    CIFAR100VGG162.642 (4)69.58
    下载: 导出CSV

    表  5  ATSNN消融实验

    Table  5  Ablation experiment of ATSNN

    数据集输出层神经元损失函数输出表征T1T2T3T4
    CIFAR1034.4389.3990.8291.86
    47.2689.8590.5291.38
    85.6888.7890.3991.28
    86.9389.9691.6391.79
    86.9390.1491.3991.86
    CIFAR10027.1356.6862.5367.22
    30.3857.1963.3767.70
    52.6963.8067.2969.61
    53.4764.3567.5369.90
    53.4765.2368.1770.25
    下载: 导出CSV

    表  6  ATSNN与低延迟算法的对比

    Table  6  Comparison between ATSNN and low-latency algorithms

    数据集 算法 算法类型 网络结构 时间步 准确率 (%)
    CIFAR10 Conversion[11] ANN转SNN VGG16 8 90.96
    STDB[33] 转换+训练 VGG16 5 91.41
    EfficientLIF-Net[10] 直接训练 VGG16 5 90.30
    DTSNN[19] 直接训练 VGG16 2.53 (4) 91.13
    本文 直接训练 VGG16 2.56 (10) 92.09
    STBP-tdBN[14] 直接训练 ResNet19 6 93.16
    DTSNN[19] 直接训练 ResNet19 2.51 (4) 91.51
    本文 直接训练 ResNet19 2.71 (10) 92.38
    CIFAR100 STDB[33] 转换+训练 VGG16 5 66.46
    Diet-SNN[15] 直接训练 VGG16 5 69.67
    Real Spike[34] 直接训练 VGG16 5 70.62
    RecDis-SNN[35] 直接训练 VGG16 5 69.88
    本文 直接训练 VGG16 3.86 (10) 71.37
    DTSNN[19] 直接训练 ResNet19 3.54 (4) 67.58
    本文 直接训练 ResNet19 2.53 (10) 70.64
    CIFAR10-DVS STBP-tdBN[14] 直接训练 ResNet19 6 67.80
    DTSNN[19] 直接训练 ResNet19 6.95 (10) 70.50
    本文 直接训练 ResNet19 5.57 (10) 69.50
    ImageNet-100 EfficientLIF-Net[10] 直接训练 ResNet19 5 79.44
    LocalZO[36] 直接训练 SEW-ResNet34 4 81.56
    本文 直接训练 ResNet19 1.76 (5) 81.96
    TinyImageNet EfficientLIF-Net[10] 直接训练 ResNet19 5 55.44
    DTSNN[19] 直接训练 ResNet19 3.71 (5) 57.18
    本文 直接训练 ResNet19 2.47 (5) 57.61
    N-Caltech-101 MC-SNN[37] 直接训练 VGG16 20 81.24
    DTSNN[19] 直接训练 VGG16 3.21 (10) 82.26
    本文 直接训练 VGG16 2.19 (10) 82.63
    NDA[30] 直接训练 ResNet19 10 78.60
    本文 直接训练 ResNet19 2.56 (10) 80.56
    下载: 导出CSV
  • [1] Dampfhoffer M, Mesquida T, Valentian A, Anghel L. Are SNNs really more energy-efficient than ANNs? An in-depth hardware-aware study. IEEE Transactions on Emerging Topics in Computational Intelligence, 2023, 7(3): 731−741 doi: 10.1109/TETCI.2022.3214509
    [2] Merolla P A, Arthur J V, Alvarez I R, Cassidy A S, Sawada J, Akopyan F, et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science, 2014, 345(6197): 668−673 doi: 10.1126/science.1254642
    [3] Davies M, Srinivasa N, Lin T H, Chinya G, Cao Y Q, Choday S H, et al. Loihi: A neuromorphic manycore processor with on-chip learning. IEEE Micro, 2018, 38(1): 82−99 doi: 10.1109/MM.2018.112130359
    [4] Pei J, Deng L, Song S, Zhao M G, Zhang Y H, Wu S, et al. Towards artificial general intelligence with hybrid Tianjic chip architecture. Nature, 2019, 572(7767): 106−111 doi: 10.1038/s41586-019-1424-8
    [5] Bouvier M, Valentian A, Mesquida T, Rummens F, Reyboz M, Vianello E, et al. Spiking neural networks hardware implementations and challenges: A survey. ACM Journal on Emerging Technologies in Computing Systems, 2019, 15(2): Article No. 22
    [6] Wu Y J, Deng L, Li G Q, Zhu J, Shi L P. Spatio-temporal backpropagation for training high-performance spiking neural networks. Frontiers in Neuroscience, DOI: 10.3389/fnins.2018.00331
    [7] Izhikevich E M. Which model to use for cortical spiking neurons? IEEE Transactions on Neural Networks, 2004, 15(5): 1063−1070 doi: 10.1109/TNN.2004.832719
    [8] Taherkhani A, Belatreche A, Li Y H, Cosma G, Maguire L P, McGinnity T M. A review of learning in biologically plausible spiking neural networks. Neural Networks, 2020, 122: 253−272 doi: 10.1016/j.neunet.2019.09.036
    [9] Liang Z Z, Schwartz D, Ditzler G, Koyluoglu O O. The impact of encoding-decoding schemes and weight normalization in spiking neural networks. Neural Networks, 2018, 108: 365−378 doi: 10.1016/j.neunet.2018.08.024
    [10] Kim Y, Li Y H, Moitra A, Yin R K, Panda P. Sharing leaky-integrate-and-fire neurons for memory-efficient spiking neural networks. Frontiers in Neuroscience, DOI: 10.3389/fnins.2023.1230002
    [11] Bu T, Ding J H, Yu Z F, Huang T J. Optimized potential initialization for low-latency spiking neural networks. arXiv preprint arXiv: 2202.01440, 2022.
    [12] Jiang C M, Zhang Y L. KLIF: An optimized spiking neuron unit for tuning surrogate gradient slope and membrane potential. arXiv preprint arXiv: 2302.09238, 2023.
    [13] Fang W, Yu Z F, Chen Y Q, Huang T J, Masquelier T, Tian Y H. Deep residual learning in spiking neural networks. arXiv preprint arXiv: 2102.04159, 2021.
    [14] Zheng H L, Wu Y J, Deng L, Hu Y F, Li G Q. Going deeper with directly-trained larger spiking neural networks. arXiv preprint arXiv: 2011.05280, 2020.
    [15] Rathi N, Roy K. DIET-SNN: A low-latency spiking neural network with direct input encoding and leakage and threshold optimization. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(6): 3174−3182 doi: 10.1109/TNNLS.2021.3111897
    [16] Guo Y F, Chen Y P, Zhang L W, Liu X D, Wang Y L, Huang X H, et al. IM-loss: Information maximization loss for spiking neural networks. In: Proceedings of the 36th Conference on Neural Information Processing Systems. New Orleans, USA: Curran Associates Inc., 2022. 156−166
    [17] Teerapittayanon S, McDanel B, Kung H T. BranchyNet: Fast inference via early exiting from deep neural networks. In: Proceedings of the 23rd International Conference on Pattern Recognition (ICPR). Cancun, Mexico: IEEE, 2016. 2464−2469
    [18] Kim Y, Panda P. Revisiting batch normalization for training low-latency deep spiking neural networks from scratch. Frontiers in Neuroscience, DOI: 10.3389/fnins.2021.773954
    [19] Li Y H, Moitra A, Geller T, Panda P. Input-aware dynamic timestep spiking neural networks for efficient in-memory computing. In: Proceedings of the 60th ACM/IEEE Design Automation Conference (DAC). San Francisco, USA: IEEE, 2023. 1−6
    [20] Deng L, Wu Y J, Hu Y F, Liang L, Li G Q, Hu X, et al. Comprehensive SNN compression using ADMM optimization and activity regularization. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(6): 2791−2805 doi: 10.1109/TNNLS.2021.3109064
    [21] Stöckl C, Maass W. Optimized spiking neurons can classify images with high accuracy through temporal coding with two spikes. Nature Machine Intelligence, 2021, 3(3): 230−238 doi: 10.1038/s42256-021-00311-4
    [22] Chen Y Q, Yu Z F, Fang W, Huang T J, Tian Y H. Pruning of deep spiking neural networks through gradient rewiring. arXiv preprint arXiv: 2105.04916, 2021.
    [23] Eshraghian J K, Lammie C, Azghadi M R, Lu W D. Navigating local minima in quantized spiking neural networks. In: Proceedings of the IEEE 4th International Conference on Artificial Intelligence Circuits and Systems (AICAS). Incheon, South Korea: IEEE, 2022. 352−355
    [24] Wang Y X, Xu Y, Yan R, Tang H J. Deep spiking neural networks with binary weights for object recognition. IEEE Transactions on Cognitive and Developmental Systems, 2021, 13(3): 514−523 doi: 10.1109/TCDS.2020.2971655
    [25] Krizhevsky A. Learning Multiple Layers of Features From Tiny Images [Master thesis], University of Toronto, Canada, 2009.
    [26] Le Y, Yang X. Tiny imagenet visual recognition challenge [Online], available: http://cs231n.stanford.edu/tiny-imagenet-200.zip, June 11, 2023
    [27] Deng J, Dong W, Socher R, Li L J, Li K, LI F F. ImageNet: A large-scale hierarchical image database. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009. 248−255
    [28] Li H M, Liu H C, Ji X Y, Li G Q, Shi L P. CIFAR10-DVS: An event-stream dataset for object classification. Frontiers in Neuroscience, DOI: 10.3389/fnins.2017.00309
    [29] Orchard G, Jayawant A, Cohen G K, Thakor N. Converting static image datasets to spiking neuromorphic datasets using saccades. Frontiers in Neuroscience, DOI: 10.3389/fnins.2015.00437
    [30] Li Y H, Kim Y, Park H, Geller T, Panda P. Neuromorphic data augmentation for training spiking neural networks. In: Proceedings of the 17th European Conference on Computer Vision. Tel Aviv, Israel: Springer, 2022. 631−649
    [31] Fang W, Chen Y Q, Ding J H, Yu Z F, Masquelier T, Chen D, et al. SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence. Science Advances, 2023, 9(40): Article No. eadi1480 doi: 10.1126/sciadv.adi1480
    [32] Yin R K, Moitra A, Bhattacharjee A, Kim Y, Panda P. SATA: Sparsity-aware training accelerator for spiking neural networks. IEEE Transactions on Computer-aided Design of Integrated Circuits and Systems, 2023, 42(6): 1926−1938 doi: 10.1109/TCAD.2022.3213211
    [33] Datta G, Kundu S, Beerel P A. Training energy-efficient deep spiking neural networks with single-spike hybrid input encoding. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN). Shenzhen, China: IEEE, 2021. 1−8
    [34] Guo Y F, Zhang L W, Chen Y P, Tong X Y, Liu X D, Wang Y L, et al. Real spike: Learning real-valued spikes for spiking neural networks. In: Proceedings of the 17th European Conference on Computer Vision. Tel Aviv, Israel: Springer, 2022. 52−68
    [35] Guo Y F, Tong X Y, Chen Y P, Zhang L W, Liu X D, Ma Z, et al. RecDis-SNNS: Rectifying membrane potential distribution for directly training spiking neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). New Orleans, USA: IEEE, 2022. 326−335
    [36] Mukhoty B, Bojkovic V, Vazelhes W, Zhao X H, Masi G, Xiong H, et al. Direct training of SNN using local zeroth order method. In: Proceedings of the Thirty-seventh Conference on Neural Information Processing Systems. New Orleans, USA: Curran Associates Inc., 2023. 18994−19014
    [37] Li X J, Tang J X, Lai J H. Learning high-performance spiking neural networks with multi-compartment spiking neurons. In: Proceedings of the 12th International Conference on Image and Graphics. Nanjing, China: Springer, 2023. 91−102
  • 加载中
图(7) / 表(6)
计量
  • 文章访问数:  165
  • HTML全文浏览量:  64
  • PDF下载量:  50
  • 被引次数: 0
出版历程
  • 收稿日期:  2023-10-25
  • 录用日期:  2024-03-15
  • 网络出版日期:  2024-07-31
  • 刊出日期:  2024-09-19

目录

    /

    返回文章
    返回