• 中文核心
  • EI
  • 中国科技核心
  • Scopus
  • CSCD
  • 英国科学文摘

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

深度学习软测量模型对抗鲁棒性增强: 一种对抗训练与攻击检测融合的对抗防御方法

郭润元 李安伦 刘涵 张友民 刘丁

郭润元, 李安伦, 刘涵, 张友民, 刘丁. 深度学习软测量模型对抗鲁棒性增强: 一种对抗训练与攻击检测融合的对抗防御方法. 自动化学报, xxxx, xx(x): x−xx doi: 10.16383/j.aas.c250725
引用本文: 郭润元, 李安伦, 刘涵, 张友民, 刘丁. 深度学习软测量模型对抗鲁棒性增强: 一种对抗训练与攻击检测融合的对抗防御方法. 自动化学报, xxxx, xx(x): x−xx doi: 10.16383/j.aas.c250725
Guo Run-Yuan, Li An-Lun, Liu Han, Zhang You-Min, Liu Ding. Enhancing adversarial robustness of deep learning-based soft sensors: a defense method integrating adversarial training and attack detection. Acta Automatica Sinica, xxxx, xx(x): x−xx doi: 10.16383/j.aas.c250725
Citation: Guo Run-Yuan, Li An-Lun, Liu Han, Zhang You-Min, Liu Ding. Enhancing adversarial robustness of deep learning-based soft sensors: a defense method integrating adversarial training and attack detection. Acta Automatica Sinica, xxxx, xx(x): x−xx doi: 10.16383/j.aas.c250725

深度学习软测量模型对抗鲁棒性增强: 一种对抗训练与攻击检测融合的对抗防御方法

doi: 10.16383/j.aas.c250725 cstr: 32138.14.j.aas.c250725
基金项目: 国家自然科学基金(62127809, 62403377, 92270117), 陕西省自然科学基础研究计划青年项目(2024JC-YBQN-0697), 西安自动化学会青年人才托举计划项目(XAAA-QNRC-2025-10)资助
详细信息
    作者简介:

    郭润元:西安理工大学自动化与信息工程学院讲师. 2023年获得西安理工大学博士学位. 主要研究方向为软测量与工业过程控制, 可信人工智能. E-mail: guorunyuan@xaut.edu.cn

    李安伦:西安理工大学自动化与信息工程学院硕士研究生. 主要研究方向为深度学习软测量, 软测量对抗攻击与防御. E-mail: 2230320161@stu.xaut.edu.cn

    刘涵:西安理工大学自动化与信息工程学院教授. 2003年获得西安理工大学博士学位. 主要研究方向为工业人工智能, 工业过程智能监测与诊断. 本文通信作者. E-mail: liuhan@xaut.edu.cn

    张友民:加拿大康考迪亚大学机械、工业与航空工程系教授, IEEE Fellow, CSME Fellow. 1995年获得西北工业大学博士学位. 主要研究方向为故障检测与诊断, 多智能体容错协同控制. E-mail: ymzhang@encs.concordia.ca

    刘丁:西安理工大学自动化与信息工程学院教授. 1997年获得西安交通大学博士学位. 主要研究方向为复杂工业过程信息测量、建模和智能优化控制, 智能机器人及控制. E-mail: liud@xaut.edu.cn

  • 中图分类号: Y

Enhancing Adversarial Robustness of Deep Learning-Based Soft Sensors: A Defense Method Integrating Adversarial Training and Attack Detection

Funds: Supported by National Natural Science Foundation of China (62127809, 62403377, 92270117), Youth Program of the Natural Science Basic Research Program of Shaanxi Province (2024JC-YBQN-0697), and Xi'an Association of Automation Young Talent Nurturing Program (XAAA-QNRC-2025-10)
More Information
    Author Bio:

    GUO Run-Yuan Lecturer at the School of Automation and Information Engineering, Xi'an University of Technology. He received his Ph.D. degree from Xi'an University of Technology in 2023. His research interests include soft sensor, industrial process control, and trustworthy artificial intelligence

    LI An-Lun Master student at the School of Automation and Information Engineering, Xi'an University of Technology. His research interests include deep learning-based soft sensors and adversarial attack and defense for soft sensors

    LIU Han Professor at the School of Automation and Information Engineering, Xi'an University of Technology. He received his Ph.D. degree from Xi'an University of Technology in 2003. His research interests include industrial artificial intelligence, intelligent monitoring and diagnosis for industrial processes. Corresponding author of this paper

    ZHANG You-Min Professor in the Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Canada. He is an IEEE Fellow and a CSME Fellow. He received his Ph.D. degree from Northwestern Polytechnical University in 1995. His research interests include fault detection and diagnosis and fault-tolerant cooperative control of multi-agent systems

    LIU Ding Professor at the School of Automation and Information Engineering, Xi'an University of Technology. He received his Ph.D. degree from Xi'an Jiaotong University in 1997. His research interests include information measurement, modeling and intelligent optimal control of complex industrial processes, and intelligent robots and their control

  • 摘要: 深度学习软测量(DLSS)已成为解决复杂工业过程关键变量测量难题的有效方法, 但其极易遭受肉眼难以察觉的对抗攻击而输出虚假预测结果, 进而危害生产安全. 现有对抗攻击检测方法对微小扰动对抗样本(SPAS)普遍难以检出, 且对抗训练防御方法仍广泛存在对抗鲁棒过拟合现象. 针对上述问题, 提出一种对抗训练与攻击检测融合(ATADI)的对抗防御方法: 首先理论证明了仅采用SPAS进行对抗训练更有助于缓解对抗鲁棒过拟合, 其次在此基础上提出一种基于SPAS的特征锚定式对抗训练(FAAT)方案, 即SPAS-FAAT. ATADI中的攻击检测器在防御过程中仅检测较大扰动对抗样本(LPAS)以规避其难以检出SPAS的不足, SPAS则被输入经SPAS-FAAT后的DLSS以获得最终软测量结果. 最后在转子热变形软测量对抗攻防案例上进行了实验: 结构消融结果显示, 将对抗训练与攻击检测相融合可作为DLSS对抗防御的有效手段, 各类攻击产生的LPAS均能以高于98%的准确率被检出, 且经SPAS-FAAT后的DLSS在SPAS与正常样本上的软测量结果均满足精度要求. ATADI方法显著增强了DLSS的对抗鲁棒性, 为对抗防御方法的研究提供了新思路.
  • 图  1  ATADI对抗防御框架图

    Fig.  1  ATADI adversarial defense framework diagram

    图  2  FAAT过程示意图

    Fig.  2  FAAT process diagram

    图  3  转子热变形DLSS的对抗攻击过程及攻击结果示意图

    Fig.  3  Adversarial attack process and attack results diagram of rotor thermal deformation DLSS

    图  4  ATADI对抗防御方法在不同阈值下的召回率折线图

    Fig.  4  Line plot of recall rate for ATADI adversarial defense method under different thresholds

    图  5  ATADI对抗防御方法在不同阈值下的主导变量预测误差折线图

    Fig.  5  Line plot of prediction error of dominant variable for ATADI defense under various thresholds

    图  6  KGAA攻击下实施对抗防御后的DLSS对抗鲁棒性测试曲线(输入SPAS)

    Fig.  6  Adversarial robustness test curves of the DLSS under KGAA attacks after implementing the adversarial defense (SPAS input)

    图  7  KAGAN攻击下实施对抗防御后的DLSS对抗鲁棒性测试曲线(输入SPAS)

    Fig.  7  Adversarial robustness test curves of the DLSS under KAGAN attacks after implementing the adversarial defense (SPAS input)

    图  8  KGAA攻击下实施对抗防御后的DLSS对抗鲁棒性测试曲线(输入正常样本)

    Fig.  8  Adversarial robustness test curves of the DLSS under KGAA attacks after implementing the adversarial defense (normal sample input)

    图  9  KAGAN攻击下实施对抗防御后的DLSS对抗鲁棒性测试曲线(输入正常样本)

    Fig.  9  Adversarial robustness test curves of the DLSS under KAGAN attacks after implementing the adversarial defense (normal sample input)

    表  1  BCD检测器对四类攻击所产生LPAS的检测结果(%)

    Table  1  Detection results of the BCD detector for LPAS generated by four types of attacks(%)

    攻击方法AccuracyPrecisionRecall
    IDAO99.6398.9599.45
    HGAA98.7697.5598.30
    KGAA99.2198.4098.95
    KAGAN98.4297.1097.95
    下载: 导出CSV

    表  2  四种不同对抗攻击下的DLSS对抗鲁棒性测试指标(输入SPAS)

    Table  2  DLSS adversarial robustness test metrics under four different adversarial attacks (SPAS input)

    测试样本MAE
    ${\rm{DLSS}}_{{\rm{SPAS}}-{\rm{FAAT}}}$${\rm{DLSS}}_{{\rm{Original}}}$${\rm{DLSS}}_{{\rm{SAT}}}$
    $x_{{\rm{SPAS}}}^{{\rm{IDAO}}}$0.02820.04210.0391
    $x_{{\rm{SPAS}}}^{{\rm{KGAA}}}$0.01970.04150.0310
    $x_{{\rm{SPAS}}}^{{\rm{HGAA}}}$0.01360.04460.0267
    $x_{{\rm{SPAS}}}^{{\rm{KAGAN}}}$0.02630.03750.0287
    下载: 导出CSV

    表  3  四种不同对抗攻击下的DLSS对抗鲁棒性测试指标(输入正常样本)

    Table  3  DLSS adversarial robustness test metrics under four different adversarial attacks (normal sample input)

    测试样本MAE
    ${\rm{DLSS}}_{{\rm{SPAS}}-{\rm{FAAT}}}$${\rm{DLSS}}_{{\rm{Original}}}$${\rm{DLSS}}_{{\rm{SAT}}}$
    $x_{{\rm{normal}}}^{{\rm{IDAO}}}$0.03860.03910.0410
    $x_{{\rm{normal}}}^{{\rm{KGAA}}}$0.03490.03410.0361
    $x_{{\rm{normal}}}^{{\rm{HGAA}}}$0.03380.03290.0358
    $x_{{\rm{normal}}}^{{\rm{KAGAN}}}$0.03280.03260.0349
    下载: 导出CSV

    表  4  采用不同对抗训练方法的DLSS对抗鲁棒性测试结果

    Table  4  Adversarial robustness test results of DLSS with different adversarial training methods

    测试样本MAE
    ${\rm{DLSS}}_{{\rm{SPAS}}-{\rm{SAT}}}$${\rm{DLSS}}_{{\rm{SPAS}}-{\rm{DAAT}}}$${\rm{DLSS}}_{{\rm{SPAS}}-{\rm{FAAT}}}$
    SPAS0.02380.02170.0197
    正常样本0.03610.03510.0349
    下载: 导出CSV

    A1  缩写对照表

    A1  List of abbreviations

    缩写全称/ 说明
    DLSS深度学习软测量
    SPAS微小扰动对抗样本
    LPAS较大扰动对抗样本
    BCD双向一致性判别
    FAAT特征锚定式对抗训练
    ATADI对抗训练与攻击检测融合
    SAT经典对抗训练
    DAAT域自适应对抗训练
    KGAA知识引导型对抗攻击
    SPAS-FAAT基于微小扰动对抗样本特征锚定式对抗训练
    DLSSOriginal原始深度学习软测量模型
    DLSSSAT经典对抗训练后得到的深度学习软测量模型
    DLSSDAAT域自适应对抗训练后得到的深度学习软测量模型
    DLSSSPAS-FAAT基于微小扰动对抗样本特征锚定式对抗训练后
    得到的深度学习软测量模型
    下载: 导出CSV
  • [1] Guo R Y, Liu H, Xie G, Zhang Y M, Liu D. A self-interpretable soft sensor based on deep learning and multiple attention mechanism: From data selection to sensor modeling. IEEE Transactions on Industrial Informatics, 2023, 19(5): 6859−6871 doi: 10.1109/TII.2022.3181692
    [2] Shi X D, Li R H, Morales H, Huang W, Xiong W L. Semi-supervised probabilistic learning network for soft sensor modeling with partially labeled data. IEEE Transactions on Automation Science and Engineering, 2025, 22: 16309−16321 doi: 10.1109/TASE.2025.3576122
    [3] 乔景慧, 柴天佑. 数据与模型驱动的水泥生料分解率软测量模型. 自动化学报, 2019, 45(8): 1564−1578

    Qiao Jing-Hui, Chai Tian-You. Data and model-based soft measurement model of cement raw meal decomposition ratio. Acta Automatica Sinica, 2019, 45(8): 1564−1578
    [4] 蒙西, 张寅, 乔俊飞. 基于动态模糊神经网络的出水含氮参数软测量方法. 控制理论与应用, 2024, 41(12): 2383−2392 doi: 10.7641/CTA.2023.20667

    Meng Xi, Zhang Yan, Qiao Jun-Fei. A soft-sensing method for effluent nitrogen-containing parameters based on dynamic fuzzy neural networks. Control Theory & Applications, 2024, 41(12): 2383−2392 doi: 10.7641/CTA.2023.20667
    [5] Zühlke M M, Kudenko D. Adversarial robustness of neural networks from the perspective of lipschitz calculus: A survey. ACM Computing Surveys, 2025, 57(6): 1−41 doi: 10.1145/3648351
    [6] Wu T, Wang X C, Qiao S J, Zhang Y M, Xian X P, Liu Y B, et al. Small perturbations are enough: Adversarial attacks on time series prediction. Information Sciences, 2022, 587: 794−812 doi: 10.1016/j.ins.2021.11.007
    [7] Jiang Q C, Fan S H, Zhu Z Y, Hou Z X, Zhong W M, Tan L, et al. Adversarial attacks on industrial soft sensors: Multi-target attacks based on diffusion models. Information Sciences, 2025, 725: Article No. 122732
    [8] Yin Z Q, Zhuo Y, Ge Z Q. Transfer adversarial attacks across industrial intelligent systems. Reliability Engineering & System Safety, 2023, 237: Article No. 109299 doi: 10.1016/j.ress.2023.109299
    [9] Kong X Y, Ge Z Q. Adversarial attacks on neural-network-based soft sensors: Directly attack output. IEEE Transactions on Industrial Informatics, 2022, 18(4): 2443−2451 doi: 10.1109/TII.2021.3093386
    [10] Chen L, Zhu Q X, He Y L. Adversarial attacks for neural network-based industrial soft sensors: Mirror output attack and translation mirror output attack. IEEE Transactions on Industrial Informatics, 2024, 20(2): 2378−2386 doi: 10.1109/TII.2023.3291717
    [11] Jiang Q C, Fan S H, Zhu Z Y, Hou Z X, Zhong W M, Tan L, et al. Adversarial attacks on industrial soft sensors: Multi-target attacks based on diffusion models. Information Sciences, 2026, 725: Article No. 122732 doi: 10.1016/j.ins.2025.122732
    [12] Zhou X L, Wu O, Yang N. Adversarial training with anti-adversaries. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(12): 10210−10227 doi: 10.1109/TPAMI.2024.3432973
    [13] Zhao S J, Wang X Z, Wei X X. Mitigating accuracy-robustness trade-off via balanced multi-teacher adversarial distillation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(12): 9338−9352 doi: 10.1109/TPAMI.2024.3416308
    [14] Guo R Y, Liu H, Liu D. When deep learning-based soft sensors encounter reliability challenges: A practical knowledge-guided adversarial attack and its defense. IEEE Transactions on Industrial Informatics, 2024, 20(2): 2702−2714 doi: 10.1109/TII.2023.3297663
    [15] Guo R Y, Chen Q Y, Tong S, Liu H. Knowledge-aided generative adversarial network: A transfer gradient-less adversarial attack for deep learning-based soft sensors. In: Proceedings of the 2024 14th Asian Control Conference (ASCC). Dalian, China: IEEE, 2024. 1254-1259
    [16] Xie Y F, Wang J, Xie S W, Chen X F. Adversarial training-based deep layer-wise probabilistic network for enhancing soft sensor modeling of industrial processes. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2023, 54(2): 972−984 doi: 10.1109/tsmc.2023.3322195
    [17] Guo R Y, Chen Q Y, Liu H, Wang W Q. Adversarial robustness enhancement for deep learning-based soft sensors: An adversarial training strategy using historical gradients and domain adaptation. Sensors, 2024, 24(12): Article No. 3909 doi: 10.3390/s24123909
    [18] Liu C, Huang Z C, Salzmann M, Zhang T, Süsstrunk S. On the impact of hard adversarial instances on overfitting in adversarial training. Journal of Machine Learning Research, 2024, 25(356): 1−46
    [19] Chen Y, Li X D, Hu P, Peng D Z, Wang X. Diffilter: Defending against adversarial perturbations with diffusion filter. IEEE Transactions on Information Forensics and Security, 2024, 19: 6779−6794 doi: 10.1109/TIFS.2024.3422923
    [20] Liu H, Zhao B, Zhang K H, Liu P. Nowhere to hide: A lightweight unsupervised detector against adversarial examples. arXiv preprint arXiv: 2210.08579, 2022.
    [21] Liu H, Zhao B, Guo J B, Zhang K H, Liu P. A lightweight unsupervised adversarial detector based on autoencoder and isolation forest. Pattern Recognition, 2024, 147: Article No. 110127 doi: 10.1016/j.patcog.2023.110127
    [22] Mumcu F, Yilmaz Y. Universal and efficient detection of adversarial data through nonuniform impact on network layers. arXiv preprint arXiv: 2506.20816, 2025.
    [23] Guo R Y, Li A L, Liu H. An adversarial attack detection method based on bidirectional consistency discrimination for deep learning-based soft sensors. In: Proceedings of the 2025 CAA Symposium on Fault Detection, Supervision and Safety for Technical Processes. Urumqi, China: IEEE, 2025. 1-6
    [24] Yin X W, Kolouri S, Rohde G K. GAT: Generative adversarial training for adversarial example detection and robust classification. arXiv preprint arXiv: 1905.11475, 2022.
    [25] Chen E C, Lee C R. Data filtering for efficient adversarial training. Pattern Recognition, 2024, 151: Article No. 110394 doi: 10.1016/j.patcog.2024.110394
    [26] Hong S, An N, Cho H, Lim J, Han I S, Moon I, Kim J. A dynamic soft sensor based on hybrid neural networks to improve early off-spec detection. Engineering with Computers, 2023, 39(4): 3011−3021 doi: 10.1007/s00366-022-01694-7
    [27] Guo R Y, Liu H. Semisupervised dynamic soft sensor based on complementary ensemble empirical mode decomposition and deep learning. Measurement, 2021, 183: Article No. 109788 doi: 10.1016/j.measurement.2021.109788
    [28] 吴英俊, 汝英涛, 刘锦涛, 施展宇, 顾松, 倪明. 基于集员滤波的自动发电控制系统虚假数据注入攻击检测. 电力系统自动化, 2022, 46(1): 33−41 doi: 10.7500/AEPS20210525006

    Wu Ying-Jun, Ru Ying-Tao, Liu Jin-Tao, Shi Zhan-Yu, Gu Song, Ni Ming. False data injection attack detection for automatic generation control system based on set-membership filtering. Automation of Electric Power Systems, 2022, 46(1): 33−41 doi: 10.7500/AEPS20210525006
    [29] Guo W, Liu F, Wang Y, Sidorov D, Wu J. Adaptive event-triggered sliding mode load frequency control for cyber-physical power systems under false data injection attacks. IEEE Transactions on Industrial Informatics, 2024, 21(4): 2947−2956 doi: 10.1109/ccdc52312.2021.9602783
    [30] 刘丁, 刘涵. 电站锅炉空气预热器控制方法及应用. 北京: 科学出版社, 2016.

    Liu Ding, Liu Han. Control Methods and Applications of Air Preheater for Power Plant Boilers. Beijing: Science Press, 2016.
  • 加载中
计量
  • 文章访问数:  5
  • HTML全文浏览量:  4
  • 被引次数: 0
出版历程
  • 收稿日期:  2025-12-16
  • 录用日期:  2026-04-20
  • 网络出版日期:  2026-06-22

目录

    /

    返回文章
    返回