• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于深度学习的电力工业物联网业务数据交互中防止数据泄露的方法。

A Deep Learning-Based Method for Preventing Data Leakage in Electric Power Industrial Internet of Things Business Data Interactions.

作者信息

Miao Weiwei, Zhao Xinjian, Zhang Yinzhao, Chen Shi, Li Xiaochao, Li Qianmu

机构信息

State Grid Jiangsu Electric Power Co., Ltd., Information & Telecommunication Branch, Nanjing 210024, China.

School of Cyber Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

出版信息

Sensors (Basel). 2024 Jun 22;24(13):4069. doi: 10.3390/s24134069.

DOI:10.3390/s24134069
PMID:39000847
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11243995/
Abstract

In the development of the Power Industry Internet of Things, the security of data interaction has always been an important challenge. In the power-based blockchain Industrial Internet of Things, node data interaction involves a large amount of sensitive data. In the current anti-leakage strategy for power business data interaction, regular expressions are used to identify sensitive data for matching. This approach is only suitable for simple structured data. For the processing of unstructured data, there is a lack of practical matching strategies. Therefore, this paper proposes a deep learning-based anti-leakage method for power business data interaction, aiming to ensure the security of power business data interaction between the State Grid business platform and third-party platforms. This method combines named entity recognition technologies and comprehensively uses regular expressions and the DeBERTa (Decoding-enhanced BERT with disentangled attention)-BiLSTM (Bidirectional Long Short-Term Memory)-CRF (Conditional Random Field) model. This method is based on the DeBERTa (Decoding-enhanced BERT with disentangled attention) model for pre-training feature extraction. It extracts sequence context semantic features through the BiLSTM, and finally obtains the global optimal through the CRF layer tag sequence. Sensitive data matching is performed on interactive structured and unstructured data to identify privacy-sensitive information in the power business. The experimental results show that the F1 score of the proposed method in this paper for identifying sensitive data entities using the CLUENER 2020 dataset reaches 81.26%, which can effectively prevent the risk of power business data leakage and provide innovative solutions for the power industry to ensure data security.

摘要

在电力行业物联网的发展过程中,数据交互安全一直是一项重大挑战。在基于电力的区块链工业物联网中,节点数据交互涉及大量敏感数据。在当前电力业务数据交互的防泄漏策略中,使用正则表达式来识别敏感数据进行匹配。这种方法仅适用于简单结构化数据。对于非结构化数据的处理,缺乏实用的匹配策略。因此,本文提出一种基于深度学习的电力业务数据交互防泄漏方法,旨在确保国家电网业务平台与第三方平台之间电力业务数据交互的安全性。该方法结合命名实体识别技术,综合运用正则表达式以及DeBERTa(带解缠注意力的解码增强型BERT)-BiLSTM(双向长短期记忆)-CRF(条件随机场)模型。此方法基于DeBERTa(带解缠注意力的解码增强型BERT)模型进行预训练特征提取,通过BiLSTM提取序列上下文语义特征,最后经CRF层获得全局最优标签序列。对交互式结构化和非结构化数据进行敏感数据匹配,以识别电力业务中的隐私敏感信息。实验结果表明,本文所提方法在使用CLUENER 2020数据集识别敏感数据实体时的F1分数达到81.26%,能够有效防范电力业务数据泄漏风险,为电力行业保障数据安全提供创新解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/cbca88b1c2ef/sensors-24-04069-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/59e632b7eb27/sensors-24-04069-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/4934ee130bcf/sensors-24-04069-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/5089100c0630/sensors-24-04069-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/5a3f6d093974/sensors-24-04069-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/cbca88b1c2ef/sensors-24-04069-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/59e632b7eb27/sensors-24-04069-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/4934ee130bcf/sensors-24-04069-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/5089100c0630/sensors-24-04069-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/5a3f6d093974/sensors-24-04069-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c6/11243995/cbca88b1c2ef/sensors-24-04069-g005.jpg

相似文献

1
A Deep Learning-Based Method for Preventing Data Leakage in Electric Power Industrial Internet of Things Business Data Interactions.一种基于深度学习的电力工业物联网业务数据交互中防止数据泄露的方法。
Sensors (Basel). 2024 Jun 22;24(13):4069. doi: 10.3390/s24134069.
2
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
3
A Lightweight Hybrid Deep Learning Privacy Preserving Model for FC-Based Industrial Internet of Medical Things.基于 FC 的工业医疗物联网的轻量级混合深度学习隐私保护模型。
Sensors (Basel). 2022 Mar 9;22(6):2112. doi: 10.3390/s22062112.
4
Research on named entity recognition of adverse drug reactions based on NLP and deep learning.基于自然语言处理和深度学习的药物不良反应命名实体识别研究
Front Pharmacol. 2023 Jun 1;14:1121796. doi: 10.3389/fphar.2023.1121796. eCollection 2023.
5
Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction.基于深度学习的自然语言提取的冠状动脉造影数据治理和 Gensini 评分自动计算。
Math Biosci Eng. 2024 Feb 23;21(3):4085-4103. doi: 10.3934/mbe.2024180.
6
DeBERTa-BiLSTM: A multi-label classification model of Arabic medical questions using pre-trained models and deep learning.基于预训练模型和深度学习的阿拉伯文医学问题多标签分类模型:DeBERTa-BiLSTM
Comput Biol Med. 2024 Mar;170:107921. doi: 10.1016/j.compbiomed.2024.107921. Epub 2024 Jan 4.
7
Clinical named entity recognition for percutaneous coronary intervention surgical information with hybrid neural network.基于混合神经网络的经皮冠状动脉介入手术信息的临床命名实体识别。
Rev Sci Instrum. 2024 Jun 1;95(6). doi: 10.1063/5.0174442.
8
Adversarial active learning for the identification of medical concepts and annotation inconsistency.对抗式主动学习在医学概念识别和标注不一致性中的应用。
J Biomed Inform. 2020 Aug;108:103481. doi: 10.1016/j.jbi.2020.103481. Epub 2020 Jul 18.
9
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
10
Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training.基于词汇特征的 BiLSTM-CRF 和三训练的中药不良事件报告命名实体识别。
J Biomed Inform. 2019 Aug;96:103252. doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.

引用本文的文献

1
Knowledge-Graph-Driven Fault Diagnosis Methods for Intelligent Production Lines.面向智能生产线的知识图谱驱动故障诊断方法
Sensors (Basel). 2025 Jun 23;25(13):3912. doi: 10.3390/s25133912.

本文引用的文献

1
An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition.基于注意力机制的 BiLSTM-CRF 方法在文档级化学命名实体识别中的应用。
Bioinformatics. 2018 Apr 15;34(8):1381-1388. doi: 10.1093/bioinformatics/btx761.