Suppr超能文献

管理压力钻井领域中的中文少样本命名实体识别与知识图谱构建

Chinese Few-Shot Named Entity Recognition and Knowledge Graph Construction in Managed Pressure Drilling Domain.

作者信息

Wei Siqing, Liang Yanchun, Li Xiaoran, Weng Xiaohui, Fu Jiasheng, Han Xiaosong

机构信息

Key Laboratory for Symbol Computation and Knowledge Engineering of National Education Ministry, College of Computer Science and Technology, Jilin University, Changchun 130012, China.

Zhuhai Laboratory of Key Laboratory for Symbol Computation and Knowledge Engineering of Ministry of Education, Zhuhai College of Science and Technology, Zhuhai 519041, China.

出版信息

Entropy (Basel). 2023 Jul 22;25(7):1097. doi: 10.3390/e25071097.

Abstract

Managed pressure drilling (MPD) is the most effective means to ensure drilling safety, and MPD is able to avoid further deterioration of complex working conditions through precise control of the wellhead back pressure. The key to the success of MPD is the well control strategy, which currently relies heavily on manual experience, hindering the automation and intelligence process of well control. In response to this issue, an MPD knowledge graph is constructed in this paper that extracts knowledge from published papers and drilling reports to guide well control. In order to improve the performance of entity extraction in the knowledge graph, a few-shot Chinese entity recognition model CEntLM-KL is extended from the EntLM model, in which the KL entropy is built to improve the accuracy of entity recognition. Through experiments on benchmark datasets, it has been shown that the proposed model has a significant improvement compared to the state-of-the-art methods. On the few-shot drilling datasets, the F-1 score of entity recognition reaches 33%. Finally, the knowledge graph is stored in Neo4J and applied for knowledge inference.

摘要

控压钻井(MPD)是确保钻井安全的最有效手段,并且控压钻井能够通过精确控制井口回压来避免复杂工况的进一步恶化。控压钻井成功的关键在于井控策略,目前该策略严重依赖人工经验,这阻碍了井控的自动化和智能化进程。针对这一问题,本文构建了一个控压钻井知识图谱,该图谱从已发表的论文和钻井报告中提取知识以指导井控。为了提高知识图谱中实体提取的性能,在EntLM模型的基础上扩展了一个少样本中文实体识别模型CEntLM-KL,其中构建了KL熵以提高实体识别的准确性。通过在基准数据集上的实验表明,与现有方法相比,所提出的模型有显著改进。在少样本钻井数据集上,实体识别的F-1分数达到33%。最后,将知识图谱存储在Neo4J中并应用于知识推理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f255/10378751/61676fbb59cd/entropy-25-01097-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验