N-GPETS：基于神经注意力图的抽取式文本摘要预训练统计模型。

N-GPETS: Neural Attention Graph-Based Pretrained Statistical Model for Extractive Text Summarization.

机构信息

Department of Computer Science, City University of Science and Information Technology, Peshawar 25000, Pakistan.

Department of Computer Science, Islamia College, Peshawar 25000, Pakistan.

出版信息

Comput Intell Neurosci. 2022 Nov 22;2022:6241373. doi: 10.1155/2022/6241373. eCollection 2022.

DOI:10.1155/2022/6241373

PMID:36458230

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9708337/

Abstract

The extractive summarization approach involves selecting the source document's salient sentences to build a summary. One of the most important aspects of extractive summarization is learning and modelling cross-sentence associations. Inspired by the popularity of Transformer-based Bidirectional Encoder Representations (BERT) pretrained linguistic model and graph attention network (GAT) having a sophisticated network that captures intersentence associations, this research work proposes a novel neural model N-GPETS by combining heterogeneous graph attention network with BERT model along with statistical approach using TF-IDF values for extractive summarization task. Apart from sentence nodes, N-GPETS also works with different semantic word nodes of varying granularity levels that serve as a link between sentences, improving intersentence interaction. Furthermore, proposed N-GPETS becomes more improved and feature-rich by integrating graph layer with BERT encoder at graph initialization step rather than employing other neural network encoders such as CNN or LSTM. To the best of our knowledge, this work is the first attempt to combine the BERT encoder and TF-IDF values of the entire document with a heterogeneous attention graph structure for the extractive summarization task. The empirical outcomes on benchmark news data sets CNN/DM show that the proposed model N-GPETS gets favorable results in comparison with other heterogeneous graph structures employing the BERT model and graph structures without the BERT model.

摘要

抽取式摘要方法涉及选择源文档的突出句子来构建摘要。抽取式摘要的最重要方面之一是学习和建模跨句子关联。受基于 Transformer 的双向编码器表示（BERT）预训练语言模型和图注意网络（GAT）的普及的启发，GAT 具有捕捉句子间关联的复杂网络，本研究工作通过结合异构图注意网络和 BERT 模型以及使用 TF-IDF 值的统计方法，提出了一种新颖的神经模型 N-GPETS，用于抽取式摘要任务。除了句子节点外，N-GPETS 还使用不同粒度级别的不同语义词节点作为句子之间的链接，从而提高句子间的交互作用。此外，通过在图初始化步骤中与 BERT 编码器集成图层，而不是使用其他神经网络编码器（如 CNN 或 LSTM），提出的 N-GPETS 变得更加改进和丰富。据我们所知，这项工作首次尝试将 BERT 编码器和整个文档的 TF-IDF 值与异构图注意力结构相结合，用于抽取式摘要任务。在基准新闻数据集 CNN/DM 上的实验结果表明，与其他使用 BERT 模型和无 BERT 模型的异构图结构的模型相比，所提出的模型 N-GPETS 具有更好的效果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9670/9708337/ae7d044e898b/CIN2022-6241373.001.jpg

相似文献

N-GPETS: Neural Attention Graph-Based Pretrained Statistical Model for Extractive Text Summarization.N-GPETS：基于神经注意力图的抽取式文本摘要预训练统计模型。

Comput Intell Neurosci. 2022 Nov 22;2022:6241373. doi: 10.1155/2022/6241373. eCollection 2022.

Multi-granularity heterogeneous graph attention networks for extractive document summarization.多粒度异质图注意力网络在抽取式文档摘要中的应用。

Neural Netw. 2022 Nov;155:340-347. doi: 10.1016/j.neunet.2022.08.021. Epub 2022 Sep 5.

Modified Bidirectional Encoder Representations From Transformers Extractive Summarization Model for Hospital Information Systems Based on Character-Level Tokens (AlphaBERT): Development and Performance Evaluation.基于字符级令牌的医院信息系统变压器抽取式摘要模型（AlphaBERT）的改进双向编码器表示：开发与性能评估

JMIR Med Inform. 2020 Apr 29;8(4):e17787. doi: 10.2196/17787.

Incorporating Domain Knowledge Into Language Models by Using Graph Convolutional Networks for Assessing Semantic Textual Similarity: Model Development and Performance Comparison.通过使用图卷积网络将领域知识融入语言模型以评估语义文本相似度：模型开发与性能比较

JMIR Med Inform. 2021 Nov 26;9(11):e23101. doi: 10.2196/23101.

Retracted: N-GPETS: Neural Attention Graph-Based Pretrained Statistical Model for Extractive Text Summarization.撤回：N-GPETS：用于抽取式文本摘要的基于神经注意力图的预训练统计模型。

Comput Intell Neurosci. 2023 Dec 13;2023:9818409. doi: 10.1155/2023/9818409. eCollection 2023.

Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks.基于异构图神经网络的多粒度自适应抽取式文档摘要

PeerJ Comput Sci. 2023 Dec 13;9:e1737. doi: 10.7717/peerj-cs.1737. eCollection 2023.

A study on pharmaceutical text relationship extraction based on heterogeneous graph neural networks.基于异质图神经网络的药物文本关系抽取研究。

Math Biosci Eng. 2024 Jan;21(1):1489-1507. doi: 10.3934/mbe.2024064. Epub 2022 Dec 28.

Unified extractive-abstractive summarization: a hybrid approach utilizing BERT and transformer models for enhanced document summarization.统一提取-抽象摘要：一种利用BERT和Transformer模型增强文档摘要的混合方法。

PeerJ Comput Sci. 2024 Nov 18;10:e2424. doi: 10.7717/peerj-cs.2424. eCollection 2024.

BERT-GT: cross-sentence n-ary relation extraction with BERT and Graph Transformer.BERT-GT：使用BERT和图变换器进行跨句子n元关系提取

Bioinformatics. 2021 Apr 5;36(24):5678-5685. doi: 10.1093/bioinformatics/btaa1087.

A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.深度学习模型在不同类别不平衡程度的非结构化医疗记录文本分类中的对比研究。

BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.

引用本文的文献

Comput Intell Neurosci. 2023 Dec 13;2023:9818409. doi: 10.1155/2023/9818409. eCollection 2023.

本文引用的文献

Automated Detection of Rehabilitation Exercise by Stroke Patients Using 3-Layer CNN-LSTM Model.基于 3 层 CNN-LSTM 模型的脑卒中患者康复运动自动检测

J Healthc Eng. 2022 Feb 4;2022:1563707. doi: 10.1155/2022/1563707. eCollection 2022.

Bert for Question Answering applied on Covid-19.应用于新冠疫情的问答式伯特模型。

Procedia Comput Sci. 2022;198:379-384. doi: 10.1016/j.procs.2021.12.257. Epub 2022 Jan 26.

RM-ADR: Resource Management Adaptive Data Rate for Mobile Application in LoRaWAN.资源管理自适应数据速率：适用于 LoRaWAN 中的移动应用

Sensors (Basel). 2021 Nov 30;21(23):7980. doi: 10.3390/s21237980.

An Intelligent Opportunistic Routing Algorithm for Wireless Sensor Networks and Its Application Towards e-Healthcare.面向电子医疗的无线传感器网络智能机会路由算法及其应用

Sensors (Basel). 2020 Jul 13;20(14):3887. doi: 10.3390/s20143887.

Software-Defined Vehicular Cloud Networks: Architecture, Applications and Virtual Machine Migration.软件定义的车载云网络：架构、应用与虚拟机迁移

Sensors (Basel). 2020 Feb 17;20(4):1092. doi: 10.3390/s20041092.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

N-GPETS：基于神经注意力图的抽取式文本摘要预训练统计模型。

N-GPETS: Neural Attention Graph-Based Pretrained Statistical Model for Extractive Text Summarization.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献