Suppr超能文献

提高抽取式文档摘要的句子中心度。

Improving extractive document summarization with sentence centrality.

机构信息

School of Information Science and Electrical Engineering, Shandong Jiaotong University, Jinan, Shandong Province, China.

School of Information Science and Electrical Engineering, Northwest University, Xian, Shanxi Province, China.

出版信息

PLoS One. 2022 Jul 22;17(7):e0268278. doi: 10.1371/journal.pone.0268278. eCollection 2022.

Abstract

Extractive document summarization (EDS) is usually seen as a sequence labeling task, which extracts sentences from a document one by one to form a summary. However, extracting sentences separately ignores the relationship between the sentences and documents. One solution is to use sentence position information to enhance sentence representation, but this will cause the sentence-leading bias problem, especially in news datasets. In this paper, we propose a novel sentence centrality for the EDS task to address these two problems. The sentence centrality is based on directed graphs, while reflecting the sentence-document relationship, it also reflects the sentence position information in the document. We implicitly strengthen the relevance of sentences and documents by using sentence centrality to enhance sentence representation. Notably, we replaced the sentence position information with sentence centrality to reduce sentence-leading bias without causing model performance degradation. Experiments on the CNN/Daily Mail dataset showed that EDS models with sentence centrality significantly improved compared with baseline models.

摘要

抽取式文档摘要(EDS)通常被视为序列标记任务,它从文档中逐句提取句子以形成摘要。然而,逐句提取忽略了句子和文档之间的关系。一种解决方案是使用句子位置信息来增强句子表示,但这会导致句子领先偏见问题,尤其是在新闻数据集上。在本文中,我们提出了一种新颖的句子中心度,用于解决这两个问题。句子中心度基于有向图,在反映句子-文档关系的同时,也反映了句子在文档中的位置信息。我们通过使用句子中心度来增强句子表示,隐式地加强了句子和文档的相关性。值得注意的是,我们用句子中心度代替句子位置信息,以减少句子领先偏见,而不会导致模型性能下降。在 CNN/Daily Mail 数据集上的实验表明,使用句子中心度的 EDS 模型与基线模型相比有显著的改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15aa/9307201/ee9f4748428a/pone.0268278.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验