• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

追求抽取式摘要方法的最高ROUGE分数。

Reaching for upper bound ROUGE score of extractive summarization methods.

作者信息

Akhmetov Iskander, Mussabayev Rustam, Gelbukh Alexander

机构信息

Kazakh-British Technical University, Almaty, Almaty, Kazakhstan.

Institute of Information and Computational Technologies, Almaty, Almaty, Kazakhstan.

出版信息

PeerJ Comput Sci. 2022 Sep 26;8:e1103. doi: 10.7717/peerj-cs.1103. eCollection 2022.

DOI:10.7717/peerj-cs.1103
PMID:36262160
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9575858/
Abstract

The extractive text summarization (ETS) method for finding the salient information from a text automatically uses the exact sentences from the source text. In this article, we answer the question of what quality of a summary we can achieve with ETS methods? To maximize the ROUGE-1 score, we used five approaches: (1) adapted reduced variable neighborhood search (RVNS), (2) Greedy algorithm, (3) VNS initialized by Greedy algorithm results, (4) genetic algorithm, and (5) genetic algorithm initialized by the Greedy algorithm results. Furthermore, we ran experiments on articles from the arXive dataset. As a result, we found 0.59 and 0.25 scores for ROUGE-1 and ROUGE-2, respectively achievable by the approach, where the genetic algorithm initialized by the Greedy algorithm results, which happens to yield the best results out of the tested approaches. Moreover, those scores appear to be higher than scores obtained by the current state-of-the-art text summarization models: the best score in the literature for ROUGE-1 on the same data set is 0.46. Therefore, we have room for the development of ETS methods, which are now undeservedly forgotten.

摘要

用于从文本中自动提取显著信息的抽取式文本摘要(ETS)方法会直接使用源文本中的精确句子。在本文中,我们回答了一个问题:使用ETS方法能获得何种质量的摘要?为了使ROUGE - 1得分最大化,我们采用了五种方法:(1)自适应简化可变邻域搜索(RVNS),(2)贪心算法,(3)由贪心算法结果初始化的VNS,(4)遗传算法,以及(5)由贪心算法结果初始化的遗传算法。此外,我们在arXive数据集中的文章上进行了实验。结果发现,由贪心算法结果初始化的遗传算法这种方法分别能实现0.59和0.25的ROUGE - 1和ROUGE - 2得分,该方法在所有测试方法中恰好产生了最佳结果。而且,这些得分似乎高于当前最先进的文本摘要模型所获得的得分:在同一数据集上,文献中ROUGE - 1的最佳得分是0.46。因此,我们有发展ETS方法的空间,而目前这些方法被无端地遗忘了。

相似文献

1
Reaching for upper bound ROUGE score of extractive summarization methods.追求抽取式摘要方法的最高ROUGE分数。
PeerJ Comput Sci. 2022 Sep 26;8:e1103. doi: 10.7717/peerj-cs.1103. eCollection 2022.
2
CERC: an interactive content extraction, recognition, and construction tool for clinical and biomedical text.CERC:一个用于临床和生物医学文本的交互式内容提取、识别和构建工具。
BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 14):306. doi: 10.1186/s12911-020-01330-8.
3
Extractive single document summarization using binary differential evolution: Optimization of different sentence quality measures.基于二进制差分进化的抽取式单文档摘要:不同句子质量度量的优化。
PLoS One. 2019 Nov 14;14(11):e0223477. doi: 10.1371/journal.pone.0223477. eCollection 2019.
4
Extractive summarization of clinical trial descriptions.临床试验描述的抽取式总结。
Int J Med Inform. 2019 Sep;129:114-121. doi: 10.1016/j.ijmedinf.2019.05.019. Epub 2019 May 30.
5
Single document text summarization addressed with a cat swarm optimization approach.基于猫群优化算法的单文档文本摘要
Appl Intell (Dordr). 2023;53(10):12268-12287. doi: 10.1007/s10489-022-04149-0. Epub 2022 Sep 24.
6
CovSumm: an unsupervised transformer-cum-graph-based hybrid document summarization model for CORD-19.CovSumm:一种用于CORD-19的基于无监督Transformer和图的混合文档摘要模型。
J Supercomput. 2023 Apr 26:1-23. doi: 10.1007/s11227-023-05291-3.
7
Exploiting Intersentence Information for Better Question-Driven Abstractive Summarization: Algorithm Development and Validation.利用句间信息实现更好的问题驱动摘要生成:算法开发与验证
JMIR Med Inform. 2022 Aug 15;10(8):e38052. doi: 10.2196/38052.
8
Graph-based extractive text summarization method for Hausa text.基于图的豪萨文本抽取式文本摘要方法。
PLoS One. 2023 May 9;18(5):e0285376. doi: 10.1371/journal.pone.0285376. eCollection 2023.
9
A study of extractive summarization of long documents incorporating local topic and hierarchical information.一项结合局部主题和层次信息的长文档抽取式摘要研究。
Sci Rep. 2024 May 2;14(1):10140. doi: 10.1038/s41598-024-60779-z.
10
Extractive text summarization model based on advantage actor-critic and graph matrix methodology.基于优势行动者-评论者和图矩阵方法的抽取式文本摘要模型。
Math Biosci Eng. 2023 Jan;20(1):1488-1504. doi: 10.3934/mbe.2023067. Epub 2022 Oct 31.

引用本文的文献

1
Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks.基于异构图神经网络的多粒度自适应抽取式文档摘要
PeerJ Comput Sci. 2023 Dec 13;9:e1737. doi: 10.7717/peerj-cs.1737. eCollection 2023.

本文引用的文献

1
Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion.超越SumBasic:通过句子简化和词汇扩展实现以任务为重点的摘要。
Inf Process Manag. 2007 Nov;43(6):1606-1618. doi: 10.1016/j.ipm.2007.01.023. Epub 2007 Apr 19.