• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于语义图的生物医学文摘方法。

A semantic graph-based approach to biomedical summarisation.

机构信息

Departamento de Ingeniería del Software e Inteligencia Artificial, Universidad Complutense de Madrid, C/Profesor José García Santesmases, Spain.

出版信息

Artif Intell Med. 2011 Sep;53(1):1-14. doi: 10.1016/j.artmed.2011.06.005. Epub 2011 Jul 12.

DOI:10.1016/j.artmed.2011.06.005
PMID:21752612
Abstract

OBJECTIVE

Access to the vast body of research literature that is available in biomedicine and related fields may be improved by automatic summarisation. This paper presents a method for summarising biomedical scientific literature that takes into consideration the characteristics of the domain and the type of documents.

METHODS

To address the problem of identifying salient sentences in biomedical texts, concepts and relations derived from the Unified Medical Language System (UMLS) are arranged to construct a semantic graph that represents the document. A degree-based clustering algorithm is then used to identify different themes or topics within the text. Different heuristics for sentence selection, intended to generate different types of summaries, are tested. A real document case is drawn up to illustrate how the method works.

RESULTS

A large-scale evaluation is performed using the recall-oriented understudy for gisting-evaluation (ROUGE) metrics. The results are compared with those achieved by three well-known summarisers (two research prototypes and a commercial application) and two baselines. Our method significantly outperforms all summarisers and baselines. The best of our heuristics achieves an improvement in performance of almost 7.7 percentage units in the ROUGE-1 score over the LexRank summariser (0.7862 versus 0.7302). A qualitative analysis of the summaries also shows that our method succeeds in identifying sentences that cover the main topic of the document and also considers other secondary or "satellite" information that might be relevant to the user.

CONCLUSION

The method proposed is proved to be an efficient approach to biomedical literature summarisation, which confirms that the use of concepts rather than terms can be very useful in automatic summarisation, especially when dealing with highly specialised domains.

摘要

目的

通过自动摘要,可以提高对生物医学和相关领域中大量研究文献的访问。本文提出了一种考虑领域特征和文档类型的生物医学科学文献摘要方法。

方法

为了解决在生物医学文本中识别重要句子的问题,从统一医学语言系统(UMLS)中提取概念和关系,构建表示文档的语义图。然后使用基于度的聚类算法来识别文本中的不同主题或话题。测试了不同的句子选择启发式方法,旨在生成不同类型的摘要。通过实际文档案例来说明该方法的工作原理。

结果

使用召回导向的文本摘用评估(ROUGE)指标进行了大规模评估。将结果与三种知名摘要器(两个研究原型和一个商业应用程序)和两个基线进行比较。我们的方法明显优于所有的摘要器和基线。我们的最佳启发式方法在 ROUGE-1 得分上比 LexRank 摘要器提高了近 7.7 个百分点(0.7862 对 0.7302)。对摘要的定性分析也表明,我们的方法成功地识别了涵盖文档主要主题的句子,并考虑了对用户可能相关的其他次要或“卫星”信息。

结论

所提出的方法被证明是一种有效的生物医学文献摘要方法,这证实了在自动摘要中使用概念而不是术语可能非常有用,特别是在处理高度专业化的领域时。

相似文献

1
A semantic graph-based approach to biomedical summarisation.基于语义图的生物医学文摘方法。
Artif Intell Med. 2011 Sep;53(1):1-14. doi: 10.1016/j.artmed.2011.06.005. Epub 2011 Jul 12.
2
Graph-based biomedical text summarization: An itemset mining and sentence clustering approach.基于图的生物医学文本摘要:一种基于项集挖掘和句子聚类的方法。
J Biomed Inform. 2018 Aug;84:42-58. doi: 10.1016/j.jbi.2018.06.005. Epub 2018 Jun 15.
3
The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.自然语言处理中领域知识与语言结构的相互作用:解读生物医学文本中的上位命题
J Biomed Inform. 2003 Dec;36(6):462-77. doi: 10.1016/j.jbi.2003.11.003.
4
UMLS knowledge for biomedical language processing.用于生物医学语言处理的统一医学语言系统知识。
Bull Med Libr Assoc. 1993 Apr;81(2):184-94.
5
A knowledge-driven approach to biomedical document conceptualization.基于知识的生物医学文献概念化方法。
Artif Intell Med. 2010 Jun;49(2):67-78. doi: 10.1016/j.artmed.2010.02.005. Epub 2010 Apr 3.
6
A hybrid method for relation extraction from biomedical literature.一种从生物医学文献中提取关系的混合方法。
Int J Med Inform. 2006 Jun;75(6):443-55. doi: 10.1016/j.ijmedinf.2005.06.010. Epub 2005 Aug 10.
7
Exploring supervised and unsupervised methods to detect topics in biomedical text.探索用于检测生物医学文本主题的监督和无监督方法。
BMC Bioinformatics. 2006 Mar 16;7:140. doi: 10.1186/1471-2105-7-140.
8
CIBS: A biomedical text summarizer using topic-based sentence clustering.CIBS:一种基于主题的句子聚类的生物医学文本摘要器。
J Biomed Inform. 2018 Dec;88:53-61. doi: 10.1016/j.jbi.2018.11.006. Epub 2018 Nov 13.
9
Consistency across the hierarchies of the UMLS Semantic Network and Metathesaurus.美国国立医学图书馆医学主题词表语义网络和元词表各层次之间的一致性。
J Biomed Inform. 2003 Dec;36(6):450-61. doi: 10.1016/j.jbi.2003.11.001.
10
A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora.面向文本语料概念化的概念驱动生物医学知识提取和可视化框架。
J Biomed Inform. 2010 Dec;43(6):1020-35. doi: 10.1016/j.jbi.2010.09.008. Epub 2010 Sep 24.

引用本文的文献

1
The power of graphs in medicine: Introducing BioGraphSum for effective text summarization.医学中图表的力量:引入BioGraphSum进行有效的文本摘要。
Heliyon. 2024 May 27;10(11):e31813. doi: 10.1016/j.heliyon.2024.e31813. eCollection 2024 Jun 15.
2
Biomedical semantic text summarizer.生物医学语义文本摘要器。
BMC Bioinformatics. 2024 Apr 16;25(1):152. doi: 10.1186/s12859-024-05712-x.
3
Federated learning-based AI approaches in smart healthcare: concepts, taxonomies, challenges and open issues.智能医疗保健中基于联邦学习的人工智能方法:概念、分类、挑战与开放问题。
Cluster Comput. 2022 Aug 17:1-41. doi: 10.1007/s10586-022-03658-4.
4
Beyond opinion classification: Extracting facts, opinions and experiences from health forums.超越观点分类:从健康论坛中提取事实、观点和经验。
PLoS One. 2019 Jan 9;14(1):e0209961. doi: 10.1371/journal.pone.0209961. eCollection 2019.
5
Classification of clinically useful sentences in clinical evidence resources.临床证据资源中临床有用句子的分类。
J Biomed Inform. 2016 Apr;60:14-22. doi: 10.1016/j.jbi.2016.01.003. Epub 2016 Jan 13.
6
Feature engineering for MEDLINE citation categorization with MeSH.使用医学主题词表(MeSH)进行医学文献数据库(MEDLINE)引文分类的特征工程
BMC Bioinformatics. 2015 Apr 8;16:113. doi: 10.1186/s12859-015-0539-7.
7
Figure-associated text summarization and evaluation.与图相关的文本总结与评估。
PLoS One. 2015 Feb 2;10(2):e0115671. doi: 10.1371/journal.pone.0115671. eCollection 2015.
8
Learning to rank-based gene summary extraction.基于学习排序的基因汇总提取。
BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S10. doi: 10.1186/1471-2105-15-S12-S10. Epub 2014 Nov 6.
9
Text summarization in the biomedical domain: a systematic review of recent research.生物医学领域的文本摘要:近期研究的系统综述
J Biomed Inform. 2014 Dec;52:457-67. doi: 10.1016/j.jbi.2014.06.009. Epub 2014 Jul 10.
10
Mapping biological entities using the longest approximately common prefix method.使用最长近似公共前缀方法对生物实体进行映射。
BMC Bioinformatics. 2014 Jun 14;15:187. doi: 10.1186/1471-2105-15-187.