• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从生物医学文献中挖掘有影响力的发现。

Mining impactful discoveries from the biomedical literature.

机构信息

Adapt Centre and School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland.

School of Medicine, Trinity College Dublin, Dublin, Ireland.

出版信息

BMC Bioinformatics. 2024 Sep 16;25(1):303. doi: 10.1186/s12859-024-05881-9.

DOI:10.1186/s12859-024-05881-9
PMID:39285337
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11403870/
Abstract

BACKGROUND

Literature-based discovery (LBD) aims to help researchers to identify relations between concepts which are worthy of further investigation by text-mining the biomedical literature. While the LBD literature is rich and the field is considered mature, standard practice in the evaluation of LBD methods is methodologically poor and has not progressed on par with the domain. The lack of properly designed and decent-sized benchmark dataset hinders the progress of the field and its development into applications usable by biomedical experts.

RESULTS

This work presents a method for mining past discoveries from the biomedical literature. It leverages the impact made by a discovery, using descriptive statistics to detect surges in the prevalence of a relation across time. The validity of the method is tested against a baseline representing the state-of-the-art "time-sliced" method.

CONCLUSIONS

This method allows the collection of a large amount of time-stamped discoveries. These can be used for LBD evaluation, alleviating the long-standing issue of inadequate evaluation. It might also pave the way for more fine-grained LBD methods, which could exploit the diversity of these past discoveries to train supervised models. Finally the dataset (or some future version of it inspired by our method) could be used as a methodological tool for systematic reviews. We provide an online exploration tool in this perspective, available at https://brainmend.adaptcentre.ie/ .

摘要

背景

基于文献的发现(LBD)旨在通过对生物医学文献进行文本挖掘,帮助研究人员识别值得进一步研究的概念之间的关系。尽管 LBD 文献丰富,并且该领域被认为已经成熟,但 LBD 方法的评估标准实践在方法上存在缺陷,并没有与该领域同步发展。缺乏经过适当设计和具有一定规模的基准数据集阻碍了该领域的发展及其向可被生物医学专家使用的应用程序的发展。

结果

这项工作提出了一种从生物医学文献中挖掘过去发现的方法。它利用发现的影响力,使用描述性统计来检测一段时间内关系的流行度的变化。该方法的有效性通过基线进行测试,基线代表了最先进的“时间切片”方法。

结论

该方法允许收集大量带有时间戳的发现。这些发现可用于 LBD 评估,从而缓解评估不足的长期问题。它也可能为更细粒度的 LBD 方法铺平道路,这些方法可以利用这些过去发现的多样性来训练监督模型。最后,该数据集(或受我们方法启发的未来版本)可以作为系统综述的方法工具。我们提供了一个在线探索工具,可在 https://brainmend.adaptcentre.ie/ 上使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/a9ffb22c4d05/12859_2024_5881_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/8e02c2cf4aa6/12859_2024_5881_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/c6a1d30cee16/12859_2024_5881_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/aa069614c022/12859_2024_5881_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/168aae6e1352/12859_2024_5881_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/a9ffb22c4d05/12859_2024_5881_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/8e02c2cf4aa6/12859_2024_5881_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/c6a1d30cee16/12859_2024_5881_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/aa069614c022/12859_2024_5881_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/168aae6e1352/12859_2024_5881_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b574/11403870/a9ffb22c4d05/12859_2024_5881_Fig5_HTML.jpg

相似文献

1
Mining impactful discoveries from the biomedical literature.从生物医学文献中挖掘有影响力的发现。
BMC Bioinformatics. 2024 Sep 16;25(1):303. doi: 10.1186/s12859-024-05881-9.
2
Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries.结合文献挖掘和机器学习预测生物医学发现。
Methods Mol Biol. 2022;2496:123-140. doi: 10.1007/978-1-0716-2305-3_7.
3
BertSRC: transformer-based semantic relation classification.BertSRC:基于转换器的语义关系分类。
BMC Med Inform Decis Mak. 2022 Sep 6;22(1):234. doi: 10.1186/s12911-022-01977-5.
4
Context-driven automatic subgraph creation for literature-based discovery.用于基于文献的发现的上下文驱动自动子图创建
J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.
5
Neural networks for open and closed Literature-based Discovery.基于文献的开放式和封闭式发现的神经网络。
PLoS One. 2020 May 15;15(5):e0232891. doi: 10.1371/journal.pone.0232891. eCollection 2020.
6
The effect of word sense disambiguation accuracy on literature based discovery.词义消歧准确性对基于文献的发现的影响。
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):57. doi: 10.1186/s12911-016-0296-1.
7
Constructing a Graph Database for Semantic Literature-Based Discovery.构建用于基于语义文献发现的图形数据库。
Stud Health Technol Inform. 2015;216:1094.
8
Quantifying and filtering knowledge generated by literature based discovery.量化并筛选基于文献发现所产生的知识。
BMC Bioinformatics. 2017 May 31;18(Suppl 7):249. doi: 10.1186/s12859-017-1641-9.
9
Enhancing the accuracy of knowledge discovery: a supervised learning method.提高知识发现的准确性:一种有监督的学习方法。
BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2105-15-S12-S9. Epub 2014 Nov 6.
10
Serial KinderMiner (SKiM) discovers and annotates biomedical knowledge using co-occurrence and transformer models.使用共现和转换器模型,串行 KinderMiner (SKiM) 发现和注释生物医学知识。
BMC Bioinformatics. 2023 Nov 1;24(1):412. doi: 10.1186/s12859-023-05539-y.

本文引用的文献

1
A systematic review on literature-based discovery workflow.基于文献的发现工作流程的系统综述。
PeerJ Comput Sci. 2019 Nov 18;5:e235. doi: 10.7717/peerj-cs.235. eCollection 2019.
2
Neural networks for open and closed Literature-based Discovery.基于文献的开放式和封闭式发现的神经网络。
PLoS One. 2020 May 15;15(5):e0232891. doi: 10.1371/journal.pone.0232891. eCollection 2020.
3
PubTator central: automated concept annotation for biomedical full text articles.PubTator 中心:用于生物医学全文文章的自动概念标注。
Nucleic Acids Res. 2019 Jul 2;47(W1):W587-W593. doi: 10.1093/nar/gkz389.
4
Disentangling the evolution of MEDLINE bibliographic database: A complex network perspective.剖析 MEDLINE 书目数据库的演化:复杂网络视角。
J Biomed Inform. 2019 Jan;89:101-113. doi: 10.1016/j.jbi.2018.11.014. Epub 2018 Dec 7.
5
LION LBD: a literature-based discovery system for cancer biology.LION LBD:一个基于文献的癌症生物学发现系统。
Bioinformatics. 2019 May 1;35(9):1553-1561. doi: 10.1093/bioinformatics/bty845.
6
Gaps within the Biomedical Literature: Initial Characterization and Assessment of Strategies for Discovery.生物医学文献中的差距:发现策略的初步特征描述与评估
Front Res Metr Anal. 2017 May;2. doi: 10.3389/frma.2017.00003. Epub 2017 May 22.
7
A collaborative filtering-based approach to biomedical knowledge discovery.基于协同过滤的生物医学知识发现方法。
Bioinformatics. 2018 Feb 15;34(4):652-659. doi: 10.1093/bioinformatics/btx613.
8
Literature Based Discovery: Models, methods, and trends.文献基础发现:模型、方法和趋势。
J Biomed Inform. 2017 Oct;74:20-32. doi: 10.1016/j.jbi.2017.08.011. Epub 2017 Aug 31.
9
Treatment of infantile-onset spinal muscular atrophy with nusinersen: a phase 2, open-label, dose-escalation study.用nusinersen 治疗婴儿型脊肌萎缩症:一项 2 期、开放标签、剂量递增研究。
Lancet. 2016 Dec 17;388(10063):3017-3026. doi: 10.1016/S0140-6736(16)31408-8. Epub 2016 Dec 7.
10
A new evaluation methodology for literature-based discovery systems.一种基于文献的发现系统的新评估方法。
J Biomed Inform. 2009 Aug;42(4):633-43. doi: 10.1016/j.jbi.2008.12.001. Epub 2008 Dec 16.