• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Dyport:基于动态重要性的生物医学假说生成基准测试技术。

Dyport: dynamic importance-based biomedical hypothesis generation benchmarking technique.

机构信息

Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, 19713, USA.

Department of Computer and Information Sciences, University of Delaware, Newark, DE, 19716, USA.

出版信息

BMC Bioinformatics. 2024 Jun 13;25(1):213. doi: 10.1186/s12859-024-05812-8.

DOI:10.1186/s12859-024-05812-8
PMID:38872097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11177514/
Abstract

BACKGROUND

Automated hypothesis generation (HG) focuses on uncovering hidden connections within the extensive information that is publicly available. This domain has become increasingly popular, thanks to modern machine learning algorithms. However, the automated evaluation of HG systems is still an open problem, especially on a larger scale.

RESULTS

This paper presents a novel benchmarking framework Dyport for evaluating biomedical hypothesis generation systems. Utilizing curated datasets, our approach tests these systems under realistic conditions, enhancing the relevance of our evaluations. We integrate knowledge from the curated databases into a dynamic graph, accompanied by a method to quantify discovery importance. This not only assesses hypotheses accuracy but also their potential impact in biomedical research which significantly extends traditional link prediction benchmarks. Applicability of our benchmarking process is demonstrated on several link prediction systems applied on biomedical semantic knowledge graphs. Being flexible, our benchmarking system is designed for broad application in hypothesis generation quality verification, aiming to expand the scope of scientific discovery within the biomedical research community.

CONCLUSIONS

Dyport is an open-source benchmarking framework designed for biomedical hypothesis generation systems evaluation, which takes into account knowledge dynamics, semantics and impact. All code and datasets are available at: https://github.com/IlyaTyagin/Dyport .

摘要

背景

自动化假设生成(HG)专注于揭示公共领域中广泛信息内隐藏的联系。由于现代机器学习算法的出现,这个领域变得越来越流行。然而,自动化的 HG 系统评估仍然是一个开放的问题,特别是在更大的规模上。

结果

本文提出了一种新颖的基准框架 Dyport,用于评估生物医学假设生成系统。我们的方法利用经过精心整理的数据集,在现实条件下测试这些系统,提高了评估的相关性。我们将来自经过整理的数据库的知识集成到一个动态图中,并结合一种方法来量化发现的重要性。这不仅评估了假设的准确性,还评估了它们在生物医学研究中的潜在影响,这大大扩展了传统的链接预测基准。我们的基准测试过程在几个链接预测系统上进行了应用,这些系统应用于生物医学语义知识图。我们的基准测试系统具有灵活性,旨在验证假设生成质量,旨在扩大生物医学研究界的科学发现范围。

结论

Dyport 是一个用于生物医学假设生成系统评估的开源基准框架,它考虑了知识动态、语义和影响。所有的代码和数据集都可以在:https://github.com/IlyaTyagin/Dyport 上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/b9e15600aa9f/12859_2024_5812_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/4e072326ed0a/12859_2024_5812_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/1fe6e2805fc3/12859_2024_5812_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/e2b58fc48d6e/12859_2024_5812_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/90ff2b39c787/12859_2024_5812_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/ab0c20fe8dce/12859_2024_5812_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/b9e15600aa9f/12859_2024_5812_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/4e072326ed0a/12859_2024_5812_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/1fe6e2805fc3/12859_2024_5812_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/e2b58fc48d6e/12859_2024_5812_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/90ff2b39c787/12859_2024_5812_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/ab0c20fe8dce/12859_2024_5812_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be3e/11177514/b9e15600aa9f/12859_2024_5812_Fig6_HTML.jpg

相似文献

1
Dyport: dynamic importance-based biomedical hypothesis generation benchmarking technique.Dyport:基于动态重要性的生物医学假说生成基准测试技术。
BMC Bioinformatics. 2024 Jun 13;25(1):213. doi: 10.1186/s12859-024-05812-8.
2
Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations.利用生物医学知识图谱中的语义模式预测治疗和因果关系。
J Biomed Inform. 2018 Jun;82:189-199. doi: 10.1016/j.jbi.2018.05.003. Epub 2018 May 12.
3
Contrasting Multi-Source Temporal Knowledge Graphs for Biomedical Hypothesis Generation.用于生物医学假设生成的多源时态知识图谱对比
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2102-2112. doi: 10.1109/TCBB.2024.3451051. Epub 2024 Dec 10.
4
Biomedical knowledge graph-optimized prompt generation for large language models.生物医学知识图谱优化的大语言模型提示生成。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae560.
5
BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale.生物概念向量:在大规模上创建和评估基于文献的生物医学概念嵌入。
PLoS Comput Biol. 2020 Apr 23;16(4):e1007617. doi: 10.1371/journal.pcbi.1007617. eCollection 2020 Apr.
6
Biomedical relation extraction method based on ensemble learning and attention mechanism.基于集成学习和注意力机制的生物医学关系抽取方法。
BMC Bioinformatics. 2024 Oct 18;25(1):333. doi: 10.1186/s12859-024-05951-y.
7
OpenBioLink: a benchmarking framework for large-scale biomedical link prediction.OpenBioLink:大规模生物医学链接预测的基准测试框架。
Bioinformatics. 2020 Jul 1;36(13):4097-4098. doi: 10.1093/bioinformatics/btaa274.
8
Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.基于链接生物医学本体的疾病-药物领域自动化本体生成框架。
Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.
9
Developing a Knowledge Graph for Pharmacokinetic Natural Product-Drug Interactions.开发药代动力学天然产物-药物相互作用知识库。
J Biomed Inform. 2023 Apr;140:104341. doi: 10.1016/j.jbi.2023.104341. Epub 2023 Mar 17.
10
A service-oriented distributed semantic mediator: integrating multiscale biomedical information.一种面向服务的分布式语义中介器:整合多尺度生物医学信息。
IEEE Trans Inf Technol Biomed. 2012 Nov;16(6):1296-303. doi: 10.1109/TITB.2012.2215045. Epub 2012 Aug 23.

本文引用的文献

1
Is Novelty Predictable?新颖性可预测吗?
Cold Spring Harb Perspect Biol. 2024 Feb 1;16(2):a041469. doi: 10.1101/cshperspect.a041469.
2
Accelerating science with human-aware artificial intelligence.用具有人类意识的人工智能加速科学发展。
Nat Hum Behav. 2023 Oct;7(10):1682-1696. doi: 10.1038/s41562-023-01648-z. Epub 2023 Jul 13.
3
The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource.NHGRI-EBI GWAS 目录:知识库和存储资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D977-D985. doi: 10.1093/nar/gkac1010.
4
Comparative Toxicogenomics Database (CTD): update 2023.比较毒理学基因组数据库(CTD):2023 年更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1257-D1262. doi: 10.1093/nar/gkac833.
5
BioGPT: generative pre-trained transformer for biomedical text generation and mining.BioGPT:用于生物医学文本生成和挖掘的生成式预训练转换器。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac409.
6
A review of biomedical datasets relating to drug discovery: a knowledge graph perspective.生物医学数据集在药物发现中的应用综述:知识图谱视角。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac404.
7
Large-Scale Validation of Hypothesis Generation Systems via Candidate Ranking.通过候选排序对假设生成系统进行大规模验证。
Proc IEEE Int Conf Big Data. 2018 Dec;2018:1494-1503. doi: 10.1109/bigdata.2018.8622637. Epub 2019 Jan 24.
8
CBAG: Conditional biomedical abstract generation.CBAG:条件生物医学文摘生成。
PLoS One. 2021 Jul 6;16(7):e0253905. doi: 10.1371/journal.pone.0253905. eCollection 2021.
9
AI-based language models powering drug discovery and development.基于人工智能的语言模型推动药物发现和开发。
Drug Discov Today. 2021 Nov;26(11):2593-2607. doi: 10.1016/j.drudis.2021.06.009. Epub 2021 Jun 30.
10
BioRel: towards large-scale biomedical relation extraction.BioRel:面向大规模生物医学关系抽取。
BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):543. doi: 10.1186/s12859-020-03889-5.