• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GAME:用于模型评估的基因组应用程序编程接口。

GAME: Genomic API for Model Evaluation.

作者信息

Luthra Ishika, Priyadarshi Satyam, Guo Rui, Mahieu Lukas, Kempynck Niklas, Dooley Damion, Penzar Dmitry, Vorontsov Ilya, Sheng Yilun, Tu Xinming, Klie Adam, Drusinsky Shiron, Floren Alexander, Armand Ethan, Alasoo Kaur, Seelig Georg, Tewhey Ryan, Koo Peter, Agarwal Vikram, Gosai Sager, Pinello Luca, White Michael A, Lal Avantika, Zeitlinger Julia, Pollard Katherine S, Libbrecht Maxwell, Carter Hannah, Mostafavi Sara, Kulakovskiy Ivan, Hsiao Will, Aerts Stein, Zhou Jian, de Boer Carl G

机构信息

School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada.

Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, USA.

出版信息

bioRxiv. 2025 Jul 8:2025.07.04.663250. doi: 10.1101/2025.07.04.663250.

DOI:10.1101/2025.07.04.663250
PMID:40672207
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12265512/
Abstract

The rapid expansion of genomics datasets and the application of machine learning has produced sequence-to-activity genomics models with ever-expanding capabilities. However, benchmarking these models on practical applications has been challenging because individual projects evaluate their models in ad hoc ways, and there is substantial heterogeneity of both model architectures and benchmarking tasks. To address this challenge, we have created GAME, a system for large-scale, community-led standardized model benchmarking on user-defined evaluation tasks. We borrow concepts from the Application Programming Interface (API) paradigm to allow for seamless communication between pre-trained models and benchmarking tasks, ensuring consistent evaluation protocols. Because all models and benchmarks are inherently compatible in this framework, the continual addition of new models and new benchmarks is easy. We also developed a Matcher module powered by a large language model (LLM) to automate ambiguous task alignment between benchmarks and models. Containerization of these modules enhances reproducibility and facilitates the deployment of models and benchmarks across computing platforms. By focusing on predicting underlying biochemical phenomena (e.g. gene expression, open chromatin, DNA binding), we ensure that tasks remain technology-independent. We provide examples of benchmarks and models implementing this framework, and anticipate that the community will contribute their own, leading to an ever-expanding and evolving set of models and evaluation tasks. This resource will accelerate genomics research by illuminating the best models for a given task, motivating novel functional genomic benchmarks, and providing a more nuanced understanding of model abilities.

摘要

基因组学数据集的迅速扩展以及机器学习的应用催生了功能不断扩展的序列到活性基因组学模型。然而,在实际应用中对这些模型进行基准测试具有挑战性,因为各个项目以临时方式评估其模型,并且模型架构和基准测试任务存在很大的异质性。为应对这一挑战,我们创建了GAME,这是一个用于在用户定义的评估任务上进行大规模、社区主导的标准化模型基准测试的系统。我们借鉴了应用程序编程接口(API)范式的概念,以实现预训练模型与基准测试任务之间的无缝通信,确保评估协议的一致性。由于在此框架中所有模型和基准测试本质上都是兼容的,因此轻松添加新模型和新基准测试很容易。我们还开发了一个由大语言模型(LLM)驱动的匹配器模块,以自动完成基准测试与模型之间模糊的任务对齐。这些模块的容器化提高了可重复性,并便于在计算平台上部署模型和基准测试。通过专注于预测潜在的生化现象(例如基因表达、开放染色质、DNA结合),我们确保任务与技术无关。我们提供了实现此框架的基准测试和模型的示例,并预计社区将贡献自己的内容,从而导致模型和评估任务的不断扩展和发展。这一资源将通过阐明给定任务的最佳模型、激发新的功能基因组学基准测试以及提供对模型能力更细致入微的理解来加速基因组学研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a254/12265512/2ef7e2a1de1c/nihpp-2025.07.04.663250v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a254/12265512/64ba7aa93569/nihpp-2025.07.04.663250v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a254/12265512/2ef7e2a1de1c/nihpp-2025.07.04.663250v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a254/12265512/64ba7aa93569/nihpp-2025.07.04.663250v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a254/12265512/2ef7e2a1de1c/nihpp-2025.07.04.663250v1-f0002.jpg

相似文献

1
GAME: Genomic API for Model Evaluation.GAME:用于模型评估的基因组应用程序编程接口。
bioRxiv. 2025 Jul 8:2025.07.04.663250. doi: 10.1101/2025.07.04.663250.
2
Short-Term Memory Impairment短期记忆障碍
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
5
Systemic Inflammatory Response Syndrome全身炎症反应综合征
6
"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”:对自闭症成年人任务沉浸现象学体验的质性研究
Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.
7
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
8
A Scoping Review of the Observed and Perceived Functional Impacts Associated With Language and Learning Disorders in School-Aged Children.一项关于学龄儿童语言和学习障碍相关的观察到的和感知到的功能影响的范围综述。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70086. doi: 10.1111/1460-6984.70086.
9
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
10
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

本文引用的文献

1
Rewriting regulatory DNA to dissect and reprogram gene expression.重写调控性DNA以剖析和重新编程基因表达。
Cell. 2025 Apr 14. doi: 10.1016/j.cell.2025.03.034.
2
Massively parallel characterization of transcriptional regulatory elements.转录调控元件的大规模并行表征
Nature. 2025 Mar;639(8054):411-420. doi: 10.1038/s41586-024-08430-9. Epub 2025 Jan 15.
3
Predicting RNA-seq coverage from DNA sequence as a unifying model of gene regulation.将DNA序列预测RNA测序覆盖度作为基因调控的统一模型。
Nat Genet. 2025 Apr;57(4):949-961. doi: 10.1038/s41588-024-02053-6. Epub 2025 Jan 8.
4
Machine-guided design of cell-type-targeting cis-regulatory elements.机器引导的细胞类型靶向顺式调控元件设计。
Nature. 2024 Oct;634(8036):1211-1220. doi: 10.1038/s41586-024-08070-z. Epub 2024 Oct 23.
5
A community effort to optimize sequence-based deep learning models of gene regulation.一项旨在优化基于序列的基因调控深度学习模型的社区努力。
Nat Biotechnol. 2024 Oct 11. doi: 10.1038/s41587-024-02414-w.
6
Personal transcriptome variation is poorly explained by current genomic deep learning models.当前的基因组深度学习模型对个体转录组变异的解释能力较差。
Nat Genet. 2023 Dec;55(12):2056-2059. doi: 10.1038/s41588-023-01574-w. Epub 2023 Nov 30.
7
Current approaches to genomic deep learning struggle to fully capture human genetic variation.当前用于基因组深度学习的方法难以完全捕捉人类遗传变异。
Nat Genet. 2023 Dec;55(12):2021-2022. doi: 10.1038/s41588-023-01517-5.
8
Benchmarking of deep neural networks for predicting personal gene expression from DNA sequence highlights shortcomings.用于从DNA序列预测个人基因表达的深度神经网络基准测试凸显了不足之处。
Nat Genet. 2023 Dec;55(12):2060-2064. doi: 10.1038/s41588-023-01524-6. Epub 2023 Nov 30.
9
Genomic benchmarks: a collection of datasets for genomic sequence classification.基因组基准测试:一组用于基因组序列分类的数据集。
BMC Genom Data. 2023 May 1;24(1):25. doi: 10.1186/s12863-023-01123-8.
10
Current sequence-based models capture gene expression determinants in promoters but mostly ignore distal enhancers.目前基于序列的模型可以捕捉启动子中的基因表达决定因素,但大多忽略了远端增强子。
Genome Biol. 2023 Mar 27;24(1):56. doi: 10.1186/s13059-023-02899-9.