• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DOME注册库:实施全社区关于报告生物学中监督式机器学习的建议。

DOME Registry: implementing community-wide recommendations for reporting supervised machine learning in biology.

作者信息

Attafi Omar Abdelghani, Clementel Damiano, Kyritsis Konstantinos, Capriotti Emidio, Farrell Gavin, Fragkouli Styliani-Christina, Castro Leyla Jael, Hatos András, Lenaerts Tom, Mazurenko Stanislav, Mozaffari Soroush, Pradelli Franco, Ruch Patrick, Savojardo Castrense, Turina Paola, Zambelli Federico, Piovesan Damiano, Monzon Alexander Miguel, Psomopoulos Fotis, Tosatto Silvio C E

机构信息

Department of Biomedical Sciences, University of Padova, Padova 35131, Italy.

Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki 570 01, Greece.

出版信息

Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae094.

DOI:10.1093/gigascience/giae094
PMID:
39661723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11633452/
Abstract

Supervised machine learning (ML) is used extensively in biology and deserves closer scrutiny. The Data Optimization Model Evaluation (DOME) recommendations aim to enhance the validation and reproducibility of ML research by establishing standards for key aspects such as data handling and processing, optimization, evaluation, and model interpretability. The recommendations help to ensure that key details are reported transparently by providing a structured set of questions. Here, we introduce the DOME registry (URL: registry.dome-ml.org), a database that allows scientists to manage and access comprehensive DOME-related information on published ML studies. The registry uses external resources like ORCID, APICURON, and the Data Stewardship Wizard to streamline the annotation process and ensure comprehensive documentation. By assigning unique identifiers and DOME scores to publications, the registry fosters a standardized evaluation of ML methods. Future plans include continuing to grow the registry through community curation, improving the DOME score definition and encouraging publishers to adopt DOME standards, and promoting transparency and reproducibility of ML in the life sciences.

摘要

监督式机器学习(ML)在生物学中被广泛使用,值得更深入的审视。数据优化模型评估(DOME)建议旨在通过为数据处理、优化、评估和模型可解释性等关键方面制定标准,来提高ML研究的验证性和可重复性。这些建议通过提供一组结构化问题,有助于确保关键细节得到透明报告。在此,我们介绍DOME注册库(网址:registry.dome-ml.org),这是一个数据库,允许科学家管理和获取已发表的ML研究中与DOME相关的全面信息。该注册库使用ORCID、APICURON和数据管理向导等外部资源来简化注释过程并确保全面记录。通过为出版物分配唯一标识符和DOME分数,该注册库促进了对ML方法的标准化评估。未来计划包括通过社区管理继续扩大注册库,改进DOME分数定义并鼓励出版商采用DOME标准,以及提高生命科学中ML的透明度和可重复性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8df/11633452/857aaf7f521e/giae094fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8df/11633452/fa6da87eeef6/giae094fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8df/11633452/525d0acc0fc6/giae094fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8df/11633452/857aaf7f521e/giae094fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8df/11633452/fa6da87eeef6/giae094fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8df/11633452/525d0acc0fc6/giae094fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8df/11633452/857aaf7f521e/giae094fig3.jpg

相似文献

1
DOME Registry: implementing community-wide recommendations for reporting supervised machine learning in biology.DOME注册库:实施全社区关于报告生物学中监督式机器学习的建议。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae094.
2
Resource Disambiguator for the Web: Extracting Biomedical Resources and Their Citations from the Scientific Literature.网络资源消歧器:从科学文献中提取生物医学资源及其引用信息
PLoS One. 2016 Jan 5;11(1):e0146300. doi: 10.1371/journal.pone.0146300. eCollection 2016.
3
DOME: recommendations for supervised machine learning validation in biology.DOME:生物学中监督式机器学习验证的建议
Nat Methods. 2021 Oct;18(10):1122-1127. doi: 10.1038/s41592-021-01205-4.
4
APICURON: a database to credit and acknowledge the work of biocurators.APICURON:一个为生物信息注释员的工作提供信用和认可的数据库。
Database (Oxford). 2021 Apr 21;2021. doi: 10.1093/database/baab019.
5
Critical Care Network in the State of Qatar.卡塔尔国重症监护网络。
Qatar Med J. 2019 Nov 7;2019(2):2. doi: 10.5339/qmj.2019.qccc.2. eCollection 2019.
6
BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciences.生物数据共享:生命科学领域经整理和众包的元数据标准、数据库及数据政策。
Database (Oxford). 2016 May 17;2016. doi: 10.1093/database/baw075. Print 2016.
7
Identifiers.org and MIRIAM Registry: community resources to provide persistent identification.Identifiers.org 和 MIRIAM 注册表:为提供持久标识提供社区资源。
Nucleic Acids Res. 2012 Jan;40(Database issue):D580-6. doi: 10.1093/nar/gkr1097. Epub 2011 Dec 2.
8
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
9
Data stewardship and curation practices in AI-based genomics and automated microscopy image analysis for high-throughput screening studies: promoting robust and ethical AI applications.基于人工智能的基因组学和用于高通量筛选研究的自动显微镜图像分析中的数据管理与整理实践:推动可靠且符合伦理的人工智能应用。
Hum Genomics. 2025 Feb 23;19(1):16. doi: 10.1186/s40246-025-00716-x.
10
Standardized Reporting of Machine Learning Applications in Urology: The STREAM-URO Framework.机器学习在泌尿外科中的标准化报告:STREAM-URO 框架。
Eur Urol Focus. 2021 Jul;7(4):672-682. doi: 10.1016/j.euf.2021.07.004. Epub 2021 Aug 3.

引用本文的文献

1
Structural bioinformatics for rational drug design.用于合理药物设计的结构生物信息学。
Res Pract Thromb Haemost. 2025 Jan 23;9(1):102691. doi: 10.1016/j.rpth.2025.102691. eCollection 2025 Jan.

本文引用的文献

1
A knowledge graph approach to predict and interpret disease-causing gene interactions.一种基于知识图谱的疾病相关基因互作预测与解释方法。
BMC Bioinformatics. 2023 Aug 29;24(1):324. doi: 10.1186/s12859-023-05451-5.
2
FAIR for AI: An interdisciplinary and international community building perspective.公平对待人工智能:跨学科和国际化的社区建设视角。
Sci Data. 2023 Jul 26;10(1):487. doi: 10.1038/s41597-023-02298-6.
3
Faster and more accurate pathogenic combination predictions with VarCoPP2.0.利用 VarCoPP2.0 实现更快更准确的病原体组合预测。
BMC Bioinformatics. 2023 May 1;24(1):179. doi: 10.1186/s12859-023-05291-3.
4
The AIMe registry for artificial intelligence in biomedical research.用于生物医学研究中人工智能的AIMe注册库。
Nat Methods. 2021 Oct;18(10):1128-1131. doi: 10.1038/s41592-021-01241-0.
5
DOME: recommendations for supervised machine learning validation in biology.DOME:生物学中监督式机器学习验证的建议
Nat Methods. 2021 Oct;18(10):1122-1127. doi: 10.1038/s41592-021-01205-4.
6
APICURON: a database to credit and acknowledge the work of biocurators.APICURON:一个为生物信息注释员的工作提供信用和认可的数据库。
Database (Oxford). 2021 Apr 21;2021. doi: 10.1093/database/baab019.
7
A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis.深度学习在医学影像疾病检测方面的性能与医疗保健专业人员的比较:系统评价和荟萃分析。
Lancet Digit Health. 2019 Oct;1(6):e271-e297. doi: 10.1016/S2589-7500(19)30123-2. Epub 2019 Sep 25.
8
Integration of mechanistic immunological knowledge into a machine learning pipeline improves predictions.将机械免疫学知识整合到机器学习流程中可改善预测效果。
Nat Mach Intell. 2020 Oct;2(10):619-628. doi: 10.1038/s42256-020-00232-8. Epub 2020 Oct 12.
9
Transparency and reproducibility in artificial intelligence.人工智能中的透明度和可重复性。
Nature. 2020 Oct;586(7829):E14-E16. doi: 10.1038/s41586-020-2766-y. Epub 2020 Oct 14.
10
Setting the standards for machine learning in biology.设定生物学中机器学习的标准。
Nat Rev Mol Cell Biol. 2019 Nov;20(11):659-660. doi: 10.1038/s41580-019-0176-5.