• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Codabench:灵活、易用且可重现的元基准测试平台。

Codabench: Flexible, easy-to-use, and reproducible meta-benchmark platform.

作者信息

Xu Zhen, Escalera Sergio, Pavão Adrien, Richard Magali, Tu Wei-Wei, Yao Quanming, Zhao Huan, Guyon Isabelle

机构信息

4Paradigm, Beijing 100085, China.

Computer Vision Center, Universitat de Barcelona, 08007 Barcelona, Spain.

出版信息

Patterns (N Y). 2022 Jun 24;3(7):100543. doi: 10.1016/j.patter.2022.100543. eCollection 2022 Jul 8.

DOI:10.1016/j.patter.2022.100543
PMID:35845844
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9278500/
Abstract

Obtaining a standardized benchmark of computational methods is a major issue in data-science communities. Dedicated frameworks enabling fair benchmarking in a unified environment are yet to be developed. Here, we introduce Codabench, a meta-benchmark platform that is open sourced and community driven for benchmarking algorithms or software agents versus datasets or tasks. A public instance of Codabench is open to everyone free of charge and allows benchmark organizers to fairly compare submissions under the same setting (software, hardware, data, algorithms), with custom protocols and data formats. Codabench has unique features facilitating easy organization of flexible and reproducible benchmarks, such as the possibility of reusing templates of benchmarks and supplying compute resources on demand. Codabench has been used internally and externally on various applications, receiving more than 130 users and 2,500 submissions. As illustrative use cases, we introduce four diverse benchmarks covering graph machine learning, cancer heterogeneity, clinical diagnosis, and reinforcement learning.

摘要

获得计算方法的标准化基准是数据科学领域的一个主要问题。目前尚未开发出能在统一环境中实现公平基准测试的专用框架。在此,我们介绍Codabench,这是一个开源且由社区驱动的元基准测试平台,用于针对数据集或任务对算法或软件代理进行基准测试。Codabench的公共实例对所有人免费开放,允许基准测试组织者在相同设置(软件、硬件、数据、算法)下,使用自定义协议和数据格式公平地比较提交的内容。Codabench具有独特的功能,便于轻松组织灵活且可重复的基准测试,例如可以重复使用基准测试模板并按需提供计算资源。Codabench已在内部和外部用于各种应用,拥有超过130名用户和2500份提交内容。作为示例用例,我们介绍了四个不同的基准测试,涵盖图机器学习、癌症异质性、临床诊断和强化学习。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/35b7116100ae/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/edbb398c2508/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/0e8f47292d58/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/c4e9c1e7a849/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/35b7116100ae/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/edbb398c2508/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/0e8f47292d58/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/c4e9c1e7a849/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ee/9278500/35b7116100ae/gr4.jpg

相似文献

1
Codabench: Flexible, easy-to-use, and reproducible meta-benchmark platform.Codabench:灵活、易用且可重现的元基准测试平台。
Patterns (N Y). 2022 Jun 24;3(7):100543. doi: 10.1016/j.patter.2022.100543. eCollection 2022 Jul 8.
2
DECONbench: a benchmarking platform dedicated to deconvolution methods for tumor heterogeneity quantification.DECONbench:一个专门用于肿瘤异质性定量反卷积方法的基准测试平台。
BMC Bioinformatics. 2021 Oct 2;22(1):473. doi: 10.1186/s12859-021-04381-4.
3
PMLB: a large benchmark suite for machine learning evaluation and comparison.PMLB:一个用于机器学习评估和比较的大型基准测试套件。
BioData Min. 2017 Dec 11;10:36. doi: 10.1186/s13040-017-0154-4. eCollection 2017.
4
Benchmarking machine learning models on multi-centre eICU critical care dataset.基于多中心 eICU 重症监护数据集的机器学习模型基准测试。
PLoS One. 2020 Jul 2;15(7):e0235424. doi: 10.1371/journal.pone.0235424. eCollection 2020.
5
PMLB v1.0: an open-source dataset collection for benchmarking machine learning methods.PMLB v1.0:用于基准测试机器学习方法的开源数据集集合。
Bioinformatics. 2022 Jan 12;38(3):878-880. doi: 10.1093/bioinformatics/btab727.
6
LEMMI: a continuous benchmarking platform for metagenomics classifiers.LEMMI:用于宏基因组分类器的连续基准测试平台。
Genome Res. 2020 Aug;30(8):1208-1216. doi: 10.1101/gr.260398.119. Epub 2020 Jul 2.
7
A Web Resource for Standardized Benchmark Datasets, Metrics, and Rosetta Protocols for Macromolecular Modeling and Design.一个用于大分子建模与设计的标准化基准数据集、指标和罗塞塔协议的网络资源。
PLoS One. 2015 Sep 3;10(9):e0130433. doi: 10.1371/journal.pone.0130433. eCollection 2015.
8
Recommendations for machine learning benchmarks in neuroimaging.神经影像学中机器学习基准的建议。
Neuroimage. 2022 Aug 15;257:119298. doi: 10.1016/j.neuroimage.2022.119298. Epub 2022 May 10.
9
Deep reinforcement learning enables better bias control in benchmark for virtual screening.深度强化学习可在虚拟筛选基准测试中实现更好的偏差控制。
Comput Biol Med. 2024 Mar;171:108165. doi: 10.1016/j.compbiomed.2024.108165. Epub 2024 Feb 15.
10
NeuroPycon: An open-source python toolbox for fast multi-modal and reproducible brain connectivity pipelines.NeuroPycon:一个开源的 Python 工具包,用于快速进行多模态和可重复的脑连接管道。
Neuroimage. 2020 Oct 1;219:117020. doi: 10.1016/j.neuroimage.2020.117020. Epub 2020 Jun 6.

引用本文的文献

1
LCD benchmark: long clinical document benchmark on mortality prediction for language models.LCD基准:用于语言模型死亡率预测的长临床文档基准。
J Am Med Inform Assoc. 2025 Feb 1;32(2):285-295. doi: 10.1093/jamia/ocae287.
2
Challenges and perspectives in computational deconvolution of genomics data.计算基因组学数据去卷积的挑战与展望。
Nat Methods. 2024 Mar;21(3):391-400. doi: 10.1038/s41592-023-02166-6. Epub 2024 Feb 19.

本文引用的文献

1
DECONbench: a benchmarking platform dedicated to deconvolution methods for tumor heterogeneity quantification.DECONbench:一个专门用于肿瘤异质性定量反卷积方法的基准测试平台。
BMC Bioinformatics. 2021 Oct 2;22(1):473. doi: 10.1186/s12859-021-04381-4.
2
Benchmarking of cell type deconvolution pipelines for transcriptomics data.基于转录组数据的细胞类型去卷积分析流水线的基准测试
Nat Commun. 2020 Nov 6;11(1):5650. doi: 10.1038/s41467-020-19015-1.
3
Guidelines for cell-type heterogeneity quantification based on a comparative analysis of reference-free DNA methylation deconvolution software.
基于无参 DNA 甲基化去卷积软件比较分析的细胞类型异质性定量指南。
BMC Bioinformatics. 2020 Jan 13;21(1):16. doi: 10.1186/s12859-019-3307-2.
4
Molecular Geometry Prediction using a Deep Generative Graph Neural Network.基于深度生成图神经网络的分子几何结构预测。
Sci Rep. 2019 Dec 31;9(1):20381. doi: 10.1038/s41598-019-56773-5.
5
Assessing reproducibility of matrix factorization methods in independent transcriptomes.评估基质分解方法在独立转录组中的可重复性。
Bioinformatics. 2019 Nov 1;35(21):4307-4313. doi: 10.1093/bioinformatics/btz225.