• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

改革:基于共识的机器学习科学建议。

REFORMS: Consensus-based Recommendations for Machine-learning-based Science.

机构信息

Department of Computer Science, Princeton University, Princeton, NJ 08544, USA.

Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA.

出版信息

Sci Adv. 2024 May 3;10(18):eadk3452. doi: 10.1126/sciadv.adk3452. Epub 2024 May 1.

DOI:10.1126/sciadv.adk3452
PMID:38691601
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11092361/
Abstract

Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways across disciplines. Motivated by this observation, our goal is to provide clear recommendations for conducting and reporting ML-based science. Drawing from an extensive review of past literature, we present the REFORMS checklist (recommendations for machine-learning-based science). It consists of 32 questions and a paired set of guidelines. REFORMS was developed on the basis of a consensus of 19 researchers across computer science, data science, mathematics, social sciences, and biomedical sciences. REFORMS can serve as a resource for researchers when designing and implementing a study, for referees when reviewing papers, and for journals when enforcing standards for transparency and reproducibility.

摘要

机器学习 (ML) 方法在科学研究中迅速普及。然而,这些方法的采用伴随着有效性、可重复性和通用性的失败。这些失败可能会阻碍科学进步,导致无效主张周围形成虚假共识,并破坏基于机器学习的科学的可信度。ML 方法在不同学科中经常以类似的方式应用和失败。受此观察的启发,我们的目标是为基于机器学习的科学提供明确的建议。我们从对过去文献的广泛回顾中,提出了基于 ML 的科学的 REFORMS 清单 (REcommendations for machine-learning-based science)。它由 32 个问题和一对指导方针组成。REFORMS 是基于来自计算机科学、数据科学、数学、社会科学和生物医学科学的 19 名研究人员的共识而开发的。REFORMS 可以作为研究人员在设计和实施研究时、评审人员在评审论文时以及期刊在执行透明度和可重复性标准时的资源。

相似文献

1
REFORMS: Consensus-based Recommendations for Machine-learning-based Science.改革:基于共识的机器学习科学建议。
Sci Adv. 2024 May 3;10(18):eadk3452. doi: 10.1126/sciadv.adk3452. Epub 2024 May 1.
2
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
3
The Reporting Quality of Machine Learning Studies on Pediatric Diabetes Mellitus: Systematic Review.机器学习在儿科糖尿病研究中的报告质量:系统评价。
J Med Internet Res. 2024 Jan 19;26:e47430. doi: 10.2196/47430.
4
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
5
Approaches for predicting dairy cattle methane emissions: from traditional methods to machine learning.预测奶牛甲烷排放的方法:从传统方法到机器学习。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae219.
6
Do peer reviewers comment on reporting items as instructed by the journal? A secondary analysis of two randomized trials.同行评审员是否按照期刊的要求对报告项目进行评论?两项随机试验的二次分析。
J Clin Epidemiol. 2025 May 8;183:111818. doi: 10.1016/j.jclinepi.2025.111818.
7
Diagnosis and management of dental caries throughout life.一生当中龋齿的诊断与管理。
NIH Consens Statement. 2001;18(1):1-23.
8
Sexual Harassment and Prevention Training性骚扰与预防培训
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Short-Term Memory Impairment短期记忆障碍

引用本文的文献

1
Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA).制定聚类分析透明报告清单及指南(TRoCA)的方案
BMJ Open. 2025 Aug 21;15(8):e099609. doi: 10.1136/bmjopen-2025-099609.
2
Hierarchical, Interactive, and Dynamic Predictive Capacity of Current Biological, Psychological, Social, and Environmental Measurements in Depression, Anxiety, ADHD, and Social Quality across the Lifespan.当前生物、心理、社会和环境测量在整个生命周期中对抑郁、焦虑、注意力缺陷多动障碍和社会质量的分层、交互和动态预测能力。
Res Sq. 2025 Jul 30:rs.3.rs-7060126. doi: 10.21203/rs.3.rs-7060126/v1.
3
Review and recommendations for using artificial intelligence in intracoronary optical coherence tomography analysis.冠状动脉内光学相干断层扫描分析中使用人工智能的综述与建议
Eur Heart J Digit Health. 2025 May 15;6(4):529-539. doi: 10.1093/ehjdh/ztaf053. eCollection 2025 Jul.
4
Unraveling overoptimism and publication bias in ML-driven science.揭示机器学习驱动的科学中的过度乐观和发表偏倚。
Patterns (N Y). 2025 Feb 25;6(4):101185. doi: 10.1016/j.patter.2025.101185. eCollection 2025 Apr 11.
5
Why an overreliance on AI-driven modelling is bad for science.为何过度依赖人工智能驱动的建模对科学不利。
Nature. 2025 Apr;640(8058):312-314. doi: 10.1038/d41586-025-01067-2.
6
Public Disclosure of Results From Artificial Intelligence/Machine Learning Research in Health Care: Comprehensive Analysis of ClinicalTrials.gov, PubMed, and Scopus Data (2010-2023).医疗保健领域人工智能/机器学习研究结果的公开披露:对ClinicalTrials.gov、PubMed和Scopus数据的综合分析(2010 - 2023年)
J Med Internet Res. 2025 Mar 21;27:e60148. doi: 10.2196/60148.
7
How can we make sound replication decisions?我们如何做出合理的复制决策?
Proc Natl Acad Sci U S A. 2025 Feb 4;122(5):e2401236121. doi: 10.1073/pnas.2401236121. Epub 2025 Jan 27.
8
Avoiding common machine learning pitfalls.避免常见的机器学习陷阱。
Patterns (N Y). 2024 Aug 28;5(10):101046. doi: 10.1016/j.patter.2024.101046. eCollection 2024 Oct 11.
9
The reanimation of pseudoscience in machine learning and its ethical repercussions.机器学习中伪科学的复兴及其伦理影响。
Patterns (N Y). 2024 Aug 1;5(9):101027. doi: 10.1016/j.patter.2024.101027. eCollection 2024 Sep 13.
10
Generative Artificial Intelligence for Health Technology Assessment: Opportunities, Challenges, and Policy Considerations: An ISPOR Working Group Report.用于卫生技术评估的生成式人工智能:机遇、挑战及政策考量:一份ISPOR工作组报告
Value Health. 2025 Feb;28(2):175-183. doi: 10.1016/j.jval.2024.10.3846. Epub 2024 Nov 12.

本文引用的文献

1
Artificial intelligence and illusions of understanding in scientific research.人工智能与科研中的理解错觉。
Nature. 2024 Mar;627(8002):49-58. doi: 10.1038/s41586-024-07146-0. Epub 2024 Mar 6.
2
Prediction-powered inference.预测驱动的推理。
Science. 2023 Nov 10;382(6671):669-674. doi: 10.1126/science.adi6000. Epub 2023 Nov 9.
3
Leakage and the reproducibility crisis in machine-learning-based science.基于机器学习的科学中的漏洞与可重复性危机。
Patterns (N Y). 2023 Aug 4;4(9):100804. doi: 10.1016/j.patter.2023.100804. eCollection 2023 Sep 8.
4
Successes and Struggles with Computational Reproducibility: Lessons from the Fragile Families Challenge.计算可重复性的成功与挑战:来自脆弱家庭挑战的经验教训。
Socius. 2019 Jan-Dec;5. doi: 10.1177/2378023119849803. Epub 2019 Sep 10.
5
Overfitting to 'predict' suicidal ideation.过度拟合以“预测”自杀意念。
Nat Hum Behav. 2023 May;7(5):680-681. doi: 10.1038/s41562-023-01560-6. Epub 2023 Apr 6.
6
Systematic review finds "spin" practices and poor reporting standards in studies on machine learning-based prediction models.系统评价发现基于机器学习的预测模型研究中存在“歪曲”做法和较差的报告标准。
J Clin Epidemiol. 2023 Jun;158:99-110. doi: 10.1016/j.jclinepi.2023.03.024. Epub 2023 Apr 5.
7
Researcher reasoning meets computational capacity: Machine learning for social science.研究人员推理与计算能力的结合:用于社会科学的机器学习。
Soc Sci Res. 2022 Nov;108:102807. doi: 10.1016/j.ssresearch.2022.102807. Epub 2022 Oct 29.
8
Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences.社会科学与健康科学中用于描述、预测及因果推断的机器学习方法映射
Sci Adv. 2022 Oct 21;8(42):eabk1942. doi: 10.1126/sciadv.abk1942. Epub 2022 Oct 19.
9
Retraction Note: A mechanistic model of the neural entropy increase elicited by psychedelic drugs.撤稿说明:致幻药物引发神经熵增加的机制模型。
Sci Rep. 2022 Sep 15;12(1):15500. doi: 10.1038/s41598-022-20093-y.
10
Deep learning for automatic brain tumour segmentation on MRI: evaluation of recommended reporting criteria via a reproduction and replication study.深度学习在 MRI 上自动脑肿瘤分割中的应用:通过再现和复制研究评估推荐报告标准。
BMJ Open. 2022 Jul 18;12(7):e059000. doi: 10.1136/bmjopen-2021-059000.