• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于检查学习模型可解释性和泛化性的外部验证扩展

Extensions of the External Validation for Checking Learned Model Interpretability and Generalizability.

作者信息

Ho Sung Yang, Phua Kimberly, Wong Limsoon, Bin Goh Wilson Wen

机构信息

School of Biological Sciences, Nanyang Technological University, Singapore 637551, Singapore.

Department of Computer Science, National University of Singapore, Singapore 117417, Singapore.

出版信息

Patterns (N Y). 2020 Nov 13;1(8):100129. doi: 10.1016/j.patter.2020.100129.

DOI:10.1016/j.patter.2020.100129
PMID:33294870
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7691387/
Abstract

We discuss the validation of machine learning models, which is standard practice in determining model efficacy and generalizability. We argue that internal validation approaches, such as cross-validation and bootstrap, cannot guarantee the quality of a machine learning model due to potentially biased training data and the complexity of the validation procedure itself. For better evaluating the generalization ability of a learned model, we suggest leveraging on external data sources from elsewhere as validation datasets, namely external validation. Due to the lack of research attractions on external validation, especially a well-structured and comprehensive study, we discuss the necessity for external validation and propose two extensions of the external validation approach that may help reveal the true domain-relevant model from a candidate set. Moreover, we also suggest a procedure to check whether a set of validation datasets is valid and introduce statistical reference points for detecting external data problems.

摘要

我们讨论机器学习模型的验证,这是确定模型有效性和通用性的标准做法。我们认为,诸如交叉验证和自助法等内部验证方法,由于潜在的有偏差的训练数据以及验证过程本身的复杂性,无法保证机器学习模型的质量。为了更好地评估学习模型的泛化能力,我们建议利用来自其他地方的外部数据源作为验证数据集,即外部验证。由于对外部验证缺乏研究吸引力,尤其是缺乏结构良好且全面的研究,我们讨论了外部验证的必要性,并提出了外部验证方法的两种扩展,这可能有助于从候选集中揭示真正与领域相关的模型。此外,我们还建议了一个程序来检查一组验证数据集是否有效,并引入统计参考点以检测外部数据问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/30121e741ff8/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/cd93cb770586/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/be2440d7ab40/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/ab145e74ed0d/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/30121e741ff8/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/cd93cb770586/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/be2440d7ab40/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/ab145e74ed0d/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae2/7691387/30121e741ff8/gr4.jpg

相似文献

1
Extensions of the External Validation for Checking Learned Model Interpretability and Generalizability.用于检查学习模型可解释性和泛化性的外部验证扩展
Patterns (N Y). 2020 Nov 13;1(8):100129. doi: 10.1016/j.patter.2020.100129.
2
Machine learning-based radiomics model to predict benign and malignant PI-RADS v2.1 category 3 lesions: a retrospective multi-center study.基于机器学习的放射组学模型预测 PI-RADS v2.1 分类 3 级良恶性病变:一项回顾性多中心研究。
BMC Med Imaging. 2023 Mar 29;23(1):47. doi: 10.1186/s12880-023-01002-9.
3
Prediction Model of Osteonecrosis of the Femoral Head After Femoral Neck Fracture: Machine Learning-Based Development and Validation Study.股骨颈骨折后股骨头坏死的预测模型:基于机器学习的开发与验证研究
JMIR Med Inform. 2021 Nov 19;9(11):e30079. doi: 10.2196/30079.
4
Generalizability of machine learning for classification of schizophrenia based on resting-state functional MRI data.基于静息态功能磁共振成像数据的机器学习在精神分裂症分类中的泛化能力。
Hum Brain Mapp. 2020 Jan;41(1):172-184. doi: 10.1002/hbm.24797. Epub 2019 Oct 1.
5
Prediction of 30-day mortality in heart failure patients with hypoxic hepatitis: Development and external validation of an interpretable machine learning model.缺氧性肝炎所致心力衰竭患者30天死亡率的预测:一种可解释机器学习模型的开发与外部验证
Front Cardiovasc Med. 2022 Oct 28;9:1035675. doi: 10.3389/fcvm.2022.1035675. eCollection 2022.
6
The importance of being external. methodological insights for the external validation of machine learning models in medicine.重视外部性。医学中机器学习模型外部验证的方法学见解。
Comput Methods Programs Biomed. 2021 Sep;208:106288. doi: 10.1016/j.cmpb.2021.106288. Epub 2021 Jul 22.
7
Internal and External Validation of the Generalizability of Machine Learning Algorithms in Predicting Non-home Discharge Disposition Following Primary Total Knee Joint Arthroplasty.机器学习算法在预测初次全膝关节置换术后非居家出院处置中的可推广性的内部和外部验证。
J Arthroplasty. 2023 Oct;38(10):1973-1981. doi: 10.1016/j.arth.2023.01.065. Epub 2023 Feb 9.
8
Differences in cohort study data affect external validation of artificial intelligence models for predictive diagnostics of dementia - lessons for translation into clinical practice.队列研究数据的差异会影响用于痴呆症预测诊断的人工智能模型的外部验证——对转化为临床实践的启示。
EPMA J. 2020 Jun 22;11(3):367-376. doi: 10.1007/s13167-020-00216-z. eCollection 2020 Sep.
9
A personalized prediction model for urinary tract infections in type 2 diabetes mellitus using machine learning.一种使用机器学习的2型糖尿病患者尿路感染个性化预测模型。
Front Pharmacol. 2024 Jan 5;14:1259596. doi: 10.3389/fphar.2023.1259596. eCollection 2023.
10
Development and validation of explainable machine-learning models for carotid atherosclerosis early screening.开发和验证可解释的机器学习模型,用于颈动脉粥样硬化的早期筛查。
J Transl Med. 2023 May 29;21(1):353. doi: 10.1186/s12967-023-04093-8.

引用本文的文献

1
Fraction-based Linear Extrapolation (FLEX) Method for Predicting Human Pharmacokinetic Clearance: Advanced Allometric Scaling Method and Machine Learning Approach.基于分数的线性外推法(FLEX)预测人体药代动力学清除率:先进的异速生长标度法和机器学习方法
Pharm Res. 2025 Sep 10. doi: 10.1007/s11095-025-03922-3.
2
Application of machine learning in early childhood development research: a scoping review.机器学习在幼儿发展研究中的应用:一项范围综述。
BMJ Open. 2025 Aug 19;15(8):e100358. doi: 10.1136/bmjopen-2025-100358.
3
Combining Self-Reported Information with Radiographic Bone Loss to Screen Periodontitis: A Performance Study.

本文引用的文献

1
Avoid Oversimplifications in Machine Learning: Going beyond the Class-Prediction Accuracy.避免机器学习中的过度简化:超越类别预测准确率
Patterns (N Y). 2020 May 8;1(2):100025. doi: 10.1016/j.patter.2020.100025.
2
Inflated performance measures in enhancer-promoter interaction-prediction methods.增强子-启动子相互作用预测方法中夸大的性能指标。
Nat Genet. 2019 Aug;51(8):1196-1198. doi: 10.1038/s41588-019-0434-7.
3
Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study.
结合自我报告信息与影像学骨丧失情况筛查牙周炎:一项效能研究。
J Clin Med. 2025 Jun 26;14(13):4531. doi: 10.3390/jcm14134531.
4
An open-access EEG dataset for speech decoding: Exploring the role of articulation and coarticulation.一个用于语音解码的开放获取脑电图数据集:探索发音和协同发音的作用。
Sci Data. 2025 Jun 17;12(1):1017. doi: 10.1038/s41597-025-05187-2.
5
External validation of machine learning models-registered models and adaptive sample splitting.机器学习模型的外部验证——注册模型与自适应样本分割
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf036.
6
Artificial intelligence for children with attention deficit/hyperactivity disorder: a scoping review.人工智能在注意力缺陷多动障碍儿童中的应用:一项范围综述。
Exp Biol Med (Maywood). 2025 Apr 24;250:10238. doi: 10.3389/ebm.2025.10238. eCollection 2025.
7
The Role of Artificial Intelligence in Epiretinal Membrane Care: A Scoping Review.人工智能在视网膜前膜护理中的作用:一项范围综述
Ophthalmol Sci. 2024 Dec 20;5(4):100689. doi: 10.1016/j.xops.2024.100689. eCollection 2025 Jul-Aug.
8
Deep learning in the discovery of antiviral peptides and peptidomimetics: databases and prediction tools.抗病毒肽和肽模拟物发现中的深度学习:数据库与预测工具
Mol Divers. 2025 Mar 28. doi: 10.1007/s11030-025-11173-y.
9
Deep learning analysis for rheumatologic imaging: current trends, future directions, and the role of human.风湿病影像学的深度学习分析:当前趋势、未来方向及人的作用
J Rheum Dis. 2025 Apr 1;32(2):73-88. doi: 10.4078/jrd.2024.0128. Epub 2025 Jan 20.
10
Machine Learning-Assisted High-Throughput Screening for Electrocatalytic Hydrogen Evolution Reaction.机器学习辅助的电催化析氢反应高通量筛选
Molecules. 2025 Feb 7;30(4):759. doi: 10.3390/molecules30040759.
深度学习模型检测胸片肺炎的可变泛化性能:一项横断面研究。
PLoS Med. 2018 Nov 6;15(11):e1002683. doi: 10.1371/journal.pmed.1002683. eCollection 2018 Nov.
4
Bootstrapping the out-of-sample predictions for efficient and accurate cross-validation.对样本外预测进行自抽样以实现高效且准确的交叉验证。
Mach Learn. 2018;107(12):1895-1922. doi: 10.1007/s10994-018-5714-4. Epub 2018 May 9.
5
Can Peripheral Blood-Derived Gene Expressions Characterize Individuals at Ultra-high Risk for Psychosis?外周血来源的基因表达能否表征处于超高精神分裂症风险的个体?
Comput Psychiatr. 2017 Dec 1;1:168-183. doi: 10.1162/CPSY_a_00007. eCollection 2017 Dec.
6
Turning straw into gold: building robustness into gene signature inference.点石成金:提高基因特征推断稳健性。
Drug Discov Today. 2019 Jan;24(1):31-36. doi: 10.1016/j.drudis.2018.08.002. Epub 2018 Aug 4.
7
Why breast cancer signatures are no better than random signatures explained.为什么乳腺癌特征与随机特征并无差异得到解释。
Drug Discov Today. 2018 Nov;23(11):1818-1823. doi: 10.1016/j.drudis.2018.05.036. Epub 2018 Jun 1.
8
Dealing with Confounders in Omics Analysis.处理组学分析中的混杂因素。
Trends Biotechnol. 2018 May;36(5):488-498. doi: 10.1016/j.tibtech.2018.01.013. Epub 2018 Feb 20.
9
Using and understanding cross-validation strategies. Perspectives on Saeb et al.使用和理解交叉验证策略。Saeb 等人的观点。
Gigascience. 2017 May 1;6(5):1-6. doi: 10.1093/gigascience/gix020.
10
Cancer reproducibility project releases first results.癌症可重复性项目公布首批结果。
Nature. 2017 Jan 18;541(7637):269-270. doi: 10.1038/541269a.