• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习模型在小样本量早期研究中的验证。

Machine Learning Model Validation for Early Stage Studies with Small Sample Sizes.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2314-2319. doi: 10.1109/EMBC46164.2021.9629697.

DOI:10.1109/EMBC46164.2021.9629697
PMID:34891749
Abstract

In early stage biomedical studies, small datasets are common due to the high cost and difficulty of sample collection with human subjects. This complicates the validation of machine learning models, which are best suited for large datasets. In this work, we examined feature selection techniques, validation frameworks, and learning curve fitting for small simulated datasets with known underlying discriminability, with the aim of identifying a protocol for estimating and interpreting early stage model performance and for planning future studies. Of a variety of examined validation configurations, a nested cross-validation framework provided the most accurate reflection of the selected features' discriminability, but the relevant features were often not properly identified during the feature selection stage for datasets with small sample sizes. Ultimately, we recommend that: (1) filter-based feature selection methods should be used to minimize overfitting to noise-based features, (2) statistical exploration should be conducted on datasets as a whole to estimate the level of discriminability and the feasibility of the classification problems, and (3) learning curves should be employed using nested cross-validation performance estimates for forecasting accuracy at larger sample sizes and estimating the required number of samples to converge towards best performance. This work should serve as a guideline for researchers incorporating machine learning in small-scale pilot studies.

摘要

在早期的生物医学研究中,由于人类样本采集的成本高、难度大,小数据集很常见。这使得机器学习模型的验证变得复杂,因为机器学习模型最适合于大数据集。在这项工作中,我们研究了特征选择技术、验证框架和针对具有已知可区分性的小型模拟数据集的学习曲线拟合,目的是确定一种用于估计和解释早期模型性能以及规划未来研究的方案。在所检查的各种验证配置中,嵌套交叉验证框架最能准确反映所选特征的可区分性,但对于样本量较小的数据集,在特征选择阶段,相关特征往往无法正确识别。最终,我们建议:(1)应使用基于过滤器的特征选择方法,以最小化对基于噪声的特征的过拟合;(2)应在整个数据集上进行统计探索,以估计可区分性水平和分类问题的可行性;(3)应使用嵌套交叉验证性能估计值来绘制学习曲线,以便预测更大样本量下的准确性,并估计达到最佳性能所需的样本数量。这项工作应该为研究人员在小规模试点研究中纳入机器学习提供指导。

相似文献

1
Machine Learning Model Validation for Early Stage Studies with Small Sample Sizes.机器学习模型在小样本量早期研究中的验证。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2314-2319. doi: 10.1109/EMBC46164.2021.9629697.
2
Machine learning algorithm validation with a limited sample size.机器学习算法在有限样本量下的验证。
PLoS One. 2019 Nov 7;14(11):e0224365. doi: 10.1371/journal.pone.0224365. eCollection 2019.
3
Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.迈向语音、语言和听力科学中的通用机器学习模型:估计样本量并减少过拟合。
J Speech Lang Hear Res. 2024 Mar 11;67(3):753-781. doi: 10.1044/2023_JSLHR-23-00273. Epub 2024 Feb 22.
4
Bias correction for selecting the minimal-error classifier from many machine learning models.从众多机器学习模型中选择最小错误分类器的偏差校正。
Bioinformatics. 2014 Nov 15;30(22):3152-8. doi: 10.1093/bioinformatics/btu520. Epub 2014 Aug 1.
5
GeFeS: A generalized wrapper feature selection approach for optimizing classification performance.GeFeS:一种用于优化分类性能的广义包装特征选择方法。
Comput Biol Med. 2020 Oct;125:103974. doi: 10.1016/j.compbiomed.2020.103974. Epub 2020 Aug 20.
6
Evaluation of a decided sample size in machine learning applications.机器学习应用中确定样本量的评估。
BMC Bioinformatics. 2023 Feb 14;24(1):48. doi: 10.1186/s12859-023-05156-9.
7
Integration of multiple types of genetic markers for neuroblastoma may contribute to improved prediction of the overall survival.多种类型的遗传标记物的整合可能有助于提高对总体生存的预测。
Biol Direct. 2018 Sep 20;13(1):17. doi: 10.1186/s13062-018-0222-9.
8
Quantifying performance of machine learning methods for neuroimaging data.量化机器学习方法在神经影像学数据中的性能。
Neuroimage. 2019 Oct 1;199:351-365. doi: 10.1016/j.neuroimage.2019.05.082. Epub 2019 Jun 5.
9
Impact of train/test sample regimen on performance estimate stability of machine learning in cardiovascular imaging.机器学习在心血管成像中,训练/测试样本方案对性能估计稳定性的影响。
Sci Rep. 2021 Jul 14;11(1):14490. doi: 10.1038/s41598-021-93651-5.
10
Cross-validation and out-of-sample testing of physical activity intensity predictions with a wrist-worn accelerometer.腕戴加速度计的体力活动强度预测的交叉验证和样本外测试。
J Appl Physiol (1985). 2018 May 1;124(5):1284-1293. doi: 10.1152/japplphysiol.00760.2017. Epub 2018 Jan 25.

引用本文的文献

1
UpStory: the uppsala storytelling dataset.UpStory:乌普萨拉故事数据集。
Front Robot AI. 2025 Jul 21;12:1547578. doi: 10.3389/frobt.2025.1547578. eCollection 2025.
2
Evaluating the Utility of Wearable Sensors for the Early Diagnosis of Parkinson Disease: Systematic Review.评估可穿戴传感器在帕金森病早期诊断中的效用:系统评价
J Med Internet Res. 2025 Jul 21;27:e69422. doi: 10.2196/69422.
3
Unraveling overoptimism and publication bias in ML-driven science.揭示机器学习驱动的科学中的过度乐观和发表偏倚。
Patterns (N Y). 2025 Feb 25;6(4):101185. doi: 10.1016/j.patter.2025.101185. eCollection 2025 Apr 11.
4
The influence of different factors on the bond strength of lithium disilicate-reinforced glass-ceramics to Resin: a machine learning analysis.不同因素对二硅酸锂增强微晶玻璃与树脂粘结强度的影响:机器学习分析
BMC Oral Health. 2025 Feb 18;25(1):256. doi: 10.1186/s12903-025-05590-6.
5
From pixels to prognosis: radiomics and AI in Alzheimer's disease management.从像素到预后:阿尔茨海默病管理中的放射组学与人工智能
Front Neurol. 2025 Jan 29;16:1536463. doi: 10.3389/fneur.2025.1536463. eCollection 2025.
6
Age group classification based on optical measurement of brain pulsation using machine learning.基于机器学习的脑搏动光学测量的年龄组分类
Sci Rep. 2025 Jan 25;15(1):3166. doi: 10.1038/s41598-025-87645-w.
7
Machine Learning Predicts Phenoconversion from Polysomnography in Isolated REM Sleep Behavior Disorder.机器学习通过多导睡眠图预测孤立性快速眼动睡眠行为障碍中的表型转换。
Brain Sci. 2024 Aug 28;14(9):871. doi: 10.3390/brainsci14090871.
8
Radiomic- and dosiomic-based clustering development for radio-induced neurotoxicity in pediatric medulloblastoma.基于放射组学和剂量组学的聚类开发用于儿童髓母细胞瘤的放射性神经毒性。
Childs Nerv Syst. 2024 Aug;40(8):2301-2310. doi: 10.1007/s00381-024-06416-6. Epub 2024 Apr 20.
9
Improving model transferability for clinical note section classification models using continued pretraining.使用持续预训练提高临床笔记部分分类模型的可转移性。
J Am Med Inform Assoc. 2023 Dec 22;31(1):89-97. doi: 10.1093/jamia/ocad190.
10
Patterns of risk-Using machine learning and structural neuroimaging to identify pedophilic offenders.风险模式——利用机器学习和结构性神经成像识别恋童癖罪犯。
Front Psychiatry. 2023 Apr 20;14:1001085. doi: 10.3389/fpsyt.2023.1001085. eCollection 2023.