现代建模技术对数据需求极大：一项用于预测二分结局的模拟研究。

Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints.

作者信息

van der Ploeg Tjeerd, Austin Peter C, Steyerberg Ewout W

机构信息

Department of Science, Medical Center Alkmaar/Inholland University, Alkmaar, The Netherlands.

出版信息

BMC Med Res Methodol. 2014 Dec 22;14:137. doi: 10.1186/1471-2288-14-137.

DOI:10.1186/1471-2288-14-137

PMID:25532820

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4289553/

Abstract

BACKGROUND

Modern modelling techniques may potentially provide more accurate predictions of binary outcomes than classical techniques. We aimed to study the predictive performance of different modelling techniques in relation to the effective sample size ("data hungriness").

METHODS

We performed simulation studies based on three clinical cohorts: 1282 patients with head and neck cancer (with 46.9% 5 year survival), 1731 patients with traumatic brain injury (22.3% 6 month mortality) and 3181 patients with minor head injury (7.6% with CT scan abnormalities). We compared three relatively modern modelling techniques: support vector machines (SVM), neural nets (NN), and random forests (RF) and two classical techniques: logistic regression (LR) and classification and regression trees (CART). We created three large artificial databases with 20 fold, 10 fold and 6 fold replication of subjects, where we generated dichotomous outcomes according to different underlying models. We applied each modelling technique to increasingly larger development parts (100 repetitions). The area under the ROC-curve (AUC) indicated the performance of each model in the development part and in an independent validation part. Data hungriness was defined by plateauing of AUC and small optimism (difference between the mean apparent AUC and the mean validated AUC <0.01).

RESULTS

We found that a stable AUC was reached by LR at approximately 20 to 50 events per variable, followed by CART, SVM, NN and RF models. Optimism decreased with increasing sample sizes and the same ranking of techniques. The RF, SVM and NN models showed instability and a high optimism even with >200 events per variable.

CONCLUSIONS

Modern modelling techniques such as SVM, NN and RF may need over 10 times as many events per variable to achieve a stable AUC and a small optimism than classical modelling techniques such as LR. This implies that such modern techniques should only be used in medical prediction problems if very large data sets are available.

摘要

背景

与传统技术相比，现代建模技术可能会对二元结局做出更准确的预测。我们旨在研究不同建模技术在有效样本量（“数据饥渴度”）方面的预测性能。

方法

我们基于三个临床队列进行了模拟研究：1282例头颈癌患者（5年生存率为46.9%）、1731例创伤性脑损伤患者（6个月死亡率为22.3%）和3181例轻度头部损伤患者（7.6%有CT扫描异常）。我们比较了三种相对现代的建模技术：支持向量机（SVM）、神经网络（NN）和随机森林（RF），以及两种传统技术：逻辑回归（LR）和分类与回归树（CART）。我们创建了三个大型人工数据库，其中受试者重复20倍、10倍和6倍，在这些数据库中，我们根据不同的基础模型生成二分结局。我们将每种建模技术应用于越来越大的开发部分（100次重复）。ROC曲线下面积（AUC）表明了每个模型在开发部分和独立验证部分的性能。数据饥渴度由AUC的平稳状态和较小的乐观度（平均表观AUC与平均验证AUC之间的差异<0.01）定义。

结果

我们发现，LR在每个变量约20至50个事件时达到稳定的AUC，其次是CART、SVM、NN和RF模型。乐观度随着样本量的增加而降低，技术排名相同。即使每个变量>200个事件，RF、SVM和NN模型仍表现出不稳定性和较高的乐观度。

结论

与LR等传统建模技术相比，SVM、NN和RF等现代建模技术可能需要每个变量10倍以上的事件才能实现稳定的AUC和较小的乐观度。这意味着，只有在有非常大的数据集时，此类现代技术才应应用于医学预测问题。

相似文献

Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints.

BMC Med Res Methodol. 2014 Dec 22;14:137. doi: 10.1186/1471-2288-14-137.

Modern modeling techniques had limited external validity in predicting mortality from traumatic brain injury.

J Clin Epidemiol. 2016 Oct;78:83-89. doi: 10.1016/j.jclinepi.2016.03.002. Epub 2016 Mar 14.

A comparison of three computational modelling methods for the prediction of virological response to combination HIV therapy.

Artif Intell Med. 2009 Sep;47(1):63-74. doi: 10.1016/j.artmed.2009.05.002. Epub 2009 Jun 12.

Prediction of survival with alternative modeling techniques using pseudo values.

PLoS One. 2014 Jun 20;9(6):e100234. doi: 10.1371/journal.pone.0100234. eCollection 2014.

Performance comparison between Logistic regression, decision trees, and multilayer perceptron in predicting peripheral neuropathy in type 2 diabetes mellitus.

Chin Med J (Engl). 2012 Mar;125(5):851-7.

Seminal quality prediction using data mining methods.

Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.

A reliable method for colorectal cancer prediction based on feature selection and support vector machine.

Med Biol Eng Comput. 2019 Apr;57(4):901-912. doi: 10.1007/s11517-018-1930-0. Epub 2018 Nov 26.

Machine Learning Models of Survival Prediction in Trauma Patients.

J Clin Med. 2019 Jun 5;8(6):799. doi: 10.3390/jcm8060799.

Prediction of intracranial findings on CT-scans by alternative modelling techniques.

BMC Med Res Methodol. 2011 Oct 25;11:143. doi: 10.1186/1471-2288-11-143.

A comparative study on feature selection for a risk prediction model for colorectal cancer.

Comput Methods Programs Biomed. 2019 Aug;177:219-229. doi: 10.1016/j.cmpb.2019.06.001. Epub 2019 Jun 4.

引用本文的文献

Construction of a predictive model for cognitive impairment among older adults in Northwest China.

Front Aging Neurosci. 2025 Jul 31;17:1487838. doi: 10.3389/fnagi.2025.1487838. eCollection 2025.

Prognostic models for large cell neuroendocrine lung carcinoma: a machine learning and regression approach.

Transl Lung Cancer Res. 2025 Jul 31;14(7):2470-2482. doi: 10.21037/tlcr-2025-130. Epub 2025 Jul 28.

Development and validation of a novel model based on clinical characteristics to predict natural disease course progression in patients with stricturing Crohn's disease.

Therap Adv Gastroenterol. 2025 Jul 28;18:17562848251358705. doi: 10.1177/17562848251358705. eCollection 2025.

Comparing variable and feature selection strategies for prediction - protocol of a simulation study in low-dimensional transplantation data.

PLoS One. 2025 Aug 1;20(8):e0328696. doi: 10.1371/journal.pone.0328696. eCollection 2025.

Herbify: an ensemble deep learning framework integrating convolutional neural networks and vision transformers for precise herb identification.

Plant Methods. 2025 Jul 27;21(1):104. doi: 10.1186/s13007-025-01421-5.

AI-based Assessment of Risk Factors for Coronary Heart Disease in Patients With Diabetes Mellitus and Construction of a Prediction Model for a Treatment Regimen.

Rev Cardiovasc Med. 2025 Jun 25;26(6):36293. doi: 10.31083/RCM36293. eCollection 2025 Jun.

A decomposition of Fisher's information to inform sample size for developing or updating fair and precise clinical prediction models for individual risk-part 1: binary outcomes.

Diagn Progn Res. 2025 Jul 8;9(1):14. doi: 10.1186/s41512-025-00193-9.

Predicting outcomes after moderate and severe traumatic brain injury using artificial intelligence: a systematic review.

NPJ Digit Med. 2025 Jun 18;8(1):373. doi: 10.1038/s41746-025-01714-y.

Risk prediction models for sarcopenia in elderly people: a systematic review and meta-analysis.

Front Med (Lausanne). 2025 Jun 2;12:1589583. doi: 10.3389/fmed.2025.1589583. eCollection 2025.

Cross-cohort analysis identifies shared gut microbial signatures and validates microbial risk scores for colorectal cancer.

J Transl Med. 2025 Jun 17;23(1):676. doi: 10.1186/s12967-025-06676-z.

本文引用的文献

Prediction of survival with alternative modeling techniques using pseudo values.

PLoS One. 2014 Jun 20;9(6):e100234. doi: 10.1371/journal.pone.0100234. eCollection 2014.

Prognosis Research Strategy (PROGRESS) 3: prognostic model research.

PLoS Med. 2013;10(2):e1001381. doi: 10.1371/journal.pmed.1001381. Epub 2013 Feb 5.

Regression trees for predicting mortality in patients with cardiovascular disease: what improvement is achieved by using ensemble-based methods?

Biom J. 2012 Sep;54(5):657-73. doi: 10.1002/bimj.201100251. Epub 2012 Jul 6.

Prediction of intracranial findings on CT-scans by alternative modelling techniques.

BMC Med Res Methodol. 2011 Oct 25;11:143. doi: 10.1186/1471-2288-11-143.

Assessment of claims of improved prediction beyond the Framingham risk score.

JAMA. 2009 Dec 2;302(21):2345-52. doi: 10.1001/jama.2009.1757.

Impact of comorbidity on short-term mortality and overall survival of head and neck cancer patients.

Head Neck. 2010 Jun;32(6):728-36. doi: 10.1002/hed.21245.

Developing a prognostic model for traumatic brain injury--a missed opportunity?

PLoS Med. 2008 Aug 5;5(8):e168. doi: 10.1371/journal.pmed.0050168.

IMPACT database of traumatic brain injury: design and description.

J Neurotrauma. 2007 Feb;24(2):239-50. doi: 10.1089/neu.2006.0036.

Artificial neural network models for prediction of acute coronary syndromes using clinical data from the time of presentation.

Ann Emerg Med. 2005 Nov;46(5):431-9. doi: 10.1016/j.annemergmed.2004.09.012.

Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.

J Clin Epidemiol. 2001 Aug;54(8):774-81. doi: 10.1016/s0895-4356(01)00341-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

现代建模技术对数据需求极大：一项用于预测二分结局的模拟研究。

Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献