Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
Department of Development and Regeneration, KU Leuven, Leuven, Belgium.
Stat Med. 2019 Apr 30;38(9):1601-1619. doi: 10.1002/sim.8063. Epub 2019 Jan 6.
Multinomial Logistic Regression (MLR) has been advocated for developing clinical prediction models that distinguish between three or more unordered outcomes. We present a full-factorial simulation study to examine the predictive performance of MLR models in relation to the relative size of outcome categories, number of predictors and the number of events per variable. It is shown that MLR estimated by Maximum Likelihood yields overfitted prediction models in small to medium sized data. In most cases, the calibration and overall predictive performance of the multinomial prediction model is improved by using penalized MLR. Our simulation study also highlights the importance of events per variable in the multinomial context as well as the total sample size. As expected, our study demonstrates the need for optimism correction of the predictive performance measures when developing the multinomial logistic prediction model. We recommend the use of penalized MLR when prediction models are developed in small data sets or in medium sized data sets with a small total sample size (ie, when the sizes of the outcome categories are balanced). Finally, we present a case study in which we illustrate the development and validation of penalized and unpenalized multinomial prediction models for predicting malignancy of ovarian cancer.
多项逻辑回归(MLR)已被倡导用于开发能够区分三个或更多无序结果的临床预测模型。我们进行了全面析因模拟研究,以研究 MLR 模型在与结果类别相对大小、预测因子数量以及每个变量的事件数量的关系方面的预测性能。结果表明,最大似然法估计的 MLR 在中小规模数据中产生过度拟合的预测模型。在大多数情况下,通过使用惩罚 MLR 可以提高多项预测模型的校准和整体预测性能。我们的模拟研究还强调了在多项背景下每个变量的事件数量以及总样本量的重要性。正如预期的那样,我们的研究表明,在开发多项逻辑预测模型时,需要对预测性能指标进行乐观校正。当在小数据集或总样本量较小的中型数据集中(即,当结果类别大小平衡时)开发预测模型时,我们建议使用惩罚 MLR。最后,我们提出了一个案例研究,说明了惩罚和非惩罚多项预测模型在预测卵巢癌恶性程度方面的开发和验证。