Suppr超能文献

具有变量选择和变换的多变量分数多项式模型的稳定性:一项自助法研究。

Stability of multivariable fractional polynomial models with selection of variables and transformations: a bootstrap investigation.

作者信息

Royston P, Sauerbrei W

机构信息

MRC Clinical Trials Unit, 222 Euston Road, London NW1 2DA, U.K.

出版信息

Stat Med. 2003 Feb 28;22(4):639-59. doi: 10.1002/sim.1310.

Abstract

Sauerbrei and Royston have recently described an algorithm, based on fractional polynomials, for the simultaneous selection of variables and of suitable transformations for continuous predictors in a multivariable regression setting. They illustrated the approach by analyses of two breast cancer data sets. Here we extend their work by considering how to assess possible instability in such multivariable fractional polynomial models. We first apply the algorithm repeatedly in many bootstrap replicates. We then use log-linear models to investigate dependencies among the inclusion fractions for each predictor and among the simplified classes of fractional polynomial function chosen in the bootstrap samples. To further evaluate the results, we define measures of instability based on a decomposition of the variability of the bootstrap-selected functions in relation to a reference function from the original model. For each data set we are able to identify large, reasonably stable subsets of the bootstrap replications in which the functional forms of the predictors appear fairly stable. Despite the considerable flexibility of the family of fractional polynomials and the consequent risk of overfitting when several variables are considered, we conclude that the multivariable selection algorithm can find stable models.

摘要

绍布雷和罗伊斯顿最近描述了一种基于分数多项式的算法,用于在多变量回归设置中同时选择变量以及为连续预测变量选择合适的变换。他们通过对两个乳腺癌数据集的分析阐述了该方法。在此,我们通过考虑如何评估此类多变量分数多项式模型中可能存在的不稳定性来扩展他们的工作。我们首先在多个自助重抽样重复样本中反复应用该算法。然后我们使用对数线性模型来研究每个预测变量的纳入比例之间以及在自助样本中选择的分数多项式函数的简化类别之间的依赖性。为了进一步评估结果,我们基于相对于原始模型中的参考函数对自助选择函数的变异性进行分解来定义不稳定性度量。对于每个数据集,我们都能够识别出自助重抽样中的大型、相对稳定的子集,其中预测变量的函数形式看起来相当稳定。尽管分数多项式族具有相当大的灵活性,并且在考虑多个变量时存在过度拟合的风险,但我们得出结论,多变量选择算法能够找到稳定的模型。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验