• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

小样本或稀有事件生存数据的加速失效时间模型估计。

On estimation for accelerated failure time models with small or rare event survival data.

机构信息

Institute of Statistical Research and Training, University of Dhaka, Dhaka, Bangladesh.

Department of Statistics, University of Dhaka, Dhaka, Bangladesh.

出版信息

BMC Med Res Methodol. 2022 Jun 11;22(1):169. doi: 10.1186/s12874-022-01638-1.

DOI:10.1186/s12874-022-01638-1
PMID:35689190
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9188212/
Abstract

BACKGROUND

Separation or monotone likelihood may exist in fitting process of the accelerated failure time (AFT) model using maximum likelihood approach when sample size is small and/or rate of censoring is high (rare event) or there is at least one strong covariate in the model, resulting in infinite estimates of at least one regression coefficient.

METHODS

This paper investigated the properties of the maximum likelihood estimator (MLE) of the regression parameters of the AFT models for small sample and/or rare-event situation and addressed the problems by introducing a penalized likelihood approach. The penalized likelihood function and the corresponding score equation is derived by adding a penalty term to the existing likelihood function, which was originally proposed by Firth (Biometrika, 1993) for the exponential family models. Further, a post-hoc adjustment of intercept and scale parameters is discussed keeping them out of penalization to ensure accurate prediction of survival probability. The penalized method was illustrated for the widely used log-location-scale family models such as Weibull, Log-normal and Log-logistic distributions and compared the models and methods uisng an extensive simulation study.

RESULTS

The simulation study, performed separately for each of the log-location-scale models, showed that Firth's penalized likelihood succeeded to solve the problem of separation and achieve convergence, providing finite estimates of the regression coefficients, which are not often possible by the MLE. Furthermore, the proposed penalized method showed substantial improvement over MLE by providing smaller amount of bias, mean squared error (MSE), narrower confidence interval and reasonably accurate prediction of survival probabilities. The methods are illustrated using prostate cancer data with existence of separation, and results supported the simulation findings.

CONCLUSION

When sample size is small (≤ 50) or event is rare (i.e., censoring proportion is high) and/or there is any evidence of separation in the data, we recommend to use Firth's penalized likelihood method for fitting AFT model.

摘要

背景

在使用最大似然法拟合加速失效时间 (AFT) 模型时,如果样本量较小且/或删失率较高(罕见事件),或者模型中至少有一个强协变量,则拟合过程中可能存在分离或单调似然,导致至少一个回归系数的估计值无穷大。

方法

本文研究了小样本和/或罕见事件情况下 AFT 模型回归参数的最大似然估计量 (MLE) 的性质,并通过引入惩罚似然方法解决了这些问题。惩罚似然函数及其相应的得分方程是通过在原始似然函数中添加惩罚项而得出的,该函数最初由 Firth(Biometrika,1993 年)提出,用于指数族模型。此外,讨论了截距和比例参数的事后调整,将它们排除在惩罚之外,以确保生存概率的准确预测。惩罚方法适用于广泛使用的对数位置尺度族模型,如 Weibull、对数正态和对数逻辑分布,并使用广泛的模拟研究比较了模型和方法。

结果

对于每个对数位置尺度模型分别进行的模拟研究表明,Firth 的惩罚似然成功地解决了分离问题并实现了收敛,提供了回归系数的有限估计值,这通常是 MLE 无法实现的。此外,所提出的惩罚方法通过提供更小的偏置、均方误差 (MSE)、更窄的置信区间和合理准确的生存概率预测,大大优于 MLE。该方法通过存在分离的前列腺癌数据进行说明,结果支持模拟结果。

结论

当样本量较小(≤50)或事件罕见(即删失比例较高),并且/或者数据中存在任何分离的证据时,我们建议使用 Firth 的惩罚似然方法来拟合 AFT 模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/42e6dca7b582/12874_2022_1638_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/b84b8497f4fe/12874_2022_1638_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/9bc07ad2a299/12874_2022_1638_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/fa8ffa14272f/12874_2022_1638_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/4d0d6f7bc7fe/12874_2022_1638_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/42e6dca7b582/12874_2022_1638_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/b84b8497f4fe/12874_2022_1638_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/9bc07ad2a299/12874_2022_1638_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/fa8ffa14272f/12874_2022_1638_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/4d0d6f7bc7fe/12874_2022_1638_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b46d/9188212/42e6dca7b582/12874_2022_1638_Fig5_HTML.jpg

相似文献

1
On estimation for accelerated failure time models with small or rare event survival data.小样本或稀有事件生存数据的加速失效时间模型估计。
BMC Med Res Methodol. 2022 Jun 11;22(1):169. doi: 10.1186/s12874-022-01638-1.
2
Firth's logistic regression with rare events: accurate effect estimates and predictions?针对罕见事件的费思逻辑回归:准确的效应估计与预测?
Stat Med. 2017 Jun 30;36(14):2302-2317. doi: 10.1002/sim.7273. Epub 2017 Mar 12.
3
Performance of Firth-and logF-type penalized methods in risk prediction for small or sparse binary data.Firth 法和对数 F 型惩罚方法在小样本或稀疏二元数据风险预测中的性能
BMC Med Res Methodol. 2017 Feb 23;17(1):33. doi: 10.1186/s12874-017-0313-9.
4
Penalized maximum likelihood inference under the mixture cure model in sparse data.稀疏数据下混合治愈模型的惩罚极大似然推断。
Stat Med. 2023 Jun 15;42(13):2134-2161. doi: 10.1002/sim.9715. Epub 2023 Mar 25.
5
Dealing with separation or near-to-separation in the model for multinomial response with application to childhood health seeking behavior data from a complex survey.处理多项响应模型中的分离或近似分离问题,并应用于来自复杂调查的儿童就医行为数据。
J Appl Stat. 2021 Sep 17;49(16):4254-4277. doi: 10.1080/02664763.2021.1977260. eCollection 2022.
6
Bring More Data!-A Good Advice? Removing Separation in Logistic Regression by Increasing Sample Size.带来更多数据!—一个好建议?通过增加样本量消除逻辑回归中的分离。
Int J Environ Res Public Health. 2019 Nov 22;16(23):4658. doi: 10.3390/ijerph16234658.
7
Information criteria for Firth's penalized partial likelihood approach in Cox regression models.Cox回归模型中Firth惩罚偏似然方法的信息准则
Stat Med. 2017 Sep 20;36(21):3422-3436. doi: 10.1002/sim.7368. Epub 2017 Jun 12.
8
To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets.调参还是不调参,小数据集或稀疏数据集的岭 logistic 回归案例研究。
BMC Med Res Methodol. 2021 Sep 30;21(1):199. doi: 10.1186/s12874-021-01374-y.
9
On hazard-based penalized likelihood estimation of accelerated failure time model with partly interval censoring.基于风险的部分区间删失加速失效时间模型的惩罚似然估计
Stat Methods Med Res. 2020 Dec;29(12):3804-3817. doi: 10.1177/0962280220942555. Epub 2020 Jul 20.
10
A Solution to Separation and Multicollinearity in Multiple Logistic Regression.多元逻辑回归中分离与多重共线性问题的一种解决方案。
J Data Sci. 2008 Oct 1;6(4):515-531.

引用本文的文献

1
Marital status and accidental mortality in male and female patients with cancer: results from the population-based cohort study.癌症男性和女性患者的婚姻状况与意外死亡率:基于人群队列研究的结果
BMC Public Health. 2025 Jul 2;25(1):2251. doi: 10.1186/s12889-025-23499-1.
2
Clinical Characteristics, Treatment and Prognosis of Primary Tracheal Adenoid Cystic Carcinoma: A Multicenter Retrospective Study.原发性气管腺样囊性癌的临床特征、治疗及预后:一项多中心回顾性研究
Cancer Med. 2025 Apr;14(8):e70877. doi: 10.1002/cam4.70877.
3
ADAPT: Analysis of Microbiome Differential Abundance by Pooling Tobit Models.

本文引用的文献

1
Bias-reduced and separation-proof GEE with small or sparse longitudinal binary data.具有小样本或稀疏纵向二分类数据的偏倚减少且可分离的 GEE。
Stat Med. 2019 Jun 30;38(14):2544-2560. doi: 10.1002/sim.8126. Epub 2019 Feb 22.
2
Firth's logistic regression with rare events: accurate effect estimates and predictions?针对罕见事件的费思逻辑回归:准确的效应估计与预测?
Stat Med. 2017 Jun 30;36(14):2302-2317. doi: 10.1002/sim.7273. Epub 2017 Mar 12.
3
A solution to the problem of separation in logistic regression.逻辑回归中分离问题的一种解决方案。
ADAPT:通过池化 Tobit 模型分析微生物组差异丰度。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae661.
4
Early life adversity has sex-dependent effects on survival across the lifespan in rhesus macaques.早期生活逆境对恒河猴寿命生存具有性别依赖性影响。
Philos Trans R Soc Lond B Biol Sci. 2024 Dec 16;379(1916):20220456. doi: 10.1098/rstb.2022.0456. Epub 2024 Oct 28.
5
AFFECT: an R package for accelerated functional failure time model with error-contaminated survival times and applications to gene expression data.AFFECT:一个用于加速带有误差污染生存时间的功能失效时间模型的 R 包,以及在基因表达数据中的应用。
BMC Bioinformatics. 2024 Aug 13;25(1):265. doi: 10.1186/s12859-024-05831-5.
6
ADAPT: Analysis of Microbiome Differential Abundance by Pooling Tobit Models.ADAPT:通过合并托比特模型分析微生物组差异丰度
bioRxiv. 2024 May 17:2024.05.14.594186. doi: 10.1101/2024.05.14.594186.
7
Relative Burden of Cancer and Noncancer Mortality Among Long-Term Survivors of Breast, Prostate, and Colorectal Cancer in the US.美国乳腺癌、前列腺癌和结直肠癌长期幸存者的癌症和非癌症死亡率的相对负担。
JAMA Netw Open. 2023 Jul 3;6(7):e2323115. doi: 10.1001/jamanetworkopen.2023.23115.
8
Does high [F]FDG uptake always mean poor prognosis? Colon cancer with high-level microsatellite instability is associated with high [F]FDG uptake on PET/CT.高[F]FDG 摄取是否总是意味着预后不良?高水平微卫星不稳定性的结肠癌与 PET/CT 上的高[F]FDG 摄取有关。
Eur Radiol. 2023 Nov;33(11):7450-7460. doi: 10.1007/s00330-023-09832-5. Epub 2023 Jun 20.
Stat Med. 2002 Aug 30;21(16):2409-19. doi: 10.1002/sim.1047.
4
A solution to the problem of monotone likelihood in Cox regression.Cox回归中单调似然问题的一种解决方案。
Biometrics. 2001 Mar;57(1):114-9. doi: 10.1111/j.0006-341x.2001.00114.x.
5
The choice of treatment for cancer patients based on covariate information.基于协变量信息为癌症患者选择治疗方法。
Bull Cancer. 1980;67(4):477-90.
6
Bias correction in maximum likelihood logistic regression.最大似然逻辑回归中的偏差校正
Stat Med. 1983 Jan-Mar;2(1):71-8. doi: 10.1002/sim.4780020108.
7
Treatment effects in competing-risks analysis of prostate cancer data.
Biometrics. 1986 Mar;42(1):203-11.