• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于秩的方法在大规模生存数据下使用半参数加速失效时间模型的最优抽样。

Optimal subsampling for semi-parametric accelerated failure time models with massive survival data using a rank-based approach.

机构信息

Department of Statistics, University of Connecticut, Storrs, Connecticut, USA.

出版信息

Stat Med. 2024 Oct 30;43(24):4650-4666. doi: 10.1002/sim.10200. Epub 2024 Aug 20.

DOI:10.1002/sim.10200
PMID:39162171
Abstract

Subsampling is a practical strategy for analyzing vast survival data, which are progressively encountered across diverse research domains. While the optimal subsampling method has been applied to inferences for Cox models and parametric accelerated failure time (AFT) models, its application to semi-parametric AFT models with rank-based estimation have received limited attention. The challenges arise from the non-smooth estimating function for regression coefficients and the seemingly zero contribution from censored observations in estimating functions in the commonly seen form. To address these challenges, we develop optimal subsampling probabilities for both event and censored observations by expressing the estimating functions through a well-defined stochastic process. Meanwhile, we apply an induced smoothing procedure to the non-smooth estimating functions. As the optimal subsampling probabilities depend on the unknown regression coefficients, we employ a two-step procedure to obtain a feasible estimation method. An additional benefit of the method is its ability to resolve the issue of underestimation of the variance when the subsample size approaches the full sample size. We validate the performance of our estimators through a simulation study and apply the methods to analyze the survival time of lymphoma patients in the surveillance, epidemiology, and end results program.

摘要

抽样是分析大规模生存数据的一种实用策略,这些数据在不同的研究领域中逐渐出现。虽然已经将最优抽样方法应用于 Cox 模型和参数加速失效时间 (AFT) 模型的推断中,但将其应用于基于秩的估计的半参数 AFT 模型却受到了有限的关注。挑战来自于回归系数的非平滑估计函数以及在常见形式的估计函数中,由于截尾观察值的贡献似乎为零。为了解决这些挑战,我们通过定义良好的随机过程来表示估计函数,从而为事件和截尾观察值开发最优抽样概率。同时,我们将诱导平滑过程应用于非平滑估计函数。由于最优抽样概率取决于未知的回归系数,因此我们采用两步法来获得可行的估计方法。该方法的另一个优点是,当子样本大小接近总样本大小时,它能够解决方差低估的问题。我们通过模拟研究验证了我们的估计量的性能,并将这些方法应用于分析监测、流行病学和结果计划中的淋巴瘤患者的生存时间。

相似文献

1
Optimal subsampling for semi-parametric accelerated failure time models with massive survival data using a rank-based approach.基于秩的方法在大规模生存数据下使用半参数加速失效时间模型的最优抽样。
Stat Med. 2024 Oct 30;43(24):4650-4666. doi: 10.1002/sim.10200. Epub 2024 Aug 20.
2
Optimal subsampling for parametric accelerated failure time models with massive survival data.针对大规模生存数据的参数加速失效时间模型的最优抽样。
Stat Med. 2022 Nov 30;41(27):5421-5431. doi: 10.1002/sim.9576. Epub 2022 Sep 20.
3
Sampling-based estimation for massive survival data with additive hazards model.基于抽样的加性风险模型在海量生存数据分析中的估计。
Stat Med. 2021 Jan 30;40(2):441-450. doi: 10.1002/sim.8783. Epub 2020 Nov 3.
4
Regression models for interval censored data using parametric pseudo-observations.使用参数伪观测值对区间 censored 数据进行回归模型分析。
BMC Med Res Methodol. 2021 Feb 15;21(1):36. doi: 10.1186/s12874-021-01227-8.
5
A Monte Carlo method for variance estimation for estimators based on induced smoothing.一种基于诱导平滑的估计量方差估计的蒙特卡罗方法。
Biostatistics. 2015 Jan;16(1):179-88. doi: 10.1093/biostatistics/kxu021. Epub 2014 May 7.
6
Residual plots to reveal the functional form for covariates in parametric accelerated failure time models.用于揭示参数加速失效时间模型中协变量函数形式的残差图。
Lifetime Data Anal. 2015 Jul;21(3):353-78. doi: 10.1007/s10985-014-9311-y. Epub 2014 Oct 16.
7
Semiparametric additive time-varying coefficients model for longitudinal data with censored time origin.具有截尾时间原点的纵向数据的半参数加性时变系数模型。
Biometrics. 2023 Jun;79(2):695-710. doi: 10.1111/biom.13610. Epub 2022 Mar 20.
8
Accelerated failure time modeling via nonparametric mixtures.基于非参数混合模型的加速失效时间建模。
Biometrics. 2023 Mar;79(1):165-177. doi: 10.1111/biom.13556. Epub 2021 Sep 20.
9
Geographically weighted accelerated failure time model for spatial survival data: application to ovarian cancer survival data in New Jersey.地理加权加速失效时间模型在空间生存数据分析中的应用:以新泽西州卵巢癌生存数据为例。
BMC Med Res Methodol. 2024 Oct 15;24(1):239. doi: 10.1186/s12874-024-02346-8.
10
Bayesian variable selection in the accelerated failure time model with an application to the surveillance, epidemiology, and end results breast cancer data.贝叶斯变量选择在加速失效时间模型中的应用,及其在监测、流行病学和最终结果乳腺癌数据中的应用。
Stat Methods Med Res. 2018 Apr;27(4):971-990. doi: 10.1177/0962280215626947. Epub 2016 Jul 20.