Department of Statistics, National Chengchi University, Taipei, Taiwan, ROC.
BMC Bioinformatics. 2024 Aug 13;25(1):265. doi: 10.1186/s12859-024-05831-5.
Survival analysis has been used to characterize the time-to-event data. In medical studies, a typical application is to analyze the survival time of specific cancers by using high-dimensional gene expressions. The main challenges include the involvement of non-informaive gene expressions and possibly nonlinear relationship between survival time and gene expressions. Moreover, due to possibly imprecise data collection or wrong record, measurement error might be ubiquitous in the survival time and its censoring status. Ignoring measurement error effects may incur biased estimator and wrong conclusion.
To tackle those challenges and derive a reliable estimation with efficiently computational implementation, we develop the R package AFFECT, which is referred to Accelerated Functional Failure time model with Error-Contaminated survival Times.
This package aims to correct for measurement error effects in survival times and implements a boosting algorithm under corrected data to determine informative gene expressions as well as derive the corresponding nonlinear functions.
生存分析已被用于描述事件时间数据。在医学研究中,一个典型的应用是通过使用高维基因表达来分析特定癌症的生存时间。主要挑战包括非信息基因表达的参与以及生存时间和基因表达之间可能存在的非线性关系。此外,由于数据收集可能不精确或记录错误,生存时间及其删失状态可能普遍存在测量误差。忽略测量误差的影响可能会导致有偏差的估计值和错误的结论。
为了解决这些挑战,并以高效的计算实现可靠的估计,我们开发了 R 包 AFFECT,它是指带有误差污染生存时间的加速功能失效时间模型。
该软件包旨在纠正生存时间中的测量误差影响,并在纠正数据下实施提升算法,以确定有信息的基因表达,并得出相应的非线性函数。