School of Economics and Statistics, Guangzhou University, Guangzhou, China.
Department of Statistics, University of South Carolina, Columbia, South Carolina.
Stat Med. 2019 Jul 20;38(16):3026-3039. doi: 10.1002/sim.8165. Epub 2019 Apr 29.
Censored failure time data with a cured subgroup is frequently encountered in many scientific areas including the cancer screening research, tumorigenicity studies, and sociological surveys. Meanwhile, one may also encounter an extraordinary large number of risk factors in practice, such as patient's demographic characteristics, clinical measurements, and medical history, which makes variable selection an emerging need in the data analysis. Motivated by a medical study on prostate cancer screening, we develop a variable selection method in the semiparametric nonmixture or promotion time cure model when interval-censored data with a cured subgroup are present. Specifically, we propose a penalized likelihood approach with the use of the least absolute shrinkage and selection operator, adaptive least absolute shrinkage and selection operator, or smoothly clipped absolute deviation penalties, which can be easily accomplished via a novel penalized expectation-maximization algorithm. We assess the finite-sample performance of the proposed methodology through extensive simulations and analyze the prostate cancer screening data for illustration.
在许多科学领域,包括癌症筛查研究、肿瘤发生研究和社会学调查中,经常会遇到带有治愈亚组的删失失效时间数据。同时,在实践中也可能会遇到大量的风险因素,如患者的人口统计学特征、临床测量和病史,这使得变量选择在数据分析中成为一种新兴的需求。受前列腺癌筛查医学研究的启发,我们在存在间隔删失数据和治愈亚组的半参数非混合或促进时间治愈模型中开发了一种变量选择方法。具体来说,我们提出了一种基于惩罚似然的方法,使用最小绝对值收缩和选择算子、自适应最小绝对值收缩和选择算子或平滑裁剪绝对偏差惩罚,这些方法可以通过一种新颖的惩罚期望最大化算法轻松实现。我们通过广泛的模拟评估了所提出方法的有限样本性能,并通过分析前列腺癌筛查数据来说明。