Business School, Hunan University, Changsha, Hunan, China.
School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China.
PLoS One. 2021 Apr 9;16(4):e0249359. doi: 10.1371/journal.pone.0249359. eCollection 2021.
A novel penalty for the proportional hazards model under the interval-censored failure time data structure is discussed, with which the subject of variable selection is rarely studied. The penalty comes from an idea to approximate some information criterion, e.g., the BIC or AIC, and the core process is to smooth the ℓ0 norm. Compared with usual regularization methods, the proposed approach is free of heavily time-consuming hyperparameter tuning. The efficiency is further improved by fitting the model and selecting variables in one step. To achieve this, sieve likelihood is introduced, which simultaneously estimates the coefficients and baseline cumulative hazards function. Furthermore, it is shown that the three desired properties for penalties, i.e., continuity, sparsity, and unbiasedness, are all guaranteed. Numerical results show that the proposed sparse estimation method is of great accuracy and efficiency. Finally, the method is used on data of Nigerian children and the key factors that have effects on child mortality are found.
讨论了一种新的比例风险模型惩罚项,该惩罚项适用于区间删失失效时间数据结构,很少有研究涉及到变量选择问题。该惩罚项来源于对某些信息准则(如 BIC 或 AIC)的近似思想,核心过程是对 ℓ0 范数进行平滑。与常用的正则化方法相比,所提出的方法无需进行大量耗时的超参数调整。通过一步拟合模型和选择变量,进一步提高了效率。为此,引入了筛选似然函数,它同时估计系数和基线累积风险函数。此外,还证明了惩罚项的三个理想特性,即连续性、稀疏性和无偏性,都是有保证的。数值结果表明,所提出的稀疏估计方法具有很高的准确性和效率。最后,该方法应用于尼日利亚儿童的数据,找到了影响儿童死亡率的关键因素。