Jabari Koopaei Leila, Zamanzade Ehsan, Parvardeh Afshin, Wang Xinlei
Department of Statistics, Faculty of Mathematics and Statistics, University of Isfahan, Isfahan, Iran.
Department of Mathematics, University of Texas at Arlington, Arlington, Texas, USA.
Biom J. 2025 Apr;67(2):e70007. doi: 10.1002/bimj.70007.
The mean residual life (MRL) function plays an important role in the summary and analysis of survival data. The main advantage of this function is that it summarizes the information in units of time instead of a probability scale, which requires careful interpretation. Ranked set sampling (RSS) is a sampling technique designed for situations, where obtaining precise measurements of sample units is expensive or difficult, but ranking them without referring to their accurate values is cost-effective or easy. However, the practical application of RSS is hindered because each sample unit is required to assign a unique rank. To alleviate this difficulty, Frey developed a novel variation of RSS, called RSS-t, that records and utilizes the tie structure in the ranking process. In this paper, we propose several different nonparametric estimators for the MRL function based on RSS-t. Then, we compare the proposed estimators with their counterparts in simple random sampling (SRS) and RSS, where tie information is not utilized. We also implemented our proposed estimators on a real data set related to patient waiting times for liver transplantation, to show their applicability and efficiency in practice. Our results show that using ties information leads to an improved statistical inference for the MRL function, and therefore a smaller sample size is needed to reach a predetermined precision.
平均剩余寿命(MRL)函数在生存数据的汇总和分析中起着重要作用。该函数的主要优点是它以时间单位汇总信息,而不是概率尺度,这需要仔细解释。排序集抽样(RSS)是一种为这样的情况设计的抽样技术:在这种情况下,获得样本单元的精确测量成本高昂或困难,但在不参考其准确值的情况下对它们进行排序具有成本效益或容易。然而,RSS的实际应用受到阻碍,因为每个样本单元都需要分配一个唯一的秩。为了缓解这一困难,弗雷开发了一种RSS的新颖变体,称为RSS-t,它在排序过程中记录并利用了平局结构。在本文中,我们基于RSS-t为MRL函数提出了几种不同的非参数估计量。然后,我们将提出的估计量与其在简单随机抽样(SRS)和未利用平局信息的RSS中的对应估计量进行比较。我们还在与肝移植患者等待时间相关的真实数据集上实现了我们提出的估计量,以展示它们在实践中的适用性和效率。我们的结果表明,使用平局信息可以改进对MRL函数的统计推断,因此需要更小的样本量来达到预定的精度。