Department of Statistical Science, Southern Methodist University, Dallas, Texas, USA.
Department of Statistics, University of Johannesburg, Johannesburg, South Africa.
Stat Med. 2018 Nov 10;37(25):3679-3692. doi: 10.1002/sim.7858. Epub 2018 Jul 12.
It is important to properly correct for measurement error when estimating density functions associated with biomedical variables. These estimators that adjust for measurement error are broadly referred to as density deconvolution estimators. While most methods in the literature assume the distribution of the measurement error to be fully known, a recently proposed method based on the empirical phase function (EPF) can deal with the situation when the measurement error distribution is unknown. The EPF density estimator has only been considered in the context of additive and homoscedastic measurement error; however, the measurement error of many biomedical variables is heteroscedastic in nature. In this paper, we developed a phase function approach for density deconvolution when the measurement error has unknown distribution and is heteroscedastic. A weighted EPF (WEPF) is proposed where the weights are used to adjust for heteroscedasticity of measurement error. The asymptotic properties of the WEPF estimator are evaluated. Simulation results show that the weighting can result in large decreases in mean integrated squared error when estimating the phase function. The estimation of the weights from replicate observations is also discussed. Finally, the construction of a deconvolution density estimator using the WEPF is compared with an existing deconvolution estimator that adjusts for heteroscedasticity but assumes the measurement error distribution to be fully known. The WEPF estimator proves to be competitive, especially when considering that it relies on minimal assumption of the distribution of measurement error.
当估计与生物医学变量相关的密度函数时,正确校正测量误差非常重要。这些用于校正测量误差的估计器通常被称为密度反卷积估计器。尽管文献中的大多数方法都假设测量误差的分布是完全已知的,但最近提出的基于经验相位函数 (EPF) 的方法可以处理测量误差分布未知的情况。EPF 密度估计器仅在加性和同方差测量误差的情况下被考虑;然而,许多生物医学变量的测量误差本质上是异方差的。在本文中,我们开发了一种用于测量误差分布未知且异方差的密度反卷积的相位函数方法。提出了一种加权 EPF (WEPF),其中权重用于调整测量误差的异方差性。评估了 WEPF 估计器的渐近性质。模拟结果表明,在估计相位函数时,加权可以大大降低平均集成平方误差。还讨论了从重复观测中估计权重的问题。最后,比较了使用 WEPF 构建的反卷积密度估计器与另一种用于校正异方差但假设测量误差分布完全已知的现有反卷积估计器。WEPF 估计器证明是有竞争力的,特别是考虑到它依赖于对测量误差分布的最小假设。