LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.
Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland.
Biometrics. 2020 Dec;76(4):1147-1156. doi: 10.1111/biom.13236. Epub 2020 Mar 5.
This article concerns the problem of estimating a continuous distribution in a diseased or nondiseased population when only group-based test results on the disease status are available. The problem is challenging in that individual disease statuses are not observed and testing results are often subject to misclassification, with further complication that the misclassification may be differential as the group size and the number of the diseased individuals in the group vary. We propose a method to construct nonparametric estimation of the distribution and obtain its asymptotic properties. The performance of the distribution estimator is evaluated under various design considerations concerning group sizes and classification errors. The method is exemplified with data from the National Health and Nutrition Examination Survey study to estimate the distribution and diagnostic accuracy of C-reactive protein in blood samples in predicting chlamydia incidence.
本文研究了当仅获得基于群体的疾病状态检测结果时,如何对患病或未患病群体中的连续分布进行估计的问题。该问题具有挑战性,因为个体的疾病状态无法观察到,且检测结果通常存在错误分类,更复杂的是,当群体规模和群体中患病个体的数量发生变化时,错误分类可能存在差异。我们提出了一种方法来构建分布的非参数估计,并获得其渐近性质。该方法在考虑群体规模和分类错误的各种设计因素下,评估了分布估计器的性能。该方法通过来自国家健康和营养检查调查研究的数据示例,估计血液样本中 C-反应蛋白在预测衣原体发病率方面的分布和诊断准确性。