McMahan Christopher S, Tebbs Joshua M, Bilder Christopher R
Department of Statistics, University of South Carolina, Columbia, South Carolina 29208, USA.
Biometrics. 2012 Sep;68(3):793-804. doi: 10.1111/j.1541-0420.2011.01726.x. Epub 2011 Dec 29.
Array-based group-testing algorithms for case identification are widely used in infectious disease testing, drug discovery, and genetics. In this article, we generalize previous statistical work in array testing to account for heterogeneity among individuals being tested. We first derive closed-form expressions for the expected number of tests (efficiency) and misclassification probabilities (sensitivity, specificity, predictive values) for two-dimensional array testing in a heterogeneous population. We then propose two "informative" array construction techniques which exploit population heterogeneity in ways that can substantially improve testing efficiency when compared to classical approaches that regard the population as homogeneous. Furthermore, a useful byproduct of our methodology is that misclassification probabilities can be estimated on a per-individual basis. We illustrate our new procedures using chlamydia and gonorrhea testing data collected in Nebraska as part of the Infertility Prevention Project.
用于病例识别的基于阵列的分组检测算法广泛应用于传染病检测、药物发现和遗传学领域。在本文中,我们对先前阵列检测中的统计工作进行了推广,以考虑被检测个体之间的异质性。我们首先推导了异质人群中二维阵列检测的预期检测次数(效率)和错误分类概率(敏感性、特异性、预测值)的闭式表达式。然后,我们提出了两种“信息性”阵列构建技术,与将人群视为同质的经典方法相比,这两种技术利用人群异质性的方式可以大幅提高检测效率。此外,我们方法的一个有用副产品是错误分类概率可以在个体基础上进行估计。我们使用作为不孕预防项目一部分在内布拉斯加州收集的衣原体和淋病检测数据来说明我们的新程序。