Inácio de Carvalho Vanda, de Carvalho Miguel, Branscum Adam J
School of Mathematics, University of Edinburgh, Scotland, U.K.
College of Public Health and Human Sciences, Oregon State University, Oregon, U.S.A.
Biometrics. 2017 Dec;73(4):1279-1288. doi: 10.1111/biom.12686. Epub 2017 Apr 4.
A novel nonparametric regression model is developed for evaluating the covariate-specific accuracy of a continuous biological marker. Accurately screening diseased from nondiseased individuals and correctly diagnosing disease stage are critically important to health care on several fronts, including guiding recommendations about combinations of treatments and their intensities. The accuracy of a continuous medical test or biomarker varies by the cutoff threshold (c) used to infer disease status. Accuracy can be measured by the probability of testing positive for diseased individuals (the true positive probability or sensitivity, Se(c), of the test), and the true negative probability (specificity, Sp(c)) of the test. A commonly used summary measure of test accuracy is the Youden index, YI=max{Se(c)+Sp(c)-1:c∈ℝ}, which is popular due in part to its ease of interpretation and relevance to population health research. In addition, clinical practitioners benefit from having an estimate of the optimal cutoff that maximizes sensitivity plus specificity available as a byproduct of estimating YI. We develop a highly flexible nonparametric model to estimate YI and its associated optimal cutoff that can respond to unanticipated skewness, multimodality, and other complexities because data distributions are modeled using dependent Dirichlet process mixtures. Important theoretical results on the support properties of the model are detailed. Inferences are available for the covariate-specific Youden index and its corresponding optimal cutoff threshold. The value of our nonparametric regression model is illustrated using multiple simulation studies and data on the age-specific accuracy of glucose as a biomarker of diabetes.
开发了一种新型非参数回归模型,用于评估连续生物标志物的协变量特异性准确性。从非患病个体中准确筛查出患病个体并正确诊断疾病阶段,在多个方面对医疗保健至关重要,包括指导有关治疗组合及其强度的建议。连续医学检测或生物标志物的准确性会因用于推断疾病状态的截断阈值(c)而异。准确性可以通过患病个体检测呈阳性的概率(检测的真阳性概率或灵敏度,Se(c))以及检测的真阴性概率(特异性,Sp(c))来衡量。一种常用的检测准确性汇总指标是约登指数,YI = max{Se(c)+Sp(c)-1:c∈ℝ},它之所以受欢迎,部分原因在于其易于解释且与人群健康研究相关。此外,临床医生受益于能够获得作为估计YI副产品的使灵敏度加特异性最大化的最佳截断值估计。我们开发了一种高度灵活的非参数模型来估计YI及其相关的最佳截断值,该模型可以应对意外的偏度、多峰性和其他复杂性,因为数据分布是使用相依狄利克雷过程混合进行建模的。详细阐述了关于该模型支持属性的重要理论结果。可对协变量特异性约登指数及其相应的最佳截断阈值进行推断。使用多项模拟研究以及关于作为糖尿病生物标志物的血糖的年龄特异性准确性的数据,说明了我们非参数回归模型的价值。