Department of Statistics, University of South Carolina, Columbia, SC, 29208, USA.
Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA.
Stat Med. 2018 Feb 20;37(4):557-571. doi: 10.1002/sim.7530. Epub 2017 Nov 2.
Many disease diagnoses involve subjective judgments by qualified raters. For example, through the inspection of a mammogram, MRI, or ultrasound image, the clinician himself becomes part of the measuring instrument. To reduce diagnostic errors and improve the quality of diagnoses, it is necessary to assess raters' diagnostic skills and to improve their skills over time. This paper focuses on a subjective binary classification process, proposing a hierarchical model linking data on rater opinions with patient true disease-development outcomes. The model allows for the quantification of the effects of rater diagnostic skills (bias and magnifier) and patient latent disease severity on the rating results. A Bayesian Markov chain Monte Carlo (MCMC) algorithm is developed to estimate these parameters. Linking to patient true disease outcomes, the rater-specific sensitivity and specificity can be estimated using MCMC samples. Cost theory is used to identify poor- and strong-performing raters and to guide adjustment of rater bias and diagnostic magnifier to improve the rating performance. Furthermore, diagnostic magnifier is shown as a key parameter to present a rater's diagnostic ability because a rater with a larger diagnostic magnifier has a uniformly better receiver operating characteristic (ROC) curve when varying the value of diagnostic bias. A simulation study is conducted to evaluate the proposed methods, and the methods are illustrated with a mammography example.
许多疾病诊断都涉及到有资质的评估者进行主观判断。例如,通过对乳房 X 光片、MRI 或超声图像的检查,临床医生自身就成为了测量仪器的一部分。为了减少诊断错误,提高诊断质量,有必要评估评估者的诊断技能,并随着时间的推移提高他们的技能。本文专注于主观的二分类过程,提出了一个层次模型,将评估者意见的数据与患者真实的疾病发展结果联系起来。该模型允许量化评估者诊断技能(偏差和放大率)和患者潜在疾病严重程度对评分结果的影响。开发了一种贝叶斯马尔可夫链蒙特卡罗(MCMC)算法来估计这些参数。通过与患者真实的疾病结果相联系,可以使用 MCMC 样本估计评估者特定的敏感性和特异性。成本理论用于识别表现不佳和表现良好的评估者,并指导调整评估者偏差和诊断放大率,以提高评分性能。此外,诊断放大率被视为一个关键参数来展示评估者的诊断能力,因为当诊断偏差值变化时,具有较大诊断放大率的评估者具有更均匀的更好的接收器操作特征(ROC)曲线。进行了模拟研究来评估所提出的方法,并通过乳房 X 光摄影示例说明了这些方法。