1 Department of Electrical and Electronic Engineering, University of Los Andes, Carrera 1 Este no. 19A-40, Bogotá 11001, Colombia.
AJR Am J Roentgenol. 2014 Jun;202(6):1272-80. doi: 10.2214/AJR.13.11509.
The purpose of this study was to compare the diagnostic accuracy achieved with and without the calibration method established by the DICOM standard in both medical-grade gray-scale displays and consumer-grade color displays.
This study involved 76 cases, six radiologists, three displays, and two display calibrations for a total of 2736 observations in a multireader-multicase factorial design. The evaluated conditions were interstitial opacities, pneumothorax, and nodules. CT was adopted as the reference standard. One medical-grade gray-scale display and two consumer-grade color displays were evaluated. Analyses of ROC curves, diagnostic accuracy (measured as AUC), accuracy of condition classification, and false-positive and false-negative rate comparisons were performed. The degree of agreement between readers was also evaluated.
No significant differences in image quality perception by the readers in the presence or absence of calibration were observed. Similar forms of the ROC curves were observed. No significant differences were detected in the observed variables (diagnostic accuracy, accuracy of condition classification, false-positive rates, false-negative rates, and image-quality perception). Strong agreement between readers was also determined for each display with and without calibration.
For the chest conditions and selected observers included in this study, no significant differences were observed between the three evaluated displays with respect to accuracy performance with and without calibration.
本研究旨在比较在医学级灰度显示器和消费级彩色显示器中使用和不使用 DICOM 标准建立的校准方法所达到的诊断准确性。
本研究涉及 76 例病例、6 名放射科医生、3 种显示器和 2 种显示器校准,在多读者多病例析因设计中总共进行了 2736 次观察。评估的条件是间质混浊、气胸和结节。采用 CT 作为参考标准。评估了一种医学级灰度显示器和两种消费级彩色显示器。进行了 ROC 曲线分析、诊断准确性(以 AUC 衡量)、条件分类准确性以及假阳性和假阴性率比较。还评估了读者之间的一致性程度。
在存在或不存在校准的情况下,读者对图像质量的感知没有显著差异。观察到的 ROC 曲线形式相似。在观察变量(诊断准确性、条件分类准确性、假阳性率、假阴性率和图像质量感知)方面未检测到显著差异。对于每个显示器,无论是校准还是未校准,读者之间都存在强烈的一致性。
对于本研究中包含的胸部情况和选定的观察者,在准确性表现方面,三种评估的显示器在使用和不使用校准的情况下没有显著差异。