Department of Radiology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, People's Republic of China.
Department of Radiological Sciences, University of California, Irvine, CA, USA.
Eur Radiol. 2022 Oct;32(10):6608-6618. doi: 10.1007/s00330-022-08899-w. Epub 2022 Jun 21.
To evaluate the diagnostic performance of Kaiser score (KS) adjusted with the apparent diffusion coefficient (ADC) (KS+) and machine learning (ML) modeling.
A dataset of 402 malignant and 257 benign lesions was identified. Two radiologists assigned the KS. If a lesion with KS > 4 had ADC > 1.4 × 10 mm/s, the KS was reduced by 4 to become KS+. In order to consider the full spectrum of ADC as a continuous variable, the KS and ADC values were used to train diagnostic models using 5 ML algorithms. The performance was evaluated using the ROC analysis, compared by the DeLong test. The sensitivity, specificity, and accuracy achieved using the threshold of KS > 4, KS+ > 4, and ADC ≤ 1.4 × 10 mm/s were obtained and compared by the McNemar test.
The ROC curves of KS, KS+, and all ML models had comparable AUC in the range of 0.883-0.921, significantly higher than that of ADC (0.837, p < 0.0001). The KS had sensitivity = 97.3% and specificity = 59.1%; and the KS+ had sensitivity = 95.5% with significantly improved specificity to 68.5% (p < 0.0001). However, when setting at the same sensitivity of 97.3%, KS+ could not improve specificity. In ML analysis, the logistic regression model had the best performance. At sensitivity = 97.3% and specificity = 65.3%, i.e., compared to KS, 16 false-positives may be avoided without affecting true cancer diagnosis (p = 0.0015).
Using dichotomized ADC to modify KS to KS+ can improve specificity, but at the price of lowered sensitivity. Machine learning algorithms may be applied to consider the ADC as a continuous variable to build more accurate diagnostic models.
• When using ADC to modify the Kaiser score to KS+, the diagnostic specificity according to the results of two independent readers was improved by 9.4-9.7%, at the price of slightly degraded sensitivity by 1.5-1.8%, and overall had improved accuracy by 2.6-2.9%. • When the KS and the continuous ADC values were combined to train models by machine learning algorithms, the diagnostic specificity achieved by the logistic regression model could be significantly improved from 59.1 to 65.3% (p = 0.0015), while maintaining at the high sensitivity of KS = 97.3%, and thus, the results demonstrated the potential of ML modeling to further evaluate the contribution of ADC. • When setting the sensitivity at the same levels, the modified KS+ and the original KS have comparable specificity; therefore, KS+ with consideration of ADC may not offer much practical help, and the original KS without ADC remains as an excellent robust diagnostic method.
评估 Kaiser 评分(KS)与表观扩散系数(ADC)调整(KS+)和机器学习(ML)模型的诊断性能。
确定了 402 个恶性和 257 个良性病变的数据集。两名放射科医生对 KS 进行了评估。如果 KS > 4 的病变 ADC > 1.4 × 10 mm/s,则 KS 降低 4 变为 KS+。为了考虑 ADC 作为连续变量的全部范围,使用 5 种 ML 算法使用 KS 和 ADC 值来训练诊断模型。使用 ROC 分析评估性能,并通过 DeLong 检验进行比较。通过 McNemar 检验获得并比较使用 KS > 4、KS+ > 4 和 ADC ≤ 1.4 × 10 mm/s 的阈值获得的灵敏度、特异性和准确性。
KS、KS+和所有 ML 模型的 ROC 曲线在 0.883-0.921 范围内具有可比的 AUC,明显高于 ADC(0.837,p < 0.0001)。KS 的灵敏度为 97.3%,特异性为 59.1%;KS+的灵敏度为 95.5%,特异性显著提高至 68.5%(p < 0.0001)。然而,当设定相同的灵敏度为 97.3%时,KS+不能提高特异性。在 ML 分析中,逻辑回归模型具有最佳性能。在灵敏度 = 97.3%和特异性 = 65.3%时,与 KS 相比,可能避免 16 个假阳性,而不会影响癌症的正确诊断(p = 0.0015)。
使用二分类 ADC 修改 KS 为 KS+可以提高特异性,但代价是敏感性略有降低。机器学习算法可用于将 ADC 视为连续变量,以构建更准确的诊断模型。
• 使用 ADC 修改 Kaiser 评分得到 KS+,根据两位独立读者的结果,诊断特异性提高了 9.4-9.7%,敏感性略有降低 1.5-1.8%,整体准确性提高了 2.6-2.9%。• 将 KS 和连续 ADC 值结合起来,通过机器学习算法进行模型训练,逻辑回归模型的诊断特异性可显著提高至 65.3%(p = 0.0015),同时保持 KS = 97.3%的高灵敏度,因此,结果表明 ML 建模有潜力进一步评估 ADC 的贡献。• 在设定相同灵敏度的情况下,修改后的 KS+和原始 KS 的特异性相当;因此,考虑 ADC 的 KS+并没有提供太多实际帮助,而没有 ADC 的原始 KS 仍然是一种出色的稳健诊断方法。