Department of Orthopaedics and Trauma, Medical University of Graz, Auenbruggerplatz 5, 8036, Graz, Austria.
ImageBiopsy Lab, Zehetnergasse 6/2/2, 1140, Vienna, Austria.
Knee Surg Sports Traumatol Arthrosc. 2023 Mar;31(3):1053-1062. doi: 10.1007/s00167-022-07220-y. Epub 2022 Nov 11.
The aims of this study were to (1) analyze the impact of an artificial intelligence (AI)-based computer system on the accuracy and agreement rate of board-certified orthopaedic surgeons (= senior readers) to detect X-ray features indicative of knee OA in comparison to unaided assessment and (2) compare the results to those of senior residents (= junior readers).
One hundred and twenty-four unilateral knee X-rays from the OAI study were analyzed regarding Kellgren-Lawrence grade, joint space narrowing (JSN), sclerosis and osteophyte OARSI grade by computerized methods. Images were rated for these parameters by three senior readers using two modalities: plain X-ray (unaided) and X-ray presented alongside reports from a computer-assisted detection system (aided). After exclusion of nine images with incomplete annotation, intraclass correlations between readers were calculated for both modalities among 115 images, and reader performance was compared to ground truth (OAI consensus). Accuracy, sensitivity and specificity were also calculated and the results were compared to those from a previous study on junior readers.
With the aided modality, senior reader agreement rates for KL grade (2.0-fold), sclerosis (1.42-fold), JSN (1.37-fold) and osteophyte OARSI grades (3.33-fold) improved significantly. Reader specificity and accuracy increased significantly for all features when using the aided modality compared to the gold standard. On the other hand, sensitivity only increased for OA diagnosis, whereas it decreased (without statistical significance) for all other features. With aided analysis, senior readers reached similar agreement and accuracy rates as junior readers, with both surpassing AI performance.
The introduction of AI-based computer-aided assessment systems can increase the agreement rate and overall accuracy for knee OA diagnosis among board-certified orthopaedic surgeons. Thus, use of this software may improve the standard of care for knee OA detection and diagnosis in the future.
Level II.
本研究旨在:(1) 分析基于人工智能 (AI) 的计算机系统对检测膝关节骨关节炎 (OA) 相关 X 射线特征的准确率和一致性的影响,比较其与非辅助评估结果的差异,并与资深读者的结果进行比较。
对 OAI 研究中的 124 例单侧膝关节 X 射线进行分析,评估 K-L 分级、关节间隙狭窄 (JSN)、硬化和骨赘 OARSI 分级。通过计算机方法对这些参数进行图像评分。三位资深读者使用两种模式(普通 X 射线(非辅助)和 X 射线,附有计算机辅助检测系统报告)对这些参数进行评分。在排除了 9 张图像标注不完整的图像后,对 115 张图像进行了两种模式下读者间的组内相关系数计算,并将读者的表现与金标准(OAI 共识)进行了比较。还计算了准确率、敏感度和特异度,并与之前对初级读者的研究结果进行了比较。
在辅助模式下,高级读者对 KL 分级(2 倍)、硬化(1.42 倍)、JSN(1.37 倍)和骨赘 OARSI 分级(3.33 倍)的一致性显著提高。与金标准相比,使用辅助模式时,所有特征的读者特异性和准确率均显著提高。另一方面,仅 OA 诊断的敏感性增加,而其他特征的敏感性则降低(无统计学意义)。使用辅助分析,高级读者的一致性和准确率与初级读者相似,两者均优于 AI 性能。
引入基于 AI 的计算机辅助评估系统可以提高骨科认证医师对膝关节 OA 诊断的一致性和总体准确率。因此,未来这种软件的使用可能会提高膝关节 OA 检测和诊断的护理标准。
2 级。