Department of Orthopaedic Surgery, Seoul National University Bundang Hospital, Seongnam-Si, Republic of Korea.
Department of Orthopaedic Surgery, Keimyung University Dongsan Hospital, Daegu, Republic of Korea.
BMC Musculoskelet Disord. 2023 Nov 8;24(1):869. doi: 10.1186/s12891-023-06951-4.
The Kellgren-Lawrence (KL) grading system is the most widely used method to classify the severity of osteoarthritis (OA) of the knee. However, due to ambiguity of terminology, the KL system showed inferior inter- and intra-observer reliability. For a more reliable evaluation, we recently developed novel deep learning (DL) software known as MediAI-OA to extract each radiographic feature of knee OA and to grade OA severity based on the KL system.
This research used data from the Osteoarthritis Initiative for training and validation of MediAI-OA. 44,193 radiographs and 810 radiographs were set as the training data and used as validation data, respectively. This AI model was developed to automatically quantify the degree of joint space narrowing (JSN) of medial and lateral tibiofemoral joint, to automatically detect osteophytes in four regions (medial distal femur, lateral distal femur, medial proximal tibia and lateral proximal tibia) of the knee joint, to classify the KL grade, and present the results of these three OA features together. The model was tested by using 400 test datasets, and the results were compared to the ground truth. The accuracy of the JSN quantification and osteophyte detection was evaluated. The KL grade classification performance was evaluated by precision, recall, F1 score, accuracy, and Cohen's kappa coefficient. In addition, we defined KL grade 2 or higher as clinically significant OA, and accuracy of OA diagnosis were obtained.
The mean squared error of JSN rate quantification was 0.067 and average osteophyte detection accuracy of the MediAI-OA was 0.84. The accuracy of KL grading was 0.83, and the kappa coefficient between the AI model and ground truth was 0.768, which demonstrated substantial consistency. The OA diagnosis accuracy of this software was 0.92.
The novel DL software known as MediAI-OA demonstrated satisfactory performance comparable to that of experienced orthopedic surgeons and radiologists for analyzing features of knee OA, KL grading and OA diagnosis. Therefore, reliable KL grading can be performed and the burden of the radiologist can be reduced by using MediAI-OA.
Kellgren-Lawrence(KL)分级系统是最广泛用于分类膝关节骨关节炎(OA)严重程度的方法。然而,由于术语模糊,KL 系统显示出较差的组内和组间可靠性。为了更可靠的评估,我们最近开发了一种新的深度学习(DL)软件,称为 MediAI-OA,用于提取膝关节 OA 的每个放射特征,并根据 KL 系统分级 OA 严重程度。
这项研究使用了来自 Osteoarthritis Initiative 的数据来训练和验证 MediAI-OA。44193 张射线照片和 810 张射线照片分别被设置为训练数据和验证数据。该 AI 模型旨在自动量化内侧和外侧胫股关节的关节间隙狭窄(JSN)程度,自动检测膝关节四个区域(内侧远端股骨、外侧远端股骨、内侧近端胫骨和外侧近端胫骨)的骨赘,对 KL 分级进行分类,并同时呈现这三个 OA 特征的结果。该模型使用 400 个测试数据集进行了测试,并将结果与真实值进行了比较。评估了 JSN 定量和骨赘检测的准确性。通过精度、召回率、F1 分数、准确性和 Cohen's kappa 系数评估 KL 分级分类性能。此外,我们将 KL 分级 2 或更高定义为临床显著 OA,并获得 OA 诊断的准确性。
JSN 率定量的均方误差为 0.067,MediAI-OA 的平均骨赘检测准确性为 0.84。KL 分级的准确性为 0.83,AI 模型与真实值之间的 Kappa 系数为 0.768,表明具有实质性一致性。该软件的 OA 诊断准确性为 0.92。
新型 DL 软件 MediAI-OA 表现出令人满意的性能,可与经验丰富的骨科医生和放射科医生相媲美,用于分析膝关节 OA、KL 分级和 OA 诊断的特征。因此,可以使用 MediAI-OA 进行可靠的 KL 分级,并减轻放射科医生的负担。