Department of Electrical and Computer Engineering, Duke University, Durham, USA.
Department of Radiology, Duke University, Durham, USA.
Comput Biol Med. 2021 Jun;133:104334. doi: 10.1016/j.compbiomed.2021.104334. Epub 2021 Mar 23.
A fully-automated deep learning algorithm matched performance of radiologists in assessment of knee osteoarthritis severity in radiographs using the Kellgren-Lawrence grading system.
To develop an automated deep learning-based algorithm that jointly uses Posterior-Anterior (PA) and Lateral (LAT) views of knee radiographs to assess knee osteoarthritis severity according to the Kellgren-Lawrence grading system.
We used a dataset of 9739 exams from 2802 patients from Multicenter Osteoarthritis Study (MOST). The dataset was divided into a training set of 2040 patients, a validation set of 259 patients and a test set of 503 patients. A novel deep learning-based method was utilized for assessment of knee OA in two steps: (1) localization of knee joints in the images, (2) classification according to the KL grading system. Our method used both PA and LAT views as the input to the model. The scores generated by the algorithm were compared to the grades provided in the MOST dataset for the entire test set as well as grades provided by 5 radiologists at our institution for a subset of the test set.
The model obtained a multi-class accuracy of 71.90% on the entire test set when compared to the ratings provided in the MOST dataset. The quadratic weighted Kappa coefficient for this set was 0.9066. The average quadratic weighted Kappa between all pairs of radiologists from our institution who took part in the study was 0.748. The average quadratic-weighted Kappa between the algorithm and the radiologists at our institution was 0.769.
The proposed model performed demonstrated equivalency of KL classification to MSK radiologists, but clearly superior reproducibility. Our model also agreed with radiologists at our institution to the same extent as the radiologists with each other. The algorithm could be used to provide reproducible assessment of knee osteoarthritis severity.
一种全自动深度学习算法在使用 Kellgren-Lawrence 分级系统评估 X 光片膝关节骨关节炎严重程度方面的表现与放射科医生相当。
开发一种基于自动深度学习的算法,该算法联合使用膝关节 X 光片的后前位(PA)和侧位(LAT)视图,根据 Kellgren-Lawrence 分级系统评估膝关节骨关节炎的严重程度。
我们使用了来自多中心骨关节炎研究(MOST)的 2802 名患者的 9739 次检查的数据集。该数据集分为训练集(2040 名患者)、验证集(259 名患者)和测试集(503 名患者)。我们使用一种新的基于深度学习的方法分两步评估膝关节 OA:(1)在图像中定位膝关节,(2)根据 KL 分级系统进行分类。我们的方法将 PA 和 LAT 视图都作为模型的输入。算法生成的分数与 MOST 数据集中提供的整个测试集的分数以及我们机构的 5 名放射科医生在测试集的一个子集上提供的分数进行了比较。
与 MOST 数据集中提供的评分相比,该模型在整个测试集上的多类准确率为 71.90%。对于该数据集,该模型的二次加权 Kappa 系数为 0.9066。参与研究的我们机构的所有放射科医生之间的平均二次加权 Kappa 系数为 0.748。该算法与我们机构的放射科医生之间的平均二次加权 Kappa 系数为 0.769。
该模型的 KL 分类与 MSK 放射科医生相当,但具有明显更好的可重复性。我们的模型与我们机构的放射科医生的一致性与放射科医生之间的一致性相同。该算法可用于提供膝关节骨关节炎严重程度的可重复评估。