Hung Truong Nguyen Khanh, Vy Vu Pham Thao, Tri Nguyen Minh, Hoang Le Ngoc, Tuan Le Van, Ho Quang Thai, Le Nguyen Quoc Khanh, Kang Jiunn-Horng
International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan.
Department of Orthopedic and Trauma, Cho Ray Hospital, Ho Chi Minh City, Vietnam.
J Magn Reson Imaging. 2023 Mar;57(3):740-749. doi: 10.1002/jmri.28284. Epub 2022 Jun 1.
Timely diagnosis of meniscus injuries is key for preventing knee joint dysfunction and improving patient outcomes because it decreases morbidity and facilitates treatment planning.
To train and evaluate a deep learning model for automated detection of meniscus tears on knee magnetic resonance imaging (MRI).
Bicentric retrospective study.
In total, 584 knee MRI studies, divided among training (n = 234), testing (n = 200), and external validation (n = 150) data sets, were used in this study. The public data set MRNet was used as a second external validation data set to evaluate the performance of the model.
A 3 T, coronal, and sagittal images from T1-weighted proton density (PD) fast spin-echo (FSE) with fat saturation and T2-weighted FSE with fat saturation sequences.
The detection system for meniscus tear was based on the improved YOLOv4 model with Darknet-53 as the backbone. The performance of the model was also compared with that of three radiologists of varying levels of experience. The determination of the presence of a meniscus tear from surgery reports was used as the ground truth for the images.
Sensitivity, specificity, prevalence, positive predictive value, negative predictive value, accuracy, and receiver operating characteristic curve were used to evaluate the performance of the detection model. Two-way analysis of variance, Wilcoxon signed-rank test, and Tukey's multiple tests were used to evaluate differences in performance between the model and radiologists.
The overall accuracies for detecting meniscus tears using our model on the internal testing, internal validation, and external validation data sets were 95.4%, 95.8%, and 78.8%, respectively. One radiologist had significantly lower performance than our model in detecting meniscal tears (accuracy: 0.9025 ± 0.093 vs. 0.9580 ± 0.025).
The proposed model had high sensitivity, specificity, and accuracy for detecting meniscus tears on knee MRIs.
3 TECHNICAL EFFICACY: Stage 2.
半月板损伤的及时诊断是预防膝关节功能障碍和改善患者预后的关键,因为它能降低发病率并有助于治疗规划。
训练并评估一种用于在膝关节磁共振成像(MRI)上自动检测半月板撕裂的深度学习模型。
双中心回顾性研究。
本研究共使用了584例膝关节MRI检查,分为训练数据集(n = 234)、测试数据集(n = 200)和外部验证数据集(n = 150)。公开数据集MRNet用作第二个外部验证数据集以评估模型的性能。
来自具有脂肪抑制的T1加权质子密度(PD)快速自旋回波(FSE)序列以及具有脂肪抑制的T2加权FSE序列的3T冠状位和矢状位图像。
半月板撕裂检测系统基于以Darknet-53为骨干的改进YOLOv4模型。该模型的性能还与三位经验水平不同的放射科医生的性能进行了比较。根据手术报告确定半月板撕裂的存在作为图像的真实情况。
使用灵敏度、特异性、患病率、阳性预测值、阴性预测值、准确性和受试者操作特征曲线来评估检测模型的性能。使用双向方差分析、Wilcoxon符号秩检验和Tukey多重检验来评估模型与放射科医生之间性能的差异。
使用我们的模型在内部测试、内部验证和外部验证数据集上检测半月板撕裂的总体准确率分别为95.4%、95.8%和78.8%。一位放射科医生在检测半月板撕裂方面的表现明显低于我们的模型(准确率:0.9025±0.093 vs. 0.9580±0.025)。
所提出的模型在检测膝关节MRI上的半月板撕裂方面具有高灵敏度、特异性和准确性。
3 技术效能:2级