Department of Orthopedic Surgery, Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, No. 600 Yishan Road, Shanghai, 200233, China.
Shanghai Aitrox Technology Corporation Limited, Shanghai, China.
Int J Comput Assist Radiol Surg. 2023 Aug;18(8):1451-1458. doi: 10.1007/s11548-023-02838-w. Epub 2023 Jan 18.
The purpose of this study was to assess if radiologists assisted by deep learning (DL) algorithms can achieve diagnostic accuracy comparable to that of pre-surgical biopsies in benign-malignant differentiation of musculoskeletal tumors (MST).
We first conducted a systematic review of literature to get the respective overall diagnostic accuracies of fine-needle aspiration biopsy (FNAB) and core needle biopsy (CNB) in differentiating between benign and malignant MST, by synthesizing data from the articles meeting our inclusion criteria. To compared against the accuracies reported in literature, we then invited 4 radiologists, respectively with 2 (A), 6 (B), 7 (C), and 33 (D) years of experience in interpreting musculoskeletal MRI to perform diagnostic tests on our own dataset (n = 62), with and without assistance of a previously developed DL algorithm. The gold standard for benign-malignant differentiation was histopathologic confirmation or clinical/radiographic follow-up.
For FNAB, a meta-analysis containing 4604 samples met the inclusion criteria, with the overall diagnostic accuracy reported to be 0.77. For CNB, an overall accuracy of 0.86 was derived by synthesizing results from 7 original research articles containing a total of 587 samples. On our internal MST dataset, the invited radiologists, respectively, achieved diagnostic accuracies of 0.84 (A), 0.89 (B), 0.87 (C), and 0.90 (D), with the assistance of DL.
Use of DL algorithms on musculoskeletal dynamic contrast-enhanced MRI improved the benign-malignant differentiation accuracy of radiologists to a level comparable to that of pre-surgical biopsies. The developed DL algorithms have a potential to lower the risk of miss-diagnosing malignancy in radiological practice.
本研究旨在评估放射科医生是否可以借助深度学习(DL)算法,在肌肉骨骼肿瘤(MST)的良恶性鉴别中,达到与术前活检相当的诊断准确率。
我们首先进行了系统文献回顾,通过综合符合纳入标准的文章中的数据,获取细针抽吸活检(FNAB)和核心针活检(CNB)在鉴别良性和恶性 MST 方面的各自总体诊断准确率。为了与文献中报告的准确率进行比较,我们随后邀请了 4 名放射科医生,分别具有 2(A)、6(B)、7(C)和 33(D)年解读肌肉骨骼 MRI 的经验,在我们自己的数据集(n=62)上进行诊断测试,同时使用和不使用先前开发的 DL 算法。良恶性鉴别金标准为组织病理学证实或临床/影像学随访。
对于 FNAB,一项包含 4604 个样本的荟萃分析符合纳入标准,总体诊断准确率为 0.77。对于 CNB,通过综合来自 7 项包含总共 587 个样本的原始研究文章的结果,得出了 0.86 的总体准确率。在我们的内部 MST 数据集上,受邀的放射科医生分别在使用和不使用 DL 的情况下,获得了 0.84(A)、0.89(B)、0.87(C)和 0.90(D)的诊断准确率。
在肌肉骨骼动态对比增强 MRI 上使用 DL 算法可提高放射科医生的良恶性鉴别准确率,达到与术前活检相当的水平。所开发的 DL 算法有可能降低放射科实践中漏诊恶性肿瘤的风险。