Suppr超能文献

深度学习软件能否提高不同经验水平的放射科医生在评估双参数前列腺MRI时的一致性和表现?

Does deep learning software improve the consistency and performance of radiologists with various levels of experience in assessing bi-parametric prostate MRI?

作者信息

Arslan Aydan, Alis Deniz, Erdemli Servet, Seker Mustafa Ege, Zeybel Gokberk, Sirolu Sabri, Kurtcan Serpil, Karaarslan Ercan

机构信息

Department of Radiology, Umraniye Training and Research Hospital, Istanbul, Turkey.

Department of Radiology, School of Medicine, Acibadem Mehmet Ali Aydinlar University, Istanbul, Turkey.

出版信息

Insights Imaging. 2023 Mar 20;14(1):48. doi: 10.1186/s13244-023-01386-w.

Abstract

OBJECTIVE

To investigate whether commercially available deep learning (DL) software improves the Prostate Imaging-Reporting and Data System (PI-RADS) scoring consistency on bi-parametric MRI among radiologists with various levels of experience; to assess whether the DL software improves the performance of the radiologists in identifying clinically significant prostate cancer (csPCa).

METHODS

We retrospectively enrolled consecutive men who underwent bi-parametric prostate MRI at a 3 T scanner due to suspicion of PCa. Four radiologists with 2, 3, 5, and > 20 years of experience evaluated the bi-parametric prostate MRI scans with and without the DL software. Whole-mount pathology or MRI/ultrasound fusion-guided biopsy was the reference. The area under the receiver operating curve (AUROC) was calculated for each radiologist with and without the DL software and compared using De Long's test. In addition, the inter-rater agreement was investigated using kappa statistics.

RESULTS

In all, 153 men with a mean age of 63.59 ± 7.56 years (range 53-80) were enrolled in the study. In the study sample, 45 men (29.80%) had clinically significant PCa. During the reading with the DL software, the radiologists changed their initial scores in 1/153 (0.65%), 2/153 (1.3%), 0/153 (0%), and 3/153 (1.9%) of the patients, yielding no significant increase in the AUROC (p > 0.05). Fleiss' kappa scores among the radiologists were 0.39 and 0.40 with and without the DL software (p = 0.56).

CONCLUSIONS

The commercially available DL software does not increase the consistency of the bi-parametric PI-RADS scoring or csPCa detection performance of radiologists with varying levels of experience.

摘要

目的

研究商用深度学习(DL)软件是否能提高不同经验水平的放射科医生在双参数磁共振成像(MRI)上对前列腺影像报告和数据系统(PI-RADS)评分的一致性;评估DL软件是否能提高放射科医生识别临床显著性前列腺癌(csPCa)的能力。

方法

我们回顾性纳入了因怀疑患有前列腺癌而在3T扫描仪上接受双参数前列腺MRI检查的连续男性患者。四名分别具有2年、3年、5年和超过20年经验的放射科医生对有和没有DL软件辅助的双参数前列腺MRI扫描进行评估。全层病理检查或MRI/超声融合引导活检作为参考标准。计算每位放射科医生在有和没有DL软件辅助情况下的受试者操作特征曲线下面积(AUROC),并使用德龙检验进行比较。此外,使用kappa统计量研究评分者间的一致性。

结果

共有153名平均年龄为63.59±7.56岁(范围53 - 80岁)的男性纳入本研究。在研究样本中,45名男性(29.80%)患有临床显著性前列腺癌。在使用DL软件阅片过程中,放射科医生对1/153(0.65%)、2/153(1.3%)、0/153(0%)和3/153(1.9%)的患者更改了初始评分,AUROC没有显著增加(p>0.05)。有和没有DL软件辅助时,放射科医生之间的Fleiss' kappa评分分别为0.39和0.40(p = 0.56)。

结论

商用DL软件并不能提高不同经验水平放射科医生在双参数PI-RADS评分的一致性或csPCa检测能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dd0/10027972/eab2e03ede18/13244_2023_1386_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验