Medical Imaging Centre, Department of Radiology Tampere University Hospital, Teiskontie 35, 33520, Tampere, Finland.
The Graduate School, Chiang Mai University, 239 Huaykaew Road, Suthep, Mueang, Chiang Mai, Thailand.
Sci Rep. 2023 Aug 29;13(1):14159. doi: 10.1038/s41598-023-40516-8.
Preoperative radiological identification of mandibular canals is essential for maxillofacial surgery. This study demonstrates the reproducibility of a deep learning system (DLS) by evaluating its localisation performance on 165 heterogeneous cone beam computed tomography (CBCT) scans from 72 patients in comparison to an experienced radiologist's annotations. We evaluated the performance of the DLS using the symmetric mean curve distance (SMCD), the average symmetric surface distance (ASSD), and the Dice similarity coefficient (DSC). The reproducibility of the SMCD was assessed using the within-subject coefficient of repeatability (RC). Three other experts rated the diagnostic validity twice using a 0-4 Likert scale. The reproducibility of the Likert scoring was assessed using the repeatability measure (RM). The RC of SMCD was 0.969 mm, the median (interquartile range) SMCD and ASSD were 0.643 (0.186) mm and 0.351 (0.135) mm, respectively, and the mean (standard deviation) DSC was 0.548 (0.138). The DLS performance was most affected by postoperative changes. The RM of the Likert scoring was 0.923 for the radiologist and 0.877 for the DLS. The mean (standard deviation) Likert score was 3.94 (0.27) for the radiologist and 3.84 (0.65) for the DLS. The DLS demonstrated proficient qualitative and quantitative reproducibility, temporal generalisability, and clinical validity.
术前下颌管的影像学识别对于颌面外科至关重要。本研究通过评估深度学习系统(DLS)在 72 名患者的 165 个异质锥形束 CT(CBCT)扫描中的定位性能,并与经验丰富的放射科医生的注释进行比较,证明了该系统的可重复性。我们使用对称平均曲线距离(SMCD)、平均对称表面距离(ASSD)和 Dice 相似系数(DSC)评估 DLS 的性能。使用受试者内重复性系数(RC)评估 SMCD 的重复性。另外 3 位专家使用 0-4 级 Likert 量表对诊断有效性进行了两次评分。使用可重复性度量(RM)评估 Likert 评分的重复性。SMCD 的 RC 为 0.969mm,SMCD 和 ASSD 的中位数(四分位距)分别为 0.643(0.186)mm 和 0.351(0.135)mm,平均(标准差)DSC 为 0.548(0.138)。DLS 的性能受术后变化的影响最大。放射科医生和 DLS 的 RM 分别为 0.923 和 0.877。放射科医生和 DLS 的平均(标准差)Likert 评分分别为 3.94(0.27)和 3.84(0.65)。DLS 表现出良好的定性和定量重复性、时间泛化能力和临床有效性。