Tunthanathip Thara, Oearsakul Thakul, Taweesomboonyat Chin, Sanghan Nuttha, Duangsoithong Rakkrit
1Division of Neurosurgery, Department of Surgery, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla.
2Division of Neuroradiology, Department of Radiology, Faculty of Medicine, Prince of Songkla University, Hat Yai, Songkhla; and.
Neurosurg Focus. 2025 Jul 1;59(1):E11. doi: 10.3171/2025.4.FOCUS24940.
Endoscopic endonasal transsphenoidal surgery (EETS) is a minimally invasive procedure that accesses the sellar and parasellar regions. Various anatomical structures must be identified during the operation, particularly the sella turcica and internal carotid artery (ICA) bilaterally. In the present retrospective cohort study, authors aimed to evaluate the performance of a deep learning (DL) model in detecting the sella turcica and ICA bilaterally in EETS video footage, with the goal of recognizing crucial landmarks and preventing potentially fatal injury.
The authors collected images from the endoscopic video footage of 98 patients who had undergone EETS from January 2015 to June 2024. The ICAs and sella turcica were labeled by neurosurgeons, and the entire dataset was divided into training, validation, and test datasets at a ratio of 7:2:1. The model for ICA and sella turcica detection was trained using the YOLOv5s object detection architecture, and precision, recall, mean average precision (mAP)@0.5, and mAP@0.5:0.95 were reported during the validation process. Moreover, the confusion matrix and area under the receiver operating characteristic curve (AUC) were assessed from the model using unseen images from the test dataset.
The DL model had precision, recall, mAP@0.5, and mAP@0.5:0.95 of 0.942, 0.955, 0.969, and 0.617, respectively, for all objects in the training processes with validation. For testing the model with unseen images, the AUC was 0.97 (95% CI 0.95-0.98), whereas average precision was 0.99 (95% CI 0.99-1.00). For ICA detection with a multiclass approach, the AUCs were 0.98 (95% CI 0.97-0.99) for the absence of any ICA, 0.93 (95% CI 0.91-0.95) for 1 ICA in the images, and 0.95 (95% CI 0.93-0.96) for both ICAs in the image. Additionally, accuracy for the ICA and sella turcica was 0.958 and 0.965, respectively.
Complex anatomical landmarks should be recognized during EETS. The computer vision model was effective in detecting the sella turcica and ICA bilaterally, as well as in identifying and avoiding fatal complications. For the model to generalize with reliability, it requires novel, unseen data from various settings to refine it and facilitate transfer learning.
鼻内镜下经蝶窦手术(EETS)是一种进入蝶鞍和鞍旁区域的微创手术。手术过程中必须识别各种解剖结构,特别是双侧的蝶鞍和颈内动脉(ICA)。在本回顾性队列研究中,作者旨在评估深度学习(DL)模型在EETS视频片段中双侧检测蝶鞍和ICA的性能,以识别关键标志并预防潜在的致命损伤。
作者收集了2015年1月至2024年6月期间接受EETS的98例患者的内镜视频图像。颈内动脉和蝶鞍由神经外科医生标记,整个数据集按7:2:1的比例分为训练集、验证集和测试集。使用YOLOv5s目标检测架构训练颈内动脉和蝶鞍检测模型,并在验证过程中报告精度、召回率、平均精度均值(mAP)@0.5和mAP@0.5:0.95。此外,使用测试数据集的未见图像评估模型的混淆矩阵和受试者操作特征曲线下面积(AUC)。
在验证的训练过程中,DL模型对所有对象的精度、召回率、mAP@0.5和mAP@0.5:0.95分别为0.942、0.955、0.969和0.617。对于用未见图像测试模型,AUC为0.97(95%CI 0.95 - 0.98),而平均精度为0.99(95%CI 0.99 - 1.00)。对于采用多类方法的颈内动脉检测,图像中无任何颈内动脉时的AUC为0.98(95%CI 0.97 - 0.99),图像中有1条颈内动脉时为0.93(95%CI 0.91 - 0.95),图像中有两条颈内动脉时为0.95(95%CI 0.93 - 0.96)。此外,颈内动脉和蝶鞍的准确率分别为0.958和0.965。
在EETS过程中应识别复杂的解剖标志。计算机视觉模型在双侧检测蝶鞍和颈内动脉以及识别和避免致命并发症方面是有效的。为了使模型可靠地泛化,它需要来自各种设置的新颖未见数据来对其进行优化并促进迁移学习。