Department of Computer Science, University of California, Irvine, CA, USA.
Institute for Genomics and Bioinformatics, University of California, Irvine, CA, USA.
Transl Vis Sci Technol. 2023 Jan 3;12(1):20. doi: 10.1167/tvst.12.1.20.
To evaluate the potential for artificial intelligence-based video analysis to determine surgical instrument characteristics when moving in the three-dimensional vitreous space.
We designed and manufactured a model eye in which we recorded choreographed videos of many surgical instruments moving throughout the eye. We labeled each frame of the videos to describe the surgical tool characteristics: tool type, location, depth, and insertional laterality. We trained two different deep learning models to predict each of the tool characteristics and evaluated model performances on a subset of images.
The accuracy of the classification model on the training set is 84% for the x-y region, 97% for depth, 100% for instrument type, and 100% for laterality of insertion. The accuracy of the classification model on the validation dataset is 83% for the x-y region, 96% for depth, 100% for instrument type, and 100% for laterality of insertion. The close-up detection model performs at 67 frames per second, with precision for most instruments higher than 75%, achieving a mean average precision of 79.3%.
We demonstrated that trained models can track surgical instrument movement in three-dimensional space and determine instrument depth, tip location, instrument insertional laterality, and instrument type. Model performance is nearly instantaneous and justifies further investigation into application to real-world surgical videos.
Deep learning offers the potential for software-based safety feedback mechanisms during surgery or the ability to extract metrics of surgical technique that can direct research to optimize surgical outcomes.
评估基于人工智能的视频分析在确定三维玻璃体空间中手术器械特征时的潜力。
我们设计并制造了一个模型眼,在其中记录了许多手术器械在眼睛中移动的编排视频。我们标记了视频的每一帧,以描述手术工具的特征:工具类型、位置、深度和插入外侧性。我们训练了两个不同的深度学习模型来预测每个工具特征,并在图像子集上评估模型性能。
训练集上分类模型的准确性为 x-y 区域 84%,深度 97%,工具类型 100%,插入外侧性 100%。验证数据集上分类模型的准确性为 x-y 区域 83%,深度 96%,工具类型 100%,插入外侧性 100%。特写检测模型每秒可处理 67 帧,大多数仪器的精度高于 75%,平均精度达到 79.3%。
我们证明了训练有素的模型可以跟踪三维空间中的手术器械运动,并确定器械深度、尖端位置、器械插入外侧性和器械类型。模型性能几乎是即时的,这证明了进一步研究将其应用于真实手术视频的合理性。
译文准确流畅,专业术语翻译准确,符合中文表达习惯。