Jurosch Franziska, Zeller Janik, Wagner Lars, Özsoy Ege, Jell Alissa, Kolb Sven, Wilhelm Dirk
Technical University of Munich, School of Medicine and Health, TUM University Hospital, Research Group MITI, Munich, Germany.
Technical University of Munich, School of Computation, Information and Technology, Chair of Computer Aided Medical Procedures, Munich, Germany.
Int J Comput Assist Radiol Surg. 2025 Jun;20(6):1159-1166. doi: 10.1007/s11548-025-03344-x. Epub 2025 Apr 12.
Deep learning methods are commonly used to generate context understanding to support surgeons and medical professionals. By expanding the current focus beyond the operating room (OR) to postoperative workflows, new forms of assistance are possible. In this article, we propose a novel multi-target multi-camera tracking (MTMCT) architecture for postoperative phase recognition, location tracking, and automatic timestamp generation.
Three RGB cameras were used to create a multi-camera data set containing 19 reenacted postoperative patient flows. Patients and beds were annotated and used to train the custom MTMCT architecture. It includes bed and patient tracking for each camera and a postoperative patient state module to provide the postoperative phase, current location of the patient, and automatically generated timestamps.
The architecture demonstrates robust performance for single- and multi-patient scenarios by embedding medical domain-specific knowledge. In multi-patient scenarios, the state machine representing the postoperative phases has a traversal accuracy of , of timestamps are generated correctly, and the patient tracking IDF1 reaches . Comparative experiments show the effectiveness of using AFLink for matching partial trajectories in postoperative settings.
As our approach shows promising results, it lays the foundation for real-time surgeon support, enhancing clinical documentation and ultimately improving patient care.
深度学习方法通常用于生成上下文理解,以支持外科医生和医学专业人员。通过将当前重点从手术室(OR)扩展到术后工作流程,可以实现新的辅助形式。在本文中,我们提出了一种新颖的多目标多相机跟踪(MTMCT)架构,用于术后阶段识别、位置跟踪和自动时间戳生成。
使用三个RGB相机创建了一个包含19个模拟术后患者流程的多相机数据集。对患者和病床进行注释,并用于训练定制的MTMCT架构。它包括每个相机的病床和患者跟踪,以及一个术后患者状态模块,以提供术后阶段、患者当前位置和自动生成的时间戳。
通过嵌入医学领域特定知识,该架构在单患者和多患者场景中均表现出强大的性能。在多患者场景中,代表术后阶段的状态机遍历准确率为 ,时间戳生成正确的比例为 ,患者跟踪IDF1达到 。对比实验表明,在术后设置中使用AFLink匹配部分轨迹是有效的。
由于我们的方法显示出有前景的结果,它为实时外科医生支持奠定了基础,增强了临床记录,最终改善了患者护理。