SurGrID：通过场景图到图像扩散实现可控手术模拟

SurGrID: controllable surgical simulation via Scene Graph to Image Diffusion.

作者信息

Frisch Yannik, Sivakumar Ssharvien Kumar, Köksal Çağhan, Böhm Elsa, Wagner Felix, Gericke Adrian, Ghazaei Ghazal, Mukhopadhyay Anirban

机构信息

TU Darmstadt, Fraunhoferstr. 5, 64297, Darmstadt, Germany.

Universitätsmedizin Mainz, Langenbeckstr. 1, 55131, Mainz, Germany.

出版信息

Int J Comput Assist Radiol Surg. 2025 May 21. doi: 10.1007/s11548-025-03397-y.

DOI:10.1007/s11548-025-03397-y

PMID:40397229

Abstract

PURPOSE

Surgical simulation offers a promising addition to conventional surgical training. However, available simulation tools lack photorealism and rely on hard-coded behaviour. Denoising Diffusion Models are a promising alternative for high-fidelity image synthesis, but existing state-of-the-art conditioning methods fall short in providing precise control or interactivity over the generated scenes.

METHODS

We introduce SurGrID, a Scene Graph to Image Diffusion Model, allowing for controllable surgical scene synthesis by leveraging Scene Graphs. These graphs encode a surgical scene's components' spatial and semantic information, which are then translated into an intermediate representation using our novel pre-training step that explicitly captures local and global information.

RESULTS

Our proposed method improves the fidelity of generated images and their coherence with the graph input over the state of the art. Further, we demonstrate the simulation's realism and controllability in a user assessment study involving clinical experts.

CONCLUSION

Scene Graphs can be effectively used for precise and interactive conditioning of Denoising Diffusion Models for simulating surgical scenes, enabling high-fidelity and interactive control over the generated content.

摘要

目的

手术模拟为传统手术训练提供了一个很有前景的补充。然而，现有的模拟工具缺乏照片般的真实感，且依赖硬编码行为。去噪扩散模型是高保真图像合成的一个有前途的替代方案，但现有的最先进的条件方法在对生成的场景提供精确控制或交互性方面存在不足。

方法

我们引入了SurGrID，一种从场景图到图像的扩散模型，通过利用场景图实现可控的手术场景合成。这些图编码了手术场景中组件的空间和语义信息，然后使用我们新颖的预训练步骤将其转换为中间表示，该步骤明确捕获局部和全局信息。

结果

我们提出的方法提高了生成图像的保真度及其与图输入的一致性，优于现有技术。此外，我们在一项涉及临床专家的用户评估研究中展示了模拟的真实感和可控性。

结论

场景图可有效地用于对去噪扩散模型进行精确和交互式的条件设定，以模拟手术场景，从而实现对生成内容的高保真和交互式控制。

相似文献

SurGrID: controllable surgical simulation via Scene Graph to Image Diffusion.SurGrID：通过场景图到图像扩散实现可控手术模拟

Int J Comput Assist Radiol Surg. 2025 May 21. doi: 10.1007/s11548-025-03397-y.

VIIDA and InViDe: computational approaches for generating and evaluating inclusive image paragraphs for the visually impaired.VIIDA和InViDe：为视障人士生成和评估包容性图像段落的计算方法。

Disabil Rehabil Assist Technol. 2025 Jul;20(5):1470-1495. doi: 10.1080/17483107.2024.2437567. Epub 2024 Dec 11.

Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力：结合混合方法和多模态分析的综合框架

JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施：系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素：一项定性证据综合分析

Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.

Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理（2025年结石病专家共识）

Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.

Interventions for recruiting smokers into cessation programmes.将吸烟者纳入戒烟计划的干预措施。

Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD009187. doi: 10.1002/14651858.CD009187.pub2.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力：系统评价和网络荟萃分析。

Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.

本文引用的文献

Latent Graph Representations for Critical View of Safety Assessment.潜在图表示在安全性评估关键视图中的应用。

IEEE Trans Med Imaging. 2024 Mar;43(3):1247-1258. doi: 10.1109/TMI.2023.3333034. Epub 2024 Mar 5.

A multimodal comparison of latent denoising diffusion probabilistic models and generative adversarial networks for medical image synthesis.基于潜在去噪扩散概率模型和生成对抗网络的医学图像合成的多模态比较。

Sci Rep. 2023 Jul 26;13(1):12098. doi: 10.1038/s41598-023-39278-0.

CaDIS: Cataract dataset for surgical RGB-image segmentation.CaDIS：用于手术 RGB 图像分割的白内障数据集。

Med Image Anal. 2021 Jul;71:102053. doi: 10.1016/j.media.2021.102053. Epub 2021 Mar 31.

A Comprehensive Survey on Graph Neural Networks.图神经网络综述。

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.

A systematic review of simulation-based training tools for technical and non-technical skills in ophthalmology.基于模拟的眼科技术和非技术技能培训工具的系统评价。

Eye (Lond). 2020 Oct;34(10):1737-1759. doi: 10.1038/s41433-020-0832-1. Epub 2020 Mar 13.

Mask R-CNN.Mask R-CNN。

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):386-397. doi: 10.1109/TPAMI.2018.2844175. Epub 2018 Jun 5.

Surgical Simulation Training Reduces Intraoperative Cataract Surgery Complications Among Residents.手术模拟训练可减少住院医师白内障手术术中并发症。

Simul Healthc. 2018 Feb;13(1):11-15. doi: 10.1097/SIH.0000000000000255.

Operating Room Performance Improves after Proficiency-Based Virtual Reality Cataract Surgery Training.基于熟练度的虚拟现实白内障手术培训后，手术室绩效得到提升。

Ophthalmology. 2017 Apr;124(4):524-531. doi: 10.1016/j.ophtha.2016.11.015. Epub 2016 Dec 22.

Long-term outcomes of resident- versus attending-performed primary trabeculectomy with mitomycin C in a United States residency program.在美国住院医师培训项目中，居民医生和主治医生施行的小梁切除术联合丝裂霉素 C 的长期疗效比较。

Am J Ophthalmol. 2014 Jun;157(6):1190-201. doi: 10.1016/j.ajo.2014.02.028. Epub 2014 Feb 14.

Cataract and surgery for cataract.白内障及白内障手术

BMJ. 2006 Jul 15;333(7559):128-32. doi: 10.1136/bmj.333.7559.128.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

SurGrID：通过场景图到图像扩散实现可控手术模拟

SurGrID: controllable surgical simulation via Scene Graph to Image Diffusion.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献