使用稳定扩散进行逼真内窥镜图像生成的最小数据要求。 - Suppr | 超能文献

使用稳定扩散进行逼真内窥镜图像生成的最小数据要求。

Minimal data requirement for realistic endoscopic image generation with Stable Diffusion.

机构信息

Sano Centre for Computational Medicine, Krakow, Poland.

Department of Computer Science, University of Verona, Verona, Italy.

出版信息

Int J Comput Assist Radiol Surg. 2024 Mar;19(3):531-539. doi: 10.1007/s11548-023-03030-w. Epub 2023 Nov 7.

DOI:10.1007/s11548-023-03030-w

PMID:37934401

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10881618/

Abstract

PURPOSE

Computer-assisted surgical systems provide support information to the surgeon, which can improve the execution and overall outcome of the procedure. These systems are based on deep learning models that are trained on complex and challenging-to-annotate data. Generating synthetic data can overcome these limitations, but it is necessary to reduce the domain gap between real and synthetic data.

METHODS

We propose a method for image-to-image translation based on a Stable Diffusion model, which generates realistic images starting from synthetic data. Compared to previous works, the proposed method is better suited for clinical application as it requires a much smaller amount of input data and allows finer control over the generation of details by introducing different variants of supporting control networks.

RESULTS

The proposed method is applied in the context of laparoscopic cholecystectomy, using synthetic and real data from public datasets. It achieves a mean Intersection over Union of 69.76%, significantly improving the baseline results (69.76 vs. 42.21%).

CONCLUSIONS

The proposed method for translating synthetic images into images with realistic characteristics will enable the training of deep learning methods that can generalize optimally to real-world contexts, thereby improving computer-assisted intervention guidance systems.

摘要

目的

计算机辅助手术系统为外科医生提供支持信息，从而提高手术的执行效率和整体效果。这些系统基于经过复杂且难以标注数据训练的深度学习模型。生成合成数据可以克服这些限制，但需要缩小真实数据和合成数据之间的领域差距。

方法

我们提出了一种基于稳定扩散模型的图像到图像转换方法，该方法可以从合成数据生成逼真的图像。与之前的工作相比，该方法更适合临床应用，因为它需要的输入数据量要少得多，并且通过引入不同变体的支持控制网络，可以更好地控制细节的生成。

结果

该方法应用于腹腔镜胆囊切除术，使用来自公共数据集的合成数据和真实数据。它的交并比均值为 69.76%，明显优于基线结果（69.76 比 42.21%）。

结论

将合成图像转换为具有逼真特征的图像的方法将使能够训练出可以最优地推广到真实环境的深度学习方法，从而改进计算机辅助干预指导系统。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

使用稳定扩散进行逼真内窥镜图像生成的最小数据要求。

Minimal data requirement for realistic endoscopic image generation with Stable Diffusion.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

使用稳定扩散进行逼真内窥镜图像生成的最小数据要求。

Minimal data requirement for realistic endoscopic image generation with Stable Diffusion.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献