• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

白内障手术的时间一致的序列到序列翻译。

Temporally consistent sequence-to-sequence translation of cataract surgeries.

机构信息

Computer Science, Technical University Darmstadt, Fraunhoferstraße 5, 64283, Darmstadt, Hessen, Germany.

出版信息

Int J Comput Assist Radiol Surg. 2023 Jul;18(7):1217-1224. doi: 10.1007/s11548-023-02925-y. Epub 2023 May 23.

DOI:10.1007/s11548-023-02925-y
PMID:37219806
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10329626/
Abstract

PURPOSE

Image-to-image translation methods can address the lack of diversity in publicly available cataract surgery data. However, applying image-to-image translation to videos-which are frequently used in medical downstream applications-induces artifacts. Additional spatio-temporal constraints are needed to produce realistic translations and improve the temporal consistency of translated image sequences.

METHODS

We introduce a motion-translation module that translates optical flows between domains to impose such constraints. We combine it with a shared latent space translation model to improve image quality. Evaluations are conducted regarding translated sequences' image quality and temporal consistency, where we propose novel quantitative metrics for the latter. Finally, the downstream task of surgical phase classification is evaluated when retraining it with additional synthetic translated data.

RESULTS

Our proposed method produces more consistent translations than state-of-the-art baselines. Moreover, it stays competitive in terms of the per-image translation quality. We further show the benefit of consistently translated cataract surgery sequences for improving the downstream task of surgical phase prediction.

CONCLUSION

The proposed module increases the temporal consistency of translated sequences. Furthermore, imposed temporal constraints increase the usability of translated data in downstream tasks. This allows overcoming some of the hurdles of surgical data acquisition and annotation and enables improving models' performance by translating between existing datasets of sequential frames.

摘要

目的

图像到图像的翻译方法可以解决公共白内障手术数据缺乏多样性的问题。然而,将图像到图像的翻译应用于视频中——这在医疗下游应用中经常使用——会产生伪影。需要额外的时空约束来生成逼真的翻译并提高翻译图像序列的时间一致性。

方法

我们引入了一个运动翻译模块,该模块可以在域之间翻译光流以施加这种约束。我们将其与共享潜在空间翻译模型相结合,以提高图像质量。评估是关于翻译序列的图像质量和时间一致性进行的,我们为此提出了新的定量指标。最后,当使用额外的合成翻译数据进行重新训练时,评估手术阶段分类的下游任务。

结果

与最先进的基线相比,我们提出的方法产生了更一致的翻译。此外,它在逐图像翻译质量方面具有竞争力。我们进一步展示了一致翻译的白内障手术序列在提高手术阶段预测下游任务方面的好处。

结论

所提出的模块提高了翻译序列的时间一致性。此外,施加的时间约束增加了翻译数据在下游任务中的可用性。这允许克服手术数据采集和注释的一些障碍,并通过在现有序列帧数据集之间进行翻译来提高模型的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/5af51f2bf110/11548_2023_2925_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/5de972423bc7/11548_2023_2925_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/1563c95530c1/11548_2023_2925_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/fa30cc836e3d/11548_2023_2925_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/5af51f2bf110/11548_2023_2925_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/5de972423bc7/11548_2023_2925_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/1563c95530c1/11548_2023_2925_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/fa30cc836e3d/11548_2023_2925_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28c7/10329626/5af51f2bf110/11548_2023_2925_Fig4_HTML.jpg

相似文献

1
Temporally consistent sequence-to-sequence translation of cataract surgeries.白内障手术的时间一致的序列到序列翻译。
Int J Comput Assist Radiol Surg. 2023 Jul;18(7):1217-1224. doi: 10.1007/s11548-023-02925-y. Epub 2023 May 23.
2
Exploring semantic consistency in unpaired image translation to generate data for surgical applications.探索非配对图像翻译中的语义一致性,以生成用于手术应用的数据。
Int J Comput Assist Radiol Surg. 2024 Jun;19(6):985-993. doi: 10.1007/s11548-024-03079-1. Epub 2024 Feb 26.
3
Learning a Generative Motion Model From Image Sequences Based on a Latent Motion Matrix.基于潜在运动矩阵的基于图像序列的生成运动模型学习。
IEEE Trans Med Imaging. 2021 May;40(5):1405-1416. doi: 10.1109/TMI.2021.3056531. Epub 2021 Apr 30.
4
Unsupervised Image-to-Image Translation: A Review.无监督图像到图像翻译:综述。
Sensors (Basel). 2022 Nov 6;22(21):8540. doi: 10.3390/s22218540.
5
The role of unpaired image-to-image translation for stain color normalization in colorectal cancer histology classification.非配对图像到图像翻译在结直肠癌组织学分类中用于染色颜色归一化的作用。
Comput Methods Programs Biomed. 2023 Jun;234:107511. doi: 10.1016/j.cmpb.2023.107511. Epub 2023 Mar 26.
6
Progressively Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation.具有自适应层实例归一化的渐进式无监督生成注意力网络用于图像到图像的翻译
Sensors (Basel). 2023 Aug 1;23(15):6858. doi: 10.3390/s23156858.
7
C -GAN: Content-consistent generative adversarial networks for unsupervised domain adaptation in medical image segmentation.C-GAN:用于医学图像分割中无监督域自适应的内容一致生成对抗网络。
Med Phys. 2022 Oct;49(10):6491-6504. doi: 10.1002/mp.15944. Epub 2022 Aug 27.
8
Cycle consistent twin energy-based models for image-to-image translation.用于图像到图像翻译的循环一致孪生能量模型。
Med Image Anal. 2024 Jan;91:103031. doi: 10.1016/j.media.2023.103031. Epub 2023 Nov 19.
9
Temporally aware volumetric generative adversarial network-based MR image reconstruction with simultaneous respiratory motion compensation: Initial feasibility in 3D dynamic cine cardiac MRI.基于时间感知体积生成对抗网络的磁共振图像重建与呼吸运动同步补偿:3D 动态电影心脏 MRI 的初步可行性。
Magn Reson Med. 2021 Nov;86(5):2666-2683. doi: 10.1002/mrm.28912. Epub 2021 Jul 13.
10
Improvement of Image Quality of Cone-beam CT Images by Three-dimensional Generative Adversarial Network.基于三维生成对抗网络的锥形束 CT 图像质量改善。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2843-2846. doi: 10.1109/EMBC46164.2021.9629952.

引用本文的文献

1
SASVi: segment any surgical video.SASVi:分割任何手术视频。
Int J Comput Assist Radiol Surg. 2025 May 20. doi: 10.1007/s11548-025-03408-y.
2
Applications of generative adversarial networks in the diagnosis, prognosis, and treatment of ophthalmic diseases.生成对抗网络在眼科疾病诊断、预后及治疗中的应用。
Graefes Arch Clin Exp Ophthalmol. 2025 Apr 22. doi: 10.1007/s00417-025-06830-9.

本文引用的文献

1
Synthetic data in machine learning for medicine and healthcare.机器学习在医学和医疗保健领域中的合成数据。
Nat Biomed Eng. 2021 Jun;5(6):493-497. doi: 10.1038/s41551-021-00751-8.
2
CATARACTS: Challenge on automatic tool annotation for cataRACT surgery.白内障:白内障手术自动工具标注挑战。
Med Image Anal. 2019 Feb;52:24-41. doi: 10.1016/j.media.2018.11.008. Epub 2018 Nov 16.