• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Learned representation-guided diffusion models for large-image generation.用于大图像生成的基于学习表征引导的扩散模型。
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2024 Jun;2024:8532-8542. doi: 10.1109/cvpr52733.2024.00815. Epub 2024 Sep 16.
2
MLVICX: Multi-Level Variance-Covariance Exploration for Chest X-Ray Self-Supervised Representation Learning.MLVICX:用于胸部X光自监督表征学习的多级方差协方差探索
IEEE J Biomed Health Inform. 2024 Dec;28(12):7480-7490. doi: 10.1109/JBHI.2024.3455337. Epub 2024 Dec 5.
3
Positional encoding-guided transformer-based multiple instance learning for histopathology whole slide images classification.基于位置编码引导的基于Transformer的多实例学习用于组织病理学全切片图像分类。
Comput Methods Programs Biomed. 2025 Jan;258:108491. doi: 10.1016/j.cmpb.2024.108491. Epub 2024 Nov 9.
4
Txt2Img-MHN: Remote Sensing Image Generation From Text Using Modern Hopfield Networks.Txt2Img-MHN:使用现代霍普菲尔德网络从文本生成遥感图像。
IEEE Trans Image Process. 2023;32:5737-5750. doi: 10.1109/TIP.2023.3323799. Epub 2023 Oct 24.
5
Multimodal representations of biomedical knowledge from limited training whole slide images and reports using deep learning.利用深度学习从有限的训练全切片图像和报告中获取生物医学知识的多模态表示。
Med Image Anal. 2024 Oct;97:103303. doi: 10.1016/j.media.2024.103303. Epub 2024 Aug 14.
6
Survey on Self-Supervised Learning: Auxiliary Pretext Tasks and Contrastive Learning Methods in Imaging.影像领域自监督学习综述:辅助 pretext 任务与对比学习方法
Entropy (Basel). 2022 Apr 14;24(4):551. doi: 10.3390/e24040551.
7
Self-supervised in-domain representation learning for remote sensing image scene classification.用于遥感图像场景分类的自监督域内表示学习
Heliyon. 2024 Sep 14;10(19):e37962. doi: 10.1016/j.heliyon.2024.e37962. eCollection 2024 Oct 15.
8
Semi-supervised classifier guided by discriminator.基于判别器的半监督分类器。
Sci Rep. 2022 Aug 29;12(1):14665. doi: 10.1038/s41598-022-18947-6.
9
Guided synthesis of annotated lung CT images with pathologies using a multi-conditioned denoising diffusion probabilistic model (mDDPM).使用多条件去噪扩散概率模型(mDDPM)对带有病变的标注肺部CT图像进行引导合成。
Phys Med Biol. 2025 Mar 6;70(6). doi: 10.1088/1361-6560/adb9b3.
10
SSiT: Saliency-Guided Self-Supervised Image Transformer for Diabetic Retinopathy Grading.SSiT:基于显著度引导的自监督图像变换器在糖尿病视网膜病变分级中的应用。
IEEE J Biomed Health Inform. 2024 May;28(5):2806-2817. doi: 10.1109/JBHI.2024.3362878. Epub 2024 May 6.

引用本文的文献

1
: Generating Histopathology Cell Topology with a Diffusion Model.利用扩散模型生成组织病理学细胞拓扑结构
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2025 Jun;2025:20979-20989. doi: 10.1109/cvpr52734.2025.01954. Epub 2025 Aug 13.
2
PixCell: A generative foundation model for digital histopathology images.PixCell:一种用于数字组织病理学图像的生成基础模型。
ArXiv. 2025 Jun 5:arXiv:2506.05127v1.
3
Autonomous learning of pathologists' cancer grading rules.病理学家癌症分级规则的自主学习
bioRxiv. 2025 Apr 7:2025.03.18.643999. doi: 10.1101/2025.03.18.643999.

用于大图像生成的基于学习表征引导的扩散模型。

Learned representation-guided diffusion models for large-image generation.

作者信息

Graikos Alexandros, Yellapragada Srikar, Le Minh-Quan, Kapse Saarthak, Prasanna Prateek, Saltz Joel, Samaras Dimitris

机构信息

Stony Brook University.

出版信息

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2024 Jun;2024:8532-8542. doi: 10.1109/cvpr52733.2024.00815. Epub 2024 Sep 16.

DOI:10.1109/cvpr52733.2024.00815
PMID:39606708
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11601131/
Abstract

To synthesize high-fidelity samples, diffusion models typically require auxiliary data to guide the generation process. However, it is impractical to procure the painstaking patch-level annotation effort required in specialized domains like histopathology and satellite imagery; it is often performed by domain experts and involves hundreds of millions of patches. Modern-day self-supervised learning (SSL) representations encode rich semantic and visual information. In this paper, we posit that such representations are expressive enough to act as proxies to fine-grained human labels. We introduce a novel approach that trains diffusion models conditioned on embeddings from SSL. Our diffusion models successfully project these features back to high-quality histopathology and remote sensing images. In addition, we construct larger images by assembling spatially consistent patches inferred from SSL embeddings, preserving long-range dependencies. Augmenting real data by generating variations of real images improves downstream classifier accuracy for patch-level and larger, image-scale classification tasks. Our models are effective even on datasets not encountered during training, demonstrating their robustness and generalizability. Generating images from learned embeddings is agnostic to the source of the embeddings. The SSL embeddings used to generate a large image can either be extracted from a reference image, or sampled from an auxiliary model conditioned on any related modality (e.g. class labels, text, genomic data). As proof of concept, we introduce the text-to-large image synthesis paradigm where we successfully synthesize large pathology and satellite images out of text descriptions.

摘要

为了合成高保真样本,扩散模型通常需要辅助数据来指导生成过程。然而,在组织病理学和卫星图像等专业领域获取所需的细致的补丁级注释工作是不切实际的;这通常由领域专家进行,涉及数亿个补丁。现代自监督学习(SSL)表示编码了丰富的语义和视觉信息。在本文中,我们认为这种表示具有足够的表现力,可以作为细粒度人类标签的代理。我们引入了一种新颖的方法,该方法基于来自SSL的嵌入来训练扩散模型。我们的扩散模型成功地将这些特征投影回高质量的组织病理学和遥感图像。此外,我们通过组装从SSL嵌入推断出的空间一致的补丁来构建更大的图像,保留长程依赖性。通过生成真实图像的变体来增强真实数据,可以提高补丁级和更大的图像尺度分类任务的下游分类器准确性。我们的模型即使在训练期间未遇到的数据集上也很有效,证明了它们的鲁棒性和通用性。从学习到的嵌入生成图像与嵌入的来源无关。用于生成大图像的SSL嵌入可以从参考图像中提取,也可以从以任何相关模态(例如类别标签、文本、基因组数据)为条件的辅助模型中采样。作为概念验证,我们引入了文本到大型图像合成范式,在其中我们成功地从文本描述中合成了大型病理学和卫星图像。