基于变分自动编码器的合成训练数据生成的深度学习神经网络在纤维化组织中胶原纤维中心线的跟踪。

Collagen fiber centerline tracking in fibrotic tissue via deep neural networks with variational autoencoder-based synthetic training data generation.

机构信息

Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA; Laboratory for Optical and Computational Instrumentation, University of Wisconsin-Madison, Madison, WI 53706, USA; Morgridge Institute for Research, Madison, WI 53706, USA.

Department of Biomedical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA; Laboratory for Optical and Computational Instrumentation, University of Wisconsin-Madison, Madison, WI 53706, USA; Morgridge Institute for Research, Madison, WI 53706, USA.

出版信息

Med Image Anal. 2023 Dec;90:102961. doi: 10.1016/j.media.2023.102961. Epub 2023 Sep 12.

DOI:10.1016/j.media.2023.102961

PMID:37802011

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10591913/

Abstract

The role of fibrillar collagen in the tissue microenvironment is critical in disease contexts ranging from cancers to chronic inflammations, as evidenced by many studies. Quantifying fibrillar collagen organization has become a powerful approach for characterizing the topology of collagen fibers and studying the role of collagen fibers in disease progression. We present a deep learning-based pipeline to quantify collagen fibers' topological properties in microscopy-based collagen images from pathological tissue samples. Our method leverages deep neural networks to extract collagen fiber centerlines and deep generative models to create synthetic training data, addressing the current shortage of large-scale annotations. As a part of this effort, we have created and annotated a collagen fiber centerline dataset, with the hope of facilitating further research in this field. Quantitative measurements such as fiber orientation, alignment, density, and length can be derived based on the centerline extraction results. Our pipeline comprises three stages. Initially, a variational autoencoder is trained to generate synthetic centerlines possessing controllable topological properties. Subsequently, a conditional generative adversarial network synthesizes realistic collagen fiber images from the synthetic centerlines, yielding a synthetic training set of image-centerline pairs. Finally, we train a collagen fiber centerline extraction network using both the original and synthetic data. Evaluation using collagen fiber images from pancreas, liver, and breast cancer samples collected via second-harmonic generation microscopy demonstrates our pipeline's superiority over several popular fiber centerline extraction tools. Incorporating synthetic data into training further enhances the network's generalizability. Our code is available at https://github.com/uw-loci/collagen-fiber-metrics.

摘要

在从癌症到慢性炎症等疾病情况下，纤维胶原在组织微环境中的作用至关重要，许多研究都证明了这一点。定量纤维胶原组织对于描述胶原纤维的拓扑结构以及研究胶原纤维在疾病进展中的作用是一种强有力的方法。我们提出了一种基于深度学习的方法，用于对病理组织样本的基于显微镜的胶原图像中的胶原纤维的拓扑性质进行定量。我们的方法利用深度神经网络提取胶原纤维中心线，并利用深度生成模型创建合成训练数据，以解决当前大规模注释的不足。作为这项工作的一部分，我们创建并注释了一个胶原纤维中心线数据集，希望能促进该领域的进一步研究。基于中心线提取结果，可以得出纤维方向、排列、密度和长度等定量测量结果。我们的流水线包含三个阶段。首先，训练变分自编码器生成具有可控拓扑性质的合成中心线。然后，条件生成对抗网络从合成中心线合成真实的胶原纤维图像，从而生成图像-中心线对的合成训练集。最后，我们使用原始数据和合成数据训练胶原纤维中心线提取网络。使用二次谐波显微镜采集的胰腺、肝脏和乳腺癌样本的胶原纤维图像进行评估，表明我们的流水线优于几种流行的纤维中心线提取工具。将合成数据纳入训练进一步提高了网络的泛化能力。我们的代码可在 https://github.com/uw-loci/collagen-fiber-metrics 上获取。

相似文献

Collagen fiber centerline tracking in fibrotic tissue via deep neural networks with variational autoencoder-based synthetic training data generation.基于变分自动编码器的合成训练数据生成的深度学习神经网络在纤维化组织中胶原纤维中心线的跟踪。

Med Image Anal. 2023 Dec;90:102961. doi: 10.1016/j.media.2023.102961. Epub 2023 Sep 12.

Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classifier.基于卷积神经网络的方向分类器的心脏 CT 血管造影中的冠状动脉中心线提取。

Med Image Anal. 2019 Jan;51:46-60. doi: 10.1016/j.media.2018.10.005. Epub 2018 Oct 22.

A deep learning generative model approach for image synthesis of plant leaves.深度学习生成模型在植物叶片图像合成中的应用

PLoS One. 2022 Nov 18;17(11):e0276972. doi: 10.1371/journal.pone.0276972. eCollection 2022.

Latent space autoencoder generative adversarial model for retinal image synthesis and vessel segmentation.用于视网膜图像合成与血管分割的潜在空间自动编码器生成对抗模型。

BMC Med Imaging. 2025 May 5;25(1):149. doi: 10.1186/s12880-025-01694-1.

Fibrillar Collagen Quantification With Curvelet Transform Based Computational Methods.基于曲波变换的计算方法对纤维状胶原蛋白进行定量分析

Front Bioeng Biotechnol. 2020 Apr 21;8:198. doi: 10.3389/fbioe.2020.00198. eCollection 2020.

Methods for Quantifying Fibrillar Collagen Alignment.定量纤维状胶原蛋白排列的方法。

Methods Mol Biol. 2017;1627:429-451. doi: 10.1007/978-1-4939-7113-8_28.

Brain Tumor Classification Using a Combination of Variational Autoencoders and Generative Adversarial Networks.使用变分自编码器和生成对抗网络相结合的脑肿瘤分类

Biomedicines. 2022 Jan 21;10(2):223. doi: 10.3390/biomedicines10020223.

Synthetic whole-slide image tile generation with gene expression profile-infused deep generative models.基于基因表达谱融合的深度生成模型的合成全幻灯片图像瓦片生成。

Cell Rep Methods. 2023 Jul 19;3(8):100534. doi: 10.1016/j.crmeth.2023.100534. eCollection 2023 Aug 28.

Multi-scale cascaded networks for synthesis of mammogram to decrease intensity distortion and increase model-based perceptual similarity.多尺度级联网络用于合成乳腺 X 线照片，以降低强度失真并提高基于模型的感知相似性。

Med Phys. 2023 Feb;50(2):837-853. doi: 10.1002/mp.16007. Epub 2022 Oct 24.

A convolutional neural network for segmentation of yeast cells without manual training annotations.一种无需手动训练注释的用于酵母细胞分割的卷积神经网络。

Bioinformatics. 2022 Feb 7;38(5):1427-1433. doi: 10.1093/bioinformatics/btab835.

引用本文的文献

Multiphoton imaging-based quantifiable collagen signatures for predicting outcomes in patients with pancreatic ductal adenocarcinoma.基于多光子成像的可量化胶原特征用于预测胰腺导管腺癌患者的预后

Sci Rep. 2025 Feb 5;15(1):4414. doi: 10.1038/s41598-025-88984-4.

A preliminary study into the emergence of tendon microstructure during postnatal development.出生后发育过程中肌腱微观结构形成的初步研究。

Matrix Biol Plus. 2024 Jan 26;21:100142. doi: 10.1016/j.mbplus.2024.100142. eCollection 2024 Feb.

本文引用的文献

Differentiation of pancreatic ductal adenocarcinoma and chronic pancreatitis using graph neural networks on histopathology and collagen fiber features.基于组织病理学和胶原纤维特征，利用图神经网络对胰腺导管腺癌和慢性胰腺炎进行鉴别。

J Pathol Inform. 2022 Nov 19;13:100158. doi: 10.1016/j.jpi.2022.100158. eCollection 2022.

Automation of generative adversarial network-based synthetic data-augmentation for maximizing the diagnostic performance with paranasal imaging.基于生成对抗网络的合成数据增强自动化，以最大限度地提高副鼻窦成像的诊断性能。

Sci Rep. 2022 Oct 27;12(1):18118. doi: 10.1038/s41598-022-22222-z.

Learning disentangled representations in the imaging domain.在成像领域中学习解缠表示。

Med Image Anal. 2022 Aug;80:102516. doi: 10.1016/j.media.2022.102516. Epub 2022 Jun 17.

Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder.基于几何的变分自编码器在高维低样本量情况下的数据增强

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2879-2896. doi: 10.1109/TPAMI.2022.3185773. Epub 2023 Feb 3.

Deep learning identification of stiffness markers in breast cancer.深度学习识别乳腺癌的硬度标志物。

Biomaterials. 2022 Jun;285:121540. doi: 10.1016/j.biomaterials.2022.121540. Epub 2022 Apr 27.

Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning.基于自监督对比学习的双流多实例学习网络用于全切片图像分类

Conf Comput Vis Pattern Recognit Workshops. 2021 Jun;2021:14318-14328. doi: 10.1109/CVPR46437.2021.01409. Epub 2021 Nov 13.

Variational Autoencoder for Image-Based Augmentation of Eye-Tracking Data.用于基于图像的眼动追踪数据增强的变分自编码器

J Imaging. 2021 May 3;7(5):83. doi: 10.3390/jimaging7050083.

Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强

J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.

A review of medical image data augmentation techniques for deep learning applications.医学图像数据增强技术在深度学习应用中的综述。

J Med Imaging Radiat Oncol. 2021 Aug;65(5):545-563. doi: 10.1111/1754-9485.13261. Epub 2021 Jun 19.

A FIJI macro for quantifying pattern in extracellular matrix.斐济宏指令：用于量化细胞外基质中模式的方法。

Life Sci Alliance. 2021 Jan 27;4(3). doi: 10.26508/lsa.202000880. Print 2021 Mar.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。