无监督深度表示学习可实现脑影像遗传关联研究中的表型发现。

Unsupervised deep representation learning enables phenotype discovery for genetic association studies of brain imaging.

机构信息

McWilliams School of Biomedical Informatics, University of Texas Health Science Center, Houston, TX, 77030, USA.

Department of Computer Science and Engineering, Texas A&M University, College Station, TX, 77843, USA.

出版信息

Commun Biol. 2024 Apr 5;7(1):414. doi: 10.1038/s42003-024-06096-7.

DOI:10.1038/s42003-024-06096-7

PMID:38580839

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10997628/

Abstract

Understanding the genetic architecture of brain structure is challenging, partly due to difficulties in designing robust, non-biased descriptors of brain morphology. Until recently, brain measures for genome-wide association studies (GWAS) consisted of traditionally expert-defined or software-derived image-derived phenotypes (IDPs) that are often based on theoretical preconceptions or computed from limited amounts of data. Here, we present an approach to derive brain imaging phenotypes using unsupervised deep representation learning. We train a 3-D convolutional autoencoder model with reconstruction loss on 6130 UK Biobank (UKBB) participants' T1 or T2-FLAIR (T2) brain MRIs to create a 128-dimensional representation known as Unsupervised Deep learning derived Imaging Phenotypes (UDIPs). GWAS of these UDIPs in held-out UKBB subjects (n = 22,880 discovery and n = 12,359/11,265 replication cohorts for T1/T2) identified 9457 significant SNPs organized into 97 independent genetic loci of which 60 loci were replicated. Twenty-six loci were not reported in earlier T1 and T2 IDP-based UK Biobank GWAS. We developed a perturbation-based decoder interpretation approach to show that these loci are associated with UDIPs mapped to multiple relevant brain regions. Our results established unsupervised deep learning can derive robust, unbiased, heritable, and interpretable brain imaging phenotypes.

摘要

理解大脑结构的遗传结构具有挑战性，部分原因是难以设计稳健、无偏的大脑形态描述符。直到最近，全基因组关联研究（GWAS）的大脑测量方法还包括传统上由专家定义或软件衍生的图像衍生表型（IDP），这些表型通常基于理论假设或从有限数量的数据计算得出。在这里，我们提出了一种使用无监督深度表示学习来推导脑影像表型的方法。我们使用重建损失在 6130 名 UK Biobank（UKBB）参与者的 T1 或 T2-FLAIR（T2）脑 MRI 上训练一个 3D 卷积自动编码器模型，以创建一个 128 维的表示，称为无监督深度学习衍生的影像表型（UDIP）。在 UKBB 中保留的受试者（n=22880 个发现和 n=12359/11265 个复制队列，用于 T1/T2）中对这些 UDIP 进行 GWAS，确定了 9457 个显著的 SNP，这些 SNP 组织成 97 个独立的遗传位点，其中 60 个位点得到了复制。26 个位点在早期的 T1 和 T2 IDP 基于的 UK Biobank GWAS 中没有报道。我们开发了一种基于扰动的解码器解释方法，表明这些位点与映射到多个相关脑区的 UDIPs 相关。我们的结果表明，无监督深度学习可以推导出稳健、无偏、可遗传和可解释的脑影像表型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/203e/10997628/a9244d43adaa/42003_2024_6096_Fig1_HTML.jpg

相似文献

Unsupervised deep representation learning enables phenotype discovery for genetic association studies of brain imaging.无监督深度表示学习可实现脑影像遗传关联研究中的表型发现。

Commun Biol. 2024 Apr 5;7(1):414. doi: 10.1038/s42003-024-06096-7.

Efficient multi-phenotype genome-wide analysis identifies genetic associations for unsupervised deep-learning-derived high-dimensional brain imaging phenotypes.高效的多表型全基因组分析确定了与无监督深度学习衍生的高维脑成像表型的遗传关联。

medRxiv. 2024 Dec 8:2024.12.06.24318618. doi: 10.1101/2024.12.06.24318618.

Inferring the genetic relationships between unsupervised deep learning-derived imaging phenotypes and glioblastoma through multi-omics approaches.通过多组学方法推断无监督深度学习衍生的成像表型与胶质母细胞瘤之间的遗传关系。

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf037.

Vision Transformer Autoencoders for Unsupervised Representation Learning: Capturing Local and Non-Local Features in Brain Imaging to Reveal Genetic Associations.用于无监督表征学习的视觉Transformer自动编码器：在脑成像中捕捉局部和非局部特征以揭示基因关联。

medRxiv. 2025 Mar 25:2025.03.24.25324549. doi: 10.1101/2025.03.24.25324549.

Exploring the genetic architecture of brain structure and ADHD using polygenic neuroimaging-derived scores.利用多基因神经影像学衍生分数探索脑结构与注意力缺陷多动障碍的遗传结构。

Am J Med Genet B Neuropsychiatr Genet. 2025 Jan;198(1):e32987. doi: 10.1002/ajmg.b.32987. Epub 2024 Jul 17.

TransferGWAS of T1-weighted brain MRI data from UK Biobank.来自英国生物银行的T1加权脑磁共振成像数据的转移全基因组关联研究

PLoS Genet. 2024 Dec 13;20(12):e1011332. doi: 10.1371/journal.pgen.1011332. eCollection 2024 Dec.

Supervised Phenotype Discovery From Multimodal Brain Imaging.基于多模态脑成像的监督式表型发现

IEEE Trans Med Imaging. 2023 Mar;42(3):834-849. doi: 10.1109/TMI.2022.3218720. Epub 2023 Mar 2.

Unsupervised representation learning on high-dimensional clinical data improves genomic discovery and prediction.基于高维临床数据的无监督表示学习可改善基因组发现和预测。

Nat Genet. 2024 Aug;56(8):1604-1613. doi: 10.1038/s41588-024-01831-6. Epub 2024 Jul 8.

Reliability of multi-site UK Biobank MRI brain phenotypes for the assessment of neuropsychiatric complications of SARS-CoV-2 infection: The COVID-CNS travelling heads study.多中心 UK Biobank 磁共振成像脑表型评估 SARS-CoV-2 感染神经精神并发症的可靠性：COVID-CNS 游走头部研究。

PLoS One. 2022 Sep 29;17(9):e0273704. doi: 10.1371/journal.pone.0273704. eCollection 2022.

Autoencoder-based phenotyping of ophthalmic images highlights genetic loci influencing retinal morphology and provides informative biomarkers.基于自动编码器的眼科图像表型分析突出了影响视网膜形态的基因位点，并提供了信息丰富的生物标志物。

Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae732.

引用本文的文献

Brain-heart-eye axis revealed by multi-organ imaging, genetics and proteomics.多器官成像、遗传学和蛋白质组学揭示的脑-心-眼轴

medRxiv. 2025 Jun 9:2025.01.04.25319995. doi: 10.1101/2025.01.04.25319995.

medRxiv. 2025 Mar 25:2025.03.24.25324549. doi: 10.1101/2025.03.24.25324549.

Genetic Insights into Brain Morphology: a Genome-Wide Association Study of Cortical Thickness and T-Weighted MRI Gray Matter-White Matter Intensity Contrast.大脑形态学的遗传学见解：一项关于皮质厚度和T加权磁共振成像灰质-白质强度对比度的全基因组关联研究。

Neuroinformatics. 2025 Apr 1;23(2):26. doi: 10.1007/s12021-025-09722-9.

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf037.

medRxiv. 2024 Dec 8:2024.12.06.24318618. doi: 10.1101/2024.12.06.24318618.

TransferGWAS of T1-weighted brain MRI data from UK Biobank.来自英国生物银行的T1加权脑磁共振成像数据的转移全基因组关联研究

PLoS Genet. 2024 Dec 13;20(12):e1011332. doi: 10.1371/journal.pgen.1011332. eCollection 2024 Dec.

EmbedGEM: a framework to evaluate the utility of embeddings for genetic discovery.EmbedGEM：一个用于评估嵌入在基因发现中的效用的框架。

Bioinform Adv. 2024 Sep 17;4(1):vbae135. doi: 10.1093/bioadv/vbae135. eCollection 2024.

Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae732.

iGWAS: Image-based genome-wide association of self-supervised deep phenotyping of retina fundus images.iGWAS：基于图像的全基因组关联分析，对视网膜眼底图像进行自我监督的深度学习表型分析。

PLoS Genet. 2024 May 10;20(5):e1011273. doi: 10.1371/journal.pgen.1011273. eCollection 2024 May.

Machine Learning to Advance Human Genome-Wide Association Studies.机器学习在全基因组关联研究中的应用

Genes (Basel). 2023 Dec 25;15(1):34. doi: 10.3390/genes15010034.

本文引用的文献

Counter the weaponization of genetics research by extremists.对抗极端分子将基因研究武器化的行为。

Nature. 2022 Oct;610(7932):444-447. doi: 10.1038/d41586-022-03252-z.

Genetic variants associated with longitudinal changes in brain structure across the lifespan.与一生中大脑结构纵向变化相关的遗传变异。

Nat Neurosci. 2022 Apr;25(4):421-432. doi: 10.1038/s41593-022-01042-4. Epub 2022 Apr 5.

Accurate brain-age models for routine clinical MRI examinations.用于常规临床 MRI 检查的精确脑龄模型。

Neuroimage. 2022 Apr 1;249:118871. doi: 10.1016/j.neuroimage.2022.118871. Epub 2022 Jan 5.

A generalized linear mixed model association tool for biobank-scale data.一种用于生物样本库规模数据的广义线性混合模型关联工具。

Nat Genet. 2021 Nov;53(11):1616-1621. doi: 10.1038/s41588-021-00954-4. Epub 2021 Nov 4.

Vertex-wise multivariate genome-wide association study identifies 780 unique genetic loci associated with cortical morphology.基于顶点的多变量全基因组关联研究确定了 780 个与皮质形态相关的独特遗传位点。

Neuroimage. 2021 Dec 1;244:118603. doi: 10.1016/j.neuroimage.2021.118603. Epub 2021 Sep 21.

Three-Dimensional Convolutional Autoencoder Extracts Features of Structural Brain Images With a "Diagnostic Label-Free" Approach: Application to Schizophrenia Datasets.三维卷积自动编码器采用“无诊断标签”方法提取脑结构图像特征：应用于精神分裂症数据集。

Front Neurosci. 2021 Jul 7;15:652987. doi: 10.3389/fnins.2021.652987. eCollection 2021.

The genetic architecture of the human thalamus and its overlap with ten common brain disorders.人类丘脑的遗传结构及其与十种常见脑部疾病的重叠。

Nat Commun. 2021 May 18;12(1):2909. doi: 10.1038/s41467-021-23175-z.

A robust brain signature region approach for episodic memory performance in older adults.一种用于评估老年人情景记忆表现的强大脑区特征方法。

Brain. 2021 May 7;144(4):1038-1040. doi: 10.1093/brain/awab140.

ASD-SAENet: A Sparse Autoencoder, and Deep-Neural Network Model for Detecting Autism Spectrum Disorder (ASD) Using fMRI Data.ASD-SAENet：一种用于使用功能磁共振成像（fMRI）数据检测自闭症谱系障碍（ASD）的稀疏自动编码器和深度神经网络模型。

Front Comput Neurosci. 2021 Apr 8;15:654315. doi: 10.3389/fncom.2021.654315. eCollection 2021.

An expanded set of genome-wide association studies of brain imaging phenotypes in UK Biobank.在英国生物银行中进行的一套扩展的全基因组关联研究，用于研究脑影像学表型。

Nat Neurosci. 2021 May;24(5):737-745. doi: 10.1038/s41593-021-00826-4. Epub 2021 Apr 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

无监督深度表示学习可实现脑影像遗传关联研究中的表型发现。

Unsupervised deep representation learning enables phenotype discovery for genetic association studies of brain imaging.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献