Ke Jianfeng, Ge Tingjian, Melamed Rachel D
Department of Computer Science, University of Massachusetts Lowell, Lowell, MA 01854, USA.
Department of Biology, University of Massachusetts Lowell, Lowell, MA 01854, USA.
bioRxiv. 2025 May 14:2025.05.08.652857. doi: 10.1101/2025.05.08.652857.
Uncovering genes dysregulated in neuropsychiatric diseases promises to put forward therapeutic targets. By linking disease risk variants to tissue-specific gene regulation, the Genotype-Tissue Expression (GTEx) project has enabled discovery of disease genes and the brain tissues where these pathogenic effects occur. But, GTEx requires huge investment in obtaining human brain tissues, limiting the sample size and tissue diversity available. Crucially, this resource does not contain all key tissues where brain diseases are known to originate. Here, we propose a novel graph convolutional model that integrates the GTEx brain expression data with the Allen Brain Atlas, a resource spanning over 100 fine grained brain tissues that are annotated with an ontology of tissue organization. Our model, Brain Ontology Expression (BRONTE), allows completion of the GTEx data for 221 individuals to impute expanded expression in 103 uncollected detailed brain tissues. We show that expression data imputed by BRONTE recovers measured variation in expression across subjects, genes, and brain tissues. Most importantly, our method both recapitulates known relationships between genetic variations and gene expression, and simultaneously discovers both new genes impacted by genetic variants, and the tissues impacted. This allows us to put forward novel mechanisms of effect for genetic variation associated with 30 brain-related traits, supplying therapeutic targets in overlooked tissues of the brain.
揭示神经精神疾病中失调的基因有望提出治疗靶点。通过将疾病风险变异与组织特异性基因调控联系起来,基因型-组织表达(GTEx)项目使得能够发现疾病基因以及这些致病效应发生的脑组织。但是,GTEx在获取人类脑组织方面需要巨大投入,限制了可用的样本量和组织多样性。至关重要的是,该资源并不包含所有已知的脑部疾病起源的关键组织。在此,我们提出一种新颖的图卷积模型,该模型将GTEx脑表达数据与艾伦脑图谱整合,艾伦脑图谱是一个涵盖100多种细粒度脑组织的资源,这些脑组织用组织组织本体进行了注释。我们的模型,脑本体表达(BRONTE),允许完成221名个体的GTEx数据,以推断103种未收集的详细脑组织中的扩展表达。我们表明,BRONTE推断的表达数据恢复了跨受试者、基因和脑组织的表达测量变异。最重要的是,我们的方法既概括了遗传变异与基因表达之间的已知关系,又同时发现了受遗传变异影响的新基因以及受影响的组织。这使我们能够提出与30种脑相关性状相关的遗传变异效应的新机制,为大脑中被忽视的组织提供治疗靶点。