• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从深度自动编码器中嵌入的压缩转录空间中提取疾病模块。

Deriving disease modules from the compressed transcriptional space embedded in a deep autoencoder.

机构信息

Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden.

Department of Biology, Center For Genomics and Systems Biology, New York University, New York, NY, 10008, USA.

出版信息

Nat Commun. 2020 Feb 12;11(1):856. doi: 10.1038/s41467-020-14666-6.

DOI:10.1038/s41467-020-14666-6
PMID:32051402
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7016183/
Abstract

Disease modules in molecular interaction maps have been useful for characterizing diseases. Yet biological networks, that commonly define such modules are incomplete and biased toward some well-studied disease genes. Here we ask whether disease-relevant modules of genes can be discovered without prior knowledge of a biological network, instead training a deep autoencoder from large transcriptional data. We hypothesize that modules could be discovered within the autoencoder representations. We find a statistically significant enrichment of genome-wide association studies (GWAS) relevant genes in the last layer, and to a successively lesser degree in the middle and first layers respectively. In contrast, we find an opposite gradient where a modular protein-protein interaction signal is strongest in the first layer, but then vanishing smoothly deeper in the network. We conclude that a data-driven discovery approach is sufficient to discover groups of disease-related genes.

摘要

分子互作图谱中的疾病模块在疾病表征中具有重要作用。然而,这些通常用于定义模块的生物网络并不完整,并且偏向于一些研究较为充分的疾病基因。在此,我们提出一个问题,即在不依赖生物网络的先验知识的情况下,是否可以通过从大量转录组数据中训练深度自动编码器来发现与疾病相关的基因模块。我们假设可以在自动编码器的表示中发现模块。我们发现,全基因组关联研究(GWAS)相关基因在最后一层的富集程度具有统计学意义,而在中间层和第一层的富集程度依次降低。相比之下,我们发现了一个相反的梯度,其中模块化的蛋白质-蛋白质相互作用信号在第一层最强,但在网络的更深层次则逐渐平滑消失。我们得出结论,数据驱动的发现方法足以发现与疾病相关的基因群。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/b46217049213/41467_2020_14666_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/bd92ed1953a9/41467_2020_14666_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/49c4a73c5d89/41467_2020_14666_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/c61619e857f8/41467_2020_14666_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/77d4d01ba8ee/41467_2020_14666_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/68fc53baefeb/41467_2020_14666_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/b252ddf98806/41467_2020_14666_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/b46217049213/41467_2020_14666_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/bd92ed1953a9/41467_2020_14666_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/49c4a73c5d89/41467_2020_14666_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/c61619e857f8/41467_2020_14666_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/77d4d01ba8ee/41467_2020_14666_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/68fc53baefeb/41467_2020_14666_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/b252ddf98806/41467_2020_14666_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd6d/7016183/b46217049213/41467_2020_14666_Fig7_HTML.jpg

相似文献

1
Deriving disease modules from the compressed transcriptional space embedded in a deep autoencoder.从深度自动编码器中嵌入的压缩转录空间中提取疾病模块。
Nat Commun. 2020 Feb 12;11(1):856. doi: 10.1038/s41467-020-14666-6.
2
Repositioning drugs by targeting network modules: a Parkinson's disease case study.通过靶向网络模块对药物进行再定位:帕金森病案例研究。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 14):532. doi: 10.1186/s12859-017-1889-0.
3
Assessment of network module identification across complex diseases.评估复杂疾病中的网络模块识别。
Nat Methods. 2019 Sep;16(9):843-852. doi: 10.1038/s41592-019-0509-5. Epub 2019 Aug 30.
4
Systems biology analyses of gene expression and genome wide association study data in obstructive sleep apnea.阻塞性睡眠呼吸暂停中基因表达和全基因组关联研究数据的系统生物学分析
Pac Symp Biocomput. 2011:14-25. doi: 10.1142/9789814335058_0003.
5
Enriching Human Interactome with Functional Mutations to Detect High-Impact Network Modules Underlying Complex Diseases.通过功能突变丰富人类互作组,以检测复杂疾病相关的高影响力网络模块。
Genes (Basel). 2019 Nov 15;10(11):933. doi: 10.3390/genes10110933.
6
Genome-wide pathogenesis interpretation using a heat diffusion-based systems genetics method and implications for gene function annotation.基于热扩散的系统遗传学方法进行全基因组发病机制解析及其对基因功能注释的意义。
Mol Genet Genomic Med. 2020 Oct;8(10):e1456. doi: 10.1002/mgg3.1456. Epub 2020 Sep 1.
7
Network analysis of GWAS data.GWAS 数据的网络分析。
Curr Opin Genet Dev. 2013 Dec;23(6):602-10. doi: 10.1016/j.gde.2013.09.003. Epub 2013 Nov 26.
8
Feature related multi-view nonnegative matrix factorization for identifying conserved functional modules in multiple biological networks.基于特征相关的多视图非负矩阵分解识别多个生物网络中的保守功能模块。
BMC Bioinformatics. 2018 Oct 29;19(1):394. doi: 10.1186/s12859-018-2434-5.
9
Benchmarking network-based gene prioritization methods for cerebral small vessel disease.基于网络的脑小血管病基因优先级排序方法的基准测试
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab006.
10
Network models of genome-wide association studies uncover the topological centrality of protein interactions in complex diseases.全基因组关联研究的网络模型揭示了蛋白质相互作用在复杂疾病中的拓扑中心性。
J Am Med Inform Assoc. 2013 Jul-Aug;20(4):619-29. doi: 10.1136/amiajnl-2012-001519. Epub 2013 Jan 25.

引用本文的文献

1
Infusing structural assumptions into dimensionality reduction for single-cell RNA sequencing data to identify small gene sets.将结构假设融入单细胞RNA测序数据的降维过程中以识别小基因集。
Commun Biol. 2025 Mar 11;8(1):414. doi: 10.1038/s42003-025-07872-9.
2
Towards the Next Generation of Data-Driven Therapeutics Using Spatially Resolved Single-Cell Technologies and Generative AI.迈向使用空间分辨单细胞技术和生成式人工智能的下一代数据驱动疗法。
Eur J Immunol. 2025 Feb;55(2):e202451234. doi: 10.1002/eji.202451234.
3
Precise and interpretable neural networks reveal epigenetic signatures of aging across youth in health and disease.

本文引用的文献

1
Unveiling new disease, pathway, and gene associations via multi-scale neural network.通过多尺度神经网络揭示新的疾病、途径和基因关联。
PLoS One. 2020 Apr 6;15(4):e0231059. doi: 10.1371/journal.pone.0231059. eCollection 2020.
2
Assessment of network module identification across complex diseases.评估复杂疾病中的网络模块识别。
Nat Methods. 2019 Sep;16(9):843-852. doi: 10.1038/s41592-019-0509-5. Epub 2019 Aug 30.
3
A validated single-cell-based strategy to identify diagnostic and therapeutic targets in complex diseases.
精确且可解释的神经网络揭示了健康与疾病状态下青年期衰老的表观遗传特征。
Front Aging. 2025 Jan 23;5:1526146. doi: 10.3389/fragi.2024.1526146. eCollection 2024.
4
Latent space arithmetic on data embeddings from healthy multi-tissue human RNA-seq decodes disease modules.来自健康多组织人类RNA测序数据嵌入的潜在空间算法可解码疾病模块。
Patterns (N Y). 2024 Oct 31;5(11):101093. doi: 10.1016/j.patter.2024.101093. eCollection 2024 Nov 8.
5
CGMega: explainable graph neural network framework with attention mechanisms for cancer gene module dissection.CGMega:具有注意力机制的可解释图神经网络框架,用于癌症基因模块剖析。
Nat Commun. 2024 Jul 17;15(1):5997. doi: 10.1038/s41467-024-50426-6.
6
Structure-primed embedding on the transcription factor manifold enables transparent model architectures for gene regulatory network and latent activity inference.转录因子流形上的结构引导嵌入能够实现基因调控网络和潜在活动推断的透明模型架构。
Genome Biol. 2024 Jan 18;25(1):24. doi: 10.1186/s13059-023-03134-1.
7
XA4C: eXplainable representation learning via Autoencoders revealing Critical genes.XA4C:基于自动编码器的可解释表示学习揭示关键基因。
PLoS Comput Biol. 2023 Oct 2;19(10):e1011476. doi: 10.1371/journal.pcbi.1011476. eCollection 2023 Oct.
8
NCAE: data-driven representations using a deep network-coherent DNA methylation autoencoder identify robust disease and risk factor signatures.NCAE:使用深度网络一致 DNA 甲基化自动编码器进行数据驱动的表示,可识别稳健的疾病和风险因素特征。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad293.
9
Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations.生物学启发的变分自动编码器允许对遗传和药物诱导的扰动进行预测建模。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad387.
10
Identifying prognostic subgroups of luminal-A breast cancer using deep autoencoders and gene expressions.利用深度自动编码器和基因表达鉴定 luminal-A 型乳腺癌的预后亚组。
PLoS Comput Biol. 2023 May 30;19(5):e1011197. doi: 10.1371/journal.pcbi.1011197. eCollection 2023 May.
一种经过验证的基于单细胞的策略,用于鉴定复杂疾病中的诊断和治疗靶点。
Genome Med. 2019 Jul 30;11(1):47. doi: 10.1186/s13073-019-0657-3.
4
Decomposing Cell Identity for Transfer Learning across Cellular Measurements, Platforms, Tissues, and Species.跨细胞测量、平台、组织和物种进行迁移学习的细胞身份分解。
Cell Syst. 2019 May 22;8(5):395-411.e8. doi: 10.1016/j.cels.2019.04.004.
5
MultiPLIER: A Transfer Learning Framework for Transcriptomics Reveals Systemic Features of Rare Disease.MultiPLIER:一种转录组学的迁移学习框架,揭示了罕见病的系统特征。
Cell Syst. 2019 May 22;8(5):380-394.e4. doi: 10.1016/j.cels.2019.04.003.
6
Dr.VAE: improving drug response prediction via modeling of drug perturbation effects.VAE 博士:通过建模药物干扰效应来改善药物反应预测。
Bioinformatics. 2019 Oct 1;35(19):3743-3751. doi: 10.1093/bioinformatics/btz158.
7
High-performance medicine: the convergence of human and artificial intelligence.高性能医学:人机智能融合。
Nat Med. 2019 Jan;25(1):44-56. doi: 10.1038/s41591-018-0300-7. Epub 2019 Jan 7.
8
GSAE: an autoencoder with embedded gene-set nodes for genomics functional characterization.GSAE:一种带有嵌入式基因集节点的自动编码器,用于基因组功能表征。
BMC Syst Biol. 2018 Dec 21;12(Suppl 8):142. doi: 10.1186/s12918-018-0642-2.
9
Genomic data integration systematically biases interactome mapping.基因组数据整合系统地影响互作图谱绘制。
PLoS Comput Biol. 2018 Oct 17;14(10):e1006474. doi: 10.1371/journal.pcbi.1006474. eCollection 2018 Oct.
10
Massive mining of publicly available RNA-seq data from human and mouse.大规模挖掘人类和小鼠公共可用的 RNA-seq 数据。
Nat Commun. 2018 Apr 10;9(1):1366. doi: 10.1038/s41467-018-03751-6.