• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scGraph2Vec:一种由图神经网络和单细胞组学数据增强的用于基因嵌入的深度生成模型。

scGraph2Vec: a deep generative model for gene embedding augmented by graph neural network and single-cell omics data.

作者信息

Lin Shiqi, Jia Peilin

机构信息

National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China.

Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.

出版信息

Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae108.

DOI:10.1093/gigascience/giae108
PMID:39704704
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11659981/
Abstract

BACKGROUND

Exploring the cellular processes of genes from the aspects of biological networks is of great interest to understanding the properties of complex diseases and biological systems. Biological networks, such as protein-protein interaction networks and gene regulatory networks, provide insights into the molecular basis of cellular processes and often form functional clusters in different tissue and disease contexts.

RESULTS

We present scGraph2Vec, a deep learning framework for generating informative gene embeddings. scGraph2Vec extends the variational graph autoencoder framework and integrates single-cell datasets and gene-gene interaction networks. We demonstrate that the gene embeddings are biologically interpretable and enable the identification of gene clusters representing functional or tissue-specific cellular processes. By comparing similar tools, we showed that scGraph2Vec clearly distinguished different gene clusters and aggregated more biologically functional genes. scGraph2Vec can be widely applied in diverse biological contexts. We illustrated that the embeddings generated by scGraph2Vec can infer disease-associated genes from genome-wide association study data (e.g., COVID-19 and Alzheimer's disease), identify additional driver genes in lung adenocarcinoma, and reveal regulatory genes responsible for maintaining or transitioning melanoma cell states.

CONCLUSIONS

scGraph2Vec not only reconstructs tissue-specific gene networks but also obtains a latent representation of genes implying their biological functions.

摘要

背景

从生物网络的角度探索基因的细胞过程对于理解复杂疾病和生物系统的特性具有重要意义。生物网络,如蛋白质-蛋白质相互作用网络和基因调控网络,为细胞过程的分子基础提供了见解,并且在不同的组织和疾病背景下常常形成功能簇。

结果

我们提出了scGraph2Vec,一种用于生成信息丰富的基因嵌入的深度学习框架。scGraph2Vec扩展了变分图自动编码器框架,并整合了单细胞数据集和基因-基因相互作用网络。我们证明基因嵌入具有生物学可解释性,能够识别代表功能或组织特异性细胞过程的基因簇。通过与类似工具的比较,我们表明scGraph2Vec能够清晰地区分不同的基因簇,并聚集更多具有生物学功能的基因。scGraph2Vec可广泛应用于各种生物学背景。我们举例说明了scGraph2Vec生成的嵌入可以从全基因组关联研究数据(如COVID-19和阿尔茨海默病)中推断疾病相关基因,识别肺腺癌中的其他驱动基因,并揭示负责维持或转变黑色素瘤细胞状态的调控基因。

结论

scGraph2Vec不仅可以重建组织特异性基因网络,还能获得暗示基因生物学功能的潜在表示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/154c4382ca77/giae108fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/ee9f6597566a/giae108fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/1840bcc4de55/giae108fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/eeaa82e9646e/giae108fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/77c6d4730167/giae108fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/01c796d7e8ed/giae108fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/a1848ee3a815/giae108fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/e74af507732e/giae108fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/154c4382ca77/giae108fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/ee9f6597566a/giae108fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/1840bcc4de55/giae108fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/eeaa82e9646e/giae108fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/77c6d4730167/giae108fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/01c796d7e8ed/giae108fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/a1848ee3a815/giae108fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/e74af507732e/giae108fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e774/11659981/154c4382ca77/giae108fig8.jpg

相似文献

1
scGraph2Vec: a deep generative model for gene embedding augmented by graph neural network and single-cell omics data.scGraph2Vec:一种由图神经网络和单细胞组学数据增强的用于基因嵌入的深度生成模型。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae108.
2
GRLGRN: graph representation-based learning to infer gene regulatory networks from single-cell RNA-seq data.GRLGRN:基于图表示的学习方法,用于从单细胞RNA测序数据推断基因调控网络。
BMC Bioinformatics. 2025 Apr 18;26(1):108. doi: 10.1186/s12859-025-06116-1.
3
Towards simplified graph neural networks for identifying cancer driver genes in heterophilic networks.迈向用于在异质网络中识别癌症驱动基因的简化图神经网络
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae691.
4
A deep learning framework for predicting disease-gene associations with functional modules and graph augmentation.一种基于功能模块和图增强的深度学习框架,用于预测疾病-基因关联。
BMC Bioinformatics. 2024 Jun 14;25(1):214. doi: 10.1186/s12859-024-05841-3.
5
SFINN: inferring gene regulatory network from single-cell and spatial transcriptomic data with shared factor neighborhood and integrated neural network.SFINN:利用共享因子邻域和集成神经网络从单细胞和空间转录组数据推断基因调控网络。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae433.
6
FactVAE: a factorized variational autoencoder for single-cell multi-omics data integration analysis.FactVAE:用于单细胞多组学数据整合分析的因子分解变分自编码器。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf157.
7
Integrating scRNA-seq and scATAC-seq with inter-type attention heterogeneous graph neural networks.将单细胞RNA测序和单细胞染色质可及性测序与跨类型注意力异构图神经网络相结合。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae711.
8
Network-based analysis of Alzheimer's Disease genes using multi-omics network integration with graph diffusion.使用多组学网络整合和图扩散对阿尔茨海默病基因进行基于网络的分析。
J Biomed Inform. 2025 Apr;164:104797. doi: 10.1016/j.jbi.2025.104797. Epub 2025 Feb 22.
9
scMGATGRN: a multiview graph attention network-based method for inferring gene regulatory networks from single-cell transcriptomic data.scMGATGRN:一种基于多视图图注意力网络的方法,用于从单细胞转录组数据推断基因调控网络。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae526.
10
scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses.scGNN 是一种用于单细胞 RNA-Seq 分析的新型图神经网络框架。
Nat Commun. 2021 Mar 25;12(1):1882. doi: 10.1038/s41467-021-22197-x.

本文引用的文献

1
scapGNN: A graph neural network-based framework for active pathway and gene module inference from single-cell multi-omics data.scapGNN:一种基于图神经网络的框架,用于从单细胞多组学数据中推断活性途径和基因模块。
PLoS Biol. 2023 Nov 13;21(11):e3002369. doi: 10.1371/journal.pbio.3002369. eCollection 2023 Nov.
2
WikiPathways 2024: next generation pathway database.WikiPathways 2024:下一代路径数据库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D679-D689. doi: 10.1093/nar/gkad960.
3
The Reactome Pathway Knowledgebase 2024.Reactome 通路知识库 2024.
Nucleic Acids Res. 2024 Jan 5;52(D1):D672-D678. doi: 10.1093/nar/gkad1025.
4
SIMBA: single-cell embedding along with features.SIMBA:单细胞特征嵌入。
Nat Methods. 2024 Jun;21(6):1003-1013. doi: 10.1038/s41592-023-01899-8. Epub 2023 May 29.
5
Gene knockout inference with variational graph autoencoder learning single-cell gene regulatory networks.基于变分图自动编码器学习的单细胞基因调控网络的基因敲除推断。
Nucleic Acids Res. 2023 Jul 21;51(13):6578-6592. doi: 10.1093/nar/gkad450.
6
The journey from melanocytes to melanoma.从黑素细胞到黑色素瘤。
Nat Rev Cancer. 2023 Jun;23(6):372-390. doi: 10.1038/s41568-023-00565-7. Epub 2023 Apr 24.
7
Integrative Analysis of a Large Real-World Cohort of Small Cell Lung Cancer Identifies Distinct Genetic Subtypes and Insights into Histologic Transformation.整合分析大样本小细胞肺癌队列鉴定出不同的遗传亚型,并深入了解组织学转化。
Cancer Discov. 2023 Jul 7;13(7):1572-1591. doi: 10.1158/2159-8290.CD-22-0620.
8
siVAE: interpretable deep generative models for single-cell transcriptomes.siVAE:单细胞转录组的可解释深度生成模型。
Genome Biol. 2023 Feb 20;24(1):29. doi: 10.1186/s13059-023-02850-y.
9
The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest.2023 年的 STRING 数据库:针对任何感兴趣的测序基因组的蛋白质-蛋白质关联网络和功能富集分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D638-D646. doi: 10.1093/nar/gkac1000.
10
CCT6A and CHCHD2 Are Coamplified with EGFR and Associated with the Unfavorable Clinical Outcomes of Lung Adenocarcinoma.CCT6A 和 CHCHD2 与 EGFR 共扩增,并与肺腺癌的不良临床结局相关。
Dis Markers. 2022 Jul 28;2022:1560199. doi: 10.1155/2022/1560199. eCollection 2022.