• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于蛋白质语言模型构建的基因网络对空间转录组学进行推断。

Imputing spatial transcriptomics through gene network constructed from protein language model.

机构信息

School of Big Data and Software Engineering, Chongqing University, Chongqing, China.

School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China.

出版信息

Commun Biol. 2024 Oct 5;7(1):1271. doi: 10.1038/s42003-024-06964-2.

DOI:10.1038/s42003-024-06964-2
PMID:39369061
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11455941/
Abstract

Image-based spatial transcriptomic sequencing technologies have enabled the measurement of gene expression at single-cell resolution, but with a limited number of genes. Current computational approaches attempt to overcome these limitations by imputing missing genes, but face challenges regarding prediction accuracy and identification of cell populations due to the neglect of gene-gene relationships. In this context, we present stImpute, a method to impute spatial transcriptomics according to reference scRNA-seq data based on the gene network constructed from the protein language model ESM-2. Specifically, stImpute employs an autoencoder to create gene expression embeddings for both spatial transcriptomics and scRNA-seq data, which are used to identify the nearest neighboring cells between scRNA-seq and spatial transcriptomics datasets. According to the neighbored cells, the gene expressions of spatial transcriptomics cells are imputed through a graph neural network, where nodes are genes, and edges are based on cosine similarity between the ESM-2 embeddings of the gene-encoding proteins. The gene prediction uncertainty is further measured through a deep learning model. stImpute was shown to consistently outperform state-of-the-art methods across multiple datasets concerning imputation and clustering. stImpute also demonstrates robustness in producing consistent results that are insensitive to model parameters.

摘要

基于图像的空间转录组测序技术使我们能够以单细胞分辨率测量基因表达,但检测到的基因数量有限。目前的计算方法试图通过内插缺失的基因来克服这些限制,但由于忽略了基因-基因关系,它们在预测准确性和细胞群体识别方面面临挑战。在这种情况下,我们提出了 stImpute,这是一种根据参考 scRNA-seq 数据对空间转录组学进行内插的方法,该方法基于从蛋白质语言模型 ESM-2 构建的基因网络。具体来说,stImpute 使用自动编码器为空间转录组学和 scRNA-seq 数据创建基因表达嵌入,这些嵌入用于识别 scRNA-seq 和空间转录组学数据集之间的最近邻细胞。根据邻接细胞,通过图神经网络对空间转录组学细胞的基因表达进行内插,其中节点是基因,边是基于基因编码蛋白的 ESM-2 嵌入之间的余弦相似度。通过深度学习模型进一步测量基因预测的不确定性。stImpute 在多个数据集的内插和聚类方面均优于最先进的方法。stImpute 还表现出在产生一致结果方面的稳健性,对模型参数不敏感。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/670fc7c8e8c1/42003_2024_6964_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/d9e0365587cf/42003_2024_6964_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/b350adc688af/42003_2024_6964_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/1daa7c92eb67/42003_2024_6964_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/97422adc7a8e/42003_2024_6964_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/670fc7c8e8c1/42003_2024_6964_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/d9e0365587cf/42003_2024_6964_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/b350adc688af/42003_2024_6964_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/1daa7c92eb67/42003_2024_6964_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/97422adc7a8e/42003_2024_6964_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cde/11455941/670fc7c8e8c1/42003_2024_6964_Fig5_HTML.jpg

相似文献

1
Imputing spatial transcriptomics through gene network constructed from protein language model.基于蛋白质语言模型构建的基因网络对空间转录组学进行推断。
Commun Biol. 2024 Oct 5;7(1):1271. doi: 10.1038/s42003-024-06964-2.
2
scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data.scBOL:单细胞和空间转录组学数据的通用细胞类型识别框架。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae188.
3
SFINN: inferring gene regulatory network from single-cell and spatial transcriptomic data with shared factor neighborhood and integrated neural network.SFINN:利用共享因子邻域和集成神经网络从单细胞和空间转录组数据推断基因调控网络。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae433.
4
stDiff: a diffusion model for imputing spatial transcriptomics through single-cell transcriptomics.stDiff:一种通过单细胞转录组学推断空间转录组学的扩散模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae171.
5
Impeller: a path-based heterogeneous graph learning method for spatial transcriptomic data imputation.叶轮:基于路径的异质图学习方法,用于空间转录组学数据插补。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae339.
6
scMGATGRN: a multiview graph attention network-based method for inferring gene regulatory networks from single-cell transcriptomic data.scMGATGRN:一种基于多视图图注意力网络的方法,用于从单细胞转录组数据推断基因调控网络。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae526.
7
scNPF: an integrative framework assisted by network propagation and network fusion for preprocessing of single-cell RNA-seq data.scNPF:一种基于网络传播和网络融合的综合框架,用于单细胞 RNA-seq 数据的预处理。
BMC Genomics. 2019 May 8;20(1):347. doi: 10.1186/s12864-019-5747-5.
8
GE-Impute: graph embedding-based imputation for single-cell RNA-seq data.GE-Impute:基于图嵌入的单细胞 RNA-seq 数据插补。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac313.
9
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
10
STdGCN: spatial transcriptomic cell-type deconvolution using graph convolutional networks.STdGCN:基于图卷积网络的空间转录组细胞类型去卷积。
Genome Biol. 2024 Aug 5;25(1):206. doi: 10.1186/s13059-024-03353-0.

引用本文的文献

1
spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model.spRefine:使用由基因组语言模型驱动的无参考框架对空间转录组学进行去噪和插补。
bioRxiv. 2025 Jul 7:2025.04.22.649977. doi: 10.1101/2025.04.22.649977.
2
Deep learning in single-cell and spatial transcriptomics data analysis: advances and challenges from a data science perspective.从数据科学视角看深度学习在单细胞和空间转录组学数据分析中的进展与挑战
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf136.

本文引用的文献

1
Reliable imputation of spatial transcriptomes with uncertainty estimation and spatial regularization.通过不确定性估计和空间正则化对空间转录组进行可靠插补。
Patterns (N Y). 2024 Jul 9;5(8):101021. doi: 10.1016/j.patter.2024.101021. eCollection 2024 Aug 9.
2
SIRV: spatial inference of RNA velocity at the single-cell resolution.SIRV:单细胞分辨率下RNA速度的空间推断
NAR Genom Bioinform. 2024 Aug 6;6(3):lqae100. doi: 10.1093/nargab/lqae100. eCollection 2024 Sep.
3
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.
scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
4
Repeated Omicron exposures override ancestral SARS-CoV-2 immune imprinting.多次感染奥密克戎会颠覆原始 SARS-CoV-2 免疫印记。
Nature. 2024 Jan;625(7993):148-156. doi: 10.1038/s41586-023-06753-7. Epub 2023 Nov 22.
5
Identification of mobile genetic elements with geNomad.使用 geNomad 识别移动遗传元件。
Nat Biotechnol. 2024 Aug;42(8):1303-1312. doi: 10.1038/s41587-023-01953-y. Epub 2023 Sep 21.
6
Predicting transcriptional outcomes of novel multigene perturbations with GEARS.用 GEARS 预测新型多基因扰动的转录结果。
Nat Biotechnol. 2024 Jun;42(6):927-935. doi: 10.1038/s41587-023-01905-6. Epub 2023 Aug 17.
7
Best practices for single-cell analysis across modalities.多模态单细胞分析的最佳实践。
Nat Rev Genet. 2023 Aug;24(8):550-572. doi: 10.1038/s41576-023-00586-w. Epub 2023 Mar 31.
8
Spatial-ID: a cell typing method for spatially resolved transcriptomics via transfer learning and spatial embedding.空间 ID:一种通过迁移学习和空间嵌入进行空间分辨转录组学的细胞分型方法。
Nat Commun. 2022 Dec 10;13(1):7640. doi: 10.1038/s41467-022-35288-0.
9
and - Novel Regulatory Factors of Postnatal Hypothalamic Neurogenesis.以及——出生后下丘脑神经发生的新型调控因子。
Front Neurosci. 2021 Nov 2;15:763856. doi: 10.3389/fnins.2021.763856. eCollection 2021.
10
Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram.基于 Tangram 的空间分辨单细胞转录组的深度学习和对齐。
Nat Methods. 2021 Nov;18(11):1352-1362. doi: 10.1038/s41592-021-01264-7. Epub 2021 Oct 28.