• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

spRefine:使用由基因组语言模型驱动的无参考框架对空间转录组学进行去噪和插补。

spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model.

作者信息

Liu Tianyu, Huang Tinglin, Jin Wengong, Chu Tinyi, Ying Rex, Zhao Hongyu

机构信息

Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, 06511, CT, USA.

Department of Biostatistics, Yale University, New Haven, 06511, CT, USA.

出版信息

bioRxiv. 2025 Jul 7:2025.04.22.649977. doi: 10.1101/2025.04.22.649977.

DOI:10.1101/2025.04.22.649977
PMID:40631230
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12236840/
Abstract

The analysis of spatial transcriptomics is hindered by high noise levels and missing gene measurements, challenges that are further compounded by the higher cost of spatial data compared to traditional single-cell data. To overcome this challenge, we introduce , a deep learning framework that leverages genomic language models to jointly denoise and impute spatial transcriptomic data. Our results demonstrate that spRefine yields more robust cell- and spot-level representations after denoising and imputation, substantially improving data integration. In addition, spRefine serves as a strong framework for model pre-training and the discovery of novel biological signals, as highlighted by multiple downstream applications across datasets of varying scales. Notably, spRefine enhances the accuracy of spatial ageing clock estimations and uncovers new aging-related relationships associated with key biological processes, such as neuronal function loss, which offers new insights for analyzing ageing effect with spatial transcriptomics.

摘要

空间转录组学的分析受到高噪声水平和基因测量缺失的阻碍,与传统单细胞数据相比,空间数据成本更高,这进一步加剧了这些挑战。为了克服这一挑战,我们引入了spRefine,这是一个深度学习框架,它利用基因组语言模型对空间转录组数据进行联合去噪和插补。我们的结果表明,spRefine在去噪和插补后产生了更稳健的细胞和斑点水平表征,显著改善了数据整合。此外,spRefine作为模型预训练和发现新生物信号的强大框架,不同规模数据集的多个下游应用突出了这一点。值得注意的是,spRefine提高了空间衰老时钟估计的准确性,并揭示了与关键生物过程(如神经元功能丧失)相关的新的衰老相关关系,这为利用空间转录组学分析衰老效应提供了新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/d9dffdb8bc96/nihpp-2025.04.22.649977v3-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/9840b5e01c5b/nihpp-2025.04.22.649977v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/67245d5df48f/nihpp-2025.04.22.649977v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/22559a745a4c/nihpp-2025.04.22.649977v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/928c2a793b3d/nihpp-2025.04.22.649977v3-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/aeb35dff10ad/nihpp-2025.04.22.649977v3-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/d1cfa931339e/nihpp-2025.04.22.649977v3-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/d9dffdb8bc96/nihpp-2025.04.22.649977v3-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/9840b5e01c5b/nihpp-2025.04.22.649977v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/67245d5df48f/nihpp-2025.04.22.649977v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/22559a745a4c/nihpp-2025.04.22.649977v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/928c2a793b3d/nihpp-2025.04.22.649977v3-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/aeb35dff10ad/nihpp-2025.04.22.649977v3-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/d1cfa931339e/nihpp-2025.04.22.649977v3-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8894/12236840/d9dffdb8bc96/nihpp-2025.04.22.649977v3-f0007.jpg

相似文献

1
spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model.spRefine:使用由基因组语言模型驱动的无参考框架对空间转录组学进行去噪和插补。
bioRxiv. 2025 Jul 7:2025.04.22.649977. doi: 10.1101/2025.04.22.649977.
2
GatorST: A Versatile Contrastive Meta-Learning Framework for Spatial Transcriptomic Data Analysis.GatorST:用于空间转录组数据分析的通用对比元学习框架。
bioRxiv. 2025 Jul 19:2025.07.01.662625. doi: 10.1101/2025.07.01.662625.
3
stGRL: spatial domain identification, denoising, and imputation algorithm for spatial transcriptome data based on multi-task graph contrastive representation learning.stGRL:基于多任务图对比表示学习的空间转录组数据的空间域识别、去噪和插补算法
BMC Biol. 2025 Jul 1;23(1):177. doi: 10.1186/s12915-025-02290-z.
4
stGNN: Spatially Informed Cell-Type Deconvolution Based on Deep Graph Learning and Statistical Modeling.stGNN:基于深度图学习和统计建模的空间信息细胞类型反卷积
Interdiscip Sci. 2025 Jun 26. doi: 10.1007/s12539-025-00728-0.
5
Linking transcriptome and morphology in bone cells at cellular resolution with generative AI.利用生成式人工智能在细胞分辨率下将骨细胞中的转录组与形态学联系起来。
J Bone Miner Res. 2024 Dec 31;40(1):20-26. doi: 10.1093/jbmr/zjae151.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
The Overlooked Role of Specimen Preparation in Bolstering Deep Learning-Enhanced Spatial Transcriptomics Workflows.样本制备在支持深度学习增强的空间转录组学工作流程中被忽视的作用。
medRxiv. 2023 Oct 9:2023.10.09.23296700. doi: 10.1101/2023.10.09.23296700.
8
Short-Term Memory Impairment短期记忆障碍
9
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
10
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.

本文引用的文献

1
The Curated Cancer Cell Atlas provides a comprehensive characterization of tumors at single-cell resolution.《精心策划的癌细胞图谱》以单细胞分辨率对肿瘤进行了全面表征。
Nat Cancer. 2025 May 8. doi: 10.1038/s43018-025-00957-8.
2
scPRINT: pre-training on 50 million cells allows robust gene network predictions.scPRINT:在5000万个细胞上进行预训练可实现强大的基因网络预测。
Nat Commun. 2025 Apr 16;16(1):3607. doi: 10.1038/s41467-025-58699-1.
3
Optimizing Xenium In Situ data utility by quality assessment and best-practice analysis workflows.
通过质量评估和最佳实践分析工作流程优化Xenium原位数据效用。
Nat Methods. 2025 Apr;22(4):813-823. doi: 10.1038/s41592-025-02617-2. Epub 2025 Mar 13.
4
Genomic language models: opportunities and challenges.基因组语言模型:机遇与挑战。
Trends Genet. 2025 Apr;41(4):286-302. doi: 10.1016/j.tig.2024.11.013. Epub 2025 Jan 2.
5
Spatial transcriptomic clocks reveal cell proximity effects in brain ageing.空间转录组时钟揭示大脑衰老中的细胞邻近效应。
Nature. 2025 Feb;638(8049):160-171. doi: 10.1038/s41586-024-08334-8. Epub 2024 Dec 18.
6
CosGeneGate selects multi-functional and credible biomarkers for single-cell analysis.CosGeneGate 为单细胞分析选择多功能且可靠的生物标志物。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae626.
7
Protein language models learn evolutionary statistics of interacting sequence motifs.蛋白质语言模型学习相互作用序列基序的进化统计信息。
Proc Natl Acad Sci U S A. 2024 Nov 5;121(45):e2406285121. doi: 10.1073/pnas.2406285121. Epub 2024 Oct 28.
8
Modal-nexus auto-encoder for multi-modality cellular data integration and imputation.模态关联自动编码器用于多模态细胞数据的整合和插补。
Nat Commun. 2024 Oct 18;15(1):9021. doi: 10.1038/s41467-024-53355-6.
9
Imputing spatial transcriptomics through gene network constructed from protein language model.基于蛋白质语言模型构建的基因网络对空间转录组学进行推断。
Commun Biol. 2024 Oct 5;7(1):1271. doi: 10.1038/s42003-024-06964-2.
10
Reliable imputation of spatial transcriptomes with uncertainty estimation and spatial regularization.通过不确定性估计和空间正则化对空间转录组进行可靠插补。
Patterns (N Y). 2024 Jul 9;5(8):101021. doi: 10.1016/j.patter.2024.101021. eCollection 2024 Aug 9.