• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scPRINT:在5000万个细胞上进行预训练可实现强大的基因网络预测。

scPRINT: pre-training on 50 million cells allows robust gene network predictions.

作者信息

Kalfon Jérémie, Samaran Jules, Peyré Gabriel, Cantini Laura

机构信息

Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics group, F-75015, Paris, France.

CNRS and DMA de l'Ecole Normale Supérieure, CNRS, Ecole Normale Supérieure, Université PSL, 75005, Paris, France.

出版信息

Nat Commun. 2025 Apr 16;16(1):3607. doi: 10.1038/s41467-025-58699-1.

DOI:10.1038/s41467-025-58699-1
PMID:40240364
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12003772/
Abstract

A cell is governed by the interaction of myriads of macromolecules. Inferring such a network of interactions has remained an elusive milestone in cellular biology. Building on recent advances in large foundation models and their ability to learn without supervision, we present scPRINT, a large cell model for the inference of gene networks pre-trained on more than 50 million cells from the cellxgene database. Using innovative pretraining tasks and model architecture, scPRINT pushes large transformer models towards more interpretability and usability when uncovering the complex biology of the cell. Based on our atlas-level benchmarks, scPRINT demonstrates superior performance in gene network inference to the state of the art, as well as competitive zero-shot abilities in denoising, batch effect correction, and cell label prediction. On an atlas of benign prostatic hyperplasia, scPRINT highlights the profound connections between ion exchange, senescence, and chronic inflammation.

摘要

细胞受无数大分子相互作用的支配。推断这样一个相互作用网络一直是细胞生物学中难以实现的里程碑。基于大型基础模型的最新进展及其无监督学习能力,我们提出了scPRINT,这是一个用于推断基因网络的大型细胞模型,它在cellxgene数据库中超过5000万个细胞上进行了预训练。通过创新的预训练任务和模型架构,scPRINT在揭示细胞复杂生物学特性时,推动大型变压器模型朝着更高的可解释性和可用性发展。基于我们的图谱级基准,scPRINT在基因网络推理方面表现出优于现有技术的性能,以及在去噪、批次效应校正和细胞标签预测方面具有竞争力的零样本能力。在良性前列腺增生图谱上,scPRINT突出了离子交换、衰老和慢性炎症之间的深刻联系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/66f74bfaa747/41467_2025_58699_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/f668e2c88daa/41467_2025_58699_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/030a34da816c/41467_2025_58699_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/3aa7dd7d6a93/41467_2025_58699_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/6d6bcf5bc320/41467_2025_58699_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/f8179a27f2ae/41467_2025_58699_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/66f74bfaa747/41467_2025_58699_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/f668e2c88daa/41467_2025_58699_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/030a34da816c/41467_2025_58699_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/3aa7dd7d6a93/41467_2025_58699_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/6d6bcf5bc320/41467_2025_58699_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/f8179a27f2ae/41467_2025_58699_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87ec/12003772/66f74bfaa747/41467_2025_58699_Fig6_HTML.jpg

相似文献

1
scPRINT: pre-training on 50 million cells allows robust gene network predictions.scPRINT:在5000万个细胞上进行预训练可实现强大的基因网络预测。
Nat Commun. 2025 Apr 16;16(1):3607. doi: 10.1038/s41467-025-58699-1.
2
Netter: re-ranking gene network inference predictions using structural network properties.内特尔:利用结构网络属性重新排列基因网络推理预测结果。
BMC Bioinformatics. 2016 Feb 9;17:76. doi: 10.1186/s12859-016-0913-0.
3
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
4
HGATLink: single-cell gene regulatory network inference via the fusion of heterogeneous graph attention networks and transformer.HGATLink:通过异构图注意力网络与Transformer融合进行单细胞基因调控网络推断
BMC Bioinformatics. 2025 Feb 11;26(1):49. doi: 10.1186/s12859-025-06071-x.
5
GRLGRN: graph representation-based learning to infer gene regulatory networks from single-cell RNA-seq data.GRLGRN:基于图表示的学习方法,用于从单细胞RNA测序数据推断基因调控网络。
BMC Bioinformatics. 2025 Apr 18;26(1):108. doi: 10.1186/s12859-025-06116-1.
6
GCLink: a graph contrastive link prediction framework for gene regulatory network inference.GCLink:一种用于基因调控网络推断的图对比链接预测框架。
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf074.
7
SFINN: inferring gene regulatory network from single-cell and spatial transcriptomic data with shared factor neighborhood and integrated neural network.SFINN:利用共享因子邻域和集成神经网络从单细胞和空间转录组数据推断基因调控网络。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae433.
8
SIGRN: Inferring Gene Regulatory Network with Soft Introspective Variational Autoencoders.SIGRN:使用软自省变分自编码器推断基因调控网络。
Int J Mol Sci. 2024 Nov 27;25(23):12741. doi: 10.3390/ijms252312741.
9
Integrating Prior Knowledge Using Transformer for Gene Regulatory Network Inference.利用Transformer整合先验知识进行基因调控网络推断
Adv Sci (Weinh). 2025 Jan;12(3):e2409990. doi: 10.1002/advs.202409990. Epub 2024 Nov 28.
10
Inferring cellular regulatory networks with Bayesian model averaging for linear regression (BMALR).使用贝叶斯模型平均法进行线性回归(BMALR)推断细胞调控网络。
Mol Biosyst. 2014 Aug;10(8):2023-30. doi: 10.1039/c4mb00053f.

引用本文的文献

1
Spatia: Multimodal Model for Prediction and Generation of Spatial Cell Phenotypes.Spatia:用于预测和生成空间细胞表型的多模态模型。
ArXiv. 2025 Jul 7:arXiv:2507.04704v1.
2
spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model.spRefine:使用由基因组语言模型驱动的无参考框架对空间转录组学进行去噪和插补。
bioRxiv. 2025 Jul 7:2025.04.22.649977. doi: 10.1101/2025.04.22.649977.
3
Advancing bioinformatics with large language models: components, applications and perspectives.

本文引用的文献

1
CZ CELLxGENE Discover: a single-cell data platform for scalable exploration, analysis and modeling of aggregated data.CZ CELLxGENE发现平台:一个用于对聚合数据进行可扩展探索、分析和建模的单细胞数据平台。
Nucleic Acids Res. 2025 Jan 6;53(D1):D886-D900. doi: 10.1093/nar/gkae1142.
2
Large-scale foundation model on single-cell transcriptomics.单细胞转录组学的大规模基础模型。
Nat Methods. 2024 Aug;21(8):1481-1491. doi: 10.1038/s41592-024-02305-7. Epub 2024 Jun 6.
3
IGFBP7 promotes endothelial cell repair in the recovery phase of acute lung injury.
利用大语言模型推进生物信息学:组件、应用与展望
ArXiv. 2025 Jan 31:arXiv:2401.04155v2.
IGFBP7 促进急性肺损伤恢复阶段的内皮细胞修复。
Clin Sci (Lond). 2024 Jul 3;138(13):797-815. doi: 10.1042/CS20240179.
4
Inferring gene regulatory networks from single-cell transcriptomics based on graph embedding.基于图嵌入的单细胞转录组学推断基因调控网络。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae291.
5
Improving protein-protein interaction prediction using protein language model and protein network features.利用蛋白质语言模型和蛋白质网络特征改进蛋白质-蛋白质相互作用预测。
Anal Biochem. 2024 Oct;693:115550. doi: 10.1016/j.ab.2024.115550. Epub 2024 Apr 26.
6
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
7
Disentanglement of single-cell data with biolord.基于 Biolord 的单细胞数据解缠
Nat Biotechnol. 2024 Nov;42(11):1678-1683. doi: 10.1038/s41587-023-02079-x. Epub 2024 Jan 15.
8
Modeling gene regulatory networks using neural network architectures.使用神经网络架构对基因调控网络进行建模。
Nat Comput Sci. 2021 Jul;1(7):491-501. doi: 10.1038/s43588-021-00099-8. Epub 2021 Jul 22.
9
Deciphering driver regulators of cell fate decisions from single-cell transcriptomics data with CEFCON.利用 CEFCON 从单细胞转录组学数据中破译细胞命运决策的驱动调控因子。
Nat Commun. 2023 Dec 20;14(1):8459. doi: 10.1038/s41467-023-44103-3.
10
Iron accumulation drives fibrosis, senescence and the senescence-associated secretory phenotype.铁积累驱动纤维化、衰老和衰老相关分泌表型。
Nat Metab. 2023 Dec;5(12):2111-2130. doi: 10.1038/s42255-023-00928-2. Epub 2023 Dec 14.