• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于无上下文零样本深度集成的算法从单细胞转录组数据推断 2500 多种表面蛋白的丰度。

Imputing abundance of over 2,500 surface proteins from single-cell transcriptomes with context-agnostic zero-shot deep ensembles.

机构信息

Department of Pharmacology and Toxicology, Michigan State University, East Lansing, MI 48824, USA.

Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA.

出版信息

Cell Syst. 2024 Sep 18;15(9):869-884.e6. doi: 10.1016/j.cels.2024.08.006. Epub 2024 Sep 6.

DOI:10.1016/j.cels.2024.08.006
PMID:39243755
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11423933/
Abstract

Cell surface proteins serve as primary drug targets and cell identity markers. Techniques such as CITE-seq (cellular indexing of transcriptomes and epitopes by sequencing) have enabled the simultaneous quantification of surface protein abundance and transcript expression within individual cells. The published data have been utilized to train machine learning models for predicting surface protein abundance solely from transcript expression. However, the small scale of proteins predicted and the poor generalization ability of these computational approaches across diverse contexts (e.g., different tissues/disease states) impede their widespread adoption. Here, we propose SPIDER (surface protein prediction using deep ensembles from single-cell RNA sequencing), a context-agnostic zero-shot deep ensemble model, which enables large-scale protein abundance prediction and generalizes better to various contexts. Comprehensive benchmarking shows that SPIDER outperforms other state-of-the-art methods. Using the predicted surface abundance of >2,500 proteins from single-cell transcriptomes, we demonstrate the broad applications of SPIDER, including cell type annotation, biomarker/target identification, and cell-cell interaction analysis in hepatocellular carcinoma and colorectal cancer. A record of this paper's transparent peer review process is included in the supplemental information.

摘要

细胞表面蛋白是主要的药物靶点和细胞特征标志物。CITE-seq(通过测序对转录组和表位进行细胞索引)等技术能够在单个细胞内同时定量测量表面蛋白丰度和转录表达。已发表的数据被用于训练机器学习模型,仅根据转录表达预测表面蛋白丰度。然而,这些计算方法预测的蛋白数量较少,在不同的环境(例如不同的组织/疾病状态)中的泛化能力较差,限制了它们的广泛应用。在这里,我们提出了 SPIDER(使用单细胞 RNA 测序的深度集成进行表面蛋白预测),这是一种与上下文无关的零样本深度集成模型,能够实现大规模的蛋白丰度预测,并更好地推广到各种环境。全面的基准测试表明,SPIDER 优于其他最先进的方法。我们使用来自单细胞转录组的 >2500 种蛋白质的预测表面丰度,展示了 SPIDER 的广泛应用,包括肝细胞癌和结直肠癌中的细胞类型注释、生物标志物/靶点识别以及细胞间相互作用分析。本论文的透明同行评审过程记录包含在补充信息中。

相似文献

1
Imputing abundance of over 2,500 surface proteins from single-cell transcriptomes with context-agnostic zero-shot deep ensembles.基于无上下文零样本深度集成的算法从单细胞转录组数据推断 2500 多种表面蛋白的丰度。
Cell Syst. 2024 Sep 18;15(9):869-884.e6. doi: 10.1016/j.cels.2024.08.006. Epub 2024 Sep 6.
2
Imputing abundance of over 2500 surface proteins from single-cell transcriptomes with context-agnostic zero-shot deep ensembles.利用上下文无关的零样本深度集成模型从单细胞转录组中估算2500多种表面蛋白的丰度。
bioRxiv. 2024 Jul 31:2024.07.31.605432. doi: 10.1101/2024.07.31.605432.
3
DEMOC: a deep embedded multi-omics learning approach for clustering single-cell CITE-seq data.DEMOC:一种用于聚类单细胞 CITE-seq 数据的深度嵌入式多组学学习方法。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac347.
4
Ensemble learning models that predict surface protein abundance from single-cell multimodal omics data.从单细胞多模态组学数据中预测表面蛋白丰度的集成学习模型。
Methods. 2021 May;189:65-73. doi: 10.1016/j.ymeth.2020.10.001. Epub 2020 Oct 9.
5
Surface protein imputation from single cell transcriptomes by deep neural networks.基于深度神经网络的单细胞转录组表面蛋白推断。
Nat Commun. 2020 Jan 31;11(1):651. doi: 10.1038/s41467-020-14391-0.
6
scDM: A deep generative method for cell surface protein prediction with diffusion model.scDM:基于扩散模型的细胞表面蛋白深度生成预测方法。
J Mol Biol. 2024 Jun 15;436(12):168610. doi: 10.1016/j.jmb.2024.168610. Epub 2024 May 15.
7
A Targeted Multi-omic Analysis Approach Measures Protein Expression and Low-Abundance Transcripts on the Single-Cell Level.靶向多组学分析方法可在单细胞水平上测量蛋白质表达和低丰度转录本。
Cell Rep. 2020 Apr 7;31(1):107499. doi: 10.1016/j.celrep.2020.03.063.
8
The Conjugation of Antibodies for the Simultaneous Detection of Surface Proteins and Transcriptome Analysis at a Single-Cell Level.抗体偶联用于单细胞水平同时检测表面蛋白和转录组分析。
Methods Mol Biol. 2020;2184:31-45. doi: 10.1007/978-1-0716-0802-9_3.
9
A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。
RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.
10
Simultaneous Measurement of Surface Proteins and Gene Expression from Single Cells.从单细胞中同时测量表面蛋白和基因表达。
Methods Mol Biol. 2020;2111:35-46. doi: 10.1007/978-1-0716-0266-9_3.

引用本文的文献

1
DGAT: A Dual-Graph Attention Network for Inferring Spatial Protein Landscapes from Transcriptomics.DGAT:一种用于从转录组学推断空间蛋白质景观的双图注意力网络。
bioRxiv. 2025 Jul 9:2025.07.05.662121. doi: 10.1101/2025.07.05.662121.

本文引用的文献

1
Efficient Generation of Paired Single-Cell Multiomics Profiles by Deep Learning.深度学习高效生成配对单细胞多组学图谱。
Adv Sci (Weinh). 2023 Jul;10(21):e2301169. doi: 10.1002/advs.202301169. Epub 2023 Apr 28.
2
A multi-use deep learning method for CITE-seq and single-cell RNA-seq data integration with cell surface protein prediction and imputation.一种用于CITE-seq和单细胞RNA-seq数据整合以及细胞表面蛋白预测与插补的多用途深度学习方法。
Nat Mach Intell. 2022 Nov;4(11):940-952. doi: 10.1038/s42256-022-00545-w. Epub 2022 Oct 27.
3
Single-cell proteomics enabled by next-generation sequencing or mass spectrometry.基于下一代测序或质谱的单细胞蛋白质组学。
Nat Methods. 2023 Mar;20(3):363-374. doi: 10.1038/s41592-023-01791-5. Epub 2023 Mar 2.
4
The complex network of transcription factors, immune checkpoint inhibitors and stemness features in colorectal cancer: A recent update.结直肠癌中转录因子、免疫检查点抑制剂和干性特征的复杂网络:最新进展。
Semin Cancer Biol. 2023 Feb;89:1-17. doi: 10.1016/j.semcancer.2023.01.001. Epub 2023 Jan 6.
5
A single-cell atlas of the multicellular ecosystem of primary and metastatic hepatocellular carcinoma.原发性和转移性肝细胞癌的多细胞生态系统单细胞图谱。
Nat Commun. 2022 Aug 6;13(1):4594. doi: 10.1038/s41467-022-32283-3.
6
A Python library for probabilistic analysis of single-cell omics data.一个用于单细胞组学数据概率分析的Python库。
Nat Biotechnol. 2022 Feb;40(2):163-166. doi: 10.1038/s41587-021-01206-w.
7
Single-cell proteo-genomic reference maps of the hematopoietic system enable the purification and massive profiling of precisely defined cell states.单细胞蛋白质基因组参考图谱可对造血系统进行精确定义的细胞状态的纯化和大规模分析。
Nat Immunol. 2021 Dec;22(12):1577-1589. doi: 10.1038/s41590-021-01059-0. Epub 2021 Nov 22.
8
BAG3 induces α-SMA expression in human fibroblasts and its over-expression correlates with poorer survival in fibrotic cancer patients.BAG3 诱导人成纤维细胞中 α-SMA 的表达,其过表达与纤维化癌症患者的生存率降低相关。
J Cell Biochem. 2022 Jan;123(1):91-101. doi: 10.1002/jcb.30171. Epub 2021 Nov 6.
9
Single-cell sequencing unveils distinct immune microenvironments with CCR6-CCL20 crosstalk in human chronic pancreatitis.单细胞测序揭示人类慢性胰腺炎中 CCR6-CCL20 相互作用的独特免疫微环境。
Gut. 2022 Sep;71(9):1831-1842. doi: 10.1136/gutjnl-2021-324546. Epub 2021 Oct 26.
10
CD177 modulates the function and homeostasis of tumor-infiltrating regulatory T cells.CD177 调节肿瘤浸润调节性 T 细胞的功能和稳态。
Nat Commun. 2021 Oct 1;12(1):5764. doi: 10.1038/s41467-021-26091-4.