• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将异质数据集映射到共同的细胞嵌入空间来实现单细胞数据的在线整合。

Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space.

机构信息

MOE Key Laboratory of Bioinformatics, Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing, 100084, China.

Tsinghua-Peking Center for Life Sciences, Beijing, 100084, China.

出版信息

Nat Commun. 2022 Oct 17;13(1):6118. doi: 10.1038/s41467-022-33758-z.

DOI:10.1038/s41467-022-33758-z
PMID:36253379
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9574176/
Abstract

Computational tools for integrative analyses of diverse single-cell experiments are facing formidable new challenges including dramatic increases in data scale, sample heterogeneity, and the need to informatively cross-reference new data with foundational datasets. Here, we present SCALEX, a deep-learning method that integrates single-cell data by projecting cells into a batch-invariant, common cell-embedding space in a truly online manner (i.e., without retraining the model). SCALEX substantially outperforms online iNMF and other state-of-the-art non-online integration methods on benchmark single-cell datasets of diverse modalities, (e.g., single-cell RNA sequencing, scRNA-seq, single-cell assay for transposase-accessible chromatin use sequencing, scATAC-seq), especially for datasets with partial overlaps, accurately aligning similar cell populations while retaining true biological differences. We showcase SCALEX's advantages by constructing continuously expandable single-cell atlases for human, mouse, and COVID-19 patients, each assembled from diverse data sources and growing with every new data. The online data integration capacity and superior performance makes SCALEX particularly appropriate for large-scale single-cell applications to build upon previous scientific insights.

摘要

用于综合分析各种单细胞实验的计算工具正面临着严峻的新挑战,包括数据规模的急剧增加、样本异质性以及需要将新数据与基础数据集进行有意义的交叉参考。在这里,我们提出了 SCALEX,这是一种深度学习方法,它以真正的在线方式(即无需重新训练模型)将细胞投影到批次不变的通用细胞嵌入空间中,从而整合单细胞数据。SCALEX 在各种模式的基准单细胞数据集(例如单细胞 RNA 测序、scRNA-seq、单细胞转座酶可及染色质测序、scATAC-seq)上的在线 iNMF 和其他最先进的非在线集成方法上表现出色,尤其是对于具有部分重叠的数据集,它能够准确地对齐相似的细胞群,同时保留真实的生物学差异。我们通过为人类、小鼠和 COVID-19 患者构建可连续扩展的单细胞图谱来展示 SCALEX 的优势,每个图谱都是由不同的数据源组装而成,并随着每一个新数据的加入而不断增长。在线数据集成能力和卓越的性能使 SCALEX 特别适合大规模的单细胞应用,以建立在以前的科学见解的基础上。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/9827da0d27d3/41467_2022_33758_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/8560cc14a1ca/41467_2022_33758_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/fc0c5cc7b867/41467_2022_33758_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/ef09f21909b5/41467_2022_33758_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/139453b8cdad/41467_2022_33758_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/9827da0d27d3/41467_2022_33758_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/8560cc14a1ca/41467_2022_33758_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/fc0c5cc7b867/41467_2022_33758_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/ef09f21909b5/41467_2022_33758_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/139453b8cdad/41467_2022_33758_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51d0/9576707/9827da0d27d3/41467_2022_33758_Fig5_HTML.jpg

相似文献

1
Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space.通过将异质数据集映射到共同的细胞嵌入空间来实现单细胞数据的在线整合。
Nat Commun. 2022 Oct 17;13(1):6118. doi: 10.1038/s41467-022-33758-z.
2
CAraCAl: CAMML with the integration of chromatin accessibility.CAraCAl:整合染色质可及性的 CAMML。
BMC Bioinformatics. 2024 Jun 13;25(1):212. doi: 10.1186/s12859-024-05833-3.
3
SAILER: scalable and accurate invariant representation learning for single-cell ATAC-seq processing and integration.SAILER:用于单细胞 ATAC-seq 处理和整合的可扩展且准确的不变表示学习。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i317-i326. doi: 10.1093/bioinformatics/btab303.
4
EpiScanpy: integrated single-cell epigenomic analysis.EpiScanpy:整合的单细胞表观基因组分析。
Nat Commun. 2021 Sep 1;12(1):5228. doi: 10.1038/s41467-021-25131-3.
5
Assessment of computational methods for the analysis of single-cell ATAC-seq data.单细胞 ATAC-seq 数据分析的计算方法评估。
Genome Biol. 2019 Nov 18;20(1):241. doi: 10.1186/s13059-019-1854-5.
6
Multiscale PHATE identifies multimodal signatures of COVID-19.多尺度 PHATE 识别 COVID-19 的多峰特征。
Nat Biotechnol. 2022 May;40(5):681-691. doi: 10.1038/s41587-021-01186-x. Epub 2022 Feb 28.
7
Translator: A fer earning Approach to Facilitate Single-Cell AC-Seq Data Analysis frm eference Dataset.翻译:一种基于收益的方法,可从参考数据集促进单细胞 AC-Seq 数据分析。
J Comput Biol. 2022 Jul;29(7):619-633. doi: 10.1089/cmb.2021.0596. Epub 2022 May 17.
8
Joint analysis of scATAC-seq datasets using epiConv.使用 epiConv 对 scATAC-seq 数据集进行联合分析。
BMC Bioinformatics. 2022 Jul 29;23(1):309. doi: 10.1186/s12859-022-04858-w.
9
BABEL enables cross-modality translation between multiomic profiles at single-cell resolution.BABEL 实现了单细胞分辨率下多组学谱之间的跨模态翻译。
Proc Natl Acad Sci U S A. 2021 Apr 13;118(15). doi: 10.1073/pnas.2023070118.
10
Destin: toolkit for single-cell analysis of chromatin accessibility.Destin:用于染色质可及性单细胞分析的工具包。
Bioinformatics. 2019 Oct 1;35(19):3818-3820. doi: 10.1093/bioinformatics/btz141.

引用本文的文献

1
scMGCL: accurate and efficient integration representation of single-cell multi-omics data.scMGCL:单细胞多组学数据的准确高效整合表示
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf392.
2
stClinic dissects clinically relevant niches by integrating spatial multi-slice multi-omics data in dynamic graphs.stClinic通过在动态图中整合空间多层多组学数据来剖析临床相关生态位。
Nat Commun. 2025 Jun 16;16(1):5317. doi: 10.1038/s41467-025-60575-x.
3
scMODAL: a general deep learning framework for comprehensive single-cell multi-omics data alignment with feature links.

本文引用的文献

1
scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning.scJoint 集成了图谱尺度单细胞 RNA-seq 和 ATAC-seq 数据,并结合了迁移学习。
Nat Biotechnol. 2022 May;40(5):703-710. doi: 10.1038/s41587-021-01161-6. Epub 2022 Jan 20.
2
Benchmarking atlas-level data integration in single-cell genomics.单细胞基因组学中图谱级数据整合的基准测试。
Nat Methods. 2022 Jan;19(1):41-50. doi: 10.1038/s41592-021-01336-8. Epub 2021 Dec 23.
3
COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas.
scMODAL:一个用于通过特征链接进行全面单细胞多组学数据比对的通用深度学习框架。
Nat Commun. 2025 May 29;16(1):4994. doi: 10.1038/s41467-025-60333-z.
4
Cross-tissue multicellular coordination and its rewiring in cancer.跨组织多细胞协调及其在癌症中的重新布线。
Nature. 2025 May 28. doi: 10.1038/s41586-025-09053-4.
5
Single cell transcriptomics correlate avian coronavirus prime vaccination efficacy with antigen-presenting cell preference.单细胞转录组学将禽冠状病毒初次疫苗接种效果与抗原呈递细胞偏好相关联。
NPJ Vaccines. 2025 May 18;10(1):99. doi: 10.1038/s41541-025-01154-5.
6
Deciphering Immunosenescence From Child to Frailty: Transcriptional Changes, Inflammation Dynamics, and Adaptive Immune Alterations.解读从儿童到衰弱阶段的免疫衰老:转录变化、炎症动态及适应性免疫改变
Aging Cell. 2025 Jul;24(7):e70082. doi: 10.1111/acel.70082. Epub 2025 Apr 26.
7
MIST: An interpretable and flexible deep learning framework for single-T cell transcriptome and receptor analysis.MIST:用于单细胞转录组和受体分析的可解释且灵活的深度学习框架。
Sci Adv. 2025 Apr 4;11(14):eadr7134. doi: 10.1126/sciadv.adr7134.
8
SCITUNA: single-cell data integration tool using network alignment.SCITUNA:使用网络比对的单细胞数据整合工具。
BMC Bioinformatics. 2025 Mar 27;26(1):92. doi: 10.1186/s12859-025-06087-3.
9
A Gene-Expression Based Comparison of Murine and Human Inhibitory Interneurons in the Cerebellar Cortex and Nuclei.基于基因表达的小鼠和人类小脑皮质及核团中抑制性中间神经元的比较
Cerebellum. 2025 Feb 28;24(2):55. doi: 10.1007/s12311-025-01809-y.
10
UniMap: Type-Level Integration Enhances Biological Preservation and Interpretability in Single-Cell Annotation.UniMap:类型级集成增强了单细胞注释中的生物学保留和可解释性。
Adv Sci (Weinh). 2025 Apr;12(16):e2410790. doi: 10.1002/advs.202410790. Epub 2025 Feb 27.
大规模单细胞转录组图谱揭示的新冠病毒免疫特征
Cell. 2021 Nov 11;184(23):5838. doi: 10.1016/j.cell.2021.10.023.
4
EpiScanpy: integrated single-cell epigenomic analysis.EpiScanpy:整合的单细胞表观基因组分析。
Nat Commun. 2021 Sep 1;12(1):5228. doi: 10.1038/s41467-021-25131-3.
5
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
6
Comprehensive analysis of single cell ATAC-seq data with SnapATAC.利用 SnapATAC 对单细胞 ATAC-seq 数据进行全面分析。
Nat Commun. 2021 Feb 26;12(1):1337. doi: 10.1038/s41467-021-21583-9.
7
Cell-Type-Specific Immune Dysregulation in Severely Ill COVID-19 Patients.COVID-19 重症患者的细胞类型特异性免疫失调。
Cell Rep. 2021 Jan 5;34(1):108590. doi: 10.1016/j.celrep.2020.108590. Epub 2020 Dec 16.
8
Longitudinal Multi-omics Analyses Identify Responses of Megakaryocytes, Erythroid Cells, and Plasmablasts as Hallmarks of Severe COVID-19.纵向多组学分析鉴定出巨核细胞、红细胞和浆细胞反应为严重 COVID-19 的特征。
Immunity. 2020 Dec 15;53(6):1296-1314.e9. doi: 10.1016/j.immuni.2020.11.017. Epub 2020 Nov 26.
9
Single-cell transcriptome profiling of an adult human cell atlas of 15 major organs.人类 15 大主要器官单细胞转录组图谱绘制。
Genome Biol. 2020 Dec 7;21(1):294. doi: 10.1186/s13059-020-02210-0.
10
A human cell atlas of fetal gene expression.人类胎儿基因表达细胞图谱。
Science. 2020 Nov 13;370(6518). doi: 10.1126/science.aba7721.