• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RCA2:一种可扩展的监督聚类算法,可减少 scRNA-seq 数据中的批次效应。

RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data.

机构信息

Laboratory of Systems Biology and Data Analytics, Genome Institute of Singapore, A*STAR, 60 Biopolis St, 138672, Singapore.

DUKE-NUS Medical School, 8 College Rd, 169857, Singapore.

出版信息

Nucleic Acids Res. 2021 Sep 7;49(15):8505-8519. doi: 10.1093/nar/gkab632.

DOI:10.1093/nar/gkab632
PMID:34320202
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8344557/
Abstract

The transcriptomic diversity of cell types in the human body can be analysed in unprecedented detail using single cell (SC) technologies. Unsupervised clustering of SC transcriptomes, which is the default technique for defining cell types, is prone to group cells by technical, rather than biological, variation. Compared to de-novo (unsupervised) clustering, we demonstrate using multiple benchmarks that supervised clustering, which uses reference transcriptomes as a guide, is robust to batch effects and data quality artifacts. Here, we present RCA2, the first algorithm to combine reference projection (batch effect robustness) with graph-based clustering (scalability). In addition, RCA2 provides a user-friendly framework incorporating multiple commonly used downstream analysis modules. RCA2 also provides new reference panels for human and mouse and supports generation of custom panels. Furthermore, RCA2 facilitates cell type-specific QC, which is essential for accurate clustering of data from heterogeneous tissues. We demonstrate the advantages of RCA2 on SC data from human bone marrow, healthy PBMCs and PBMCs from COVID-19 patients. Scalable supervised clustering methods such as RCA2 will facilitate unified analysis of cohort-scale SC datasets.

摘要

使用单细胞 (SC) 技术,可以以前所未有的细节分析人体细胞类型的转录组多样性。无监督聚类 SC 转录组,这是定义细胞类型的默认技术,容易根据技术而不是生物学变化将细胞分组。与从头开始(无监督)聚类相比,我们使用多个基准证明,使用参考转录组作为指导的有监督聚类对批次效应和数据质量伪影具有鲁棒性。在这里,我们提出了 RCA2,这是第一个将参考投影(批处理效应稳健性)与基于图的聚类(可扩展性)相结合的算法。此外,RCA2 提供了一个用户友好的框架,其中包含多个常用的下游分析模块。RCA2 还提供了人类和小鼠的新参考面板,并支持生成自定义面板。此外,RCA2 促进细胞类型特异性 QC,这对于从异质组织中准确聚类数据至关重要。我们在来自人类骨髓、健康 PBMC 和 COVID-19 患者 PBMC 的 SC 数据上展示了 RCA2 的优势。像 RCA2 这样的可扩展监督聚类方法将促进队列规模 SC 数据集的统一分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/dbc4cf47fecc/gkab632fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/138e3e970b37/gkab632fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/8f36738996a0/gkab632fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/24738febdd9f/gkab632fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/c439142a8d4a/gkab632fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/1eb89d4bf8b3/gkab632fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/f776eb916942/gkab632fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/dbc4cf47fecc/gkab632fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/138e3e970b37/gkab632fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/8f36738996a0/gkab632fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/24738febdd9f/gkab632fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/c439142a8d4a/gkab632fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/1eb89d4bf8b3/gkab632fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/f776eb916942/gkab632fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4748/8421225/dbc4cf47fecc/gkab632fig7.jpg

相似文献

1
RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data.RCA2:一种可扩展的监督聚类算法,可减少 scRNA-seq 数据中的批次效应。
Nucleic Acids Res. 2021 Sep 7;49(15):8505-8519. doi: 10.1093/nar/gkab632.
2
Integrating Deep Supervised, Self-Supervised and Unsupervised Learning for Single-Cell RNA-seq Clustering and Annotation.将深度监督学习、自监督学习和无监督学习相结合进行单细胞 RNA-seq 聚类和注释。
Genes (Basel). 2020 Jul 14;11(7):792. doi: 10.3390/genes11070792.
3
Scedar: A scalable Python package for single-cell RNA-seq exploratory data analysis.Scedar:一个用于单细胞 RNA-seq 探索性数据分析的可扩展 Python 包。
PLoS Comput Biol. 2020 Apr 27;16(4):e1007794. doi: 10.1371/journal.pcbi.1007794. eCollection 2020 Apr.
4
Identifying cell types to interpret scRNA-seq data: how, why and more possibilities.鉴定细胞类型以解释 scRNA-seq 数据:方法、原因及更多可能性。
Brief Funct Genomics. 2020 Jul 29;19(4):286-291. doi: 10.1093/bfgp/elaa003.
5
An active learning approach for clustering single-cell RNA-seq data.一种用于聚类单细胞 RNA-seq 数据的主动学习方法。
Lab Invest. 2022 Mar;102(3):227-235. doi: 10.1038/s41374-021-00639-w. Epub 2021 Jul 9.
6
Contrastive self-supervised clustering of scRNA-seq data.单细胞 RNA 测序数据的对比自监督聚类。
BMC Bioinformatics. 2021 May 27;22(1):280. doi: 10.1186/s12859-021-04210-8.
7
dropClust: efficient clustering of ultra-large scRNA-seq data.dropClust:超大规模 scRNA-seq 数据的高效聚类。
Nucleic Acids Res. 2018 Apr 6;46(6):e36. doi: 10.1093/nar/gky007.
8
A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。
RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.
9
Supervised application of internal validation measures to benchmark dimensionality reduction methods in scRNA-seq data.监督应用内部验证措施,以基准化 scRNA-seq 数据的降维方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab304.
10
Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis.深度学习能够实现单细胞 RNA-seq 分析中具有批次效应去除功能的精确聚类。
Nat Commun. 2020 May 11;11(1):2338. doi: 10.1038/s41467-020-15851-3.

引用本文的文献

1
Mapping Cell Identity from scRNA-seq: A primer on computational methods.从单细胞RNA测序映射细胞身份:计算方法入门
Comput Struct Biotechnol J. 2025 Apr 2;27:1559-1569. doi: 10.1016/j.csbj.2025.03.051. eCollection 2025.
2
GRAMD1B is a regulator of lipid homeostasis, autophagic flux and phosphorylated tau.GRAMD1B是脂质稳态、自噬通量和磷酸化tau蛋白的调节因子。
Nat Commun. 2025 Apr 9;16(1):3312. doi: 10.1038/s41467-025-58585-w.
3
Single-cell RNA sequencing of peripheral blood links cell-type-specific regulation of splicing to autoimmune and inflammatory diseases.

本文引用的文献

1
DUBStepR is a scalable correlation-based feature selection method for accurately clustering single-cell data.DUBStepR 是一种可扩展的基于相关性的特征选择方法,用于准确地对单细胞数据进行聚类。
Nat Commun. 2021 Oct 6;12(1):5849. doi: 10.1038/s41467-021-26085-2.
2
No evidence that plasmablasts transdifferentiate into developing neutrophils in severe COVID-19 disease.没有证据表明在重症 COVID-19 疾病中浆母细胞会转分化为发育中的中性粒细胞。
Clin Transl Immunology. 2021 Jun 30;10(7):e1308. doi: 10.1002/cti2.1308. eCollection 2021.
3
Integrated analysis of multimodal single-cell data.
外周血单细胞RNA测序将剪接的细胞类型特异性调控与自身免疫性疾病和炎症性疾病联系起来。
Nat Genet. 2024 Dec;56(12):2739-2752. doi: 10.1038/s41588-024-02019-8. Epub 2024 Dec 3.
4
Optimization of diagnosis and treatment of hematological diseases via artificial intelligence.通过人工智能优化血液疾病的诊断与治疗
Front Med (Lausanne). 2024 Nov 7;11:1487234. doi: 10.3389/fmed.2024.1487234. eCollection 2024.
5
Quantification of escape from X chromosome inactivation with single-cell omics data reveals heterogeneity across cell types and tissues.单细胞组学数据定量分析 X 染色体失活逃逸揭示了细胞类型和组织间的异质性。
Cell Genom. 2024 Aug 14;4(8):100625. doi: 10.1016/j.xgen.2024.100625. Epub 2024 Jul 30.
6
Omada: robust clustering of transcriptomes through multiple testing.Omada:通过多重检验实现转录组的稳健聚类。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae039.
7
Systematic immune cell dysregulation and molecular subtypes revealed by single-cell RNA-seq of subjects with type 1 diabetes.单细胞 RNA-seq 揭示 1 型糖尿病患者的系统性免疫细胞失调和分子亚型。
Genome Med. 2024 Mar 27;16(1):45. doi: 10.1186/s13073-024-01300-z.
8
Longitudinal single cell atlas identifies complex temporal relationship between type I interferon response and COVID-19 severity.纵向单细胞图谱鉴定 I 型干扰素反应与 COVID-19 严重程度之间的复杂时间关系。
Nat Commun. 2024 Jan 18;15(1):567. doi: 10.1038/s41467-023-44524-0.
9
Phagocytosis-initiated tumor hybrid cells acquire a c-Myc-mediated quasi-polarization state for immunoevasion and distant dissemination.吞噬作用引发的肿瘤杂交细胞获得 c-Myc 介导的准极化状态以实现免疫逃逸和远处播散。
Nat Commun. 2023 Oct 17;14(1):6569. doi: 10.1038/s41467-023-42303-5.
10
Artificial Intelligence-Assisted Diagnostic Cytology and Genomic Testing for Hematologic Disorders.人工智能辅助诊断细胞学和血液病基因组检测。
Cells. 2023 Jun 30;12(13):1755. doi: 10.3390/cells12131755.
多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
4
A single-cell atlas of the peripheral immune response in patients with severe COVID-19.严重 COVID-19 患者外周免疫反应的单细胞图谱。
Nat Med. 2020 Jul;26(7):1070-1076. doi: 10.1038/s41591-020-0944-y. Epub 2020 Jun 8.
5
Eleven grand challenges in single-cell data science.单细胞数据科学的 11 大挑战。
Genome Biol. 2020 Feb 7;21(1):31. doi: 10.1186/s13059-020-1926-6.
6
A benchmark of batch-effect correction methods for single-cell RNA sequencing data.单细胞 RNA 测序数据批次效应校正方法的基准测试。
Genome Biol. 2020 Jan 16;21(1):12. doi: 10.1186/s13059-019-1850-9.
7
Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression.使用正则化负二项式回归进行单细胞 RNA-seq 数据的归一化和方差稳定化。
Genome Biol. 2019 Dec 23;20(1):296. doi: 10.1186/s13059-019-1874-1.
8
scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data.scPred:一种准确的有监督方法,用于对单细胞 RNA-seq 数据进行细胞类型分类。
Genome Biol. 2019 Dec 12;20(1):264. doi: 10.1186/s13059-019-1862-5.
9
Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling.单细胞 RNA-seq 数据中肿瘤微环境细胞类型的概率分配。
Nat Methods. 2019 Oct;16(10):1007-1015. doi: 10.1038/s41592-019-0529-1. Epub 2019 Sep 9.
10
A general approach for detecting expressed mutations in AML cells using single cell RNA-sequencing.使用单细胞 RNA 测序检测 AML 细胞中表达突变的一般方法。
Nat Commun. 2019 Aug 14;10(1):3660. doi: 10.1038/s41467-019-11591-1.