• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估从单细胞RNA测序数据为细胞簇分配细胞类型标签的方法。

Evaluation of methods to assign cell type labels to cell clusters from single-cell RNA-sequencing data.

作者信息

Diaz-Mejia J Javier, Meng Elaine C, Pico Alexander R, MacParland Sonya A, Ketela Troy, Pugh Trevor J, Bader Gary D, Morris John H

机构信息

Princess Margaret Cancer Centre, University Health Network, Toronto, ON, M5G 2M9, Canada.

The Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada.

出版信息

F1000Res. 2019 Mar 15;8. doi: 10.12688/f1000research.18490.3. eCollection 2019.

DOI:10.12688/f1000research.18490.3
PMID:31508207
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6720041/
Abstract

Identification of cell type subpopulations from complex cell mixtures using single-cell RNA-sequencing (scRNA-seq) data includes automated steps from normalization to cell clustering. However, assigning cell type labels to cell clusters is often conducted manually, resulting in limited documentation, low reproducibility and uncontrolled vocabularies. This is partially due to the scarcity of reference cell type signatures and because some methods support limited cell type signatures. In this study, we benchmarked five methods representing first-generation enrichment analysis (ORA), second-generation approaches (GSEA and GSVA), machine learning tools (CIBERSORT) and network-based neighbor voting (METANEIGHBOR), for the task of assigning cell type labels to cell clusters from scRNA-seq data. We used five scRNA-seq datasets: human liver, 11 Tabula Muris mouse tissues, two human peripheral blood mononuclear cell datasets, and mouse retinal neurons, for which reference cell type signatures were available. The datasets span Drop-seq, 10X Chromium and Seq-Well technologies and range in size from ~3,700 to ~68,000 cells. Our results show that, in general, all five methods perform well in the task as evaluated by receiver operating characteristic curve analysis (average area under the curve (AUC) = 0.91, sd = 0.06), whereas precision-recall analyses show a wide variation depending on the method and dataset (average AUC = 0.53, sd = 0.24). We observed an influence of the number of genes in cell type signatures on performance, with smaller signatures leading more frequently to incorrect results. GSVA was the overall top performer and was more robust in cell type signature subsampling simulations, although different methods performed well using different datasets. METANEIGHBOR and GSVA were the fastest methods. CIBERSORT and METANEIGHBOR were more influenced than the other methods by analyses including only expected cell types. We provide an extensible framework that can be used to evaluate other methods and datasets at https://github.com/jdime/scRNAseq_cell_cluster_labeling.

摘要

利用单细胞RNA测序(scRNA-seq)数据从复杂细胞混合物中识别细胞类型亚群,包括从标准化到细胞聚类的自动化步骤。然而,给细胞簇分配细胞类型标签通常是手动进行的,这导致记录有限、可重复性低以及词汇不受控制。部分原因是参考细胞类型特征稀缺,且一些方法支持的细胞类型特征有限。在本研究中,我们对代表第一代富集分析(ORA)、第二代方法(GSEA和GSVA)、机器学习工具(CIBERSORT)和基于网络的邻居投票(METANEIGHBOR)的五种方法进行了基准测试,用于从scRNA-seq数据给细胞簇分配细胞类型标签的任务。我们使用了五个scRNA-seq数据集:人类肝脏、11个小鼠组织图谱小鼠组织、两个人类外周血单核细胞数据集以及小鼠视网膜神经元,这些数据集都有可用的参考细胞类型特征。这些数据集涵盖了Drop-seq、10X Chromium和Seq-Well技术,细胞数量从约3700个到约68000个不等。我们的结果表明,总体而言,通过接受者操作特征曲线分析评估,所有五种方法在该任务中表现良好(曲线下平均面积(AUC)=0.91,标准差=0.06),而精确召回分析显示,根据方法和数据集的不同存在很大差异(平均AUC =0.53,标准差=0.24)。我们观察到细胞类型特征中基因数量对性能有影响,特征越小越频繁导致错误结果。GSVA总体上表现最佳,在细胞类型特征二次抽样模拟中更稳健,尽管不同方法在不同数据集上表现良好。METANEIGHBOR和GSVA是最快的方法。与其他方法相比,CIBERSORT和METANEIGHBOR在仅包括预期细胞类型的分析中受影响更大。我们在https://github.com/jdime/scRNAseq_cell_cluster_labeling提供了一个可扩展框架,可用于评估其他方法和数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/6c8874bd26b3/f1000research-8-22823-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/cd2431563b01/f1000research-8-22823-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/e1c25812b2fc/f1000research-8-22823-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/4cc2d42026c3/f1000research-8-22823-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/6887a883901d/f1000research-8-22823-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/fce61cdc31f9/f1000research-8-22823-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/6c8874bd26b3/f1000research-8-22823-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/cd2431563b01/f1000research-8-22823-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/e1c25812b2fc/f1000research-8-22823-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/4cc2d42026c3/f1000research-8-22823-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/6887a883901d/f1000research-8-22823-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/fce61cdc31f9/f1000research-8-22823-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bba9/6792477/6c8874bd26b3/f1000research-8-22823-g0005.jpg

相似文献

1
Evaluation of methods to assign cell type labels to cell clusters from single-cell RNA-sequencing data.评估从单细胞RNA测序数据为细胞簇分配细胞类型标签的方法。
F1000Res. 2019 Mar 15;8. doi: 10.12688/f1000research.18490.3. eCollection 2019.
2
ACTINN: automated identification of cell types in single cell RNA sequencing.ACTINN:单细胞 RNA 测序中细胞类型的自动识别。
Bioinformatics. 2020 Jan 15;36(2):533-538. doi: 10.1093/bioinformatics/btz592.
3
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。
PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.
4
Autoencoder-based cluster ensembles for single-cell RNA-seq data analysis.基于自动编码器的单细胞 RNA-seq 数据分析聚类集成。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 19):660. doi: 10.1186/s12859-019-3179-5.
5
A novel graph-based k-partitioning approach improves the detection of gene-gene correlations by single-cell RNA sequencing.一种基于图的新 k 划分方法通过单细胞 RNA 测序提高了基因-基因相关性的检测。
BMC Genomics. 2022 Jan 7;23(1):35. doi: 10.1186/s12864-021-08235-4.
6
Benchmarking clustering algorithms on estimating the number of cell types from single-cell RNA-sequencing data.基于单细胞 RNA 测序数据评估细胞类型数量的聚类算法基准测试。
Genome Biol. 2022 Feb 8;23(1):49. doi: 10.1186/s13059-022-02622-0.
7
An active learning approach for clustering single-cell RNA-seq data.一种用于聚类单细胞 RNA-seq 数据的主动学习方法。
Lab Invest. 2022 Mar;102(3):227-235. doi: 10.1038/s41374-021-00639-w. Epub 2021 Jul 9.
8
scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets.scAnno:一种基于去卷积策略的单细胞 RNA 测序数据集自动细胞类型注释工具。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad179.
9
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
10
HArmonized single-cell RNA-seq Cell type Assisted Deconvolution (HASCAD).HArmonized single-cell RNA-seq Cell type Assisted Deconvolution (HASCAD). 协调单细胞 RNA-seq 细胞类型辅助去卷积 (HASCAD)。
BMC Med Genomics. 2023 Oct 31;16(Suppl 2):272. doi: 10.1186/s12920-023-01674-w.

引用本文的文献

1
Age, sex, and cell type-resolved hypothalamic gene expression across the pubertal transition in mice.年龄、性别和细胞类型解析的小鼠青春期过渡期间的下丘脑基因表达。
Biol Sex Differ. 2024 Oct 24;15(1):83. doi: 10.1186/s13293-024-00661-9.
2
Tumor hypoxia in immune infiltration and prognosis of bladder cancer.肿瘤缺氧在膀胱癌免疫浸润及预后中的作用
Transl Cancer Res. 2024 Jul 31;13(7):3273-3284. doi: 10.21037/tcr-23-2375. Epub 2024 Jul 26.
3
Challenges and opportunities to computationally deconvolve heterogeneous tissue with varying cell sizes using single-cell RNA-sequencing datasets.
使用单细胞 RNA 测序数据集对具有不同细胞大小的异质组织进行计算去卷积所面临的挑战和机遇。
Genome Biol. 2023 Dec 14;24(1):288. doi: 10.1186/s13059-023-03123-4.
4
GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership.GoM DE:利用允许成员等级的差异表达分析来解释序列计数数据中的结构。
Genome Biol. 2023 Oct 19;24(1):236. doi: 10.1186/s13059-023-03067-9.
5
Alzheimer's disease-induced phagocytic microglia express a specific profile of coding and non-coding RNAs.阿尔茨海默病诱导的吞噬小胶质细胞表达特定的编码和非编码 RNA 谱。
Alzheimers Dement. 2024 Feb;20(2):954-974. doi: 10.1002/alz.13502. Epub 2023 Oct 12.
6
GOWDL: gene ontology-driven wide and deep learning model for cell typing of scRNA-seq data.GOWDL:基于基因本体论的 scRNA-seq 数据细胞分型的广泛而深入的学习模型。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad332.
7
Mapping pediatric brain tumors to their origins in the developing cerebellum.将小儿脑瘤定位到发育中的小脑。
Neuro Oncol. 2023 Oct 3;25(10):1895-1909. doi: 10.1093/neuonc/noad124.
8
Decreased left heart flow in fetal lambs causes left heart hypoplasia and pro-fibrotic tissue remodeling.胎儿羊左心血流减少导致左心发育不良和促纤维化组织重塑。
Commun Biol. 2023 Jul 22;6(1):770. doi: 10.1038/s42003-023-05132-2.
9
Natural γδT17 cell development and functional acquisition is governed by the mTORC2--Maf-controlled mitochondrial fission pathway.天然γδT17细胞的发育和功能获得受mTORC2-Maf控制的线粒体分裂途径调控。
iScience. 2023 Apr 10;26(5):106630. doi: 10.1016/j.isci.2023.106630. eCollection 2023 May 19.
10
Predictive network analysis identifies JMJD6 and other potential key drivers in Alzheimer's disease.预测网络分析确定 JMJD6 和其他潜在的阿尔茨海默病关键驱动因素。
Commun Biol. 2023 May 15;6(1):503. doi: 10.1038/s42003-023-04791-5.