单细胞 RNA 测序分析中鉴定 K mAjor 细胞群体的 IKAP 方法。

IKAP-Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis.

机构信息

Bioinformatics and Computational Biology Laboratory, National Heart, Lung, and Blood Institute, National Institutes of Health, 12 South Drive, Bethesda, MD 20892, USA.

Hematology Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, 10 Center Drive, Bethesda, MD 20814, USA.

出版信息

Gigascience. 2019 Oct 1;8(10). doi: 10.1093/gigascience/giz121.

DOI:10.1093/gigascience/giz121

PMID:31574155

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6771546/

Abstract

BACKGROUND

In single-cell RNA-sequencing analysis, clustering cells into groups and differentiating cell groups by differentially expressed (DE) genes are 2 separate steps for investigating cell identity. However, the ability to differentiate between cell groups could be affected by clustering. This interdependency often creates a bottleneck in the analysis pipeline, requiring researchers to repeat these 2 steps multiple times by setting different clustering parameters to identify a set of cell groups that are more differentiated and biologically relevant.

FINDINGS

To accelerate this process, we have developed IKAP-an algorithm to identify major cell groups and improve differentiating cell groups by systematically tuning parameters for clustering. We demonstrate that, with default parameters, IKAP successfully identifies major cell types such as T cells, B cells, natural killer cells, and monocytes in 2 peripheral blood mononuclear cell datasets and recovers major cell types in a previously published mouse cortex dataset. These major cell groups identified by IKAP present more distinguishing DE genes compared with cell groups generated by different combinations of clustering parameters. We further show that cell subtypes can be identified by recursively applying IKAP within identified major cell types, thereby delineating cell identities in a multi-layered ontology.

CONCLUSIONS

By tuning the clustering parameters to identify major cell groups, IKAP greatly improves the automation of single-cell RNA-sequencing analysis to produce distinguishing DE genes and refine cell ontology using single-cell RNA-sequencing data.

摘要

背景

在单细胞 RNA 测序分析中，通过差异表达 (DE) 基因对细胞进行聚类和区分细胞群是两个独立的步骤，用于研究细胞身份。然而，区分细胞群的能力可能会受到聚类的影响。这种相互依存关系经常在分析管道中造成瓶颈，需要研究人员通过设置不同的聚类参数多次重复这两个步骤，以确定一组更具差异性和生物学相关性的细胞群。

发现

为了加速这一过程，我们开发了 IKAP——一种通过系统调整聚类参数来识别主要细胞群和改善细胞群区分度的算法。我们证明，在默认参数下，IKAP 成功地识别了两个外周血单核细胞数据集和之前发表的小鼠皮质数据集的主要细胞类型，如 T 细胞、B 细胞、自然杀伤细胞和单核细胞。与通过不同聚类参数组合生成的细胞群相比，IKAP 识别的这些主要细胞群具有更多区分性的 DE 基因。我们进一步表明，通过在识别的主要细胞类型内递归应用 IKAP，可以识别细胞亚型，从而在多层次本体中描绘细胞身份。

结论

通过调整聚类参数来识别主要细胞群，IKAP 大大提高了单细胞 RNA 测序分析的自动化程度，使用单细胞 RNA 测序数据生成有区别的 DE 基因，并细化细胞本体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb5/6771546/d4e8b4437e86/giz121fig1.jpg

相似文献

IKAP-Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis.单细胞 RNA 测序分析中鉴定 K mAjor 细胞群体的 IKAP 方法。

Gigascience. 2019 Oct 1;8(10). doi: 10.1093/gigascience/giz121.

MarcoPolo: a method to discover differentially expressed genes in single-cell RNA-seq data without depending on prior clustering.马可波罗法：一种无需依赖于先前聚类即可在单细胞 RNA-seq 数据中发现差异表达基因的方法。

Nucleic Acids Res. 2022 Jul 8;50(12):e71. doi: 10.1093/nar/gkac216.

Embracing the dropouts in single-cell RNA-seq analysis.拥抱单细胞 RNA-seq 分析中的离群值。

Nat Commun. 2020 Mar 3;11(1):1169. doi: 10.1038/s41467-020-14976-9.

SAIC: an iterative clustering approach for analysis of single cell RNA-seq data.SAIC：一种用于分析单细胞 RNA-seq 数据的迭代聚类方法。

BMC Genomics. 2017 Oct 3;18(Suppl 6):689. doi: 10.1186/s12864-017-4019-5.

Evaluating single-cell cluster stability using the Jaccard similarity index.使用 Jaccard 相似性指数评估单细胞聚类稳定性。

Bioinformatics. 2021 Aug 9;37(15):2212-2214. doi: 10.1093/bioinformatics/btaa956.

A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。

PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.

scConsensus: combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data.scConsensus：在单细胞 RNA 测序数据中结合监督和无监督聚类进行细胞类型识别。

BMC Bioinformatics. 2021 Apr 12;22(1):186. doi: 10.1186/s12859-021-04028-4.

VPAC: Variational projection for accurate clustering of single-cell transcriptomic data.VPAC：用于单细胞转录组数据精确聚类的变分投影。

BMC Bioinformatics. 2019 May 1;20(Suppl 7):0. doi: 10.1186/s12859-019-2742-4.

Clustering trees: a visualization for evaluating clusterings at multiple resolutions.聚类树：一种用于在多个分辨率下评估聚类的可视化方法。

Gigascience. 2018 Jul 1;7(7). doi: 10.1093/gigascience/giy083.

PanoView: An iterative clustering method for single-cell RNA sequencing data.PanoView：一种用于单细胞 RNA 测序数据的迭代聚类方法。

PLoS Comput Biol. 2019 Aug 30;15(8):e1007040. doi: 10.1371/journal.pcbi.1007040. eCollection 2019 Aug.

引用本文的文献

ANXA5: A Key Regulator of Immune Cell Infiltration in Hepatocellular Carcinoma.膜联蛋白 A5：调控肝癌免疫细胞浸润的关键分子。

Med Sci Monit. 2024 Jun 2;30:e943523. doi: 10.12659/MSM.943523.

Building and analyzing metacells in single-cell genomics data.在单细胞基因组学数据中构建和分析元细胞。

Mol Syst Biol. 2024 Jul;20(7):744-766. doi: 10.1038/s44320-024-00045-6. Epub 2024 May 29.

VC-resist glioblastoma cell state: vessel co-option as a key driver of chemoradiation resistance.耐血管性胶质母细胞瘤细胞状态：血管选择作为放化疗抵抗的关键驱动因素。

Nat Commun. 2024 Apr 29;15(1):3602. doi: 10.1038/s41467-024-47985-z.

scLENS: data-driven signal detection for unbiased scRNA-seq data analysis.scLENS：用于无偏单细胞RNA测序数据分析的数据驱动信号检测

Nat Commun. 2024 Apr 27;15(1):3575. doi: 10.1038/s41467-024-47884-3.

Generation of antigen-specific mature T cells from RAG1RAG2B2M stem cells by engineering their microenvironment.通过工程化微环境从 RAG1RAG2B2M 干细胞中生成抗原特异性成熟 T 细胞。

Nat Biomed Eng. 2024 Apr;8(4):461-478. doi: 10.1038/s41551-023-01146-7. Epub 2023 Dec 7.

Imaging and multi-omics datasets converge to define different neural progenitor origins for ATRT-SHH subgroups.影像学和多组学数据集的融合定义了 ATRT-SHH 亚组不同的神经前体细胞起源。

Nat Commun. 2023 Oct 20;14(1):6669. doi: 10.1038/s41467-023-42371-7.

Sub-Cluster Identification through Semi-Supervised Optimization of Rare-Cell Silhouettes (SCISSORS) in single-cell RNA-sequencing.基于单细胞 RNA 测序中稀有细胞轮廓的半监督优化（SCISSORS）的子聚类识别。

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad449.

Dysregulated stem cell niches and altered lymphocyte recirculation cause B and T cell lymphopenia in WHIM syndrome.异常调控的干细胞龛和改变的淋巴细胞再循环导致 WHIM 综合征中 B 和 T 细胞减少。

Sci Immunol. 2022 Sep 23;7(75):eabo3170. doi: 10.1126/sciimmunol.abo3170.

Molecular signatures of in situ to invasive progression for basal-like breast cancers: An integrated mouse model and human DCIS study.基底样乳腺癌原位至浸润性进展的分子特征：一项整合的小鼠模型和人导管原位癌研究

NPJ Breast Cancer. 2022 Jul 18;8(1):83. doi: 10.1038/s41523-022-00450-w.

Deconvolution of the hematopoietic stem cell microenvironment reveals a high degree of specialization and conservation.造血干细胞微环境的反卷积揭示了高度的专业化和保守性。

iScience. 2022 Apr 8;25(5):104225. doi: 10.1016/j.isci.2022.104225. eCollection 2022 May 20.

本文引用的文献

CHETAH: a selective, hierarchical cell type identification method for single-cell RNA sequencing.CHETAH：一种用于单细胞 RNA 测序的选择性、层次化细胞类型识别方法。

Nucleic Acids Res. 2019 Sep 19;47(16):e95. doi: 10.1093/nar/gkz543.

Challenges in unsupervised clustering of single-cell RNA-seq data.无监督单细胞 RNA-seq 数据聚类的挑战。

Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9.

Integrating single-cell transcriptomic data across different conditions, technologies, and species.整合不同条件、技术和物种的单细胞转录组数据。

Nat Biotechnol. 2018 Jun;36(5):411-420. doi: 10.1038/nbt.4096. Epub 2018 Apr 2.

Cell type discovery using single-cell transcriptomics: implications for ontological representation.基于单细胞转录组学的细胞类型发现：对本体论表示的影响。

Hum Mol Genet. 2018 May 1;27(R1):R40-R47. doi: 10.1093/hmg/ddy100.

Cell type discovery and representation in the era of high-content single cell phenotyping.高通量单细胞表型分析时代的细胞类型发现和表示。

BMC Bioinformatics. 2017 Dec 21;18(Suppl 17):559. doi: 10.1186/s12859-017-1977-1.

The Human Cell Atlas.人类细胞图谱

Elife. 2017 Dec 5;6:e27041. doi: 10.7554/eLife.27041.

Identifying cell populations with scRNASeq.单细胞 RNA 测序鉴定细胞群体。

Mol Aspects Med. 2018 Feb;59:114-122. doi: 10.1016/j.mam.2017.07.002. Epub 2017 Jul 25.

SC3: consensus clustering of single-cell RNA-seq data.SC3：单细胞RNA测序数据的一致性聚类

Nat Methods. 2017 May;14(5):483-486. doi: 10.1038/nmeth.4236. Epub 2017 Mar 27.

PRROC: computing and visualizing precision-recall and receiver operating characteristic curves in R.PRROC：在R语言中计算和可视化精确率-召回率曲线及接收器操作特性曲线

Bioinformatics. 2015 Aug 1;31(15):2595-7. doi: 10.1093/bioinformatics/btv153. Epub 2015 Mar 24.

Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq.脑结构。单细胞 RNA 测序揭示的小鼠皮层和海马中的细胞类型。

Science. 2015 Mar 6;347(6226):1138-42. doi: 10.1126/science.aaa1934. Epub 2015 Feb 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

单细胞 RNA 测序分析中鉴定 K mAjor 细胞群体的 IKAP 方法。

IKAP-Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis.

机构信息

出版信息

BACKGROUND

FINDINGS

CONCLUSIONS

背景

发现

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献