主成分分析降维：单细胞转录谱的层次聚类

pcaReduce: hierarchical clustering of single cell transcriptional profiles.

作者信息

Žurauskienė Justina, Yau Christopher

机构信息

Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford, OX3 7BN, UK.

Department of Statistics, University of Oxford, 1 S. Parks Rd, Oxford, OX1 3TG, UK.

出版信息

BMC Bioinformatics. 2016 Mar 22;17:140. doi: 10.1186/s12859-016-0984-y.

DOI:10.1186/s12859-016-0984-y

PMID:27005807

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4802652/

Abstract

BACKGROUND

Advances in single cell genomics provide a way of routinely generating transcriptomics data at the single cell level. A frequent requirement of single cell expression analysis is the identification of novel patterns of heterogeneity across single cells that might explain complex cellular states or tissue composition. To date, classical statistical analysis tools have being routinely applied, but there is considerable scope for the development of novel statistical approaches that are better adapted to the challenges of inferring cellular hierarchies.

RESULTS

We have developed a novel agglomerative clustering method that we call pcaReduce to generate a cell state hierarchy where each cluster branch is associated with a principal component of variation that can be used to differentiate two cell states. Using two real single cell datasets, we compared our approach to other commonly used statistical techniques, such as K-means and hierarchical clustering. We found that pcaReduce was able to give more consistent clustering structures when compared to broad and detailed cell type labels.

CONCLUSIONS

Our novel integration of principal components analysis and hierarchical clustering establishes a connection between the representation of the expression data and the number of cell types that can be discovered. In doing so we found that pcaReduce performs better than either technique in isolation in terms of characterising putative cell states. Our methodology is complimentary to other single cell clustering techniques and adds to a growing palette of single cell bioinformatics tools for profiling heterogeneous cell populations.

摘要

背景

单细胞基因组学的进展提供了一种在单细胞水平上常规生成转录组学数据的方法。单细胞表达分析的一个常见要求是识别单细胞间新的异质性模式，这些模式可能解释复杂的细胞状态或组织组成。迄今为止，经典统计分析工具已被常规应用，但开发更适合推断细胞层次结构挑战的新型统计方法仍有很大空间。

结果

我们开发了一种新颖的凝聚聚类方法，称为pcaReduce，以生成细胞状态层次结构，其中每个聚类分支与可用于区分两种细胞状态的主要变异成分相关联。使用两个真实的单细胞数据集，我们将我们的方法与其他常用统计技术（如K均值和层次聚类）进行了比较。我们发现，与宽泛和详细的细胞类型标签相比，pcaReduce能够给出更一致的聚类结构。

结论

我们对主成分分析和层次聚类的新颖整合在表达数据的表示与可发现的细胞类型数量之间建立了联系。通过这样做，我们发现pcaReduce在表征假定的细胞状态方面比单独使用任何一种技术都表现得更好。我们的方法是对其他单细胞聚类技术的补充，并为分析异质细胞群体的单细胞生物信息学工具增添了新的内容。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b96/4802652/9dca905cc506/12859_2016_984_Fig1_HTML.jpg

相似文献

pcaReduce: hierarchical clustering of single cell transcriptional profiles.主成分分析降维：单细胞转录谱的层次聚类

BMC Bioinformatics. 2016 Mar 22;17:140. doi: 10.1186/s12859-016-0984-y.

SAIC: an iterative clustering approach for analysis of single cell RNA-seq data.SAIC：一种用于分析单细胞 RNA-seq 数据的迭代聚类方法。

BMC Genomics. 2017 Oct 3;18(Suppl 6):689. doi: 10.1186/s12864-017-4019-5.

A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。

RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.

Autoencoder-based cluster ensembles for single-cell RNA-seq data analysis.基于自动编码器的单细胞 RNA-seq 数据分析聚类集成。

BMC Bioinformatics. 2019 Dec 24;20(Suppl 19):660. doi: 10.1186/s12859-019-3179-5.

Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study.降维与聚类模型在单细胞 RNA 测序数据中的应用：一项比较研究。

Int J Mol Sci. 2020 Mar 22;21(6):2181. doi: 10.3390/ijms21062181.

Dimensionality Reduction and Louvain Agglomerative Hierarchical Clustering for Cluster-Specified Frequent Biomarker Discovery in Single-Cell Sequencing Data.用于单细胞测序数据中聚类特定频繁生物标志物发现的降维和Louvain凝聚层次聚类

Front Genet. 2022 Feb 7;13:828479. doi: 10.3389/fgene.2022.828479. eCollection 2022.

jSRC: a flexible and accurate joint learning algorithm for clustering of single-cell RNA-sequencing data.jSRC：一种用于单细胞 RNA-seq 数据聚类的灵活准确的联合学习算法。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa433.

A critical assessment of clustering algorithms to improve cell clustering and identification in single-cell transcriptome study.对聚类算法的批判性评估，以提高单细胞转录组研究中的细胞聚类和鉴定。

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad497.

DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.DIMM-SC：一种基于 Dirichlet 混合模型的用于聚类基于液滴的单细胞转录组学数据的方法。

Bioinformatics. 2018 Jan 1;34(1):139-146. doi: 10.1093/bioinformatics/btx490.

Collaborative Structure-Preserved Missing Data Imputation for Single-Cell RNA-Seq Clustering.单细胞 RNA-Seq 聚类的协作结构保留缺失数据插补。

IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1480-1491. doi: 10.1109/TCBB.2024.3404013. Epub 2024 Oct 9.

引用本文的文献

RGCN-BA: relational graph convolutional network with batch awareness for single-cell RNA sequencing clustering.RGCN-BA：用于单细胞RNA测序聚类的具有批次感知的关系图卷积网络

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf378.

Soft graph clustering for single-cell RNA sequencing data.用于单细胞RNA测序数据的软图聚类

BMC Bioinformatics. 2025 Jul 25;26(1):195. doi: 10.1186/s12859-025-06231-z.

IGCLAPS: an interpretable graph contrastive learning method with adaptive positive sampling for scRNA-seq data analysis.IGCLAPS：一种用于单细胞RNA测序数据分析的具有自适应正样本采样的可解释图对比学习方法。

Bioinformatics. 2025 Jul 21. doi: 10.1093/bioinformatics/btaf411.

Differentiable graph clustering with structural grouping for single-cell RNA-seq data.用于单细胞RNA测序数据的具有结构分组的可微图聚类

Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf347.

scCCTR: An iterative selection-based semi-supervised clustering model for single-cell RNA-seq data.scCCTR：一种用于单细胞RNA测序数据的基于迭代选择的半监督聚类模型。

Comput Struct Biotechnol J. 2025 Mar 14;27:1090-1102. doi: 10.1016/j.csbj.2025.03.018. eCollection 2025.

scSAMAC: saliency-adjusted masking induced attention contrastive learning for single-cell clustering.scSAMAC：用于单细胞聚类的显著性调整掩膜诱导注意力对比学习

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf128.

Limitations of Clustering with PCA and Correlated Noise.主成分分析（PCA）聚类及相关噪声的局限性

J Stat Comput Simul. 2024;94(10):2291-2319. doi: 10.1080/00949655.2024.2329976. Epub 2024 May 5.

aKNNO: single-cell and spatial transcriptomics clustering with an optimized adaptive k-nearest neighbor graph.aKNNO：使用优化的自适应k近邻图进行单细胞和空间转录组学聚类

Genome Biol. 2024 Aug 1;25(1):203. doi: 10.1186/s13059-024-03339-y.

Single-cell omics: experimental workflow, data analyses and applications.单细胞组学：实验工作流程、数据分析及应用

Sci China Life Sci. 2025 Jan;68(1):5-102. doi: 10.1007/s11427-023-2561-0. Epub 2024 Jul 23.

Clustering and visualization of single-cell RNA-seq data using path metrics.基于路径测度的单细胞 RNA-seq 数据聚类和可视化。

PLoS Comput Biol. 2024 May 29;20(5):e1012014. doi: 10.1371/journal.pcbi.1012014. eCollection 2024 May.

本文引用的文献

SC3: consensus clustering of single-cell RNA-seq data.SC3：单细胞RNA测序数据的一致性聚类

Nat Methods. 2017 May;14(5):483-486. doi: 10.1038/nmeth.4236. Epub 2017 Mar 27.

Single-cell transcriptomes reveal characteristic features of human pancreatic islet cell types.单细胞转录组揭示了人类胰岛细胞类型的特征。

EMBO Rep. 2016 Feb;17(2):178-87. doi: 10.15252/embr.201540946. Epub 2015 Dec 21.

SINCERA: A Pipeline for Single-Cell RNA-Seq Profiling Analysis.SINCERA：一种用于单细胞RNA测序分析的流程

PLoS Comput Biol. 2015 Nov 24;11(11):e1004575. doi: 10.1371/journal.pcbi.1004575. eCollection 2015 Nov.

Defining cell types and states with single-cell genomics.利用单细胞基因组学定义细胞类型和状态。

Genome Res. 2015 Oct;25(10):1491-8. doi: 10.1101/gr.190595.115.

Single-cell messenger RNA sequencing reveals rare intestinal cell types.单细胞信使 RNA 测序揭示罕见的肠道细胞类型。

Nature. 2015 Sep 10;525(7568):251-5. doi: 10.1038/nature14966. Epub 2015 Aug 19.

Computational assignment of cell-cycle stage from single-cell transcriptome data.基于单细胞转录组数据的细胞周期阶段的计算分配

Methods. 2015 Sep 1;85:54-61. doi: 10.1016/j.ymeth.2015.06.021. Epub 2015 Jul 2.

Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets.利用纳升液滴对单个细胞进行高度并行的全基因组表达谱分析。

Cell. 2015 May 21;161(5):1202-1214. doi: 10.1016/j.cell.2015.05.002.

Spatial reconstruction of single-cell gene expression data.单细胞基因表达数据的空间重建

Nat Biotechnol. 2015 May;33(5):495-502. doi: 10.1038/nbt.3192. Epub 2015 Apr 13.

High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin.高通量单细胞 RNA-seq 数据到组织起源的空间图谱绘制。

Nat Biotechnol. 2015 May;33(5):503-9. doi: 10.1038/nbt.3209. Epub 2015 Apr 13.

Identification of cell types from single-cell transcriptomes using a novel clustering method.基于新型聚类方法的单细胞转录组细胞类型鉴定。

Bioinformatics. 2015 Jun 15;31(12):1974-80. doi: 10.1093/bioinformatics/btv088. Epub 2015 Feb 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

主成分分析降维：单细胞转录谱的层次聚类

pcaReduce: hierarchical clustering of single cell transcriptional profiles.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献