• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞RNA测序(scRNA-seq)数据主成分分析(PCA)图中马蹄形效应的系统分析。

Systematic analysis on the horse-shoe-like effect in PCA plots of scRNA-seq data.

作者信息

Shah Najeebullah, Meng Qiuchen, Zou Ziheng, Zhang Xuegong

机构信息

MOE Key Lab of Bioinformatics & Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China.

School of Life Sciences and Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China.

出版信息

Bioinform Adv. 2024 Jul 29;4(1):vbae109. doi: 10.1093/bioadv/vbae109. eCollection 2024.

DOI:10.1093/bioadv/vbae109
PMID:39132288
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11316618/
Abstract

MOTIVATION

In single-cell studies, principal component analysis (PCA) is widely used to reduce the dimensionality of dataset and visualize in 2D or 3D PC plots. Scientists often focus on different clusters within PC plot, overlooking the specific phenomenon, such as horse-shoe-like effect, that may reveal hidden knowledge about underlying biological dataset. This phenomenon remains largely unexplored in single-cell studies.

RESULTS

In this study, we investigated into the horse-shoe-like effect in PC plots using simulated and real scRNA-seq datasets. We systematically explain horse-shoe-like phenomenon from various inter-related perspectives. Initially, we establish an intuitive understanding with the help of simulated datasets. Then, we generalized the acquired knowledge on real biological scRNA-seq data. Experimental results provide logical explanations and understanding for the appearance of horse-shoe-like effect in PC plots. Furthermore, we identify a potential problem with a well-known theory of 'distance saturation property' attributed to induce horse-shoe phenomenon. Finally, we analyse a mathematical model for horse-shoe effect that suggests trigonometric solutions to estimated eigenvectors. We observe significant resemblance after comparing the results of mathematical model with simulated and real scRNA-seq datasets.

AVAILABILITY AND IMPLEMENTATION

The code for reproducing the results of this study is available at: https://github.com/najeebullahshah/PCA-Horse-Shoe.

摘要

动机

在单细胞研究中,主成分分析(PCA)被广泛用于降低数据集的维度并在二维或三维主成分图中进行可视化。科学家们通常关注主成分图中的不同聚类,而忽略了可能揭示基础生物学数据集隐藏知识的特定现象,如马蹄形效应。这种现象在单细胞研究中很大程度上仍未得到探索。

结果

在本研究中,我们使用模拟和真实的单细胞RNA测序(scRNA-seq)数据集研究了主成分图中的马蹄形效应。我们从各种相互关联的角度系统地解释了马蹄形现象。首先,我们借助模拟数据集建立了直观的理解。然后,我们将获得的知识推广到真实的生物学scRNA-seq数据上。实验结果为马蹄形效应在主成分图中的出现提供了逻辑解释和理解。此外,我们发现了一个归因于诱导马蹄形现象的著名“距离饱和特性”理论的潜在问题。最后,我们分析了一个马蹄形效应的数学模型,该模型提出了估计特征向量的三角解。在将数学模型的结果与模拟和真实的scRNA-seq数据集进行比较后,我们观察到了显著的相似性。

可用性和实现

用于重现本研究结果的代码可在以下网址获取:https://github.com/najeebullahshah/PCA-Horse-Shoe 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/1a9dd6c2740e/vbae109f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/187d5fbb238f/vbae109f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/0e8de8e18d8d/vbae109f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/b0204b3a38a6/vbae109f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/e427c27e4a30/vbae109f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/65c9401c73e7/vbae109f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/1a9dd6c2740e/vbae109f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/187d5fbb238f/vbae109f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/0e8de8e18d8d/vbae109f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/b0204b3a38a6/vbae109f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/e427c27e4a30/vbae109f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/65c9401c73e7/vbae109f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed9/11316618/1a9dd6c2740e/vbae109f6.jpg

相似文献

1
Systematic analysis on the horse-shoe-like effect in PCA plots of scRNA-seq data.单细胞RNA测序(scRNA-seq)数据主成分分析(PCA)图中马蹄形效应的系统分析。
Bioinform Adv. 2024 Jul 29;4(1):vbae109. doi: 10.1093/bioadv/vbae109. eCollection 2024.
2
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
3
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。
PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.
4
scHFC: a hybrid fuzzy clustering method for single-cell RNA-seq data optimized by natural computation.scHFC:一种基于自然计算优化的单细胞 RNA-seq 数据的混合模糊聚类方法。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab588.
5
CellVGAE: an unsupervised scRNA-seq analysis workflow with graph attention networks.CellVGAE:一种基于图注意网络的无监督 scRNA-seq 分析工作流程。
Bioinformatics. 2022 Feb 7;38(5):1277-1286. doi: 10.1093/bioinformatics/btab804.
6
ClusterMap: compare multiple single cell RNA-Seq datasets across different experimental conditions.ClusterMap:跨不同实验条件比较多个单细胞 RNA-Seq 数据集。
Bioinformatics. 2019 Sep 1;35(17):3038-3045. doi: 10.1093/bioinformatics/btz024.
7
Cell-level somatic mutation detection from single-cell RNA sequencing.单细胞 RNA 测序中单细胞体细胞突变检测
Bioinformatics. 2019 Nov 1;35(22):4679-4687. doi: 10.1093/bioinformatics/btz288.
8
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
9
bayNorm: Bayesian gene expression recovery, imputation and normalization for single-cell RNA-sequencing data.bayNorm:用于单细胞 RNA-seq 数据的贝叶斯基因表达恢复、插补和标准化。
Bioinformatics. 2020 Feb 15;36(4):1174-1181. doi: 10.1093/bioinformatics/btz726.
10
scCNC: a method based on capsule network for clustering scRNA-seq data.scCNC:一种基于胶囊网络的 scRNA-seq 数据聚类方法。
Bioinformatics. 2022 Aug 2;38(15):3703-3709. doi: 10.1093/bioinformatics/btac393.

本文引用的文献

1
Correspondence analysis for dimension reduction, batch integration, and visualization of single-cell RNA-seq data.基于单细胞 RNA-seq 数据降维、批次整合和可视化的对应分析。
Sci Rep. 2023 Jan 21;13(1):1197. doi: 10.1038/s41598-022-26434-1.
2
How does the structure of data impact cell-cell similarity? Evaluating how structural properties influence the performance of proximity metrics in single cell RNA-seq data.数据结构如何影响细胞间的相似性?评估结构属性如何影响单细胞 RNA-seq 数据中邻近度量的性能。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac387.
3
Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization.
面向转录组数据可视化的降维方法综合评估。
Commun Biol. 2022 Jul 19;5(1):719. doi: 10.1038/s42003-022-03628-x.
4
Impact of Data Preprocessing on Integrative Matrix Factorization of Single Cell Data.数据预处理对单细胞数据整合矩阵分解的影响
Front Oncol. 2020 Jun 23;10:973. doi: 10.3389/fonc.2020.00973. eCollection 2020.
5
S100a4 upregulation in Pik3caH1047R;Trp53R270H;MMTV-Cre-driven mammary tumors promotes metastasis.S100a4 在 Pik3caH1047R;Trp53R270H;MMTV-Cre 驱动的乳腺肿瘤中的上调促进转移。
Breast Cancer Res. 2019 Dec 27;21(1):152. doi: 10.1186/s13058-019-1238-5.
6
Single-Cell RNA-Seq Reveals Cellular Heterogeneity of Pluripotency Transition and X Chromosome Dynamics during Early Mouse Development.单细胞 RNA 测序揭示了早期小鼠发育过程中多能性转换和 X 染色体动态的细胞异质性。
Cell Rep. 2019 Mar 5;26(10):2593-2607.e3. doi: 10.1016/j.celrep.2019.02.031.
7
Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics.弹弓:单细胞转录组学的细胞谱系和伪时间推断。
BMC Genomics. 2018 Jun 19;19(1):477. doi: 10.1186/s12864-018-4772-0.
8
Single-cell mapping of gene expression landscapes and lineage in the zebrafish embryo.单细胞映射斑马鱼胚胎中的基因表达图谱和谱系。
Science. 2018 Jun 1;360(6392):981-987. doi: 10.1126/science.aar4362. Epub 2018 Apr 26.
9
Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors.通过匹配相互最近邻,纠正单细胞 RNA 测序数据中的批次效应。
Nat Biotechnol. 2018 Jun;36(5):421-427. doi: 10.1038/nbt.4091. Epub 2018 Apr 2.
10
Uncovering the Horseshoe Effect in Microbial Analyses.揭示微生物分析中的马蹄铁效应。
mSystems. 2017 Feb 21;2(1). doi: 10.1128/mSystems.00166-16. eCollection 2017 Jan-Feb.