• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞RNA测序数据降维方法的比较

A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data.

作者信息

Xiang Ruizhi, Wang Wencan, Yang Lei, Wang Shiyuan, Xu Chaohan, Chen Xiaowen

机构信息

College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.

School of Optometry and Ophthalmology and Eye Hospital, Wenzhou Medical University, Wenzhou, China.

出版信息

Front Genet. 2021 Mar 23;12:646936. doi: 10.3389/fgene.2021.646936. eCollection 2021.

DOI:10.3389/fgene.2021.646936
PMID:33833778
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8021860/
Abstract

Single-cell RNA sequencing (scRNA-seq) is a high-throughput sequencing technology performed at the level of an individual cell, which can have a potential to understand cellular heterogeneity. However, scRNA-seq data are high-dimensional, noisy, and sparse data. Dimension reduction is an important step in downstream analysis of scRNA-seq. Therefore, several dimension reduction methods have been developed. We developed a strategy to evaluate the stability, accuracy, and computing cost of 10 dimensionality reduction methods using 30 simulation datasets and five real datasets. Additionally, we investigated the sensitivity of all the methods to hyperparameter tuning and gave users appropriate suggestions. We found that t-distributed stochastic neighbor embedding (t-SNE) yielded the best overall performance with the highest accuracy and computing cost. Meanwhile, uniform manifold approximation and projection (UMAP) exhibited the highest stability, as well as moderate accuracy and the second highest computing cost. UMAP well preserves the original cohesion and separation of cell populations. In addition, it is worth noting that users need to set the hyperparameters according to the specific situation before using the dimensionality reduction methods based on non-linear model and neural network.

摘要

单细胞RNA测序(scRNA-seq)是一种在单个细胞水平上进行的高通量测序技术,它有潜力帮助理解细胞异质性。然而,scRNA-seq数据是高维、有噪声且稀疏的数据。降维是scRNA-seq下游分析中的重要步骤。因此,已经开发了几种降维方法。我们开发了一种策略,使用30个模拟数据集和5个真实数据集来评估10种降维方法的稳定性、准确性和计算成本。此外,我们研究了所有方法对超参数调整的敏感性,并为用户提供了适当的建议。我们发现,t分布随机邻域嵌入(t-SNE)以最高的准确性和计算成本产生了最佳的整体性能。同时,均匀流形近似与投影(UMAP)表现出最高的稳定性,以及中等的准确性和第二高的计算成本。UMAP很好地保留了细胞群体的原始凝聚性和分离性。此外,值得注意的是,在使用基于非线性模型和神经网络的降维方法之前,用户需要根据具体情况设置超参数。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/30f53d3e712f/fgene-12-646936-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/5b70e8917fcc/fgene-12-646936-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/07ae0902f6f5/fgene-12-646936-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/53d4db565b30/fgene-12-646936-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/d6fdf142c431/fgene-12-646936-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/96b0b4004354/fgene-12-646936-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/d48226085670/fgene-12-646936-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/30f53d3e712f/fgene-12-646936-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/5b70e8917fcc/fgene-12-646936-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/07ae0902f6f5/fgene-12-646936-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/53d4db565b30/fgene-12-646936-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/d6fdf142c431/fgene-12-646936-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/96b0b4004354/fgene-12-646936-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/d48226085670/fgene-12-646936-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/8021860/30f53d3e712f/fgene-12-646936-g007.jpg

相似文献

1
A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data.单细胞RNA测序数据降维方法的比较
Front Genet. 2021 Mar 23;12:646936. doi: 10.3389/fgene.2021.646936. eCollection 2021.
2
Dimensionality Reduction of Single-Cell RNA-Seq Data.单细胞 RNA-Seq 数据的降维处理。
Methods Mol Biol. 2021;2284:331-342. doi: 10.1007/978-1-0716-1307-8_18.
3
Visualizing Single-Cell RNA-seq Data with Semisupervised Principal Component Analysis.基于半监督主成分分析的单细胞 RNA-seq 数据可视化
Int J Mol Sci. 2020 Aug 12;21(16):5797. doi: 10.3390/ijms21165797.
4
Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data.UMAP 通过降维增强了批量转录组数据中样本异质性分析。
Cell Rep. 2021 Jul 27;36(4):109442. doi: 10.1016/j.celrep.2021.109442.
5
K-nearest-neighbors induced topological PCA for single cell RNA-sequence data analysis.K 近邻诱导拓扑主成分分析在单细胞 RNA 测序数据分析中的应用。
Comput Biol Med. 2024 Jun;175:108497. doi: 10.1016/j.compbiomed.2024.108497. Epub 2024 Apr 24.
6
Analyzing Single Cell RNA Sequencing with Topological Nonnegative Matrix Factorization.使用拓扑非负矩阵分解分析单细胞RNA测序
J Comput Appl Math. 2024 Aug 1;445. doi: 10.1016/j.cam.2024.115842. Epub 2024 Feb 19.
7
Visualization of Single Cell RNA-Seq Data Using t-SNE in R.使用 R 中的 t-SNE 可视化单细胞 RNA-Seq 数据。
Methods Mol Biol. 2020;2117:159-167. doi: 10.1007/978-1-0716-0301-7_8.
8
nPCA: a linear dimensionality reduction method using a multilayer perceptron.nPCA:一种使用多层感知器的线性降维方法。
Front Genet. 2024 Jan 8;14:1290447. doi: 10.3389/fgene.2023.1290447. eCollection 2023.
9
Supervised application of internal validation measures to benchmark dimensionality reduction methods in scRNA-seq data.监督应用内部验证措施,以基准化 scRNA-seq 数据的降维方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab304.
10
K-Nearest-Neighbors Induced Topological PCA for Single Cell RNA-Sequence Data Analysis.用于单细胞RNA序列数据分析的K近邻诱导拓扑主成分分析
ArXiv. 2023 Oct 23:arXiv:2310.14521v1.

引用本文的文献

1
Establishing single cell RNA transcriptomics: a brief guide.建立单细胞RNA转录组学:简要指南。
Front Zool. 2025 Sep 2;22(1):25. doi: 10.1186/s12983-025-00579-x.
2
On the dilemma of using single EV analysis for liquid biopsy: the challenge of low abundance of tumor EVs in blood.关于在液体活检中使用单一细胞外囊泡分析的困境:血液中肿瘤细胞外囊泡丰度低的挑战。
Theranostics. 2025 Jul 24;15(16):8031-8048. doi: 10.7150/thno.115131. eCollection 2025.
3
Evaluating discrepancies in dimensionality reduction for time-series single-cell RNA-sequencing data.

本文引用的文献

1
Benchmarking algorithms for pathway activity transformation of single-cell RNA-seq data.单细胞RNA测序数据通路活性转化的基准算法
Comput Struct Biotechnol J. 2020 Oct 15;18:2953-2961. doi: 10.1016/j.csbj.2020.10.007. eCollection 2020.
2
Parameter tuning is a key part of dimensionality reduction via deep variational autoencoders for single cell RNA transcriptomics.参数调整是通过深度变分自编码器进行单细胞RNA转录组学降维的关键部分。
Pac Symp Biocomput. 2019;24:362-373.
3
Single-cell RNA-seq denoising using a deep count autoencoder.
评估时间序列单细胞RNA测序数据降维中的差异。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf287.
4
Precision targeting of β-catenin induces tumor reprogramming and immunity in hepatocellular cancers.精准靶向β-连环蛋白可诱导肝癌的肿瘤重编程和免疫反应。
Nat Commun. 2025 May 30;16(1):5009. doi: 10.1038/s41467-025-60457-2.
5
Methodological opportunities in genomic data analysis to advance health equity.基因组数据分析中促进健康公平的方法学机遇。
Nat Rev Genet. 2025 May 15. doi: 10.1038/s41576-025-00839-w.
6
SciGeneX: enhancing transcriptional analysis through gene module detection in single-cell and spatial transcriptomics data.SciGeneX:通过在单细胞和空间转录组学数据中进行基因模块检测来增强转录分析。
NAR Genom Bioinform. 2025 Apr 17;7(2):lqaf043. doi: 10.1093/nargab/lqaf043. eCollection 2025 Jun.
7
The impact of dropouts in scRNAseq dense neighborhood analysis.单细胞RNA测序密集邻域分析中缺失数据的影响。
Comput Struct Biotechnol J. 2025 Mar 24;27:1278-1285. doi: 10.1016/j.csbj.2025.03.033. eCollection 2025.
8
Hepatic stellate cells control liver zonation, size and functions via R-spondin 3.肝星状细胞通过R-spondin 3控制肝脏的区域化、大小和功能。
Nature. 2025 Apr;640(8059):752-761. doi: 10.1038/s41586-025-08677-w. Epub 2025 Mar 12.
9
Research Trends and Dynamics in Single-cell RNA Sequencing for Musculoskeletal Diseases: A Scientometric and Visualization Study.肌肉骨骼疾病单细胞RNA测序的研究趋势与动态:一项科学计量与可视化研究
Int J Med Sci. 2025 Jan 1;22(3):528-550. doi: 10.7150/ijms.104697. eCollection 2025.
10
Transcriptome size matters for single-cell RNA-seq normalization and bulk deconvolution.转录组大小对单细胞RNA测序标准化和批量反卷积很重要。
Nat Commun. 2025 Feb 1;16(1):1246. doi: 10.1038/s41467-025-56623-1.
基于深度计数自编码器的单细胞 RNA-seq 去噪。
Nat Commun. 2019 Jan 23;10(1):390. doi: 10.1038/s41467-018-07931-2.
4
GrandPrix: scaling up the Bayesian GPLVM for single-cell data.GrandPrix:针对单细胞数据扩展贝叶斯 GPLVM。
Bioinformatics. 2019 Jan 1;35(1):47-54. doi: 10.1093/bioinformatics/bty533.
5
Dimensionality reduction for visualizing single-cell data using UMAP.使用UMAP进行单细胞数据可视化的降维方法。
Nat Biotechnol. 2018 Dec 3. doi: 10.1038/nbt.4314.
6
Interpretable dimensionality reduction of single cell transcriptome data with deep generative models.基于深度生成模型的单细胞转录组数据的可解释维度约简。
Nat Commun. 2018 May 21;9(1):2002. doi: 10.1038/s41467-018-04368-5.
7
Splatter: simulation of single-cell RNA sequencing data.Splatter:单细胞 RNA 测序数据模拟。
Genome Biol. 2017 Sep 12;18(1):174. doi: 10.1186/s13059-017-1305-0.
8
Single-cell RNA sequencing: Technical advancements and biological applications.单细胞 RNA 测序:技术进展与生物学应用。
Mol Aspects Med. 2018 Feb;59:36-46. doi: 10.1016/j.mam.2017.07.003. Epub 2017 Aug 1.
9
Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning.基于核函数相似性学习的单细胞 RNA-seq 数据可视化与分析。
Nat Methods. 2017 Apr;14(4):414-416. doi: 10.1038/nmeth.4207. Epub 2017 Mar 6.
10
Massively parallel digital transcriptional profiling of single cells.大规模平行数字化单细胞转录组分析。
Nat Commun. 2017 Jan 16;8:14049. doi: 10.1038/ncomms14049.