• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞RNA测序数据分析方法组合的比较。

Comparison of scRNA-seq data analysis method combinations.

作者信息

Xu Li, Xue Tong, Ding Weiyue, Shen Linshan

出版信息

Brief Funct Genomics. 2022 Nov 17;21(6):433-440. doi: 10.1093/bfgp/elac027.

DOI:10.1093/bfgp/elac027
PMID:36124658
Abstract

Single-cell ribonucleic acid (RNA)-sequencing (scRNA-seq) data analysis refers to the use of appropriate methods to analyze the dataset generated by RNA-sequencing performed on the single-cell transcriptome. It usually contains three steps: normalization to eliminate the technical noise, dimensionality reduction to facilitate visual understanding and data compression and clustering to divide the data into several similarity-based clusters. In addition, the gene expression data contain a large number of zero counts. These zero counts are considered relevant to random dropout events induced by multiple factors in the sequencing experiments, such as low RNA input, and the stochastic nature of the gene expression pattern at the single-cell level. The zero counts can be eliminated only through the analysis of the scRNA-seq data, and although many methods have been proposed to this end, there is still a lack of research on the combined effect of existing methods. In this paper, we summarize the two kinds of normalization, two kinds of dimension reduction and three kinds of clustering methods widely used in the current mainstream scRNA-seq data analysis. Furthermore, we propose to combine these methods into 12 technology combinations, each with a whole set of scRNA-seq data analysis processes. We evaluated the proposed combinations using Goolam, a publicly available scRNA-seq, by comparing the final clustering results and found the most suitable collection scheme of these classic methods. Our results showed that using appropriate technology combinations can improve the efficiency and accuracy of the scRNA-seq data analysis. The combinations not only satisfy the basic requirements of noise reduction, dimension reduction and cell clustering but also ensure preserving the heterogeneity of cells in downstream analysis. The dataset, Goolam, used in the study can be obtained from the ArrayExpress database under the accession number E-MTAB-3321.

摘要

单细胞核糖核酸(RNA)测序(scRNA-seq)数据分析是指运用适当方法分析对单细胞转录组进行RNA测序所生成的数据集。它通常包含三个步骤:归一化以消除技术噪声、降维以便于直观理解和数据压缩,以及聚类以将数据划分为几个基于相似度的簇。此外,基因表达数据包含大量零计数。这些零计数被认为与测序实验中多种因素引发的随机丢失事件相关,比如低RNA输入以及单细胞水平上基因表达模式的随机性。只有通过scRNA-seq数据分析才能消除零计数,尽管为此已经提出了许多方法,但对于现有方法的组合效应仍缺乏研究。在本文中,我们总结了当前主流scRNA-seq数据分析中广泛使用的两类归一化方法、两类降维方法和三类聚类方法。此外,我们提议将这些方法组合成12种技术组合,每种组合都有一整套scRNA-seq数据分析流程。我们通过比较最终聚类结果,使用公开可用的scRNA-seq数据集Goolam对所提出的组合进行了评估,并找到了这些经典方法最合适的组合方案。我们的结果表明,使用适当的技术组合可以提高scRNA-seq数据分析的效率和准确性。这些组合不仅满足降噪、降维和细胞聚类的基本要求,还能确保在下游分析中保留细胞的异质性。该研究中使用的数据集Goolam可从ArrayExpress数据库获取,登录号为E-MTAB-3321。

相似文献

1
Comparison of scRNA-seq data analysis method combinations.单细胞RNA测序数据分析方法组合的比较。
Brief Funct Genomics. 2022 Nov 17;21(6):433-440. doi: 10.1093/bfgp/elac027.
2
scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data.scDSSC:用于 scRNA-seq 数据的深度稀疏子空间聚类。
PLoS Comput Biol. 2022 Dec 19;18(12):e1010772. doi: 10.1371/journal.pcbi.1010772. eCollection 2022 Dec.
3
Attention-based deep clustering method for scRNA-seq cell type identification.基于注意力机制的深度聚类方法在 scRNA-seq 细胞类型鉴定中的应用。
PLoS Comput Biol. 2023 Nov 10;19(11):e1011641. doi: 10.1371/journal.pcbi.1011641. eCollection 2023 Nov.
4
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.机器学习和统计方法在单细胞 RNA 测序数据分析中的应用。
Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063.
5
Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study.降维与聚类模型在单细胞 RNA 测序数据中的应用:一项比较研究。
Int J Mol Sci. 2020 Mar 22;21(6):2181. doi: 10.3390/ijms21062181.
6
scHFC: a hybrid fuzzy clustering method for single-cell RNA-seq data optimized by natural computation.scHFC:一种基于自然计算优化的单细胞 RNA-seq 数据的混合模糊聚类方法。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab588.
7
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。
PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.
8
Data Analysis in Single-Cell Transcriptome Sequencing.单细胞转录组测序中的数据分析
Methods Mol Biol. 2018;1754:311-326. doi: 10.1007/978-1-4939-7717-8_18.
9
A Personalized Low-Rank Subspace Clustering Method Based on Locality and Similarity Constraints for scRNA-seq Data Analysis.基于局部性和相似性约束的个性化低秩子空间聚类方法在 scRNA-seq 数据分析中的应用。
IEEE J Biomed Health Inform. 2023 May;27(5):2575-2584. doi: 10.1109/JBHI.2023.3247723. Epub 2023 May 4.
10
A new and effective two-step clustering approach for single cell RNA sequencing data.一种新的、有效的单细胞 RNA 测序数据两步聚类方法。
BMC Genomics. 2023 Nov 9;23(Suppl 6):864. doi: 10.1186/s12864-023-09577-x.

引用本文的文献

1
A lossless reference-free sequence compression algorithm leveraging grammatical, statistical, and substitution rules.一种利用语法、统计和替换规则的无损无参考序列压缩算法。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae050.