Suppr超能文献

RFCell:一种基于排列和随机森林的单细胞RNA测序聚类基因选择方法。

RFCell: A Gene Selection Approach for scRNA-seq Clustering Based on Permutation and Random Forest.

作者信息

Zhao Yuan, Fang Zhao-Yu, Lin Cui-Xiang, Deng Chao, Xu Yun-Pei, Li Hong-Dong

机构信息

Hunan Provincial Key Laboratory on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, China.

School of Mathematics and Statistics, Central South University, Changsha, China.

出版信息

Front Genet. 2021 Jul 27;12:665843. doi: 10.3389/fgene.2021.665843. eCollection 2021.

Abstract

In recent years, the application of single cell RNA-seq (scRNA-seq) has become more and more popular in fields such as biology and medical research. Analyzing scRNA-seq data can discover complex cell populations and infer single-cell trajectories in cell development. Clustering is one of the most important methods to analyze scRNA-seq data. In this paper, we focus on improving scRNA-seq clustering through gene selection, which also reduces the dimensionality of scRNA-seq data. Studies have shown that gene selection for scRNA-seq data can improve clustering accuracy. Therefore, it is important to select genes with cell type specificity. Gene selection not only helps to reduce the dimensionality of scRNA-seq data, but also can improve cell type identification in combination with clustering methods. Here, we proposed RFCell, a supervised gene selection method, which is based on permutation and random forest classification. We first use RFCell and three existing gene selection methods to select gene sets on 10 scRNA-seq data sets. Then, three classical clustering algorithms are used to cluster the cells obtained by these gene selection methods. We found that the gene selection performance of RFCell was better than other gene selection methods.

摘要

近年来,单细胞RNA测序(scRNA-seq)在生物学和医学研究等领域的应用越来越广泛。分析scRNA-seq数据可以发现复杂的细胞群体,并推断细胞发育过程中的单细胞轨迹。聚类是分析scRNA-seq数据的最重要方法之一。在本文中,我们专注于通过基因选择来改进scRNA-seq聚类,这也降低了scRNA-seq数据的维度。研究表明,对scRNA-seq数据进行基因选择可以提高聚类准确性。因此,选择具有细胞类型特异性的基因很重要。基因选择不仅有助于降低scRNA-seq数据的维度,还可以结合聚类方法提高细胞类型识别能力。在此,我们提出了RFCell,一种基于排列和随机森林分类的监督基因选择方法。我们首先使用RFCell和三种现有的基因选择方法在10个scRNA-seq数据集上选择基因集。然后,使用三种经典聚类算法对通过这些基因选择方法获得的细胞进行聚类。我们发现RFCell的基因选择性能优于其他基因选择方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a5/8354212/35b7e186e658/fgene-12-665843-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验