• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 QDE-SVM 的 scRNA-seq 数据基因特征选择和细胞类型分类方法。

On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data.

机构信息

Faculty of Information Science and Technology, Multimedia University, Bukit Beruang, Melaka, Malaysia.

出版信息

PLoS One. 2023 Oct 19;18(10):e0292961. doi: 10.1371/journal.pone.0292961. eCollection 2023.

DOI:10.1371/journal.pone.0292961
PMID:37856458
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10586655/
Abstract

Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpretation of the cell types. Therefore, gene selection, also known as feature selection in data science, plays an important role in selecting informative genes for scRNA-seq cell type identification. Generally speaking, feature selection methods are categorized into filter-, wrapper-, and embedded-based approaches. From the existing literature, methods from filter- and embedded-based approaches are widely applied in scRNA-seq gene selection tasks. The wrapper-based method that gives promising results in other fields has yet been extensively utilized for selecting gene features from scRNA-seq data; in addition, most of the existing wrapper methods used in this field are clustering instead of classification-based. With a large number of annotated data available today, this study applied a classification-based approach as an alternative to the clustering-based wrapper method. In our work, a quantum-inspired differential evolution (QDE) wrapped with a classification method was introduced to select a subset of genes from twelve well-known scRNA-seq transcriptomic datasets to identify cell types. In particular, the QDE was combined with different machine-learning (ML) classifiers namely logistic regression, decision tree, support vector machine (SVM) with linear and radial basis function kernels, as well as extreme learning machine. The linear SVM wrapped with QDE, namely QDE-SVM, was chosen by referring to the feature selection results from the experiment. QDE-SVM showed a superior cell type classification performance among QDE wrapping with other ML classifiers as well as the recent wrapper methods (i.e., FSCAM, SSD-LAHC, MA-HS, and BSF). QDE-SVM achieved an average accuracy of 0.9559, while the other wrapper methods achieved average accuracies in the range of 0.8292 to 0.8872.

摘要

细胞类型鉴定是单细胞 RNA 测序 (scRNA-seq) 研究中的基本任务之一。它是促进下游解释(如差异表达、轨迹推断等)的关键步骤。scRNA-seq 数据包含可能影响细胞类型解释的技术变化。因此,基因选择(在数据科学中也称为特征选择)在为 scRNA-seq 细胞类型鉴定选择信息丰富的基因方面起着重要作用。一般来说,特征选择方法分为过滤型、包装型和嵌入式方法。从现有文献来看,过滤型和嵌入式方法的方法在 scRNA-seq 基因选择任务中得到了广泛应用。在其他领域中给出有希望结果的包装型方法尚未广泛用于从 scRNA-seq 数据中选择基因特征;此外,该领域中使用的大多数现有包装方法都是基于聚类而不是基于分类。有了大量现有的注释数据,本研究应用了一种基于分类的方法作为聚类包装方法的替代方法。在我们的工作中,引入了一种基于量子启发式差分进化 (QDE) 的分类方法,用于从 12 个著名的 scRNA-seq 转录组数据集选择一组基因,以识别细胞类型。特别是,QDE 与不同的机器学习 (ML) 分类器(即逻辑回归、决策树、带有线性和径向基函数核的支持向量机 (SVM) 以及极限学习机)相结合。通过参考实验中的特征选择结果,选择了 QDE 与其他 ML 分类器以及最近的包装方法(即 FSCAM、SSD-LAHC、MA-HS 和 BSF)包装的线性 SVM,即 QDE-SVM。QDE-SVM 与其他包装方法以及最近的包装方法(即 FSCAM、SSD-LAHC、MA-HS 和 BSF)相比,在细胞类型分类性能方面表现出优越的性能。QDE-SVM 的平均准确率为 0.9559,而其他包装方法的平均准确率在 0.8292 到 0.8872 之间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8491/10586655/af487e1f5b39/pone.0292961.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8491/10586655/8aa5e59b2d96/pone.0292961.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8491/10586655/3753b848679c/pone.0292961.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8491/10586655/af487e1f5b39/pone.0292961.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8491/10586655/8aa5e59b2d96/pone.0292961.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8491/10586655/3753b848679c/pone.0292961.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8491/10586655/af487e1f5b39/pone.0292961.g003.jpg

相似文献

1
On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data.基于 QDE-SVM 的 scRNA-seq 数据基因特征选择和细胞类型分类方法。
PLoS One. 2023 Oct 19;18(10):e0292961. doi: 10.1371/journal.pone.0292961. eCollection 2023.
2
FSCAM: CAM-Based Feature Selection for Clustering scRNA-seq.FSCAM:基于 CAM 的 scRNA-seq 聚类特征选择。
Interdiscip Sci. 2022 Jun;14(2):394-408. doi: 10.1007/s12539-021-00495-8. Epub 2022 Jan 14.
3
CTISL: a dynamic stacking multi-class classification approach for identifying cell types from single-cell RNA-seq data.CTISL:一种动态堆叠多类分类方法,用于从单细胞 RNA-seq 数据中识别细胞类型。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae063.
4
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
5
A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。
RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.
6
scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data.scDSSC:用于 scRNA-seq 数据的深度稀疏子空间聚类。
PLoS Comput Biol. 2022 Dec 19;18(12):e1010772. doi: 10.1371/journal.pcbi.1010772. eCollection 2022 Dec.
7
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.机器学习和统计方法在单细胞 RNA 测序数据分析中的应用。
Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063.
8
Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview.单细胞 RNA 测序分析:分步概述。
Methods Mol Biol. 2021;2284:343-365. doi: 10.1007/978-1-0716-1307-8_19.
9
Cell Type Annotation Model Selection: General-Purpose vs. Pattern-Aware Feature Gene Selection in Single-Cell RNA-Seq Data.细胞类型注释模型选择:单细胞 RNA-Seq 数据中的通用型与模式感知特征基因选择
Genes (Basel). 2023 Feb 26;14(3):596. doi: 10.3390/genes14030596.
10
Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study.降维与聚类模型在单细胞 RNA 测序数据中的应用:一项比较研究。
Int J Mol Sci. 2020 Mar 22;21(6):2181. doi: 10.3390/ijms21062181.

引用本文的文献

1
Feature selection methods affect the performance of scRNA-seq data integration and querying.特征选择方法会影响单细胞RNA测序(scRNA-seq)数据整合与查询的性能。
Nat Methods. 2025 Apr;22(4):834-844. doi: 10.1038/s41592-025-02624-3. Epub 2025 Mar 13.

本文引用的文献

1
A Novel Algorithm for Feature Selection Using Penalized Regression with Applications to Single-Cell RNA Sequencing Data.一种使用惩罚回归进行特征选择的新算法及其在单细胞RNA测序数据中的应用
Biology (Basel). 2022 Oct 12;11(10):1495. doi: 10.3390/biology11101495.
2
SMaSH: a scalable, general marker gene identification framework for single-cell RNA-sequencing.SMaSH:一种用于单细胞 RNA 测序的可扩展的通用标记基因识别框架。
BMC Bioinformatics. 2022 Aug 8;23(1):328. doi: 10.1186/s12859-022-04860-2.
3
scEFSC: Accurate single-cell RNA-seq data analysis via ensemble consensus clustering based on multiple feature selections.
scEFSC:基于多重特征选择的集成共识聚类实现准确的单细胞RNA测序数据分析
Comput Struct Biotechnol J. 2022 Apr 27;20:2181-2197. doi: 10.1016/j.csbj.2022.04.023. eCollection 2022.
4
Triku: a feature selection method based on nearest neighbors for single-cell data.Triku:一种基于最近邻的单细胞数据分析特征选择方法。
Gigascience. 2022 Mar 12;11. doi: 10.1093/gigascience/giac017.
5
sc-REnF: An entropy guided robust feature selection for single-cell RNA-seq data.sc-REnF:一种用于单细胞RNA测序数据的熵引导鲁棒特征选择方法
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab517.
6
FSCAM: CAM-Based Feature Selection for Clustering scRNA-seq.FSCAM:基于 CAM 的 scRNA-seq 聚类特征选择。
Interdiscip Sci. 2022 Jun;14(2):394-408. doi: 10.1007/s12539-021-00495-8. Epub 2022 Jan 14.
7
Feature selection revisited in the single-cell era.单细胞时代的特征选择再探讨。
Genome Biol. 2021 Dec 1;22(1):321. doi: 10.1186/s13059-021-02544-3.
8
From bulk, single-cell to spatial RNA sequencing.从批量、单细胞到空间 RNA 测序。
Int J Oral Sci. 2021 Nov 15;13(1):36. doi: 10.1038/s41368-021-00146-0.
9
Single-cell analysis of the human pancreas in type 2 diabetes using multi-spectral imaging mass cytometry.使用多光谱成像质谱流式细胞术分析 2 型糖尿病患者的人胰腺单细胞。
Cell Rep. 2021 Nov 2;37(5):109919. doi: 10.1016/j.celrep.2021.109919.
10
RFCell: A Gene Selection Approach for scRNA-seq Clustering Based on Permutation and Random Forest.RFCell:一种基于排列和随机森林的单细胞RNA测序聚类基因选择方法。
Front Genet. 2021 Jul 27;12:665843. doi: 10.3389/fgene.2021.665843. eCollection 2021.