• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scBKAP:基于二分 K-Means 的单细胞 RNA-Seq 数据聚类模型。

scBKAP: A Clustering Model for Single-Cell RNA-Seq Data Based on Bisecting K-Means.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2007-2015. doi: 10.1109/TCBB.2022.3230098. Epub 2023 Jun 5.

DOI:10.1109/TCBB.2022.3230098
PMID:37015596
Abstract

Advances in single-cell RNA sequencing (scRNA-seq) technologies allow researchers to analyze the genome-wide transcription profile and to solve biological problems at the individual-cell resolution. However, existing clustering methods on scRNA-seq suffer from high dropout rate and curse of dimensionality in the data. Here, we propose a novel pipeline, scBKAP, the cornerstone of which is a single-cell bisecting K-means clustering method based on an autoencoder network and a dimensionality reduction model MPDR. Specially, scBKAP utilizes an autoencoder network to reconstruct gene expression values from scRNA-seq data to alleviate the dropout issue, and the MPDR model composed of the M3Drop feature selection algorithm and the PHATE dimensionality reduction algorithm to reduce the dimensions of reconstructed data. The dimensionality-reduced data are then fed into the bisecting K-means clustering algorithm to identify the clusters of cells. Comprehensive experiments demonstrate scBKAP's superior performance over nine state-of-the-art single-cell clustering methods on 21 public scRNA-seq datasets and simulated datasets. The source codes and datasets are available at https://github.com/YuBinLab-QUST/scBKAP/ and https://doi.org/10.24433/CO.4592131.v1.

摘要

单细胞 RNA 测序 (scRNA-seq) 技术的进步使研究人员能够分析全基因组转录谱,并以单细胞分辨率解决生物学问题。然而,现有的 scRNA-seq 聚类方法存在高缺失率和数据维度诅咒的问题。在这里,我们提出了一种新的流水线 scBKAP,其基石是基于自动编码器网络和降维模型 MPDR 的单细胞二分 K-均值聚类方法。特别地,scBKAP 利用自动编码器网络从 scRNA-seq 数据中重建基因表达值,以减轻缺失问题,而由 M3Drop 特征选择算法和 PHATE 降维算法组成的 MPDR 模型则用于降低重建数据的维度。然后,将降维后的数据输入二分 K-均值聚类算法以识别细胞簇。综合实验表明,在 21 个公共 scRNA-seq 数据集和模拟数据集上,scBKAP 在九种最先进的单细胞聚类方法中的性能更为优越。源代码和数据集可在 https://github.com/YuBinLab-QUST/scBKAP/ 和 https://doi.org/10.24433/CO.4592131.v1 上获得。

相似文献

1
scBKAP: A Clustering Model for Single-Cell RNA-Seq Data Based on Bisecting K-Means.scBKAP:基于二分 K-Means 的单细胞 RNA-Seq 数据聚类模型。
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2007-2015. doi: 10.1109/TCBB.2022.3230098. Epub 2023 Jun 5.
2
scHFC: a hybrid fuzzy clustering method for single-cell RNA-seq data optimized by natural computation.scHFC:一种基于自然计算优化的单细胞 RNA-seq 数据的混合模糊聚类方法。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab588.
3
scGMAI: a Gaussian mixture model for clustering single-cell RNA-Seq data based on deep autoencoder.scGMAI:基于深度自动编码器的单细胞 RNA-Seq 数据聚类的高斯混合模型。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa316.
4
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
5
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.
6
Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder.基于改进深度变分自动编码器的单细胞 RNA-seq 数据降维和可视化。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad152.
7
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
8
scGAC: a graph attentional architecture for clustering single-cell RNA-seq data.scGAC:一种用于聚类单细胞 RNA-seq 数据的图注意力架构。
Bioinformatics. 2022 Apr 12;38(8):2187-2193. doi: 10.1093/bioinformatics/btac099.
9
SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data.SSNMDI:一种用于单细胞 RNA-seq 数据聚类的半监督非负矩阵分解和数据插补的新型联合学习模型。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad149.
10
Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network.基于自动编码器和图神经网络的单细胞 RNA-seq 数据深度结构聚类。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac018.

引用本文的文献

1
scPEDSSC: proximity enhanced deep sparse subspace clustering method for scRNA-seq data.scPEDSSC:用于单细胞RNA测序数据的邻近增强深度稀疏子空间聚类方法
PLoS Comput Biol. 2025 Apr 28;21(4):e1012924. doi: 10.1371/journal.pcbi.1012924. eCollection 2025 Apr.
2
Transfer learning for clustering single-cell RNA-seq data crossing-species and batch, case on uterine fibroids.跨物种和批次的单细胞 RNA-seq 数据聚类的迁移学习:以子宫肌瘤为例。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad426.
3
A universal framework for single-cell multi-omics data integration with graph convolutional networks.
基于图卷积网络的单细胞多组学数据集成通用框架
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad081.