• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于子空间和加权距离的小 scRNA-seq 数据聚类方法。

A clustering method for small scRNA-seq data based on subspace and weighted distance.

机构信息

Hunan Engineering & Technology Research Centre for Agricultural Big Data Analysis & Decision-Making, Hunan Agricultural University, Changsha, Hunan, China.

Hunan Agricultural University, College of Information and Intelligence, Changsha, Hunan, China.

出版信息

PeerJ. 2023 Jan 23;11:e14706. doi: 10.7717/peerj.14706. eCollection 2023.

DOI:10.7717/peerj.14706
PMID:36710872
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9879162/
Abstract

BACKGROUND

Identifying the cell types using unsupervised methods is essential for scRNA-seq research. However, conventional similarity measures introduce challenges to single-cell data clustering because of the high dimensional, high noise, and high dropout.

METHODS

We proposed a clustering method for small cRNA-seq data based on ubspace and eighted istance (SSWD), which follows the assumption that the sets of gene subspace composed of similar density-distributing genes can better distinguish cell groups. To accurately capture the intrinsic relationship among cells or genes, a new distance metric that combines Euclidean and Pearson distance through a weighting strategy was proposed. The relative Calinski-Harabasz (CH) index was used to estimate the cluster numbers instead of the CH index because it is comparable across degrees of freedom.

RESULTS

We compared SSWD with seven prevailing methods on eight publicly scRNA-seq datasets. The experimental results show that the SSWD has better clustering accuracy and the partitioning ability of cell groups. SSWD can be downloaded at https://github.com/ningzilan/SSWD.

摘要

背景

使用无监督方法识别细胞类型对于 scRNA-seq 研究至关重要。然而,由于高维、高噪声和高缺失,传统的相似性度量方法给单细胞数据聚类带来了挑战。

方法

我们提出了一种基于子空间和加权距离(SSWD)的小型 cRNA-seq 数据聚类方法,该方法遵循这样的假设,即由相似密度分布基因组成的基因子空间集可以更好地区分细胞群。为了准确捕捉细胞或基因之间的内在关系,我们通过加权策略提出了一种新的距离度量,它结合了欧几里得和皮尔逊距离。我们使用相对 Calinski-Harabasz(CH)指数来估计聚类数量,而不是 CH 指数,因为它在自由度方面具有可比性。

结果

我们在八个公开的 scRNA-seq 数据集上比较了 SSWD 与七种流行的方法。实验结果表明,SSWD 具有更好的聚类准确性和细胞群的划分能力。SSWD 可以在 https://github.com/ningzilan/SSWD 上下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/cacb86a3fd7a/peerj-11-14706-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/92249676dced/peerj-11-14706-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/3f90502751d2/peerj-11-14706-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/8bf362d6f4f6/peerj-11-14706-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/15d662152735/peerj-11-14706-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/5405ce28a630/peerj-11-14706-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/cacb86a3fd7a/peerj-11-14706-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/92249676dced/peerj-11-14706-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/3f90502751d2/peerj-11-14706-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/8bf362d6f4f6/peerj-11-14706-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/15d662152735/peerj-11-14706-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/5405ce28a630/peerj-11-14706-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ec0/9879162/cacb86a3fd7a/peerj-11-14706-g006.jpg

相似文献

1
A clustering method for small scRNA-seq data based on subspace and weighted distance.基于子空间和加权距离的小 scRNA-seq 数据聚类方法。
PeerJ. 2023 Jan 23;11:e14706. doi: 10.7717/peerj.14706. eCollection 2023.
2
A Personalized Low-Rank Subspace Clustering Method Based on Locality and Similarity Constraints for scRNA-seq Data Analysis.基于局部性和相似性约束的个性化低秩子空间聚类方法在 scRNA-seq 数据分析中的应用。
IEEE J Biomed Health Inform. 2023 May;27(5):2575-2584. doi: 10.1109/JBHI.2023.3247723. Epub 2023 May 4.
3
JLONMFSC: Clustering scRNA-seq data based on joint learning of non-negative matrix factorization and subspace clustering.JLONMFSC:基于非负矩阵分解和子空间聚类联合学习的 scRNA-seq 数据聚类。
Methods. 2024 Feb;222:1-9. doi: 10.1016/j.ymeth.2023.11.019. Epub 2023 Dec 19.
4
scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data.scDSSC:用于 scRNA-seq 数据的深度稀疏子空间聚类。
PLoS Comput Biol. 2022 Dec 19;18(12):e1010772. doi: 10.1371/journal.pcbi.1010772. eCollection 2022 Dec.
5
scNAME: neighborhood contrastive clustering with ancillary mask estimation for scRNA-seq data.scNAME:基于辅助掩模估计的 scRNA-seq 数据邻域对比聚类。
Bioinformatics. 2022 Mar 4;38(6):1575-1583. doi: 10.1093/bioinformatics/btac011.
6
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
7
GNN-based embedding for clustering scRNA-seq data.基于图神经网络的 scRNA-seq 数据聚类嵌入方法。
Bioinformatics. 2022 Jan 27;38(4):1037-1044. doi: 10.1093/bioinformatics/btab787.
8
scHFC: a hybrid fuzzy clustering method for single-cell RNA-seq data optimized by natural computation.scHFC:一种基于自然计算优化的单细胞 RNA-seq 数据的混合模糊聚类方法。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab588.
9
scMAGS: Marker gene selection from scRNA-seq data for spatial transcriptomics studies.scMAGS:从单细胞RNA测序数据中选择标记基因用于空间转录组学研究。
Comput Biol Med. 2023 Mar;155:106634. doi: 10.1016/j.compbiomed.2023.106634. Epub 2023 Feb 9.
10
scGAC: a graph attentional architecture for clustering single-cell RNA-seq data.scGAC:一种用于聚类单细胞 RNA-seq 数据的图注意力架构。
Bioinformatics. 2022 Apr 12;38(8):2187-2193. doi: 10.1093/bioinformatics/btac099.

引用本文的文献

1
ScAGCN: Graph Convolutional Network with Adaptive Aggregation Mechanism for scRNA-seq Data Dimensionality Reduction.ScAGCN:用于单细胞RNA测序数据降维的具有自适应聚合机制的图卷积网络
Interdiscip Sci. 2025 Apr 25. doi: 10.1007/s12539-025-00702-w.
2
scMEDAL for the interpretable analysis of single-cell transcriptomics data with batch effect visualization using a deep mixed effects autoencoder.scMEDAL:使用深度混合效应自动编码器进行单细胞转录组学数据的可解释分析及批次效应可视化
Res Sq. 2025 Mar 19:rs.3.rs-6081478. doi: 10.21203/rs.3.rs-6081478/v1.
3
scMEDAL for the interpretable analysis of single-cell transcriptomics data with batch effect visualization using a deep mixed effects autoencoder.

本文引用的文献

1
Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data.基于熵子空间分离的 scRNA-seq 数据降噪聚类(ENCORE)。
Nucleic Acids Res. 2021 Feb 22;49(3):e18. doi: 10.1093/nar/gkaa1157.
2
Identification of cell types from single cell data using stable clustering.基于稳定聚类的单细胞数据中的细胞类型鉴定。
Sci Rep. 2020 Jul 23;10(1):12349. doi: 10.1038/s41598-020-66848-3.
3
Resolving single-cell heterogeneity from hundreds of thousands of cells through sequential hybrid clustering and NMF.
scMEDAL:用于通过深度混合效应自动编码器进行批量效应可视化的单细胞转录组学数据的可解释分析。
ArXiv. 2025 Mar 13:arXiv:2411.06635v3.
通过顺序混合聚类和 NMF 从数十万细胞中解析单细胞异质性。
Bioinformatics. 2020 Jun 1;36(12):3773-3780. doi: 10.1093/bioinformatics/btaa201.
4
Single-cell RNA-seq clustering: datasets, models, and algorithms.单细胞 RNA-seq 聚类:数据集、模型和算法。
RNA Biol. 2020 Jun;17(6):765-783. doi: 10.1080/15476286.2020.1728961. Epub 2020 Mar 1.
5
Tracing tumorigenesis in a solid tumor model at single-cell resolution.单细胞分辨率追踪实体瘤模型中的肿瘤发生。
Nat Commun. 2020 Feb 20;11(1):991. doi: 10.1038/s41467-020-14777-0.
6
Global Sensitivity Analysis of Bulk Properties of an Atomic Nucleus.原子核体性质的全局灵敏度分析。
Phys Rev Lett. 2019 Dec 20;123(25):252501. doi: 10.1103/PhysRevLett.123.252501.
7
SAME-clustering: Single-cell Aggregated Clustering via Mixture Model Ensemble.SAME 聚类:基于混合模型集成的单细胞聚集聚类。
Nucleic Acids Res. 2020 Jan 10;48(1):86-95. doi: 10.1093/nar/gkz959.
8
Clustering and classification methods for single-cell RNA-sequencing data.单细胞 RNA 测序数据的聚类和分类方法。
Brief Bioinform. 2020 Jul 15;21(4):1196-1208. doi: 10.1093/bib/bbz062.
9
Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments.使用混合对照实验对标单细胞 RNA 测序分析流程。
Nat Methods. 2019 Jun;16(6):479-487. doi: 10.1038/s41592-019-0425-8. Epub 2019 May 27.
10
PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data.PanglaoDB:一个用于探索小鼠和人类单细胞 RNA 测序数据的网络服务器。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz046.