• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞自我定标聚类与转录组测序数据。

Single Cell Self-Paced Clustering with Transcriptome Sequencing Data.

机构信息

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.

School of Computer Science and Technology, Harbin Institute of Technology Shenzhen, Shenzhen 518055, China.

出版信息

Int J Mol Sci. 2022 Mar 31;23(7):3900. doi: 10.3390/ijms23073900.

DOI:10.3390/ijms23073900
PMID:35409258
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8999118/
Abstract

Single cell RNA sequencing (scRNA-seq) allows researchers to explore tissue heterogeneity, distinguish unusual cell identities, and find novel cellular subtypes by providing transcriptome profiling for individual cells. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the performance of existing single-cell clustering methods is extremely sensitive to the presence of noise data and outliers. Existing clustering algorithms can easily fall into local optimal solutions. There is still no consensus on the best performing method. To address this issue, we introduce a single cell self-paced clustering (scSPaC) method with F-norm based nonnegative matrix factorization (NMF) for scRNA-seq data and a sparse single cell self-paced clustering (sscSPaC) method with l21-norm based nonnegative matrix factorization for scRNA-seq data. We gradually add single cells from simple to complex to our model until all cells are selected. In this way, the influences of noisy data and outliers can be significantly reduced. The proposed method achieved the best performance on both simulation data and real scRNA-seq data. A case study about human clara cells and ependymal cells scRNA-seq data clustering shows that scSPaC is more advantageous near the clustering dividing line.

摘要

单细胞 RNA 测序 (scRNA-seq) 通过为单个细胞提供转录组谱分析,允许研究人员探索组织异质性、区分异常细胞身份和发现新的细胞亚型。聚类分析通常用于预测细胞类别分配和推断细胞身份。然而,现有的单细胞聚类方法的性能对噪声数据和异常值的存在非常敏感。现有的聚类算法很容易陷入局部最优解。目前还没有关于表现最好的方法的共识。为了解决这个问题,我们引入了一种基于 F 范数的基于非负矩阵分解 (NMF) 的单细胞自定步聚类 (scSPaC) 方法和一种基于 l21 范数的基于非负矩阵分解的稀疏单细胞自定步聚类 (sscSPaC) 方法,用于 scRNA-seq 数据。我们逐渐将单细胞从简单添加到复杂,直到选择所有细胞。通过这种方式,可以显著降低噪声数据和异常值的影响。该方法在模拟数据和真实 scRNA-seq 数据上均取得了最佳性能。一项关于人克拉拉细胞和室管膜细胞 scRNA-seq 数据聚类的案例研究表明,scSPaC 在聚类分界线附近更具优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/855d5bd65ba4/ijms-23-03900-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/5b0b0535ea4f/ijms-23-03900-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/e42fb06bf55b/ijms-23-03900-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/93ffa81dfe87/ijms-23-03900-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/855d5bd65ba4/ijms-23-03900-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/5b0b0535ea4f/ijms-23-03900-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/e42fb06bf55b/ijms-23-03900-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/93ffa81dfe87/ijms-23-03900-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f0d/8999118/855d5bd65ba4/ijms-23-03900-g004.jpg

相似文献

1
Single Cell Self-Paced Clustering with Transcriptome Sequencing Data.单细胞自我定标聚类与转录组测序数据。
Int J Mol Sci. 2022 Mar 31;23(7):3900. doi: 10.3390/ijms23073900.
2
JLONMFSC: Clustering scRNA-seq data based on joint learning of non-negative matrix factorization and subspace clustering.JLONMFSC:基于非负矩阵分解和子空间聚类联合学习的 scRNA-seq 数据聚类。
Methods. 2024 Feb;222:1-9. doi: 10.1016/j.ymeth.2023.11.019. Epub 2023 Dec 19.
3
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
4
scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data.scDSSC:用于 scRNA-seq 数据的深度稀疏子空间聚类。
PLoS Comput Biol. 2022 Dec 19;18(12):e1010772. doi: 10.1371/journal.pcbi.1010772. eCollection 2022 Dec.
5
A Personalized Low-Rank Subspace Clustering Method Based on Locality and Similarity Constraints for scRNA-seq Data Analysis.基于局部性和相似性约束的个性化低秩子空间聚类方法在 scRNA-seq 数据分析中的应用。
IEEE J Biomed Health Inform. 2023 May;27(5):2575-2584. doi: 10.1109/JBHI.2023.3247723. Epub 2023 May 4.
6
Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study.降维与聚类模型在单细胞 RNA 测序数据中的应用:一项比较研究。
Int J Mol Sci. 2020 Mar 22;21(6):2181. doi: 10.3390/ijms21062181.
7
Multi-View Clustering With Graph Learning for scRNA-Seq Data.基于图学习的 scRNA-Seq 数据的多视图聚类。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Nov-Dec;20(6):3535-3546. doi: 10.1109/TCBB.2023.3298334. Epub 2023 Dec 25.
8
SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data.SSNMDI:一种用于单细胞 RNA-seq 数据聚类的半监督非负矩阵分解和数据插补的新型联合学习模型。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad149.
9
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.机器学习和统计方法在单细胞 RNA 测序数据分析中的应用。
Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063.
10
Single-cell data clustering based on sparse optimization and low-rank matrix factorization.基于稀疏优化和低秩矩阵分解的单细胞数据聚类。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab098.

引用本文的文献

1
Identification of kidney cell types in scRNA-seq and snRNA-seq data using machine learning algorithms.使用机器学习算法在单细胞RNA测序(scRNA-seq)和单细胞核RNA测序(snRNA-seq)数据中识别肾细胞类型。
Heliyon. 2024 Sep 27;10(19):e38567. doi: 10.1016/j.heliyon.2024.e38567. eCollection 2024 Oct 15.

本文引用的文献

1
Single-cell data clustering based on sparse optimization and low-rank matrix factorization.基于稀疏优化和低秩矩阵分解的单细胞数据聚类。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab098.
2
Dual self-paced multi-view clustering.双自步多视图聚类。
Neural Netw. 2021 Aug;140:184-192. doi: 10.1016/j.neunet.2021.02.022. Epub 2021 Mar 6.
3
SARS-CoV-2 Infection of Pluripotent Stem Cell-Derived Human Lung Alveolar Type 2 Cells Elicits a Rapid Epithelial-Intrinsic Inflammatory Response.SARS-CoV-2 感染多能干细胞衍生的人肺Ⅱ型肺泡细胞会引发快速的上皮细胞固有炎症反应。
Cell Stem Cell. 2020 Dec 3;27(6):962-973.e7. doi: 10.1016/j.stem.2020.09.013. Epub 2020 Sep 18.
4
Cumulus provides cloud-based data analysis for large-scale single-cell and single-nucleus RNA-seq.Cumulus 为大规模单细胞和单细胞核 RNA-seq 提供基于云的数据分析。
Nat Methods. 2020 Aug;17(8):793-798. doi: 10.1038/s41592-020-0905-x. Epub 2020 Jul 27.
5
Using transfer learning from prior reference knowledge to improve the clustering of single-cell RNA-Seq data.利用先验参考知识的迁移学习来改进单细胞 RNA-Seq 数据的聚类。
Sci Rep. 2019 Dec 30;9(1):20353. doi: 10.1038/s41598-019-56911-z.
6
High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell.同一细胞中转录组和染色质可及性的高通量测序。
Nat Biotechnol. 2019 Dec;37(12):1452-1457. doi: 10.1038/s41587-019-0290-0. Epub 2019 Oct 14.
7
Clustering and classification methods for single-cell RNA-sequencing data.单细胞 RNA 测序数据的聚类和分类方法。
Brief Bioinform. 2020 Jul 15;21(4):1196-1208. doi: 10.1093/bib/bbz062.
8
Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments.使用混合对照实验对标单细胞 RNA 测序分析流程。
Nat Methods. 2019 Jun;16(6):479-487. doi: 10.1038/s41592-019-0425-8. Epub 2019 May 27.
9
PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data.PanglaoDB:一个用于探索小鼠和人类单细胞 RNA 测序数据的网络服务器。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz046.
10
Semisoft clustering of single-cell data.单细胞数据的半软聚类。
Proc Natl Acad Sci U S A. 2019 Jan 8;116(2):466-471. doi: 10.1073/pnas.1817715116. Epub 2018 Dec 26.