• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

IDclust:用于通过单细胞转录组学和表观基因组学无监督识别细胞类型的迭代聚类法

IDclust: Iterative clustering for unsupervised identification of cell types with single cell transcriptomics and epigenomics.

作者信息

Prompsy Pacôme, Saichi Mélissa, Raimundo Félix, Vallot Céline

机构信息

CNRS UMR3244, Institut Curie, PSL Research University, 26 rue d'Ulm, 75005 Paris, France.

Department of Translational Research, Institut Curie, PSL Research University, 26 rue d'Ulm, 75005 Paris, France.

出版信息

NAR Genom Bioinform. 2024 Dec 18;6(4):lqae174. doi: 10.1093/nargab/lqae174. eCollection 2024 Dec.

DOI:10.1093/nargab/lqae174
PMID:39703425
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11655290/
Abstract

The increasing diversity of single-cell datasets require systematic cell type characterization. Clustering is a critical step in single-cell analysis, heavily influencing downstream analyses. However, current unsupervised clustering algorithms rely on biologically irrelevant parameters that require manual optimization and fail to capture hierarchical relationships between clusters. We developed IDclust, a framework that identifies clusters with significant biological features at multiple resolutions using biologically meaningful thresholds like fold change, adjusted -value and fraction of expressing cells. By iteratively processing and clustering subsets of the dataset, IDclust guarantees that all clusters found have significantly different features and stops only when no more interpretable cluster is found. It also creates a hierarchy of clusters, enabling visualization of the hierarchical relationships between different clusters. Analyzing multiple single-cell transcriptomic reference datasets, IDclust achieves superior clustering accuracy compared to state of the art algorithms. We showcase its utility by identifying previously unannotated clusters and identifying branching patterns in scATAC datasets. Using it's unsupervised nature and ability to analyze different -omics, we compare the resolution of different histone marks in multi-omic paired-tag dataset. Overall, IDclust automates single-cell exploration, facilitates cell type annotation and provides a biologically interpretable basis for clustering.

摘要

单细胞数据集日益增加的多样性需要系统的细胞类型表征。聚类是单细胞分析中的关键步骤,对下游分析有重大影响。然而,当前的无监督聚类算法依赖于生物学上不相关的参数,这些参数需要手动优化,并且无法捕捉聚类之间的层次关系。我们开发了IDclust,这是一个框架,它使用诸如倍数变化、调整后的 值和表达细胞分数等生物学上有意义的阈值,在多个分辨率下识别具有显著生物学特征的聚类。通过迭代处理和聚类数据集的子集,IDclust保证找到的所有聚类都有显著不同的特征,并且只有在找不到更多可解释的聚类时才停止。它还创建了聚类层次结构,能够可视化不同聚类之间的层次关系。通过分析多个单细胞转录组参考数据集,与现有算法相比,IDclust实现了更高的聚类准确性。我们通过识别以前未注释的聚类和识别scATAC数据集中的分支模式来展示其效用。利用其无监督性质和分析不同组学的能力,我们在多组学配对标签数据集中比较了不同组蛋白标记的分辨率。总体而言,IDclust使单细胞探索自动化,便于细胞类型注释,并为聚类提供生物学上可解释的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/1299d8280219/lqae174fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/90e5baaa15ba/lqae174figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/7d1440341061/lqae174fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/5eb8d68427b5/lqae174fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/03676eeb8738/lqae174fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/d08588b3f21c/lqae174fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/1299d8280219/lqae174fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/90e5baaa15ba/lqae174figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/7d1440341061/lqae174fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/5eb8d68427b5/lqae174fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/03676eeb8738/lqae174fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/d08588b3f21c/lqae174fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db0a/11655290/1299d8280219/lqae174fig5.jpg

相似文献

1
IDclust: Iterative clustering for unsupervised identification of cell types with single cell transcriptomics and epigenomics.IDclust:用于通过单细胞转录组学和表观基因组学无监督识别细胞类型的迭代聚类法
NAR Genom Bioinform. 2024 Dec 18;6(4):lqae174. doi: 10.1093/nargab/lqae174. eCollection 2024 Dec.
2
scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization.scMNMF:一种基于矩阵分解的单细胞多组学聚类新方法。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae228.
3
scMUSCL: multi-source transfer learning for clustering scRNA-seq data.scMUSCL:用于单细胞RNA测序数据聚类的多源迁移学习
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf137.
4
Interpretable multi-omics integration with UMAP embeddings and density-based clustering.基于UMAP嵌入和密度聚类的可解释多组学整合
bioRxiv. 2024 Oct 11:2024.10.07.617035. doi: 10.1101/2024.10.07.617035.
5
CHOIR improves significance-based detection of cell types and states from single-cell data.CHOIR可提高基于显著性的单细胞数据中细胞类型和状态的检测能力。
bioRxiv. 2025 Feb 19:2024.01.18.576317. doi: 10.1101/2024.01.18.576317.
6
An interpretable framework for clustering single-cell RNA-Seq datasets.用于聚类单细胞 RNA-Seq 数据集的可解释框架。
BMC Bioinformatics. 2018 Mar 9;19(1):93. doi: 10.1186/s12859-018-2092-7.
7
scDRMAE: integrating masked autoencoder with residual attention networks to leverage omics feature dependencies for accurate cell clustering.scDRMAE:集成掩蔽自动编码器和残差注意力网络,利用组学特征依赖性进行准确的细胞聚类。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae599.
8
scSemiAAE: a semi-supervised clustering model for single-cell RNA-seq data.scSemiAAE:一种用于单细胞 RNA-seq 数据的半监督聚类模型。
BMC Bioinformatics. 2023 May 26;24(1):217. doi: 10.1186/s12859-023-05339-4.
9
Improving replicability in single-cell RNA-Seq cell type discovery with Dune.利用 Dune 提高单细胞 RNA-Seq 细胞类型发现的可重复性。
BMC Bioinformatics. 2024 May 24;25(1):198. doi: 10.1186/s12859-024-05814-6.
10
clusterMaker2: a major update to clusterMaker, a multi-algorithm clustering app for Cytoscape.clusterMaker2:clusterMaker 的一个主要更新,clusterMaker 是 Cytoscape 的一个多算法聚类应用程序。
BMC Bioinformatics. 2023 Apr 5;24(1):134. doi: 10.1186/s12859-023-05225-z.

本文引用的文献

1
A benchmark of computational pipelines for single-cell histone modification data.单细胞组蛋白修饰数据计算流程基准测试。
Genome Biol. 2023 Jun 20;24(1):143. doi: 10.1186/s13059-023-02981-2.
2
SEACells infers transcriptional and epigenomic cellular states from single-cell genomics data.SEACells 从单细胞基因组学数据推断转录和表观基因组细胞状态。
Nat Biotechnol. 2023 Dec;41(12):1746-1757. doi: 10.1038/s41587-023-01716-9. Epub 2023 Mar 27.
3
The Tabula Sapiens: A multiple-organ, single-cell transcriptomic atlas of humans.智慧人图谱:人类多器官单细胞转录组图谱。
Science. 2022 May 13;376(6594):eabl4896. doi: 10.1126/science.abl4896.
4
Metacell-2: a divide-and-conquer metacell algorithm for scalable scRNA-seq analysis.Metacell-2:一种用于可扩展 scRNA-seq 分析的分而治之的元细胞算法。
Genome Biol. 2022 Apr 19;23(1):100. doi: 10.1186/s13059-022-02667-1.
5
Benchmarking clustering algorithms on estimating the number of cell types from single-cell RNA-sequencing data.基于单细胞 RNA 测序数据评估细胞类型数量的聚类算法基准测试。
Genome Biol. 2022 Feb 8;23(1):49. doi: 10.1186/s13059-022-02622-0.
6
Confronting false discoveries in single-cell differential expression.单细胞差异表达中虚假发现的应对策略。
Nat Commun. 2021 Sep 28;12(1):5692. doi: 10.1038/s41467-021-25960-2.
7
MultiK: an automated tool to determine optimal cluster numbers in single-cell RNA sequencing data.MultiK:一种用于确定单细胞 RNA 测序数据中最佳聚类数目的自动化工具。
Genome Biol. 2021 Aug 19;22(1):232. doi: 10.1186/s13059-021-02445-5.
8
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
9
Joint profiling of histone modifications and transcriptome in single cells from mouse brain.单细胞中小鼠脑内组蛋白修饰与转录组的联合分析。
Nat Methods. 2021 Mar;18(3):283-292. doi: 10.1038/s41592-021-01060-3. Epub 2021 Feb 15.
10
Interactive analysis of single-cell epigenomic landscapes with ChromSCape.使用 ChromSCape 进行单细胞表观基因组景观的交互式分析。
Nat Commun. 2020 Nov 11;11(1):5702. doi: 10.1038/s41467-020-19542-x.