• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 Dune 提高单细胞 RNA-Seq 细胞类型发现的可重复性。

Improving replicability in single-cell RNA-Seq cell type discovery with Dune.

机构信息

Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, USA.

Center for Computational Biology, University of California, Berkeley, CA, USA.

出版信息

BMC Bioinformatics. 2024 May 24;25(1):198. doi: 10.1186/s12859-024-05814-6.

DOI:10.1186/s12859-024-05814-6
PMID:38789920
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11127396/
Abstract

BACKGROUND

Single-cell transcriptome sequencing (scRNA-Seq) has allowed new types of investigations at unprecedented levels of resolution. Among the primary goals of scRNA-Seq is the classification of cells into distinct types. Many approaches build on existing clustering literature to develop tools specific to single-cell. However, almost all of these methods rely on heuristics or user-supplied parameters to control the number of clusters. This affects both the resolution of the clusters within the original dataset as well as their replicability across datasets. While many recommendations exist, in general, there is little assurance that any given set of parameters will represent an optimal choice in the trade-off between cluster resolution and replicability. For instance, another set of parameters may result in more clusters that are also more replicable.

RESULTS

Here, we propose Dune, a new method for optimizing the trade-off between the resolution of the clusters and their replicability. Our method takes as input a set of clustering results-or partitions-on a single dataset and iteratively merges clusters within each partitions in order to maximize their concordance between partitions. As demonstrated on multiple datasets from different platforms, Dune outperforms existing techniques, that rely on hierarchical merging for reducing the number of clusters, in terms of replicability of the resultant merged clusters as well as concordance with ground truth. Dune is available as an R package on Bioconductor: https://www.bioconductor.org/packages/release/bioc/html/Dune.html .

CONCLUSIONS

Cluster refinement by Dune helps improve the robustness of any clustering analysis and reduces the reliance on tuning parameters. This method provides an objective approach for borrowing information across multiple clusterings to generate replicable clusters most likely to represent common biological features across multiple datasets.

摘要

背景

单细胞转录组测序(scRNA-Seq)允许在前所未有的分辨率水平上进行新类型的研究。scRNA-Seq 的主要目标之一是将细胞分类为不同的类型。许多方法基于现有的聚类文献来开发专门针对单细胞的工具。然而,几乎所有这些方法都依赖于启发式或用户提供的参数来控制聚类的数量。这会影响原始数据集中聚类的分辨率以及它们在数据集之间的可重复性。虽然有很多建议,但通常不能保证任何给定的参数集在聚类分辨率和可重复性之间的权衡中代表最佳选择。例如,另一组参数可能会导致更多的聚类,并且这些聚类也更具有可重复性。

结果

在这里,我们提出了 Dune,一种用于优化聚类分辨率与其可重复性之间权衡的新方法。我们的方法以单个数据集上的一组聚类结果或分区作为输入,并在每个分区内迭代地合并聚类,以最大化分区之间的一致性。在来自不同平台的多个数据集上的演示表明,Dune 在可重复性和与真实情况的一致性方面,优于依赖层次聚类来减少聚类数量的现有技术。Dune 可作为 Bioconductor 上的 R 包使用:https://www.bioconductor.org/packages/release/bioc/html/Dune.html。

结论

通过 Dune 进行聚类细化有助于提高任何聚类分析的稳健性,并减少对调整参数的依赖。该方法提供了一种跨多个聚类借用信息的客观方法,以生成最有可能代表多个数据集之间常见生物学特征的可重复聚类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/7d92f1e343fe/12859_2024_5814_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/5f1f1b7f6fa2/12859_2024_5814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/ec914881aabb/12859_2024_5814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/1cdb6d4269fe/12859_2024_5814_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/ce16b02df7ca/12859_2024_5814_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/7d92f1e343fe/12859_2024_5814_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/5f1f1b7f6fa2/12859_2024_5814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/ec914881aabb/12859_2024_5814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/1cdb6d4269fe/12859_2024_5814_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/ce16b02df7ca/12859_2024_5814_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a63e/11127396/7d92f1e343fe/12859_2024_5814_Fig5_HTML.jpg

相似文献

1
Improving replicability in single-cell RNA-Seq cell type discovery with Dune.利用 Dune 提高单细胞 RNA-Seq 细胞类型发现的可重复性。
BMC Bioinformatics. 2024 May 24;25(1):198. doi: 10.1186/s12859-024-05814-6.
2
Identifying cell states in single-cell RNA-seq data at statistically maximal resolution.以统计学上最大分辨率识别单细胞 RNA-seq 数据中的细胞状态。
PLoS Comput Biol. 2024 Jul 12;20(7):e1012224. doi: 10.1371/journal.pcbi.1012224. eCollection 2024 Jul.
3
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
4
DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.DIMM-SC:一种基于 Dirichlet 混合模型的用于聚类基于液滴的单细胞转录组学数据的方法。
Bioinformatics. 2018 Jan 1;34(1):139-146. doi: 10.1093/bioinformatics/btx490.
5
A parameter-free deep embedded clustering method for single-cell RNA-seq data.一种无参数深度嵌入聚类方法,用于单细胞 RNA-seq 数据。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac172.
6
A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data.基于单细胞 RNA-seq 数据的混合深度聚类方法进行稳健的细胞类型分析。
RNA. 2020 Oct;26(10):1303-1319. doi: 10.1261/rna.074427.119. Epub 2020 Jun 12.
7
scBubbletree: computational approach for visualization of single cell RNA-seq data.scBubbletree:单细胞 RNA-seq 数据可视化的计算方法。
BMC Bioinformatics. 2024 Sep 13;25(1):302. doi: 10.1186/s12859-024-05927-y.
8
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
9
Multi-Objective Optimized Fuzzy Clustering for Detecting Cell Clusters from Single-Cell Expression Profiles.基于单细胞表达谱的多目标优化模糊聚类检测细胞簇。
Genes (Basel). 2019 Aug 13;10(8):611. doi: 10.3390/genes10080611.
10
Clustering scRNA-seq data with the cross-view collaborative information fusion strategy.使用跨视图协同信息融合策略对 scRNA-seq 数据进行聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae511.

引用本文的文献

1
Interpretable unsupervised learning enables accurate clustering with high-throughput imaging flow cytometry.可解释的无监督学习可实现高通量成像流式细胞术的精确聚类。
Sci Rep. 2023 Nov 23;13(1):20533. doi: 10.1038/s41598-023-46782-w.

本文引用的文献

1
scConsensus: combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data.scConsensus:在单细胞 RNA 测序数据中结合监督和无监督聚类进行细胞类型识别。
BMC Bioinformatics. 2021 Apr 12;22(1):186. doi: 10.1186/s12859-021-04028-4.
2
A curated database reveals trends in single-cell transcriptomics.一个经过精心策划的数据库揭示了单细胞转录组学的发展趋势。
Database (Oxford). 2020 Nov 28;2020. doi: 10.1093/database/baaa073.
3
Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling.
单细胞 RNA-seq 数据中肿瘤微环境细胞类型的概率分配。
Nat Methods. 2019 Oct;16(10):1007-1015. doi: 10.1038/s41592-019-0529-1. Epub 2019 Sep 9.
4
Supervised classification enables rapid annotation of cell atlases.监督分类可实现细胞图谱的快速标注。
Nat Methods. 2019 Oct;16(10):983-986. doi: 10.1038/s41592-019-0535-3. Epub 2019 Sep 9.
5
SCINA: A Semi-Supervised Subtyping Algorithm of Single Cells and Bulk Samples.SCINA:一种单细胞和批量样本的半监督亚群算法。
Genes (Basel). 2019 Jul 12;10(7):531. doi: 10.3390/genes10070531.
6
Polled Digital Cell Sorter (p-DCS): Automatic identification of hematological cell types from single cell RNA-sequencing clusters.流式数字细胞分选仪(p-DCS):从单细胞 RNA 测序簇中自动识别血细胞类型。
BMC Bioinformatics. 2019 Jul 1;20(1):369. doi: 10.1186/s12859-019-2951-x.
7
Comprehensive Integration of Single-Cell Data.单细胞数据的综合整合。
Cell. 2019 Jun 13;177(7):1888-1902.e21. doi: 10.1016/j.cell.2019.05.031. Epub 2019 Jun 6.
8
From Louvain to Leiden: guaranteeing well-connected communities.从鲁汶到莱顿:保障互联互通的社区。
Sci Rep. 2019 Mar 26;9(1):5233. doi: 10.1038/s41598-019-41695-z.
9
The single-cell transcriptional landscape of mammalian organogenesis.哺乳动物器官发生的单细胞转录组图谱。
Nature. 2019 Feb;566(7745):496-502. doi: 10.1038/s41586-019-0969-x. Epub 2019 Feb 20.
10
Challenges in unsupervised clustering of single-cell RNA-seq data.无监督单细胞 RNA-seq 数据聚类的挑战。
Nat Rev Genet. 2019 May;20(5):273-282. doi: 10.1038/s41576-018-0088-9.