• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Shared Differential Expression-Based Distance Reflects Global Cell Type Relationships in Single-Cell RNA Sequencing Data.基于共享差异表达的距离反映了单细胞 RNA 测序数据中的全局细胞类型关系。
J Comput Biol. 2022 Aug;29(8):867-879. doi: 10.1089/cmb.2021.0652. Epub 2022 Jul 6.
2
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。
PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.
3
Single-cell data clustering based on sparse optimization and low-rank matrix factorization.基于稀疏优化和低秩矩阵分解的单细胞数据聚类。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab098.
4
Spectral clustering based on learning similarity matrix.基于学习相似性矩阵的谱聚类。
Bioinformatics. 2018 Jun 15;34(12):2069-2076. doi: 10.1093/bioinformatics/bty050.
5
Identifying cell states in single-cell RNA-seq data at statistically maximal resolution.以统计学上最大分辨率识别单细胞 RNA-seq 数据中的细胞状态。
PLoS Comput Biol. 2024 Jul 12;20(7):e1012224. doi: 10.1371/journal.pcbi.1012224. eCollection 2024 Jul.
6
A Personalized Low-Rank Subspace Clustering Method Based on Locality and Similarity Constraints for scRNA-seq Data Analysis.基于局部性和相似性约束的个性化低秩子空间聚类方法在 scRNA-seq 数据分析中的应用。
IEEE J Biomed Health Inform. 2023 May;27(5):2575-2584. doi: 10.1109/JBHI.2023.3247723. Epub 2023 May 4.
7
SIDEseq: A Cell Similarity Measure Defined by Shared Identified Differentially Expressed Genes for Single-Cell RNA sequencing Data.SIDEseq:一种由单细胞RNA测序数据中共享的已识别差异表达基因定义的细胞相似性度量方法。
Stat Biosci. 2017 Jun;9(1):200-216. doi: 10.1007/s12561-017-9194-z. Epub 2017 May 17.
8
A clustering method for small scRNA-seq data based on subspace and weighted distance.基于子空间和加权距离的小 scRNA-seq 数据聚类方法。
PeerJ. 2023 Jan 23;11:e14706. doi: 10.7717/peerj.14706. eCollection 2023.
9
scNAME: neighborhood contrastive clustering with ancillary mask estimation for scRNA-seq data.scNAME:基于辅助掩模估计的 scRNA-seq 数据邻域对比聚类。
Bioinformatics. 2022 Mar 4;38(6):1575-1583. doi: 10.1093/bioinformatics/btac011.
10
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.机器学习和统计方法在单细胞 RNA 测序数据分析中的应用。
Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063.

本文引用的文献

1
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
2
Model-based deep embedding for constrained clustering analysis of single cell RNA-seq data.基于模型的深度学习嵌入方法用于单细胞 RNA-seq 数据的约束聚类分析。
Nat Commun. 2021 Mar 25;12(1):1873. doi: 10.1038/s41467-021-22008-3.
3
SSRE: Cell Type Detection Based on Sparse Subspace Representation and Similarity Enhancement.SSRE:基于稀疏子空间表示和相似度增强的细胞类型检测。
Genomics Proteomics Bioinformatics. 2021 Apr;19(2):282-291. doi: 10.1016/j.gpb.2020.09.004. Epub 2021 Feb 27.
4
Initialization is critical for preserving global data structure in both t-SNE and UMAP.初始化对于在t-SNE和UMAP中保存全局数据结构至关重要。
Nat Biotechnol. 2021 Feb;39(2):156-157. doi: 10.1038/s41587-020-00809-z. Epub 2021 Feb 1.
5
ScGSLC: An unsupervised graph similarity learning framework for single-cell RNA-seq data clustering.ScGSLC:一种用于单细胞 RNA-seq 数据聚类的无监督图相似性学习框架。
Comput Biol Chem. 2021 Feb;90:107415. doi: 10.1016/j.compbiolchem.2020.107415. Epub 2020 Nov 18.
6
Benchmarking single-cell RNA-sequencing protocols for cell atlas projects.单细胞 RNA 测序技术在细胞图谱项目中的基准测试。
Nat Biotechnol. 2020 Jun;38(6):747-755. doi: 10.1038/s41587-020-0469-4. Epub 2020 Apr 6.
7
Systematic comparison of single-cell and single-nucleus RNA-sequencing methods.单细胞和单细胞核 RNA 测序方法的系统比较。
Nat Biotechnol. 2020 Jun;38(6):737-746. doi: 10.1038/s41587-020-0465-8. Epub 2020 Apr 6.
8
Diversity, Mechanisms, and Significance of Macrophage Plasticity.巨噬细胞可塑性的多样性、机制及意义。
Annu Rev Pathol. 2020 Jan 24;15:123-147. doi: 10.1146/annurev-pathmechdis-012418-012718. Epub 2019 Sep 17.
9
GeneFishing to reconstruct context specific portraits of biological processes.基因钓取重建生物过程的上下文特异图谱。
Proc Natl Acad Sci U S A. 2019 Sep 17;116(38):18943-18950. doi: 10.1073/pnas.1820340116. Epub 2019 Sep 4.
10
The single-cell sequencing: new developments and medical applications.单细胞测序:新进展与医学应用
Cell Biosci. 2019 Jun 26;9:53. doi: 10.1186/s13578-019-0314-y. eCollection 2019.

基于共享差异表达的距离反映了单细胞 RNA 测序数据中的全局细胞类型关系。

Shared Differential Expression-Based Distance Reflects Global Cell Type Relationships in Single-Cell RNA Sequencing Data.

机构信息

Division of Biostatistics and Department of Statistics,Berkeley, Berkeley, California, USA.

Division of Biostatistics University of California, Berkeley, Berkeley, California, USA.

出版信息

J Comput Biol. 2022 Aug;29(8):867-879. doi: 10.1089/cmb.2021.0652. Epub 2022 Jul 6.

DOI:10.1089/cmb.2021.0652
PMID:35793527
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9419948/
Abstract

Unsupervised cell clustering on the basis of meaningful biological variation in single-cell RNA sequencing (scRNA seq) data has received significant attention, as it assists with ontological subpopulation identification among the data. A key step in the clustering process is to compute distances between the cells under a specified distance measure. Although particular distance measures may successfully separate cells into biologically relevant clusters, they may fail to retain global structure of the data, such as relative similarity between the cell clusters. In this article, we modify a biologically motivated distance measure, SIDEseq, for use of aggregate comparisons of cell types in large single-cell assays, and demonstrate that, across simulated and real scRNA seq data, the distance matrix more consistently retains global cell type relationships than commonly used distance measures for scRNA seq clustering. We call the modified distance measure "SIDEREF." We explore spectral dimension reduction of the SIDEREF distance matrix as a means of noise filtering, similar to principal components analysis applied directly to expression data. We utilize a summary measure of relative cell type distances to better display the cell group relationships. SIDEREF visualizations more consistently reflect global structures in the data than other commonly considered distance measures. We utilize relative cell type distances and the SIDEREF distance measure to uncover compositional differences between annotated leukocyte cell groups in a compendium of scRNA seq assays comprising 12 tissues. SIDEREF and associated analysis is openly available on GitHub.

摘要

基于单细胞 RNA 测序 (scRNA seq) 数据中有意义的生物学变异对细胞进行无监督聚类受到了广泛关注,因为它有助于在数据中识别生物学亚群。聚类过程的关键步骤是在指定的距离度量下计算细胞之间的距离。虽然特定的距离度量可以成功地将细胞分为生物学上相关的簇,但它们可能无法保留数据的全局结构,例如细胞簇之间的相对相似性。在本文中,我们修改了一种基于生物学的距离度量 SIDEseq,用于在大型单细胞测定中对细胞类型进行综合比较,并证明在模拟和真实的 scRNA seq 数据中,距离矩阵比常用于 scRNA seq 聚类的距离度量更能一致地保留全局细胞类型关系。我们将修改后的距离度量称为“SIDEREF”。我们探索了 SIDEREF 距离矩阵的谱维数约简作为一种降噪方法,类似于直接应用于表达数据的主成分分析。我们利用相对细胞类型距离的综合度量来更好地显示细胞群关系。SIDEREF 可视化比其他常用的距离度量更能一致地反映数据中的全局结构。我们利用相对细胞类型距离和 SIDEREF 距离度量来揭示注释白细胞细胞群在包含 12 种组织的 scRNA seq 测定综合集中的组成差异。SIDEREF 及其相关分析在 GitHub 上公开可用。