• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过非负矩阵分解检测单细胞RNA测序数据中的异质性。

Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization.

作者信息

Zhu Xun, Ching Travers, Pan Xinghua, Weissman Sherman M, Garmire Lana

机构信息

Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, United States; Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, United States.

Department of Genetics, Yale University , New Haven , CT , United States.

出版信息

PeerJ. 2017 Jan 19;5:e2888. doi: 10.7717/peerj.2888. eCollection 2017.

DOI:10.7717/peerj.2888
PMID:28133571
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5251935/
Abstract

Single-cell RNA-Sequencing (scRNA-Seq) is a fast-evolving technology that enables the understanding of biological processes at an unprecedentedly high resolution. However, well-suited bioinformatics tools to analyze the data generated from this new technology are still lacking. Here we investigate the performance of non-negative matrix factorization (NMF) method to analyze a wide variety of scRNA-Seq datasets, ranging from mouse hematopoietic stem cells to human glioblastoma data. In comparison to other unsupervised clustering methods including K-means and hierarchical clustering, NMF has higher accuracy in separating similar groups in various datasets. We ranked genes by their importance scores (-scores) in separating these groups, and discovered that NMF uniquely identifies genes expressed at intermediate levels as top-ranked genes. Finally, we show that in conjugation with the modularity detection method FEM, NMF reveals meaningful protein-protein interaction modules. In summary, we propose that NMF is a desirable method to analyze heterogeneous single-cell RNA-Seq data. The NMF based subpopulation detection package is available at: https://github.com/lanagarmire/NMFEM.

摘要

单细胞RNA测序(scRNA-Seq)是一项快速发展的技术,能够以前所未有的高分辨率理解生物过程。然而,仍缺乏适用于分析这项新技术所产生数据的生物信息学工具。在此,我们研究非负矩阵分解(NMF)方法在分析各种scRNA-Seq数据集时的性能,这些数据集涵盖从小鼠造血干细胞到人类胶质母细胞瘤数据等。与包括K均值和层次聚类在内的其他无监督聚类方法相比,NMF在分离各种数据集中的相似组时具有更高的准确性。我们根据基因在分离这些组中的重要性得分(-得分)对基因进行排序,发现NMF独特地将中等水平表达的基因识别为排名靠前的基因。最后,我们表明,与模块性检测方法FEM结合时,NMF能揭示有意义的蛋白质-蛋白质相互作用模块。总之,我们提出NMF是分析异质单细胞RNA-Seq数据的理想方法。基于NMF的亚群检测软件包可在以下网址获取:https://github.com/lanagarmire/NMFEM 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/c11ddffc4ca9/peerj-05-2888-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/48786d975deb/peerj-05-2888-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/4c70b19b49be/peerj-05-2888-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/21353a8da706/peerj-05-2888-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/c11ddffc4ca9/peerj-05-2888-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/48786d975deb/peerj-05-2888-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/4c70b19b49be/peerj-05-2888-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/21353a8da706/peerj-05-2888-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de93/5251935/c11ddffc4ca9/peerj-05-2888-g004.jpg

相似文献

1
Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization.通过非负矩阵分解检测单细胞RNA测序数据中的异质性。
PeerJ. 2017 Jan 19;5:e2888. doi: 10.7717/peerj.2888. eCollection 2017.
2
A robust semi-supervised NMF model for single cell RNA-seq data.一种用于单细胞RNA测序数据的强大半监督非负矩阵分解模型。
PeerJ. 2020 Oct 16;8:e10091. doi: 10.7717/peerj.10091. eCollection 2020.
3
Unsupervised Cluster Analysis and Gene Marker Extraction of scRNA-seq Data Based On Non-Negative Matrix Factorization.基于非负矩阵分解的 scRNA-seq 数据无监督聚类分析和基因标志物提取。
IEEE J Biomed Health Inform. 2022 Jan;26(1):458-467. doi: 10.1109/JBHI.2021.3091506. Epub 2022 Jan 17.
4
Joint learning dimension reduction and clustering of single-cell RNA-sequencing data.单细胞 RNA 测序数据的联合降维和聚类学习。
Bioinformatics. 2020 Jun 1;36(12):3825-3832. doi: 10.1093/bioinformatics/btaa231.
5
NMF Clustering: Accessible NMF-based Clustering Utilizing GPU Acceleration.非负矩阵分解聚类:利用GPU加速的基于非负矩阵分解的可访问聚类。
J Bioinform Syst Biol. 2023;6(4):379-383. doi: 10.26502/jbsb.5107072. Epub 2023 Dec 21.
6
Robust classification of single-cell transcriptome data by nonnegative matrix factorization.通过非负矩阵分解对单细胞转录组数据进行稳健分类。
Bioinformatics. 2017 Jan 15;33(2):235-242. doi: 10.1093/bioinformatics/btw607. Epub 2016 Sep 23.
7
SC-JNMF: single-cell clustering integrating multiple quantification methods based on joint non-negative matrix factorization.SC-JNMF:基于联合非负矩阵分解整合多种量化方法的单细胞聚类
PeerJ. 2021 Aug 27;9:e12087. doi: 10.7717/peerj.12087. eCollection 2021.
8
SinNLRR: a robust subspace clustering method for cell type detection by non-negative and low-rank representation.SinNLRR:一种基于非负低秩表示的稳健子空间聚类方法,用于细胞类型检测。
Bioinformatics. 2019 Oct 1;35(19):3642-3650. doi: 10.1093/bioinformatics/btz139.
9
Imputation for Single-cell RNA-seq Data with Non-negative Matrix Factorization and Transfer Learning.基于非负矩阵分解和迁移学习的单细胞 RNA-seq 数据插补。
J Bioinform Comput Biol. 2023 Dec;21(6):2350029. doi: 10.1142/S0219720023500294. Epub 2024 Jan 23.
10
NMFClustering: Accessible NMF-based clustering utilizing GPU acceleration.非负矩阵分解聚类:利用GPU加速的基于非负矩阵分解的可访问聚类。
bioRxiv. 2023 Jun 27:2023.06.16.545370. doi: 10.1101/2023.06.16.545370.

引用本文的文献

1
Identifying cancer cells from calling single-nucleotide variants in scRNA-seq data.从 scRNA-seq 数据中识别单核苷酸变异的癌细胞。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae512.
2
Interface-guided phenotyping of coding variants in the transcription factor RUNX1.转录因子 RUNX1 中编码变异的接口引导表型分析。
Cell Rep. 2024 Jul 23;43(7):114436. doi: 10.1016/j.celrep.2024.114436. Epub 2024 Jul 4.
3
Interpretable online network dictionary learning for inferring long-range chromatin interactions.

本文引用的文献

1
Batch effects and the effective design of single-cell gene expression studies.批次效应与单细胞基因表达研究的有效设计。
Sci Rep. 2017 Jan 3;7:39921. doi: 10.1038/srep39921.
2
Single-Cell Transcriptomics Bioinformatics and Computational Challenges.单细胞转录组学的生物信息学与计算挑战
Front Genet. 2016 Sep 21;7:163. doi: 10.3389/fgene.2016.00163. eCollection 2016.
3
Comprehensive analysis of differentially expressed profiles of lncRNAs and circRNAs with associated co-expression and ceRNA networks in bladder carcinoma.
可解释的在线网络字典学习用于推断长程染色质相互作用。
PLoS Comput Biol. 2024 May 16;20(5):e1012095. doi: 10.1371/journal.pcbi.1012095. eCollection 2024 May.
4
Multiscale differential geometry learning of networks with applications to single-cell RNA sequencing data.基于网络的多尺度微分几何学习及其在单细胞 RNA 测序数据分析中的应用。
Comput Biol Med. 2024 Mar;171:108211. doi: 10.1016/j.compbiomed.2024.108211. Epub 2024 Feb 28.
5
Inferring cellular and molecular processes in single-cell data with non-negative matrix factorization using Python, R and GenePattern Notebook implementations of CoGAPS.使用 Python、R 和 GenePattern Notebook 实现的 CoGAPS 中的非负矩阵分解,推断单细胞数据中的细胞和分子过程。
Nat Protoc. 2023 Dec;18(12):3690-3731. doi: 10.1038/s41596-023-00892-x. Epub 2023 Nov 21.
6
A new Bayesian factor analysis method improves detection of genes and biological processes affected by perturbations in single-cell CRISPR screening.一种新的贝叶斯因子分析方法提高了单细胞 CRISPR 筛选中扰动影响的基因和生物过程的检测能力。
Nat Methods. 2023 Nov;20(11):1693-1703. doi: 10.1038/s41592-023-02017-4. Epub 2023 Sep 28.
7
scGEM: Unveiling the Nested Tree-Structured Gene Co-Expressing Modules in Single Cell Transcriptome Data.scGEM:揭示单细胞转录组数据中嵌套树状结构的基因共表达模块
Cancers (Basel). 2023 Aug 26;15(17):4277. doi: 10.3390/cancers15174277.
8
Learning Individual Survival Models from PanCancer Whole Transcriptome Data.从泛癌全转录组数据中学习个体生存模型。
Clin Cancer Res. 2023 Oct 2;29(19):3924-3936. doi: 10.1158/1078-0432.CCR-22-3493.
9
ASGARD is A Single-cell Guided Pipeline to Aid Repurposing of Drugs.ASGARD 是一个单细胞指导管道,用于帮助药物的再利用。
Nat Commun. 2023 Feb 22;14(1):993. doi: 10.1038/s41467-023-36637-3.
10
Improved downstream functional analysis of single-cell RNA-sequence data using DGAN.基于深度生成对抗网络(DGAN)提高单细胞 RNA-seq 数据下游功能分析。
Sci Rep. 2023 Jan 28;13(1):1618. doi: 10.1038/s41598-023-28952-y.
膀胱癌中lncRNAs和circRNAs差异表达谱及其相关共表达和ceRNA网络的综合分析
Oncotarget. 2016 Jul 26;7(30):47186-47200. doi: 10.18632/oncotarget.9706.
4
Identifications of novel mechanisms in breast cancer cells involving duct-like multicellular spheroid formation after exposure to the Random Positioning Machine.在接触随机定位器后,乳腺癌细胞中涉及管样多细胞球体形成的新机制的鉴定。
Sci Rep. 2016 May 27;6:26887. doi: 10.1038/srep26887.
5
Characterizing transcriptional heterogeneity through pathway and gene set overdispersion analysis.通过通路和基因集过度分散分析来表征转录异质性。
Nat Methods. 2016 Mar;13(3):241-4. doi: 10.1038/nmeth.3734. Epub 2016 Jan 18.
6
The combination of four molecular markers improves thyroid cancer cytologic diagnosis and patient management.四种分子标志物的联合应用可改善甲状腺癌的细胞学诊断及患者管理。
BMC Cancer. 2015 Nov 19;15:918. doi: 10.1186/s12885-015-1917-2.
7
Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization.基于判别式非负矩阵分解的RNA测序数据基因排序
PLoS One. 2015 Sep 8;10(9):e0137782. doi: 10.1371/journal.pone.0137782. eCollection 2015.
8
Malat1 as an evolutionarily conserved lncRNA, plays a positive role in regulating proliferation and maintaining undifferentiated status of early-stage hematopoietic cells.Malat1作为一种进化上保守的长链非编码RNA,在调节早期造血细胞的增殖和维持其未分化状态方面发挥着积极作用。
BMC Genomics. 2015 Sep 3;16(1):676. doi: 10.1186/s12864-015-1881-x.
9
Identification of cDC1- and cDC2-committed DC progenitors reveals early lineage priming at the common DC progenitor stage in the bone marrow.鉴定 cDC1 和 cDC2 定向的 DC 祖细胞揭示了骨髓中共同 DC 祖细胞阶段的早期谱系启动。
Nat Immunol. 2015 Jul;16(7):718-28. doi: 10.1038/ni.3200. Epub 2015 Jun 8.
10
Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets.利用纳升液滴对单个细胞进行高度并行的全基因组表达谱分析。
Cell. 2015 May 21;161(5):1202-1214. doi: 10.1016/j.cell.2015.05.002.