• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 Python、R 和 GenePattern Notebook 实现的 CoGAPS 中的非负矩阵分解,推断单细胞数据中的细胞和分子过程。

Inferring cellular and molecular processes in single-cell data with non-negative matrix factorization using Python, R and GenePattern Notebook implementations of CoGAPS.

机构信息

Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University, Baltimore, MD, USA.

Convergence Institute, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University, Baltimore, MD, USA.

出版信息

Nat Protoc. 2023 Dec;18(12):3690-3731. doi: 10.1038/s41596-023-00892-x. Epub 2023 Nov 21.

DOI:10.1038/s41596-023-00892-x
PMID:37989764
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10961825/
Abstract

Non-negative matrix factorization (NMF) is an unsupervised learning method well suited to high-throughput biology. However, inferring biological processes from an NMF result still requires additional post hoc statistics and annotation for interpretation of learned features. Here, we introduce a suite of computational tools that implement NMF and provide methods for accurate and clear biological interpretation and analysis. A generalized discussion of NMF covering its benefits, limitations and open questions is followed by four procedures for the Bayesian NMF algorithm Coordinated Gene Activity across Pattern Subsets (CoGAPS). Each procedure will demonstrate NMF analysis to quantify cell state transitions in a public domain single-cell RNA-sequencing dataset. The first demonstrates PyCoGAPS, our new Python implementation that enhances runtime for large datasets, and the second allows its deployment in Docker. The third procedure steps through the same single-cell NMF analysis using our R CoGAPS interface. The fourth introduces a beginner-friendly CoGAPS platform using GenePattern Notebook, aimed at users with a working conceptual knowledge of data analysis but without a basic proficiency in the R or Python programming language. We also constructed a user-facing website to serve as a central repository for information and instructional materials about CoGAPS and its application programming interfaces. The expected timing to setup the packages and conduct a test run is around 15 min, and an additional 30 min to conduct analyses on a precomputed result. The expected runtime on the user's desired dataset can vary from hours to days depending on factors such as dataset size or input parameters.

摘要

非负矩阵分解 (NMF) 是一种非常适合高通量生物学的无监督学习方法。然而,要从 NMF 结果推断生物学过程,仍然需要额外的事后统计和注释来解释学习到的特征。在这里,我们引入了一套计算工具,实现了 NMF,并提供了准确和清晰的生物学解释和分析方法。首先对 NMF 进行了一般性讨论,涵盖了它的优点、局限性和悬而未决的问题,然后介绍了 Coordinated Gene Activity across Pattern Subsets (CoGAPS) 的贝叶斯 NMF 算法的四个程序。每个程序都将演示 NMF 分析,以量化公共领域单细胞 RNA-seq 数据集的细胞状态转变。第一个演示了 PyCoGAPS,这是我们新的 Python 实现,可提高大型数据集的运行时效率,第二个允许在 Docker 中部署它。第三个程序将使用我们的 R CoGAPS 接口逐步完成相同的单细胞 NMF 分析。第四个介绍了一个适合初学者的 CoGAPS 平台,使用 GenePattern Notebook,面向具有数据分析概念知识但不具备 R 或 Python 编程语言基本熟练程度的用户。我们还构建了一个面向用户的网站,作为关于 CoGAPS 及其应用程序编程接口的信息和教学材料的中央存储库。设置包并进行测试运行的预期时间约为 15 分钟,而在预计算结果上进行分析的额外时间为 30 分钟。根据数据集大小或输入参数等因素,在用户所需数据集上的预期运行时间可能从几小时到几天不等。

相似文献

1
Inferring cellular and molecular processes in single-cell data with non-negative matrix factorization using Python, R and GenePattern Notebook implementations of CoGAPS.使用 Python、R 和 GenePattern Notebook 实现的 CoGAPS 中的非负矩阵分解,推断单细胞数据中的细胞和分子过程。
Nat Protoc. 2023 Dec;18(12):3690-3731. doi: 10.1038/s41596-023-00892-x. Epub 2023 Nov 21.
2
Short-Term Memory Impairment短期记忆障碍
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Improving the usability of open health service delivery simulation models using Python and web apps.使用Python和网络应用程序提高开放式医疗服务提供模拟模型的可用性。
NIHR Open Res. 2023 Dec 15;3:48. doi: 10.3310/nihropenres.13467.1. eCollection 2023.
7
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
8
A Comprehensive and Modality Diverse Cervical Spine and Back Musculoskeletal Physical Exam Curriculum for Medical Students.面向医学生的全面且多模态的颈椎和背部肌肉骨骼物理检查课程
J Educ Teach Emerg Med. 2025 Jul 31;10(3):SG1-SG8. doi: 10.21980/J8RQ0N. eCollection 2025 Jul.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Sympathetic nerve blocks for persistent pain in adults with inoperable abdominopelvic cancer.成人无法手术的腹盆腔癌症持续性疼痛的交感神经阻滞。
Cochrane Database Syst Rev. 2024 Jun 6;6(6):CD015229. doi: 10.1002/14651858.CD015229.pub2.

引用本文的文献

1
GRACKLE: an interpretable matrix factorization approach for biomedical representation learning.GRACKLE:一种用于生物医学表示学习的可解释矩阵分解方法。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i609-i618. doi: 10.1093/bioinformatics/btaf213.
2
The Key Role of COA6 in Pancreatic Ductal Adenocarcinoma: Metabolic Reprogramming and Regulation of the Immune Microenvironment.COA6在胰腺导管腺癌中的关键作用:代谢重编程与免疫微环境调节
J Cell Mol Med. 2025 Jul;29(13):e70685. doi: 10.1111/jcmm.70685.
3
An unbiased tissue transcriptome analysis identifies potential markers for skin phenotypes and therapeutic responses in atopic dermatitis.

本文引用的文献

1
Uncovering the spatial landscape of molecular interactions within the tumor microenvironment through latent spaces.通过潜在空间揭示肿瘤微环境中分子相互作用的空间景观。
Cell Syst. 2023 Apr 19;14(4):285-301.e4. doi: 10.1016/j.cels.2023.03.004.
2
GSEApy: a comprehensive package for performing gene set enrichment analysis in Python.GSEApy:一个用于在 Python 中进行基因集富集分析的综合软件包。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac757.
3
Transfer learning between preclinical models and human tumors identifies a conserved NK cell activation signature in anti-CTLA-4 responsive tumors.
一项无偏倚的组织转录组分析确定了特应性皮炎皮肤表型和治疗反应的潜在标志物。
Nat Commun. 2025 Jun 2;16(1):4981. doi: 10.1038/s41467-025-59340-x.
4
Role of artificial intelligence in advancing immunology.人工智能在推动免疫学发展中的作用。
Immunol Res. 2025 Apr 24;73(1):76. doi: 10.1007/s12026-025-09632-7.
5
Revealing a coherent cell state landscape across single cell datasets with CONCORD.利用CONCORD揭示单细胞数据集中连贯的细胞状态图谱。
bioRxiv. 2025 Apr 11:2025.03.13.643146. doi: 10.1101/2025.03.13.643146.
6
Intrinsic GATA4 expression sensitizes the aortic root to dilation in a Loeys-Dietz syndrome mouse model.在洛伊氏综合征小鼠模型中,内源性GATA4表达使主动脉根部对扩张敏感。
Nat Cardiovasc Res. 2024 Dec;3(12):1468-1481. doi: 10.1038/s44161-024-00562-5. Epub 2024 Nov 20.
7
Spatiotemporal transcriptomic mapping of regenerative inflammation in skeletal muscle reveals a dynamic multilayered tissue architecture.骨骼肌再生炎症的时空转录组图谱揭示了动态的多层次组织架构。
J Clin Invest. 2024 Aug 27;134(20):e173858. doi: 10.1172/JCI173858.
8
Digitize your Biology! Modeling multicellular systems through interpretable cell behavior.将你的生物学数字化!通过可解释的细胞行为对多细胞系统进行建模。
bioRxiv. 2023 Nov 5:2023.09.17.557982. doi: 10.1101/2023.09.17.557982.
临床前模型与人类肿瘤之间的迁移学习确定了抗 CTLA-4 反应性肿瘤中 NK 细胞激活的保守特征。
Genome Med. 2021 Aug 11;13(1):129. doi: 10.1186/s13073-021-00944-5.
4
Community-wide hackathons to identify central themes in single-cell multi-omics.全社区范围的黑客马拉松活动,以识别单细胞多组学中的核心主题。
Genome Biol. 2021 Aug 5;22(1):220. doi: 10.1186/s13059-021-02433-9.
5
From bench to bedside: Single-cell analysis for cancer immunotherapy.从实验室到临床:单细胞分析在癌症免疫治疗中的应用。
Cancer Cell. 2021 Aug 9;39(8):1062-1080. doi: 10.1016/j.ccell.2021.07.004. Epub 2021 Jul 29.
6
CoGAPS 3: Bayesian non-negative matrix factorization for single-cell analysis with asynchronous updates and sparse data structures.CoGAPS 3:用于单细胞分析的贝叶斯非负矩阵分解,具有异步更新和稀疏数据结构。
BMC Bioinformatics. 2020 Oct 14;21(1):453. doi: 10.1186/s12859-020-03796-9.
7
Jointly defining cell types from multiple single-cell datasets using LIGER.使用 LIGER 从多个单细胞数据集联合定义细胞类型。
Nat Protoc. 2020 Nov;15(11):3632-3662. doi: 10.1038/s41596-020-0391-8. Epub 2020 Oct 12.
8
: batch effect adjustment for RNA-seq count data.RNA测序计数数据的批次效应调整
NAR Genom Bioinform. 2020 Sep;2(3):lqaa078. doi: 10.1093/nargab/lqaa078. Epub 2020 Sep 21.
9
A systematic evaluation of single-cell RNA-sequencing imputation methods.单细胞 RNA-seq 数据插补方法的系统评价
Genome Biol. 2020 Aug 27;21(1):218. doi: 10.1186/s13059-020-02132-x.
10
Latent Factor Modeling of scRNA-Seq Data Uncovers Dysregulated Pathways in Autoimmune Disease Patients.
iScience. 2020 Aug 12;23(9):101451. doi: 10.1016/j.isci.2020.101451. eCollection 2020 Sep 25.