• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用约束鲁棒非负矩阵分解对单细胞RNA测序数据进行降维

Dimensionality reduction for single cell RNA sequencing data using constrained robust non-negative matrix factorization.

作者信息

Zhang Shuqin, Yang Liu, Yang Jinwen, Lin Zhixiang, Ng Michael K

机构信息

School of Mathematical Sciences, Fudan University, Shanghai 200433, China.

College of Intelligence and Computing, Tianjin University, Tianjin 300350, China.

出版信息

NAR Genom Bioinform. 2020 Aug 28;2(3):lqaa064. doi: 10.1093/nargab/lqaa064. eCollection 2020 Sep.

DOI:10.1093/nargab/lqaa064
PMID:33575614
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7671375/
Abstract

Single cell RNA-sequencing (scRNA-seq) technology, a powerful tool for analyzing the entire transcriptome at single cell level, is receiving increasing research attention. The presence of dropouts is an important characteristic of scRNA-seq data that may affect the performance of downstream analyses, such as dimensionality reduction and clustering. Cells sequenced to lower depths tend to have more dropouts than those sequenced to greater depths. In this study, we aimed to develop a dimensionality reduction method to address both dropouts and the non-negativity constraints in scRNA-seq data. The developed method simultaneously performs dimensionality reduction and dropout imputation under the non-negative matrix factorization (NMF) framework. The dropouts were modeled as a non-negative sparse matrix. Summation of the observed data matrix and dropout matrix was approximated by NMF. To ensure the sparsity pattern was maintained, a weighted ℓ penalty that took into account the dependency of dropouts on the sequencing depth in each cell was imposed. An efficient algorithm was developed to solve the proposed optimization problem. Experiments using both synthetic data and real data showed that dimensionality reduction via the proposed method afforded more robust clustering results compared with those obtained from the existing methods, and that dropout imputation improved the differential expression analysis.

摘要

单细胞RNA测序(scRNA-seq)技术作为一种在单细胞水平分析整个转录组的强大工具,正受到越来越多的研究关注。缺失值的存在是scRNA-seq数据的一个重要特征,可能会影响下游分析的性能,如降维和聚类。测序深度较低的细胞往往比测序深度较高的细胞有更多的缺失值。在本研究中,我们旨在开发一种降维方法,以解决scRNA-seq数据中的缺失值和非负约束问题。所开发的方法在非负矩阵分解(NMF)框架下同时进行降维和缺失值插补。将缺失值建模为一个非负稀疏矩阵。通过NMF对观测数据矩阵和缺失值矩阵的和进行近似。为了确保稀疏模式得以维持,施加了一种加权ℓ惩罚,该惩罚考虑了每个细胞中缺失值对测序深度的依赖性。开发了一种高效算法来解决所提出的优化问题。使用合成数据和真实数据进行的实验表明,与现有方法相比,通过所提出的方法进行降维能得到更稳健的聚类结果,并且缺失值插补改善了差异表达分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/a33ebd4324e7/lqaa064fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/18ec70fe50e3/lqaa064fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/575aa2831f7e/lqaa064fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/4b15abf2b701/lqaa064fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/900f7849756d/lqaa064fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/724a14e87f1a/lqaa064fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/7313918dd27d/lqaa064fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/f40f655104be/lqaa064fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/dac61037d651/lqaa064fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/569feb2301ad/lqaa064fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/a33ebd4324e7/lqaa064fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/18ec70fe50e3/lqaa064fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/575aa2831f7e/lqaa064fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/4b15abf2b701/lqaa064fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/900f7849756d/lqaa064fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/724a14e87f1a/lqaa064fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/7313918dd27d/lqaa064fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/f40f655104be/lqaa064fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/dac61037d651/lqaa064fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/569feb2301ad/lqaa064fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d55b/7671375/a33ebd4324e7/lqaa064fig10.jpg

相似文献

1
Dimensionality reduction for single cell RNA sequencing data using constrained robust non-negative matrix factorization.使用约束鲁棒非负矩阵分解对单细胞RNA测序数据进行降维
NAR Genom Bioinform. 2020 Aug 28;2(3):lqaa064. doi: 10.1093/nargab/lqaa064. eCollection 2020 Sep.
2
SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data.SSNMDI:一种用于单细胞 RNA-seq 数据聚类的半监督非负矩阵分解和数据插补的新型联合学习模型。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad149.
3
NISC: Neural Network-Imputation for Single-Cell RNA Sequencing and Cell Type Clustering.NISC:用于单细胞RNA测序和细胞类型聚类的神经网络插补法
Front Genet. 2022 May 3;13:847112. doi: 10.3389/fgene.2022.847112. eCollection 2022.
4
Imputation for Single-cell RNA-seq Data with Non-negative Matrix Factorization and Transfer Learning.基于非负矩阵分解和迁移学习的单细胞 RNA-seq 数据插补。
J Bioinform Comput Biol. 2023 Dec;21(6):2350029. doi: 10.1142/S0219720023500294. Epub 2024 Jan 23.
5
Joint learning dimension reduction and clustering of single-cell RNA-sequencing data.单细胞 RNA 测序数据的联合降维和聚类学习。
Bioinformatics. 2020 Jun 1;36(12):3825-3832. doi: 10.1093/bioinformatics/btaa231.
6
scRMD: imputation for single cell RNA-seq data via robust matrix decomposition.scRMD:基于稳健矩阵分解的单细胞 RNA-seq 数据插补。
Bioinformatics. 2020 May 1;36(10):3156-3161. doi: 10.1093/bioinformatics/btaa139.
7
Bubble: a fast single-cell RNA-seq imputation using an autoencoder constrained by bulk RNA-seq data.Bubble:一种利用受批量RNA测序数据约束的自动编码器进行的快速单细胞RNA测序插补方法。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac580.
8
A robust semi-supervised NMF model for single cell RNA-seq data.一种用于单细胞RNA测序数据的强大半监督非负矩阵分解模型。
PeerJ. 2020 Oct 16;8:e10091. doi: 10.7717/peerj.10091. eCollection 2020.
9
Robust Graph Regularized NMF with Dissimilarity and Similarity Constraints for ScRNA-seq Data Clustering.具有差异和相似性约束的鲁棒图正则化非负矩阵分解用于单细胞RNA测序数据聚类
J Chem Inf Model. 2022 Dec 12;62(23):6271-6286. doi: 10.1021/acs.jcim.2c01305. Epub 2022 Dec 2.
10
Matrix factorization for biomedical link prediction and scRNA-seq data imputation: an empirical survey.矩阵分解在生物医学链接预测和 scRNA-seq 数据插补中的应用:一项实证调查。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab479.

引用本文的文献

1
PLNMFG: Pseudo-label guided non-negative matrix factorization model with graph constraint for single-cell multi-omics data clustering.PLNMFG:用于单细胞多组学数据聚类的具有图约束的伪标签引导非负矩阵分解模型。
PLoS Comput Biol. 2025 Aug 18;21(8):e1013375. doi: 10.1371/journal.pcbi.1013375. eCollection 2025 Aug.
2
Opportunities and challenges of single-cell and spatially resolved genomics methods for neuroscience discovery.用于神经科学发现的单细胞和空间分辨基因组学方法的机遇与挑战。
Nat Neurosci. 2024 Dec;27(12):2292-2309. doi: 10.1038/s41593-024-01806-0. Epub 2024 Dec 3.
3
Integrative Single Cell Atlas Revealed Intratumoral Heterogeneity Generation from an Adaptive Epigenetic Cell State in Human Bladder Urothelial Carcinoma.

本文引用的文献

1
scRMD: imputation for single cell RNA-seq data via robust matrix decomposition.scRMD:基于稳健矩阵分解的单细胞 RNA-seq 数据插补。
Bioinformatics. 2020 May 1;36(10):3156-3161. doi: 10.1093/bioinformatics/btaa139.
2
Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression.使用正则化负二项式回归进行单细胞 RNA-seq 数据的归一化和方差稳定化。
Genome Biol. 2019 Dec 23;20(1):296. doi: 10.1186/s13059-019-1874-1.
3
DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data.
整合单细胞图谱揭示了人类膀胱癌中适应性表观遗传细胞状态引起的肿瘤内异质性产生。
Adv Sci (Weinh). 2024 Jun;11(24):e2308438. doi: 10.1002/advs.202308438. Epub 2024 Apr 6.
4
GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership.GoM DE:利用允许成员等级的差异表达分析来解释序列计数数据中的结构。
Genome Biol. 2023 Oct 19;24(1):236. doi: 10.1186/s13059-023-03067-9.
5
scPNMF: sparse gene encoding of single cells to facilitate gene selection for targeted gene profiling.scPNMF:稀疏的单细胞基因编码,以方便选择用于靶向基因分析的基因。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i358-i366. doi: 10.1093/bioinformatics/btab273.
DeepImpute:一种准确、快速且可扩展的深度学习神经网络方法,用于填补单细胞 RNA-seq 数据。
Genome Biol. 2019 Oct 18;20(1):211. doi: 10.1186/s13059-019-1837-6.
4
False signals induced by single-cell imputation.单细胞插补诱导的假信号。
F1000Res. 2018 Nov 2;7:1740. doi: 10.12688/f1000research.16613.2. eCollection 2018.
5
Dimensionality reduction for visualizing single-cell data using UMAP.使用UMAP进行单细胞数据可视化的降维方法。
Nat Biotechnol. 2018 Dec 3. doi: 10.1038/nbt.4314.
6
VIPER: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies.VIPER:单细胞 RNA 测序研究中用于准确基因表达恢复的保留变异性的插补。
Genome Biol. 2018 Nov 12;19(1):196. doi: 10.1186/s13059-018-1575-1.
7
Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris.单细胞转录组学分析 20 种小鼠器官构建小鼠多器官单细胞图谱。
Nature. 2018 Oct;562(7727):367-372. doi: 10.1038/s41586-018-0590-4. Epub 2018 Oct 3.
8
A UNIFIED STATISTICAL FRAMEWORK FOR SINGLE CELL AND BULK RNA SEQUENCING DATA.用于单细胞和批量RNA测序数据的统一统计框架
Ann Appl Stat. 2018 Mar;12(1):609-632. doi: 10.1214/17-AOAS1110. Epub 2018 Mar 9.
9
Comparison of Computational Methods for Imputing Single-Cell RNA-Sequencing Data.比较单细胞 RNA 测序数据插补的计算方法。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):376-389. doi: 10.1109/TCBB.2018.2848633. Epub 2018 Jun 19.
10
Recovering Gene Interactions from Single-Cell Data Using Data Diffusion.利用数据扩散从单细胞数据中恢复基因相互作用。
Cell. 2018 Jul 26;174(3):716-729.e27. doi: 10.1016/j.cell.2018.05.061. Epub 2018 Jun 28.