• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多目标优化的单细胞RNA测序缺失值插补

Imputing dropouts for single-cell RNA sequencing based on multi-objective optimization.

作者信息

Jin Ke, Li Bo, Yan Hong, Zhang Xiao-Fei

机构信息

Department of Statistics, School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China.

Hubei Key Laboratory of Mathematical Sciences, Central China Normal University, Wuhan 430079, China.

出版信息

Bioinformatics. 2022 Jun 13;38(12):3222-3230. doi: 10.1093/bioinformatics/btac300.

DOI:10.1093/bioinformatics/btac300
PMID:35485740
Abstract

MOTIVATION

Single-cell RNA sequencing (scRNA-seq) technologies have been testified revolutionary for their promotion on the profiling of single-cell transcriptomes at single-cell resolution. Excess zeros due to various technical noises, called dropouts, will mislead downstream analyses. Therefore, it is crucial to have accurate imputation methods to address the dropout problem.

RESULTS

In this article, we develop a new dropout imputation method for scRNA-seq data based on multi-objective optimization. Our method is different from existing ones, which assume that the underlying data has a preconceived structure and impute the dropouts according to the information learned from such structure. We assume that the data combines three types of latent structures, including the horizontal structure (genes are similar to each other), the vertical structure (cells are similar to each other) and the low-rank structure. The combination weights and latent structures are learned using multi-objective optimization. And, the weighted average of the observed data and the imputation results learned from the three types of structures are considered as the final result. Comprehensive downstream experiments show the superiority of our method in terms of recovery of true gene expression profiles, differential expression analysis, cell clustering and cell trajectory inference.

AVAILABILITY AND IMPLEMENTATION

The R package is available at https://github.com/Zhangxf-ccnu/scMOO and https://zenodo.org/record/5785195. The codes to reproduce the downstream analyses in this article can be found at https://github.com/Zhangxf-ccnu/scMOO_experiments_codes and https://zenodo.org/record/5786211. The detailed list of data sets used in the present study is represented in Supplementary Table S1 in the Supplementary materials.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

单细胞RNA测序(scRNA-seq)技术因其在单细胞分辨率下促进单细胞转录组分析而具有革命性。由于各种技术噪声导致的过多零值(称为缺失值)会误导下游分析。因此,拥有准确的插补方法来解决缺失值问题至关重要。

结果

在本文中,我们基于多目标优化开发了一种用于scRNA-seq数据的新型缺失值插补方法。我们的方法与现有方法不同,现有方法假设基础数据具有预先设定的结构,并根据从该结构中学到的信息来插补缺失值。我们假设数据结合了三种类型的潜在结构,包括水平结构(基因彼此相似)、垂直结构(细胞彼此相似)和低秩结构。使用多目标优化来学习组合权重和潜在结构。并且,将观测数据与从三种结构中学到的插补结果的加权平均值视为最终结果。全面的下游实验表明,我们的方法在恢复真实基因表达谱、差异表达分析、细胞聚类和细胞轨迹推断方面具有优势。

可用性和实现方式

R包可在https://github.com/Zhangxf-ccnu/scMOO和https://zenodo.org/record/5785195获取。重现本文下游分析的代码可在https://github.com/Zhangxf-ccnu/scMOO_experiments_codes和https://zenodo.org/record/5786211找到。本研究中使用的数据集详细列表见补充材料中的补充表S1。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Imputing dropouts for single-cell RNA sequencing based on multi-objective optimization.基于多目标优化的单细胞RNA测序缺失值插补
Bioinformatics. 2022 Jun 13;38(12):3222-3230. doi: 10.1093/bioinformatics/btac300.
2
EnImpute: imputing dropout events in single-cell RNA-sequencing data via ensemble learning.EnImpute:通过集成学习在单细胞 RNA 测序数据中推断缺失事件。
Bioinformatics. 2019 Nov 1;35(22):4827-4829. doi: 10.1093/bioinformatics/btz435.
3
TsImpute: an accurate two-step imputation method for single-cell RNA-seq data.TsImpute:一种用于单细胞 RNA-seq 数据的准确两步插补方法。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad731.
4
scWMC: weighted matrix completion-based imputation of scRNA-seq data via prior subspace information.scWMC:基于加权矩阵补全的 scRNA-seq 数据插补,通过先验子空间信息。
Bioinformatics. 2022 Sep 30;38(19):4537-4545. doi: 10.1093/bioinformatics/btac570.
5
scTSSR: gene expression recovery for single-cell RNA sequencing using two-side sparse self-representation.scTSSR:使用双边稀疏自表示进行单细胞 RNA 测序的基因表达恢复。
Bioinformatics. 2020 May 1;36(10):3131-3138. doi: 10.1093/bioinformatics/btaa108.
6
scRMD: imputation for single cell RNA-seq data via robust matrix decomposition.scRMD:基于稳健矩阵分解的单细胞 RNA-seq 数据插补。
Bioinformatics. 2020 May 1;36(10):3156-3161. doi: 10.1093/bioinformatics/btaa139.
7
CMF-Impute: an accurate imputation tool for single-cell RNA-seq data.CMF-Impute:一种用于单细胞 RNA-seq 数据的精确插补工具。
Bioinformatics. 2020 May 1;36(10):3139-3147. doi: 10.1093/bioinformatics/btaa109.
8
2DImpute: imputation in single-cell RNA-seq data from correlations in two dimensions.2DImpute:基于二维相关性的单细胞 RNA-seq 数据插补。
Bioinformatics. 2020 Jun 1;36(11):3588-3589. doi: 10.1093/bioinformatics/btaa148.
9
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute:一种用于单细胞 RNA 序列缺失的集成相似性插补方法。
Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.
10
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。
Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.

引用本文的文献

1
PbImpute: Precise Zero Discrimination and Balanced Imputation in Single-Cell RNA Sequencing Data.PbImpute:单细胞RNA测序数据中的精确零判别与平衡插补
J Chem Inf Model. 2025 Mar 10;65(5):2670-2684. doi: 10.1021/acs.jcim.4c02125. Epub 2025 Feb 17.
2
scTCA: a hybrid Transformer-CNN architecture for imputation and denoising of scDNA-seq data.scTCA:一种用于 scDNA-seq 数据插补和去噪的混合 Transformer-CNN 架构。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae577.
3
BANMF-S: a blockwise accelerated non-negative matrix factorization framework with structural network constraints for single cell imputation.
BANMF-S:一种基于结构网络约束的分块加速非负矩阵分解框架,用于单细胞插补。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae432.
4
scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization.scRNMF:一种基于鲁棒非负矩阵分解的单细胞 RNA-seq 数据插补方法。
PLoS Comput Biol. 2024 Aug 8;20(8):e1012339. doi: 10.1371/journal.pcbi.1012339. eCollection 2024 Aug.
5
Single-cell omics: experimental workflow, data analyses and applications.单细胞组学:实验工作流程、数据分析及应用
Sci China Life Sci. 2025 Jan;68(1):5-102. doi: 10.1007/s11427-023-2561-0. Epub 2024 Jul 23.
6
TsImpute: an accurate two-step imputation method for single-cell RNA-seq data.TsImpute:一种用于单细胞 RNA-seq 数据的准确两步插补方法。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad731.
7
[Imputation method for dropout in single-cell transcriptome data].[单细胞转录组数据中缺失值的插补方法]
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Aug 25;40(4):778-783. doi: 10.7507/1001-5515.202301009.
8
Multi-Objective Genetic Algorithm for Cluster Analysis of Single-Cell Transcriptomes.用于单细胞转录组聚类分析的多目标遗传算法
J Pers Med. 2023 Jan 20;13(2):183. doi: 10.3390/jpm13020183.