[单细胞转录组数据中缺失值的插补方法]

[Imputation method for dropout in single-cell transcriptome data].

作者信息

Jiang Chao, Hu Longfei, Xu Chunxiang, Ge Qinyu, Zhao Xiangwei

机构信息

State Key Laboratory of Bioelectronics, School of Biological Sciences and Medical Engineering, Southeast University, Nanjing 210096, P. R. China.

School of Medicine & Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, P. R. China.

出版信息

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Aug 25;40(4):778-783. doi: 10.7507/1001-5515.202301009.

Abstract

Single-cell transcriptome sequencing (scRNA-seq) can resolve the expression characteristics of cells in tissues with single-cell precision, enabling researchers to quantify cellular heterogeneity within populations with higher resolution, revealing potentially heterogeneous cell populations and the dynamics of complex tissues. However, the presence of a large number of technical zeros in scRNA-seq data will have an impact on downstream analysis of cell clustering, differential genes, cell annotation, and pseudotime, hindering the discovery of meaningful biological signals. The main idea to solve this problem is to make use of the potential correlation between cells and genes, and to impute the technical zeros through the observed data. Based on this, this paper reviewed the basic methods of imputing technical zeros in the scRNA-seq data and discussed the advantages and disadvantages of the existing methods. Finally, recommendations and perspectives on the use and development of the method were provided.

摘要

单细胞转录组测序(scRNA-seq)能够以单细胞精度解析组织中细胞的表达特征,使研究人员能够以更高分辨率量化群体内的细胞异质性,揭示潜在的异质细胞群体和复杂组织的动态变化。然而,scRNA-seq数据中存在大量技术零值会对细胞聚类、差异基因、细胞注释和拟时间等下游分析产生影响,阻碍有意义生物信号的发现。解决这一问题的主要思路是利用细胞与基因之间的潜在相关性,并通过观测数据对技术零值进行插补。基于此,本文综述了scRNA-seq数据中技术零值插补的基本方法,并讨论了现有方法的优缺点。最后,对该方法的使用和发展提供了建议和展望。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索