• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

I-Impute:一种用于单细胞 RNA 测序数据插补的自洽方法。

I-Impute: a self-consistent method to impute single cell RNA sequencing data.

机构信息

School of Software, Northwestern Polytechnical University, Xi'an, Shaanxi, 710072, China.

Department of Computer Science, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China.

出版信息

BMC Genomics. 2020 Nov 18;21(Suppl 10):618. doi: 10.1186/s12864-020-07007-w.

DOI:10.1186/s12864-020-07007-w
PMID:33208097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7677776/
Abstract

BACKGROUND

Single-cell RNA-sequencing (scRNA-seq) is becoming indispensable in the study of cell-specific transcriptomes. However, in scRNA-seq techniques, only a small fraction of the genes are captured due to "dropout" events. These dropout events require intensive treatment when analyzing scRNA-seq data. For example, imputation tools have been proposed to estimate dropout events and de-noise data. The performance of these imputation tools are often evaluated, or fine-tuned, using various clustering criteria based on ground-truth cell subgroup labels. This limits their effectiveness in the cases where we lack cell subgroup knowledge. We consider an alternative strategy which requires the imputation to follow a "self-consistency" principle; that is, the imputation process is to refine its results until there is no internal inconsistency or dropouts from the data.

RESULTS

We propose the use of "self-consistency" as a main criteria in performing imputation. To demonstrate this principle we devised I-Impute, a "self-consistent" method, to impute scRNA-seq data. I-Impute optimizes continuous similarities and dropout probabilities, in iterative refinements until a self-consistent imputation is reached. On the in silico data sets, I-Impute exhibited the highest Pearson correlations for different dropout rates consistently compared with the state-of-art methods SAVER and scImpute. Furthermore, we collected three wetlab datasets, mouse bladder cells dataset, embryonic stem cells dataset, and aortic leukocyte cells dataset, to evaluate the tools. I-Impute exhibited feasible cell subpopulation discovery efficacy on all the three datasets. It achieves the highest clustering accuracy compared with SAVER and scImpute.

CONCLUSIONS

A strategy based on "self-consistency", captured through our method, I-Impute, gave imputation results better than the state-of-the-art tools. Source code of I-Impute can be accessed at https://github.com/xikanfeng2/I-Impute .

摘要

背景

单细胞 RNA 测序(scRNA-seq)在研究细胞特异性转录组方面变得不可或缺。然而,在 scRNA-seq 技术中,由于“缺失”事件,只有一小部分基因被捕获。这些缺失事件在分析 scRNA-seq 数据时需要进行密集处理。例如,已经提出了插补工具来估计缺失事件并对数据进行去噪。这些插补工具的性能通常使用基于真实细胞亚群标签的各种聚类标准来评估或微调。这限制了它们在缺乏细胞亚群知识的情况下的有效性。我们考虑了一种替代策略,该策略要求插补遵循“自洽性”原则;也就是说,插补过程是要细化其结果,直到数据中没有内部不一致或缺失。

结果

我们提出将“自洽性”用作执行插补的主要标准。为了证明这一原理,我们设计了 I-Impute,一种“自洽”的方法,用于插补 scRNA-seq 数据。I-Impute 通过连续的相似性和缺失概率的迭代细化来优化,直到达到自洽的插补。在模拟数据集中,与最先进的方法 SAVER 和 scImpute 相比,I-Impute 在不同缺失率下表现出了最高的 Pearson 相关性。此外,我们收集了三个湿实验室数据集,即小鼠膀胱细胞数据集、胚胎干细胞数据集和主动脉白细胞细胞数据集,以评估这些工具。I-Impute 在所有三个数据集上都表现出了可行的细胞亚群发现效果。与 SAVER 和 scImpute 相比,它实现了最高的聚类准确性。

结论

通过我们的方法 I-Impute 捕获的基于“自洽性”的策略,给出的插补结果优于最先进的工具。I-Impute 的源代码可以在 https://github.com/xikanfeng2/I-Impute 上访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/943eabd8e403/12864_2020_7007_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/0d431794ccd6/12864_2020_7007_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/1e0e45549532/12864_2020_7007_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/3ef44309eea5/12864_2020_7007_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/943eabd8e403/12864_2020_7007_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/0d431794ccd6/12864_2020_7007_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/1e0e45549532/12864_2020_7007_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/3ef44309eea5/12864_2020_7007_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de88/7677776/943eabd8e403/12864_2020_7007_Fig4_HTML.jpg

相似文献

1
I-Impute: a self-consistent method to impute single cell RNA sequencing data.I-Impute:一种用于单细胞 RNA 测序数据插补的自洽方法。
BMC Genomics. 2020 Nov 18;21(Suppl 10):618. doi: 10.1186/s12864-020-07007-w.
2
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。
Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.
3
GE-Impute: graph embedding-based imputation for single-cell RNA-seq data.GE-Impute:基于图嵌入的单细胞 RNA-seq 数据插补。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac313.
4
ScLRTC: imputation for single-cell RNA-seq data via low-rank tensor completion.ScLRTC:基于低秩张量补全的单细胞 RNA-seq 数据插补。
BMC Genomics. 2021 Nov 29;22(1):860. doi: 10.1186/s12864-021-08101-3.
5
CMF-Impute: an accurate imputation tool for single-cell RNA-seq data.CMF-Impute:一种用于单细胞 RNA-seq 数据的精确插补工具。
Bioinformatics. 2020 May 1;36(10):3139-3147. doi: 10.1093/bioinformatics/btaa109.
6
An efficient scRNA-seq dropout imputation method using graph attention network.基于图注意力网络的高效 scRNA-seq 缺失值插补方法。
BMC Bioinformatics. 2021 Dec 7;22(1):582. doi: 10.1186/s12859-021-04493-x.
7
AGImpute: imputation of scRNA-seq data based on a hybrid GAN with dropouts identification.AGImpute:基于带有缺失值识别的混合生成对抗网络对单细胞RNA测序数据进行插补
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae068.
8
SAE-Impute: imputation for single-cell data via subspace regression and auto-encoders.SAE-Impute:通过子空间回归和自动编码器对单细胞数据进行插补
BMC Bioinformatics. 2024 Oct 1;25(1):317. doi: 10.1186/s12859-024-05944-x.
9
Epi-Impute: Single-Cell RNA-seq Imputation via Integration with Single-Cell ATAC-seq.Epi-Impute:通过与单细胞 ATAC-seq 整合进行单细胞 RNA-seq 插补。
Int J Mol Sci. 2023 Mar 25;24(7):6229. doi: 10.3390/ijms24076229.
10
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute:一种用于单细胞 RNA 序列缺失的集成相似性插补方法。
Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.

引用本文的文献

1
Inferring Alzheimer's Disease Pathologic Traits from Clinical Measures in Living Adults.从成年活体的临床指标推断阿尔茨海默病的病理特征。
J Alzheimers Dis. 2024;98(1):95-107. doi: 10.3233/JAD-230639.
2
Inferring Alzheimer's disease pathologic traits from clinical measures in living adults.从成年活体的临床指标推断阿尔茨海默病的病理特征。
medRxiv. 2024 Jan 20:2023.05.08.23289668. doi: 10.1101/2023.05.08.23289668.
3
Advancements in Genomic and Behavioral Neuroscience Analysis for the Study of Normal and Pathological Brain Function.

本文引用的文献

1
Subpopulation Detection and Their Comparative Analysis across Single-Cell Experiments with scPopCorn.利用 scPopCorn 进行单细胞实验中的亚群检测及其比较分析。
Cell Syst. 2019 Jun 26;8(6):506-513.e5. doi: 10.1016/j.cels.2019.05.007. Epub 2019 Jun 19.
2
An imputation-regularized optimization algorithm for high dimensional missing data problems and beyond.一种用于高维缺失数据问题及其他问题的插补正则化优化算法。
J R Stat Soc Series B Stat Methodol. 2018 Nov;80(5):899-926. doi: 10.1111/rssb.12279. Epub 2018 Jun 25.
3
PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data.
用于研究正常和病理脑功能的基因组学与行为神经科学分析进展
Front Mol Neurosci. 2022 Jun 23;15:905328. doi: 10.3389/fnmol.2022.905328. eCollection 2022.
PanglaoDB:一个用于探索小鼠和人类单细胞 RNA 测序数据的网络服务器。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz046.
4
Scalable analysis of cell-type composition from single-cell transcriptomics using deep recurrent learning.利用深度循环神经网络对单细胞转录组学数据进行可扩展的细胞类型组成分析。
Nat Methods. 2019 Apr;16(4):311-314. doi: 10.1038/s41592-019-0353-7. Epub 2019 Mar 18.
5
Single-cell RNA-seq denoising using a deep count autoencoder.基于深度计数自编码器的单细胞 RNA-seq 去噪。
Nat Commun. 2019 Jan 23;10(1):390. doi: 10.1038/s41467-018-07931-2.
6
Spatially and functionally distinct subclasses of breast cancer-associated fibroblasts revealed by single cell RNA sequencing.单细胞 RNA 测序揭示乳腺癌相关成纤维细胞的空间和功能上不同的亚群。
Nat Commun. 2018 Dec 4;9(1):5150. doi: 10.1038/s41467-018-07582-3.
7
Deep generative modeling for single-cell transcriptomics.单细胞转录组学的深度生成模型。
Nat Methods. 2018 Dec;15(12):1053-1058. doi: 10.1038/s41592-018-0229-2. Epub 2018 Nov 30.
8
Unravelling subclonal heterogeneity and aggressive disease states in TNBC through single-cell RNA-seq.通过单细胞 RNA 测序揭示三阴性乳腺癌中的亚克隆异质性和侵袭性疾病状态。
Nat Commun. 2018 Sep 4;9(1):3588. doi: 10.1038/s41467-018-06052-0.
9
Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing.单细胞测序对非小细胞肺癌 T 细胞的全面刻画。
Nat Med. 2018 Jul;24(7):978-985. doi: 10.1038/s41591-018-0045-3. Epub 2018 Jun 25.
10
SAVER: gene expression recovery for single-cell RNA sequencing.SAVER:单细胞 RNA 测序的基因表达恢复。
Nat Methods. 2018 Jul;15(7):539-542. doi: 10.1038/s41592-018-0033-z. Epub 2018 Jun 25.