• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于单细胞 RNA-seq 数据的细胞丰度预测新方法。

A novel method for predicting cell abundance based on single-cell RNA-seq data.

机构信息

School of Computer Science, Northwestern Polytechnical University, Chang'an Ave, Changan Qu, Xi'an City, Shaanxi Province, China.

出版信息

BMC Bioinformatics. 2021 Aug 25;22(Suppl 9):281. doi: 10.1186/s12859-021-04187-4.

DOI:10.1186/s12859-021-04187-4
PMID:34433409
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8386079/
Abstract

BACKGROUND

It is important to understand the composition of cell type and its proportion in intact tissues, as changes in certain cell types are the underlying cause of disease in humans. Although compositions of cell type and ratios can be obtained by single-cell sequencing, single-cell sequencing is currently expensive and cannot be applied in clinical studies involving a large number of subjects. Therefore, it is useful to apply the bulk RNA-Seq dataset and the single-cell RNA dataset to deconvolute and obtain the cell type composition in the tissue.

RESULTS

By analyzing the existing cell population prediction methods, we found that most of the existing methods need the cell-type-specific gene expression profile as the input of the signature matrix. However, in real applications, it is not always possible to find an available signature matrix. To solve this problem, we proposed a novel method, named DCap, to predict cell abundance. DCap is a deconvolution method based on non-negative least squares. DCap considers the weight resulting from measurement noise of bulk RNA-seq and calculation error of single-cell RNA-seq data, during the calculation process of non-negative least squares and performs the weighted iterative calculation based on least squares. By weighting the bulk tissue gene expression matrix and single-cell gene expression matrix, DCap minimizes the measurement error of bulk RNA-Seq and also reduces errors resulting from differences in the number of expressed genes in the same type of cells in different samples. Evaluation test shows that DCap performs better in cell type abundance prediction than existing methods.

CONCLUSION

DCap solves the deconvolution problem using weighted non-negative least squares to predict cell type abundance in tissues. DCap has better prediction results and does not need to prepare a signature matrix that gives the cell-type-specific gene expression profile in advance. By using DCap, we can better study the changes in cell proportion in diseased tissues and provide more information on the follow-up treatment of diseases.

摘要

背景

了解完整组织中细胞类型的组成及其比例非常重要,因为某些细胞类型的变化是人类疾病的根本原因。虽然可以通过单细胞测序获得细胞类型组成和比例,但单细胞测序目前较为昂贵,并且无法应用于涉及大量研究对象的临床研究中。因此,应用批量 RNA-Seq 数据集和单细胞 RNA 数据集进行去卷积以获得组织中的细胞类型组成是很有用的。

结果

通过分析现有的细胞群体预测方法,我们发现大多数现有的方法需要细胞类型特异性基因表达谱作为特征矩阵的输入。然而,在实际应用中,并不总是能够找到可用的特征矩阵。为了解决这个问题,我们提出了一种新的方法,名为 DCap,用于预测细胞丰度。DCap 是一种基于非负最小二乘法的去卷积方法。DCap 在非负最小二乘的计算过程中考虑了批量 RNA-seq 的测量噪声和单细胞 RNA-seq 数据的计算误差产生的权重,并基于最小二乘法进行加权迭代计算。通过对批量组织基因表达矩阵和单细胞基因表达矩阵进行加权,DCap 最小化了批量 RNA-Seq 的测量误差,同时减少了由于不同样本中相同类型细胞的表达基因数量不同而产生的误差。评估测试表明,DCap 在细胞类型丰度预测方面的性能优于现有的方法。

结论

DCap 使用加权非负最小二乘法解决去卷积问题,以预测组织中的细胞类型丰度。DCap 具有更好的预测结果,并且不需要预先准备给出细胞类型特异性基因表达谱的特征矩阵。通过使用 DCap,我们可以更好地研究疾病组织中细胞比例的变化,并为疾病的后续治疗提供更多信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/000cad084d68/12859_2021_4187_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/bae992b7d914/12859_2021_4187_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/117985225496/12859_2021_4187_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/7bb2bdb2a855/12859_2021_4187_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/19d084afba54/12859_2021_4187_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/2a68c85e04c3/12859_2021_4187_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/8088bcc3b9d9/12859_2021_4187_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/4147193988aa/12859_2021_4187_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/000cad084d68/12859_2021_4187_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/bae992b7d914/12859_2021_4187_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/117985225496/12859_2021_4187_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/7bb2bdb2a855/12859_2021_4187_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/19d084afba54/12859_2021_4187_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/2a68c85e04c3/12859_2021_4187_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/8088bcc3b9d9/12859_2021_4187_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/4147193988aa/12859_2021_4187_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8df/8386079/000cad084d68/12859_2021_4187_Fig8_HTML.jpg

相似文献

1
A novel method for predicting cell abundance based on single-cell RNA-seq data.基于单细胞 RNA-seq 数据的细胞丰度预测新方法。
BMC Bioinformatics. 2021 Aug 25;22(Suppl 9):281. doi: 10.1186/s12859-021-04187-4.
2
MuSiC2: cell-type deconvolution for multi-condition bulk RNA-seq data.MuSiC2:用于多条件批量 RNA-seq 数据的细胞类型去卷积。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac430.
3
Bulk brain tissue cell-type deconvolution with bias correction for single-nuclei RNA sequencing data using DeTREM.使用 DeTREM 对基于单细胞核 RNA 测序数据的偏倚校正进行批量脑组织细胞类型去卷积。
BMC Bioinformatics. 2023 Sep 19;24(1):349. doi: 10.1186/s12859-023-05476-w.
4
Deconvolution from bulk gene expression by leveraging sample-wise and gene-wise similarities and single-cell RNA-Seq data.通过利用样本和基因之间的相似性以及单细胞 RNA-Seq 数据进行批量基因表达的反卷积。
BMC Genomics. 2024 Sep 18;25(1):875. doi: 10.1186/s12864-024-10728-x.
5
Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes.使用 scnRNA-seq 转录组进行批量 RNA-seq 去卷积的有效方法。
Genome Biol. 2023 Aug 1;24(1):177. doi: 10.1186/s13059-023-03016-6.
6
Inverse weighting method with jackknife variance estimator for differential expression analysis of single-cell RNA sequencing data.基于刀切方差估计的逆加权法用于单细胞 RNA 测序数据分析中的差异表达分析。
Comput Biol Chem. 2022 Oct;100:107733. doi: 10.1016/j.compbiolchem.2022.107733. Epub 2022 Jul 18.
7
scDeconv: an R package to deconvolve bulk DNA methylation data with scRNA-seq data and paired bulk RNA-DNA methylation data.scDeconv:一个用于对 scRNA-seq 数据和配对的 bulk RNA-DNA 甲基化数据进行去卷积的 R 包。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac150.
8
SimBu: bias-aware simulation of bulk RNA-seq data with variable cell-type composition.SimBu:具有可变细胞类型组成的批量 RNA-seq 数据的偏差感知模拟。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii141-ii147. doi: 10.1093/bioinformatics/btac499.
9
Approximate estimation of cell-type resolution transcriptome in bulk tissue through matrix completion.通过矩阵补全对批量组织中的细胞类型分辨率转录组进行近似估计。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad273.
10
dCaP: detecting differential binding events in multiple conditions and proteins.dCaP:检测多种条件和蛋白质中的差异结合事件
BMC Genomics. 2014;15 Suppl 9(Suppl 9):S12. doi: 10.1186/1471-2164-15-S9-S12. Epub 2014 Dec 8.

引用本文的文献

1
Insights into the dynamics and evolution of Rummeliibacillus stabekisii prophages in extreme environments: from Antarctic soil to spacecraft floors.对极端环境中斯氏鲁梅利芽孢杆菌原噬菌体的动态变化及进化的见解:从南极土壤到航天器地板
Extremophiles. 2024 Dec 21;29(1):10. doi: 10.1007/s00792-024-01377-9.
2
Enabling Single-Cell Drug Response Annotations from Bulk RNA-Seq Using SCAD.利用 SCAD 从批量 RNA-Seq 中实现单细胞药物反应注释。
Adv Sci (Weinh). 2023 Apr;10(11):e2204113. doi: 10.1002/advs.202204113. Epub 2023 Feb 10.
3
Temporal progress of gene expression analysis with RNA-Seq data: A review on the relationship between computational methods.

本文引用的文献

1
An end-to-end heterogeneous graph representation learning-based framework for drug-target interaction prediction.基于端到端异质图表示学习的药物-靶标相互作用预测框架。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa430.
2
Integrating multi-network topology for gene function prediction using deep neural networks.使用深度神经网络整合多网络拓扑结构进行基因功能预测。
Brief Bioinform. 2021 Mar 22;22(2):2096-2105. doi: 10.1093/bib/bbaa036.
3
Combining gene ontology with deep neural networks to enhance the clustering of single cell RNA-Seq data.
基于RNA测序数据的基因表达分析的时间进展:计算方法之间关系的综述
Comput Struct Biotechnol J. 2022 Dec 1;21:86-98. doi: 10.1016/j.csbj.2022.11.051. eCollection 2023.
4
XDec-CHI reveals immunosuppressive interactions in pancreatic ductal adenocarcinoma.XDec-CHI揭示了胰腺导管腺癌中的免疫抑制相互作用。
iScience. 2022 Sep 29;25(10):105249. doi: 10.1016/j.isci.2022.105249. eCollection 2022 Oct 21.
5
Deciphering the endometrial immune landscape of RIF during the window of implantation from cellular senescence by integrated bioinformatics analysis and machine learning.通过整合生物信息学分析和机器学习,解析着床窗口期反复种植失败子宫内膜的细胞衰老免疫全景。
Front Immunol. 2022 Sep 5;13:952708. doi: 10.3389/fimmu.2022.952708. eCollection 2022.
6
Graph Embedding Based Novel Gene Discovery Associated With Diabetes Mellitus.基于图嵌入的与糖尿病相关的新型基因发现
Front Genet. 2021 Nov 25;12:779186. doi: 10.3389/fgene.2021.779186. eCollection 2021.
7
An MRI Study on Effects of Math Education on Brain Development Using Multi-Instance Contrastive Learning.一项使用多实例对比学习法对数学教育对大脑发育影响的磁共振成像研究。
Front Psychol. 2021 Nov 24;12:765754. doi: 10.3389/fpsyg.2021.765754. eCollection 2021.
将基因本体论与深度神经网络相结合,以增强单细胞 RNA-Seq 数据的聚类。
BMC Bioinformatics. 2019 Jun 10;20(Suppl 8):284. doi: 10.1186/s12859-019-2769-6.
4
A learning-based framework for miRNA-disease association identification using neural networks.基于神经网络的 miRNA-疾病关联识别学习框架。
Bioinformatics. 2019 Nov 1;35(21):4364-4371. doi: 10.1093/bioinformatics/btz254.
5
Cell composition analysis of bulk genomics using single-cell data.使用单细胞数据进行批量基因组学的细胞组成分析。
Nat Methods. 2019 Apr;16(4):327-332. doi: 10.1038/s41592-019-0355-5. Epub 2019 Mar 18.
6
Bulk tissue cell type deconvolution with multi-subject single-cell expression reference.基于多主体单细胞表达参考的组织细胞类型去卷积。
Nat Commun. 2019 Jan 22;10(1):380. doi: 10.1038/s41467-018-08023-x.
7
Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease.单细胞转录组学分析揭示了肾脏疾病的潜在细胞靶标。
Science. 2018 May 18;360(6390):758-763. doi: 10.1126/science.aar2131. Epub 2018 Apr 5.
8
Estimation of immune cell content in tumour tissue using single-cell RNA-seq data.利用单细胞 RNA 测序数据估算肿瘤组织中的免疫细胞含量。
Nat Commun. 2017 Dec 11;8(1):2032. doi: 10.1038/s41467-017-02289-3.
9
Single-Cell Transcriptome Profiling of Human Pancreatic Islets in Health and Type 2 Diabetes.健康与2型糖尿病状态下人类胰岛的单细胞转录组分析
Cell Metab. 2016 Oct 11;24(4):593-607. doi: 10.1016/j.cmet.2016.08.020. Epub 2016 Sep 22.
10
RNA Sequencing of Single Human Islet Cells Reveals Type 2 Diabetes Genes.单细胞人类胰岛 RNA 测序揭示 2 型糖尿病基因。
Cell Metab. 2016 Oct 11;24(4):608-615. doi: 10.1016/j.cmet.2016.08.018. Epub 2016 Sep 22.