• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Semi-CAM:一种具有部分标记基因信息的批量转录组数据的半监督解卷积方法。

Semi-CAM: A semi-supervised deconvolution method for bulk transcriptomic data with partial marker gene information.

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.

Department of Pediatrics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.

出版信息

Sci Rep. 2020 Mar 25;10(1):5434. doi: 10.1038/s41598-020-62330-2.

DOI:10.1038/s41598-020-62330-2
PMID:32214192
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7096458/
Abstract

Deconvolution of bulk transcriptomics data from mixed cell populations is vital to identify the cellular mechanism of complex diseases. Existing deconvolution approaches can be divided into two major groups: supervised and unsupervised methods. Supervised deconvolution methods use cell type-specific prior information including cell proportions, reference cell type-specific gene signatures, or marker genes for each cell type, which may not be available in practice. Unsupervised methods, such as non-negative matrix factorization (NMF) and Convex Analysis of Mixtures (CAM), in contrast, completely disregard prior information and thus are not efficient for data with partial cell type-specific information. In this paper, we propose a semi-supervised deconvolution method, semi-CAM, that extends CAM by utilizing marker information from partial cell types. Analysis of simulation and two benchmark data have demonstrated that semi-CAM outperforms CAM by yielding more accurate cell proportion estimations when markers from partial/all cell types are available. In addition, when markers from all cell types are available, semi-CAM achieves better or similar accuracy compared to the supervised method using signature genes, CIBERSORT, and the marker-based supervised methods semi-NMF and DSA. Furthermore, analysis of human chlamydia-infection data with bulk expression profiles from six cell types and prior marker information of only three cell types suggests that semi-CAM achieves more accurate cell proportion estimations than CAM.

摘要

从混合细胞群体中反卷积批量转录组学数据对于识别复杂疾病的细胞机制至关重要。现有的反卷积方法可分为两大类:有监督和无监督方法。有监督反卷积方法使用细胞类型特异性的先验信息,包括细胞比例、参考细胞类型特异性基因特征或每个细胞类型的标记基因,但在实践中可能无法获得。相比之下,无监督方法(如非负矩阵分解(NMF)和混合物凸分析(CAM))完全忽略了先验信息,因此对于具有部分细胞类型特异性信息的数据效率不高。在本文中,我们提出了一种半监督反卷积方法 semi-CAM,它通过利用部分细胞类型的标记信息扩展了 CAM。通过对模拟数据和两个基准数据集的分析,我们证明了当部分/所有细胞类型的标记可用时,semi-CAM 比 CAM 能够产生更准确的细胞比例估计。此外,当所有细胞类型的标记都可用时,与使用特征基因、CIBERSORT 的有监督方法以及基于标记的有监督方法 semi-NMF 和 DSA 相比,semi-CAM 具有更好或相似的准确性。此外,对来自六个细胞类型的批量表达谱和仅三个细胞类型的先验标记信息的人类衣原体感染数据进行分析表明,semi-CAM 比 CAM 能够实现更准确的细胞比例估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/049fddc812fc/41598_2020_62330_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/7bfe54566f15/41598_2020_62330_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/4e9f9d0b15a2/41598_2020_62330_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/a84aaa46b163/41598_2020_62330_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/d820e286f8be/41598_2020_62330_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/6b8156457121/41598_2020_62330_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/942f6858b3a0/41598_2020_62330_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/a3e2e7773567/41598_2020_62330_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/70767183a72e/41598_2020_62330_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/d09f29c0953a/41598_2020_62330_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/049fddc812fc/41598_2020_62330_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/7bfe54566f15/41598_2020_62330_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/4e9f9d0b15a2/41598_2020_62330_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/a84aaa46b163/41598_2020_62330_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/d820e286f8be/41598_2020_62330_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/6b8156457121/41598_2020_62330_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/942f6858b3a0/41598_2020_62330_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/a3e2e7773567/41598_2020_62330_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/70767183a72e/41598_2020_62330_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/d09f29c0953a/41598_2020_62330_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77c/7096458/049fddc812fc/41598_2020_62330_Fig10_HTML.jpg

相似文献

1
Semi-CAM: A semi-supervised deconvolution method for bulk transcriptomic data with partial marker gene information.Semi-CAM:一种具有部分标记基因信息的批量转录组数据的半监督解卷积方法。
Sci Rep. 2020 Mar 25;10(1):5434. doi: 10.1038/s41598-020-62330-2.
2
Semi-supervised Nonnegative Matrix Factorization for gene expression deconvolution: a case study.半监督非负矩阵分解在基因表达解卷积中的应用:案例研究。
Infect Genet Evol. 2012 Jul;12(5):913-21. doi: 10.1016/j.meegid.2011.08.014. Epub 2011 Sep 10.
3
GLDADec: marker-gene guided LDA modeling for bulk gene expression deconvolution.GLDADec:基于标记基因引导的 LDA 建模的批量基因表达解卷积。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae315.
4
Comparative Analysis of Cell Mixtures Deconvolution and Gene Signatures Generated for Blood, Immune and Cancer Cells.细胞混合物去卷积与血液、免疫和癌细胞生成的基因特征的比较分析。
Int J Mol Sci. 2023 Jun 28;24(13):10765. doi: 10.3390/ijms241310765.
5
SSMD: a semi-supervised approach for a robust cell type identification and deconvolution of mouse transcriptomics data.SSMD:一种用于稳健的细胞类型鉴定和小鼠转录组数据分析去卷积的半监督方法。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa307.
6
Assessing transcriptomic heterogeneity of single-cell RNASeq data by bulk-level gene expression data.通过批量水平基因表达数据评估单细胞RNA测序数据的转录组异质性。
BMC Bioinformatics. 2024 Jun 12;25(1):209. doi: 10.1186/s12859-024-05825-3.
7
SpaDecon: cell-type deconvolution in spatial transcriptomics with semi-supervised learning.SpaDecon:基于半监督学习的空间转录组学中的细胞类型去卷积。
Commun Biol. 2023 Apr 7;6(1):378. doi: 10.1038/s42003-023-04761-x.
8
Benchmarking of cell type deconvolution pipelines for transcriptomics data.基于转录组数据的细胞类型去卷积分析流水线的基准测试
Nat Commun. 2020 Nov 6;11(1):5650. doi: 10.1038/s41467-020-19015-1.
9
Deblender: a semi-/unsupervised multi-operational computational method for complete deconvolution of expression data from heterogeneous samples.Deblender:一种半/无监督的多操作计算方法,用于从异质样本中完全反卷积表达数据。
BMC Bioinformatics. 2018 Nov 7;19(1):408. doi: 10.1186/s12859-018-2442-5.
10
BEDwARS: a robust Bayesian approach to bulk gene expression deconvolution with noisy reference signatures.BEDwARS:一种具有稳健性的贝叶斯方法,可用于对具有噪声参考特征的批量基因表达解卷积。
Genome Biol. 2023 Aug 3;24(1):178. doi: 10.1186/s13059-023-03007-7.

引用本文的文献

1
CAM3.0: determining cell type composition and expression from bulk tissues with fully unsupervised deconvolution.CAM3.0:通过完全无监督的去卷积从批量组织中确定细胞类型组成和表达。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae107.
2
Bioinformatic Characterization of Whole Blood Neutrophils in Pelvic Inflammatory Disease: A Potential Prognostic Indicator for Transumbilical Single-Port Laparoscopic Pelvic Abscess Surgery.全血中性粒细胞的生物信息学特征在盆腔炎中的研究:经脐单孔腹腔镜盆腔脓肿手术的潜在预后指标。
Comput Math Methods Med. 2022 Apr 1;2022:2555603. doi: 10.1155/2022/2555603. eCollection 2022.