Suppr超能文献

从复杂组织中完全反卷积 DNA 甲基化信号:一种几何方法。

Complete deconvolution of DNA methylation signals from complex tissues: a geometric approach.

机构信息

School of Science, East China University of Technology, Nanchang, Jiangxi 330013, China.

Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, USA.

出版信息

Bioinformatics. 2021 May 23;37(8):1052-1059. doi: 10.1093/bioinformatics/btaa930.

Abstract

MOTIVATION

It is a common practice in epigenetics research to profile DNA methylation on tissue samples, which is usually a mixture of different cell types. To properly account for the mixture, estimating cell compositions has been recognized as an important first step. Many methods were developed for quantifying cell compositions from DNA methylation data, but they mostly have limited applications due to lack of reference or prior information.

RESULTS

We develop Tsisal, a novel complete deconvolution method which accurately estimate cell compositions from DNA methylation data without any prior knowledge of cell types or their proportions. Tsisal is a full pipeline to estimate number of cell types, cell compositions and identify cell-type-specific CpG sites. It can also assign cell type labels when (full or part of) reference panel is available. Extensive simulation studies and analyses of seven real datasets demonstrate the favorable performance of our proposed method compared with existing deconvolution methods serving similar purpose.

AVAILABILITY AND IMPLEMENTATION

The proposed method Tsisal is implemented as part of the R/Bioconductor package TOAST at https://bioconductor.org/packages/TOAST.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

在表观遗传学研究中,对组织样本中的 DNA 甲基化进行分析是一种常见做法,而这些组织样本通常是不同细胞类型的混合物。为了正确解释这种混合物,估计细胞组成被认为是重要的第一步。已经开发出许多从 DNA 甲基化数据中定量细胞组成的方法,但由于缺乏参考或先验信息,它们大多应用有限。

结果

我们开发了 Tsisal,这是一种新颖的完整去卷积方法,它可以在没有任何关于细胞类型或其比例的先验知识的情况下,从 DNA 甲基化数据中准确估计细胞组成。Tsisal 是一个完整的管道,用于估计细胞类型的数量、细胞组成和识别细胞类型特异性 CpG 位点。当有(全部或部分)参考面板时,它还可以分配细胞类型标签。对七个真实数据集的广泛模拟研究和分析表明,与服务于类似目的的现有去卷积方法相比,我们提出的方法具有更好的性能。

可用性和实现

所提出的方法 Tsisal 作为 R/Bioconductor 包 TOAST 的一部分实现,可在 https://bioconductor.org/packages/TOAST 上获取。

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

1
Complete deconvolution of DNA methylation signals from complex tissues: a geometric approach.
Bioinformatics. 2021 May 23;37(8):1052-1059. doi: 10.1093/bioinformatics/btaa930.
3
Robust partial reference-free cell composition estimation from tissue expression.
Bioinformatics. 2020 Jun 1;36(11):3431-3438. doi: 10.1093/bioinformatics/btaa184.
4
Dissecting differential signals in high-throughput data from complex tissues.
Bioinformatics. 2019 Oct 15;35(20):3898-3905. doi: 10.1093/bioinformatics/btz196.
5
debCAM: a bioconductor R package for fully unsupervised deconvolution of complex tissues.
Bioinformatics. 2020 Jun 1;36(12):3927-3929. doi: 10.1093/bioinformatics/btaa205.
6
BPRMeth: a flexible Bioconductor package for modelling methylation profiles.
Bioinformatics. 2018 Jul 15;34(14):2485-2486. doi: 10.1093/bioinformatics/bty129.
8
Comparison of different cell type correction methods for genome-scale epigenetics studies.
BMC Bioinformatics. 2017 Apr 14;18(1):216. doi: 10.1186/s12859-017-1611-2.
9
Reference-free cell mixture adjustments in analysis of DNA methylation data.
Bioinformatics. 2014 May 15;30(10):1431-9. doi: 10.1093/bioinformatics/btu029. Epub 2014 Jan 21.

引用本文的文献

2
Examining cellular heterogeneity in human DNA methylation studies: Overview and recommendations.
STAR Protoc. 2025 Mar 21;6(1):103638. doi: 10.1016/j.xpro.2025.103638. Epub 2025 Feb 12.
4
Scalable Screening of Ternary-Code DNA Methylation Dynamics Associated with Human Traits.
bioRxiv. 2025 Feb 11:2024.05.17.594606. doi: 10.1101/2024.05.17.594606.
5
Computational deconvolution of DNA methylation data from mixed DNA samples.
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae234.
8
Random field modeling of multi-trait multi-locus association for detecting methylation quantitative trait loci.
Bioinformatics. 2022 Aug 10;38(16):3853-3862. doi: 10.1093/bioinformatics/btac443.

本文引用的文献

1
A probabilistic gene expression barcode for annotation of cell types from single-cell RNA-seq data.
Biostatistics. 2022 Oct 14;23(4):1150-1164. doi: 10.1093/biostatistics/kxac021.
2
Robust partial reference-free cell composition estimation from tissue expression.
Bioinformatics. 2020 Jun 1;36(11):3431-3438. doi: 10.1093/bioinformatics/btaa184.
4
Role of coenzymes in cancer metabolism.
Semin Cell Dev Biol. 2020 Feb;98:44-53. doi: 10.1016/j.semcdb.2019.05.027. Epub 2019 Jun 19.
5
Complete deconvolution of cellular mixtures based on linearity of transcriptional signatures.
Nat Commun. 2019 May 17;10(1):2209. doi: 10.1038/s41467-019-09990-5.
6
Immune infiltration in renal cell carcinoma.
Cancer Sci. 2019 May;110(5):1564-1572. doi: 10.1111/cas.13996. Epub 2019 Apr 7.
7
Bulk tissue cell type deconvolution with multi-subject single-cell expression reference.
Nat Commun. 2019 Jan 22;10(1):380. doi: 10.1038/s41467-018-08023-x.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验