从复杂组织中完全反卷积 DNA 甲基化信号：一种几何方法。

Complete deconvolution of DNA methylation signals from complex tissues: a geometric approach.

机构信息

School of Science, East China University of Technology, Nanchang, Jiangxi 330013, China.

Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, USA.

出版信息

Bioinformatics. 2021 May 23;37(8):1052-1059. doi: 10.1093/bioinformatics/btaa930.

DOI:10.1093/bioinformatics/btaa930

PMID:33135072

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8150138/

Abstract

MOTIVATION

It is a common practice in epigenetics research to profile DNA methylation on tissue samples, which is usually a mixture of different cell types. To properly account for the mixture, estimating cell compositions has been recognized as an important first step. Many methods were developed for quantifying cell compositions from DNA methylation data, but they mostly have limited applications due to lack of reference or prior information.

RESULTS

We develop Tsisal, a novel complete deconvolution method which accurately estimate cell compositions from DNA methylation data without any prior knowledge of cell types or their proportions. Tsisal is a full pipeline to estimate number of cell types, cell compositions and identify cell-type-specific CpG sites. It can also assign cell type labels when (full or part of) reference panel is available. Extensive simulation studies and analyses of seven real datasets demonstrate the favorable performance of our proposed method compared with existing deconvolution methods serving similar purpose.

AVAILABILITY AND IMPLEMENTATION

The proposed method Tsisal is implemented as part of the R/Bioconductor package TOAST at https://bioconductor.org/packages/TOAST.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

在表观遗传学研究中，对组织样本中的 DNA 甲基化进行分析是一种常见做法，而这些组织样本通常是不同细胞类型的混合物。为了正确解释这种混合物，估计细胞组成被认为是重要的第一步。已经开发出许多从 DNA 甲基化数据中定量细胞组成的方法，但由于缺乏参考或先验信息，它们大多应用有限。

结果

我们开发了 Tsisal，这是一种新颖的完整去卷积方法，它可以在没有任何关于细胞类型或其比例的先验知识的情况下，从 DNA 甲基化数据中准确估计细胞组成。Tsisal 是一个完整的管道，用于估计细胞类型的数量、细胞组成和识别细胞类型特异性 CpG 位点。当有（全部或部分）参考面板时，它还可以分配细胞类型标签。对七个真实数据集的广泛模拟研究和分析表明，与服务于类似目的的现有去卷积方法相比，我们提出的方法具有更好的性能。

可用性和实现

所提出的方法 Tsisal 作为 R/Bioconductor 包 TOAST 的一部分实现，可在 https://bioconductor.org/packages/TOAST 上获取。

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

Complete deconvolution of DNA methylation signals from complex tissues: a geometric approach.从复杂组织中完全反卷积 DNA 甲基化信号：一种几何方法。

Bioinformatics. 2021 May 23;37(8):1052-1059. doi: 10.1093/bioinformatics/btaa930.

TOAST: improving reference-free cell composition estimation by cross-cell type differential analysis.TOAST：通过跨细胞类型差异分析改善无参考细胞成分估计。

Genome Biol. 2019 Sep 4;20(1):190. doi: 10.1186/s13059-019-1778-0.

Robust partial reference-free cell composition estimation from tissue expression.从组织表达中稳健的无参考局部细胞成分估计

Bioinformatics. 2020 Jun 1;36(11):3431-3438. doi: 10.1093/bioinformatics/btaa184.

Dissecting differential signals in high-throughput data from complex tissues.解析复杂组织高通量数据中的差异信号。

Bioinformatics. 2019 Oct 15;35(20):3898-3905. doi: 10.1093/bioinformatics/btz196.

debCAM: a bioconductor R package for fully unsupervised deconvolution of complex tissues.debCAM：一个用于复杂组织完全无监督去卷积的 Bioconductor R 包。

Bioinformatics. 2020 Jun 1;36(12):3927-3929. doi: 10.1093/bioinformatics/btaa205.

BPRMeth: a flexible Bioconductor package for modelling methylation profiles.BPRMeth：一个用于建模甲基化谱的灵活的 Bioconductor 包。

Bioinformatics. 2018 Jul 15;34(14):2485-2486. doi: 10.1093/bioinformatics/bty129.

Guidelines for cell-type heterogeneity quantification based on a comparative analysis of reference-free DNA methylation deconvolution software.基于无参 DNA 甲基化去卷积软件比较分析的细胞类型异质性定量指南。

BMC Bioinformatics. 2020 Jan 13;21(1):16. doi: 10.1186/s12859-019-3307-2.

Comparison of different cell type correction methods for genome-scale epigenetics studies.用于基因组规模表观遗传学研究的不同细胞类型校正方法的比较。

BMC Bioinformatics. 2017 Apr 14;18(1):216. doi: 10.1186/s12859-017-1611-2.

Reference-free cell mixture adjustments in analysis of DNA methylation data.无参考细胞混合物调整在 DNA 甲基化数据分析中的应用。

Bioinformatics. 2014 May 15;30(10):1431-9. doi: 10.1093/bioinformatics/btu029. Epub 2014 Jan 21.

Reference-free deconvolution, visualization and interpretation of complex DNA methylation data using DecompPipeline, MeDeCom and FactorViz.无参考解卷积、可视化和解释复杂 DNA 甲基化数据的方法：DecompPipeline、MeDeCom 和 FactorViz

Nat Protoc. 2020 Oct;15(10):3240-3263. doi: 10.1038/s41596-020-0369-6. Epub 2020 Sep 25.

引用本文的文献

Reference-free deconvolution of complex samples based on cross-cell-type differential analysis: Systematic evaluations with various feature selection options.基于跨细胞类型差异分析的复杂样本无参考去卷积：使用各种特征选择选项的系统评估

Front Genet. 2025 May 30;16:1570781. doi: 10.3389/fgene.2025.1570781. eCollection 2025.

Examining cellular heterogeneity in human DNA methylation studies: Overview and recommendations.人类DNA甲基化研究中的细胞异质性检测：综述与建议

STAR Protoc. 2025 Mar 21;6(1):103638. doi: 10.1016/j.xpro.2025.103638. Epub 2025 Feb 12.

A Multicellular In Vitro Model of the Human Intestine with Immunocompetent Features Highlights Host-Pathogen Interactions During Early Salmonella Typhimurium Infection.一种具有免疫活性特征的人肠道多细胞体外模型突出了鼠伤寒沙门氏菌早期感染期间的宿主-病原体相互作用。

Adv Sci (Weinh). 2025 Mar;12(9):e2411233. doi: 10.1002/advs.202411233. Epub 2025 Jan 14.

Scalable Screening of Ternary-Code DNA Methylation Dynamics Associated with Human Traits.与人类性状相关的三元编码DNA甲基化动力学的可扩展筛选

bioRxiv. 2025 Feb 11:2024.05.17.594606. doi: 10.1101/2024.05.17.594606.

Computational deconvolution of DNA methylation data from mixed DNA samples.混合 DNA 样本中 DNA 甲基化数据的计算去卷积。

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae234.

Validating reference-based algorithms to determine cell-type heterogeneity in ovarian cancer DNA methylation studies.验证基于参考的算法以确定卵巢癌 DNA 甲基化研究中的细胞类型异质性。

Sci Rep. 2024 May 14;14(1):11048. doi: 10.1038/s41598-024-61857-y.

CAM3.0: determining cell type composition and expression from bulk tissues with fully unsupervised deconvolution.CAM3.0：通过完全无监督的去卷积从批量组织中确定细胞类型组成和表达。

Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae107.

Random field modeling of multi-trait multi-locus association for detecting methylation quantitative trait loci.多性状多位点关联的随机区域建模用于检测甲基化数量性状基因座。

Bioinformatics. 2022 Aug 10;38(16):3853-3862. doi: 10.1093/bioinformatics/btac443.

本文引用的文献

A probabilistic gene expression barcode for annotation of cell types from single-cell RNA-seq data.一种基于概率的基因表达条码，用于注释单细胞 RNA-seq 数据中的细胞类型。

Biostatistics. 2022 Oct 14;23(4):1150-1164. doi: 10.1093/biostatistics/kxac021.

Robust partial reference-free cell composition estimation from tissue expression.从组织表达中稳健的无参考局部细胞成分估计

Bioinformatics. 2020 Jun 1;36(11):3431-3438. doi: 10.1093/bioinformatics/btaa184.

TOAST: improving reference-free cell composition estimation by cross-cell type differential analysis.TOAST：通过跨细胞类型差异分析改善无参考细胞成分估计。

Genome Biol. 2019 Sep 4;20(1):190. doi: 10.1186/s13059-019-1778-0.

Role of coenzymes in cancer metabolism.辅酶在癌症代谢中的作用。

Semin Cell Dev Biol. 2020 Feb;98:44-53. doi: 10.1016/j.semcdb.2019.05.027. Epub 2019 Jun 19.

Complete deconvolution of cellular mixtures based on linearity of transcriptional signatures.基于转录特征的线性关系实现细胞混合物的完全去卷积。

Nat Commun. 2019 May 17;10(1):2209. doi: 10.1038/s41467-019-09990-5.

Immune infiltration in renal cell carcinoma.肾细胞癌中的免疫浸润。

Cancer Sci. 2019 May;110(5):1564-1572. doi: 10.1111/cas.13996. Epub 2019 Apr 7.

Bulk tissue cell type deconvolution with multi-subject single-cell expression reference.基于多主体单细胞表达参考的组织细胞类型去卷积。

Nat Commun. 2019 Jan 22;10(1):380. doi: 10.1038/s41467-018-08023-x.

BayesCCE: a Bayesian framework for estimating cell-type composition from DNA methylation without the need for methylation reference.BayesCCE：一种贝叶斯框架，用于在无需甲基化参考的情况下从 DNA 甲基化数据中估计细胞类型组成。

Genome Biol. 2018 Sep 21;19(1):141. doi: 10.1186/s13059-018-1513-2.

A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies.表观基因组全关联研究中用于校正细胞类型异质性的基于参考的算法比较。

BMC Bioinformatics. 2017 Feb 13;18(1):105. doi: 10.1186/s12859-017-1511-5.

Estimating and accounting for tumor purity in the analysis of DNA methylation data from cancer studies.在癌症研究的DNA甲基化数据分析中估计并考量肿瘤纯度。

Genome Biol. 2017 Jan 25;18(1):17. doi: 10.1186/s13059-016-1143-5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验