Department of Physical Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel.
Edmond J. Safra Center for Bioinformatics, Tel Aviv University, Tel Aviv 6997801, Israel.
Bioinformatics. 2021 Jul 12;37(Suppl_1):i327-i333. doi: 10.1093/bioinformatics/btab306.
While promoter methylation is associated with reinforcing fundamental tissue identities, the methylation status of distant enhancers was shown by genome-wide association studies to be a powerful determinant of cell-state and cancer. With recent availability of long reads that report on the methylation status of enhancer-promoter pairs on the same molecule, we hypothesized that probing these pairs on the single-molecule level may serve the basis for detection of rare cancerous transformations in a given cell population. We explore various analysis approaches for deconvolving cell-type mixtures based on their genome-wide enhancer-promoter methylation profiles.
To evaluate our hypothesis we examine long-read optical methylome data for the GM12878 cell line and myoblast cell lines from two donors. We identified over 100 000 enhancer-promoter pairs that co-exist on at least 30 individual DNA molecules. We developed a detailed methodology for mixture deconvolution and applied it to estimate the proportional cell compositions in synthetic mixtures. Analysis of promoter methylation, as well as enhancer-promoter pairwise methylation, resulted in very accurate estimates. In addition, we show that pairwise methylation analysis can be generalized from deconvolving different cell types to subtle scenarios where one wishes to resolve different cell populations of the same cell-type.
The code used in this work to analyze single-molecule Bionano Genomics optical maps is available via the GitHub repository https://github.com/ebensteinLab/Single_molecule_methylation_in_EP.
虽然启动子甲基化与强化基本组织身份有关,但全基因组关联研究表明,远距离增强子的甲基化状态是细胞状态和癌症的一个强有力决定因素。最近,长读长技术的出现可以报告同一分子上增强子-启动子对的甲基化状态,我们假设在单细胞水平上探测这些对可能为检测给定细胞群体中的罕见癌变提供基础。我们探索了各种基于全基因组增强子-启动子甲基化谱来解析细胞混合物的分析方法。
为了评估我们的假设,我们检查了 GM12878 细胞系和来自两位供体的成肌细胞系的长读光学甲基组数据。我们鉴定了超过 100000 个至少在 30 个单个 DNA 分子上共存的增强子-启动子对。我们开发了一种详细的混合物反卷积方法,并将其应用于估计合成混合物中的比例细胞组成。启动子甲基化以及增强子-启动子对的甲基化分析都产生了非常准确的估计。此外,我们表明,对甲基化的成对分析可以从不同细胞类型的反卷积推广到希望解析同一细胞类型的不同细胞群体的微妙情况。
本工作中用于分析单分子 Bionano Genomics 光学图谱的代码可通过 GitHub 存储库 https://github.com/ebensteinLab/Single_molecule_methylation_in_EP 获得。