Yassi Maryam, Shams Davodly Ehsan, Hajebi Khaniki Saeedeh, Kerachian Mohammad Amin
Cancer Genetics Research Unit, Reza Radiotherapy and Oncology Center, Mashhad 9184156815, Iran.
Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand.
J Pers Med. 2024 Mar 29;14(4):361. doi: 10.3390/jpm14040361.
DNA methylation is a key epigenetic modification involved in gene regulation, contributing to both physiological and pathological conditions. For a more profound comprehension, it is essential to conduct a precise comparison of DNA methylation patterns between sample groups that represent distinct statuses. Analysis of differentially methylated regions (DMRs) using computational approaches can help uncover the precise relationships between these phenomena. This paper describes a hybrid model that combines the beta-binomial Bayesian hierarchical model with a combination of ranking methods known as HBCR_DMR. During the initial phase, we model the actual methylation proportions of the CpG sites (CpGs) within the replicates. This modeling is achieved through beta-binomial distribution, with parameters set by a group mean and a dispersion parameter. During the second stage, we establish the selection of distinguishing CpG sites based on their methylation status, employing multiple ranking techniques. Finally, we combine the ranking lists of differentially methylated CpG sites through a voting system. Our analyses, encompassing simulations and real data, reveal outstanding performance metrics, including a sensitivity of 0.72, specificity of 0.89, and an F1 score of 0.76, yielding an overall accuracy of 0.82 and an AUC of 0.94. These findings underscore HBCR_DMR's robust capacity to distinguish methylated regions, confirming its utility as a valuable tool for DNA methylation analysis.
DNA甲基化是一种参与基因调控的关键表观遗传修饰,在生理和病理状况中均发挥作用。为了更深入地理解,有必要对代表不同状态的样本组之间的DNA甲基化模式进行精确比较。使用计算方法分析差异甲基化区域(DMR)有助于揭示这些现象之间的精确关系。本文描述了一种混合模型,该模型将贝塔二项式贝叶斯分层模型与一种称为HBCR_DMR的排序方法组合相结合。在初始阶段,我们对重复样本中CpG位点(CpGs)的实际甲基化比例进行建模。这种建模是通过贝塔二项式分布实现的,其参数由组均值和离散参数设置。在第二阶段,我们根据CpG位点的甲基化状态,采用多种排序技术来确定有区别的CpG位点。最后,我们通过投票系统将差异甲基化CpG位点的排序列表进行合并。我们涵盖模拟和真实数据的分析揭示了出色的性能指标,包括灵敏度为0.72、特异性为0.89、F1分数为0.76,总体准确率为0.82,曲线下面积(AUC)为0.94。这些发现强调了HBCR_DMR区分甲基化区域的强大能力,证实了其作为DNA甲基化分析的宝贵工具的实用性。