Sahoo Karishma, Sundararajan Vino
Integrative Multiomics Lab, School of Bio Sciences and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu 632014, India.
Comput Struct Biotechnol J. 2024 May 17;23:2304-2325. doi: 10.1016/j.csbj.2024.05.015. eCollection 2024 Dec.
Understanding the intricate relationships between gene expression levels and epigenetic modifications in a genome is crucial to comprehending the pathogenic mechanisms of many diseases. With the advancement of DNA Methylome Profiling techniques, the emphasis on identifying Differentially Methylated Regions (DMRs/DMGs) has become crucial for biomarker discovery, offering new insights into the etiology of illnesses. This review surveys the current state of computational tools/algorithms for the analysis of microarray-based DNA methylation profiling datasets, focusing on key concepts underlying the diagnostic/prognostic CpG site extraction. It addresses methodological frameworks, algorithms, and pipelines employed by various authors, serving as a roadmap to address challenges and understand changing trends in the methodologies for analyzing array-based DNA methylation profiling datasets derived from diseased genomes. Additionally, it highlights the importance of integrating gene expression and methylation datasets for accurate biomarker identification, explores prognostic prediction models, and discusses molecular subtyping for disease classification. The review also emphasizes the contributions of machine learning, neural networks, and data mining to enhance diagnostic workflow development, thereby improving accuracy, precision, and robustness.
了解基因组中基因表达水平与表观遗传修饰之间的复杂关系对于理解许多疾病的致病机制至关重要。随着DNA甲基化组分析技术的进步,识别差异甲基化区域(DMRs/DMGs)对于生物标志物发现变得至关重要,为疾病的病因学提供了新的见解。本综述调查了用于分析基于微阵列的DNA甲基化分析数据集的计算工具/算法的现状,重点关注诊断/预后CpG位点提取的关键概念。它阐述了不同作者采用的方法框架、算法和流程,为应对挑战和理解分析来自患病基因组的基于阵列的DNA甲基化分析数据集的方法的变化趋势提供了路线图。此外,它强调了整合基因表达和甲基化数据集以准确识别生物标志物的重要性,探索了预后预测模型,并讨论了疾病分类的分子亚型。该综述还强调了机器学习、神经网络和数据挖掘对加强诊断工作流程开发的贡献,从而提高准确性、精确性和稳健性。