Genomic Medicine Laboratory-UILDM, Santa Lucia Foundation IRCCS, 00179 Rome, Italy.
Department of Biomedicine and Prevention, Tor Vergata University, 00133 Rome, Italy.
Cells. 2022 Dec 18;11(24):4114. doi: 10.3390/cells11244114.
The study describes a protocol for methylation analysis integrated with Machine Learning (ML) algorithms developed to classify Facio-Scapulo-Humeral Dystrophy (FSHD) subjects. The DNA methylation levels of two regions (DR1 and -PAS) were assessed by an in-house protocol based on bisulfite sequencing and capillary electrophoresis, followed by statistical and ML analyses. The study involved two independent cohorts, namely a training group of 133 patients with clinical signs of FSHD and 150 healthy controls (CTRL) and a testing set of 27 FSHD patients and 25 CTRL. As expected, FSHD patients showed significantly reduced methylation levels compared to CTRL. We utilized single CpG sites to develop a ML pipeline able to discriminate FSHD subjects. The model identified four CpGs sites as the most relevant for the discrimination of FSHD subjects and showed high metrics values (accuracy: 0.94, sensitivity: 0.93, specificity: 0.96). Two additional models were developed to differentiate patients with lower size and patients who might carry pathogenic variants in FSHD genes, respectively. Overall, the present model enables an accurate classification of FSHD patients, providing additional evidence for DNA methylation as a powerful disease biomarker that could be employed for prioritizing subjects to be tested for FSHD.
本研究描述了一种整合机器学习(ML)算法的甲基化分析方案,该方案旨在对 Facio-Scapulo-Humeral Dystrophy(FSHD)患者进行分类。通过基于亚硫酸氢盐测序和毛细管电泳的内部方案评估了两个区域(DR1 和-PAS)的 DNA 甲基化水平,随后进行了统计和 ML 分析。该研究涉及两个独立的队列,即 133 名具有 FSHD 临床体征的患者和 150 名健康对照(CTRL)的训练组,以及 27 名 FSHD 患者和 25 名 CTRL 的测试组。正如预期的那样,与 CTRL 相比,FSHD 患者的甲基化水平明显降低。我们利用单个 CpG 位点开发了一种能够区分 FSHD 患者的 ML 管道。该模型确定了四个 CpG 位点作为区分 FSHD 患者的最相关位点,并显示出较高的指标值(准确性:0.94、敏感性:0.93、特异性:0.96)。另外开发了两个模型,分别用于区分较小大小的患者和可能携带 FSHD 基因致病性变异的患者。总体而言,目前的模型能够对 FSHD 患者进行准确分类,为 DNA 甲基化作为一种强大的疾病生物标志物提供了额外的证据,该标志物可用于优先选择需要进行 FSHD 检测的患者。