Genetics and Genome Biology, Hospital for Sick Children, Toronto, ON M5G 1X8, Canada.
Genetics and Genome Biology, Hospital for Sick Children, Toronto, ON M5G 1X8, Canada; Division of Clinical and Metabolic Genetics, Hospital for Sick Children, Toronto, ON M5G 1X8, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S, Canada.
Am J Hum Genet. 2021 Aug 5;108(8):1359-1366. doi: 10.1016/j.ajhg.2021.06.015. Epub 2021 Jul 22.
DNA methylation (DNAm) signatures are unique patterns of DNAm alterations defined for rare disorders caused by pathogenic variants in epigenetic regulatory genes. The potential of DNAm signatures (also known as "episignatures") is just beginning to emerge as there are >300 known epigenetic regulatory genes, ∼100 of which are linked to neurodevelopmental disorders. To date, approximately 50 signatures have been identified, which have proven unexpectedly successful as predictive tools for classifying variants of uncertain significance as pathogenic or benign. The molecular basis of these signatures is poorly understood. Furthermore, their relationships to primary disease pathophysiology have yet to be adequately investigated, despite clear demonstrations of potential connections. There are currently no published guidelines for signature development. As signatures are highly dependent on the samples and methods used to derive them, we propose a framework for consideration in signature development including sample size, statistical parameters, cell type of origin, and the value of detailed clinical and molecular information. We illustrate the relationship between signature output/efficacy and sample size by generating and testing 837 DNAm signatures of Kleefstra syndrome using downsampling analysis. Our findings highlight that no single DNAm signature encompasses all DNAm alterations present in a rare disorder, and that a substandard study design can generate a DNAm signature that misclassifies variants. Finally, we discuss the importance of further investigating DNAm signatures to inform disease pathophysiology and broaden their scope as a functional assay.
DNA 甲基化(DNAm)特征是由表观遗传调控基因中的致病性变异引起的罕见疾病的独特 DNAm 改变模式。DNAm 特征(也称为“表观遗传特征”)的潜力才刚刚开始显现,因为已知有超过 300 个表观遗传调控基因,其中约 100 个与神经发育障碍有关。迄今为止,已经鉴定了大约 50 个特征,这些特征作为预测工具,出乎意料地成功地将意义不明的变异分类为致病性或良性。这些特征的分子基础还了解甚少。此外,尽管已经清楚地证明了它们之间存在潜在的联系,但它们与主要疾病发病机制的关系尚未得到充分研究。目前还没有关于特征开发的指南。由于特征高度依赖于用于推导它们的样本和方法,我们提出了一个特征开发的框架,包括样本量、统计参数、起源细胞类型以及详细临床和分子信息的价值。我们通过使用降采样分析生成和测试 Kleefstra 综合征的 837 个 DNAm 特征来阐明特征输出/功效与样本量之间的关系。我们的研究结果表明,没有单个 DNAm 特征包含罕见疾病中存在的所有 DNAm 改变,并且不良的研究设计可以生成错误分类变异的 DNAm 特征。最后,我们讨论了进一步研究 DNAm 特征以了解疾病发病机制并拓宽其作为功能测定的范围的重要性。