Fazzari Melissa J, Greally John M
Division of Biostatistics, Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA.
Methods Mol Biol. 2010;620:243-65. doi: 10.1007/978-1-60761-580-4_7.
Epigenetics is the study of heritable change other than those encoded in DNA sequence. Cytosine methylation of DNA at CpG dinucleotides is the most well-studied epigenetic phenomenon, although epigenetic changes also encompass non-DNA methylation mechanisms, such as covalent histone modifications, micro-RNA interactions, and chromatin remodeling complexes. Methylation changes, both global and gene specific, have been observed to be associated with disease, particularly in cancer.This chapter begins with a general overview of epigenomics, and then focuses on understanding and analyzing genome-wide cytosine methylation data. There are many microarray-based techniques available to measure cytosine methylation across the genome, as well as gold-standard techniques based on sequencing bisulfite converted DNA, which is used to measure methylation in a smaller, more targeted set of loci. We have provided an overview of many of the current technologies - their advantages, limitations, and recent improvements. Regardless of which technology is used, the goal is to produce a set of methylation measurements that are highly consistent with true methylation levels of the corresponding set of CpG dinucleotides.Identifying all loci with aberrant methylation or hypomethylation in disease, or in natural processes such as aging, requires the comparison of methylation levels across many samples. In such studies, the development of methylation-based diagnostic tools may be of interest, potentially to be used as early disease detection strategies based on a set of sentinel loci. In addition, the identification of loci with potentially reversible methylation events may result in new therapeutic options. Given the vast number of measurable sites, prioritization of candidate loci is an important and complex issue and rests on a foundation of appropriate statistical testing and summarization. Coupled with statistical estimates of importance, the genomic context of each locus measured may offer important information about the mechanisms by which epigenetic changes impact disease and allows us further refinement of candidate loci. We will conclude this chapter by identifying issues in building methylation-based models for prediction and potential directions of further statistical research in epigenetics.
表观遗传学是对DNA序列编码以外的可遗传变化的研究。DNA在CpG二核苷酸处的胞嘧啶甲基化是研究最为深入的表观遗传现象,不过表观遗传变化还包括非DNA甲基化机制,如共价组蛋白修饰、微小RNA相互作用以及染色质重塑复合体。已观察到全局和基因特异性的甲基化变化都与疾病相关,尤其是在癌症中。本章首先对表观基因组学进行总体概述,然后着重于理解和分析全基因组胞嘧啶甲基化数据。有许多基于微阵列的技术可用于测量全基因组的胞嘧啶甲基化,还有基于对亚硫酸氢盐转化后的DNA进行测序的金标准技术,该技术用于测量较小的、更具针对性的一组位点中的甲基化情况。我们概述了许多当前技术——它们的优点、局限性以及近期的改进。无论使用哪种技术,目标都是生成一组与相应CpG二核苷酸的真实甲基化水平高度一致的甲基化测量值。要确定疾病或衰老等自然过程中所有存在异常甲基化或低甲基化的位点,需要比较多个样本的甲基化水平。在此类研究中,基于甲基化的诊断工具的开发可能会受到关注,有可能用作基于一组哨兵位点的早期疾病检测策略。此外,识别具有潜在可逆甲基化事件的位点可能会带来新的治疗选择。鉴于可测量位点数量众多,候选位点的优先级排序是一个重要且复杂的问题,其基础是适当的统计检验和汇总。结合重要性的统计估计,所测量每个位点的基因组背景可能会提供有关表观遗传变化影响疾病的机制的重要信息,并使我们能够进一步优化候选位点。本章将通过确定构建基于甲基化的预测模型中的问题以及表观遗传学进一步统计研究的潜在方向来结束。