McCallum Kenneth, Jiang Wenxin, Wang Ji-Ping
Department of Statistics, Northwestern University, Evanston, IL 60208, USA.
Int J Math Comput Sci. 2010;5(2):87-100.
DNA methylation is an important epigenetic phenomenon that is associated with a variety of diseases, particularly cancers. Recent development of high throughput sequencing technology has enabled researchers to investigate the methylation rate at a single nucleotide resolution for any given sample. Testing for methylation rate equality or difference between two samples, however, is challenged by the small sample size observed at many sites across the genome. Fisher's exact test is typically used in this situation; however, it is conservative and it cannot be used to test for specific difference in methylation rate between two samples. In this paper, we propose an empirical Bayes approach that utilizes the genome-wide data as prior information for methylation differentiation between two samples. We show that this new approach is more powerful than Fisher's exact test. In addition, it can be used to test for any specific methylation difference while controlling the false discovery rate (FDR). The new method is applied to a real data set from a colon tumor study.
DNA甲基化是一种重要的表观遗传现象,与多种疾病尤其是癌症相关。高通量测序技术的最新发展使研究人员能够以单核苷酸分辨率研究任何给定样本的甲基化率。然而,检测两个样本之间甲基化率的相等性或差异受到全基因组许多位点样本量小的挑战。在这种情况下通常使用Fisher精确检验;然而,它较为保守,不能用于检验两个样本之间甲基化率的特定差异。在本文中,我们提出一种经验贝叶斯方法,该方法利用全基因组数据作为两个样本之间甲基化差异的先验信息。我们表明这种新方法比Fisher精确检验更有效。此外,它可用于检验任何特定的甲基化差异,同时控制错误发现率(FDR)。新方法应用于一项结肠癌肿瘤研究的真实数据集。