Department of Molecular Epidemiology, Leiden University Medical Center, 2333 ZC Leiden, the Netherlands.
Bioinformatics. 2014 Dec 1;30(23):3435-7. doi: 10.1093/bioinformatics/btu566. Epub 2014 Aug 21.
The Illumina 450k array is a frequently used platform for large-scale genome-wide DNA methylation studies, i.e. epigenome-wide association studies. Currently, quality control of 450k data can be performed with Illumina's GenomeStudio and is part of a limited number 450k analysis pipelines. However, GenomeStudio cannot handle large-scale studies, and existing pipelines provide limited options for quality control and neither support interactive exploration by the user. To aid the detection of bad-quality samples in large-scale genome-wide DNA methylation studies as flexible and transparent as possible, we have developed MethylAid; a visual and interactive Web application using RStudio's shiny package. Bad-quality samples are detected using sample-dependent and sample-independent quality control probes present on the array and user-adjustable thresholds. In-depth exploration of bad-quality samples can be performed using several interactive diagnostic plots. Furthermore, plots can be annotated with user-provided metadata, for example, to identify outlying batches. Our new tool makes quality assessment of 450k array data interactive, flexible and efficient and is, therefore, expected to be useful for both data analysts and core facilities.
MethylAid is implemented as an R/Bioconductor package (www.bioconductor.org/packages/3.0/bioc/html/MethylAid.html). A demo application is available from shiny.bioexp.nl/MethylAid.
Illumina 450k 芯片是一种常用于大规模全基因组 DNA 甲基化研究(即全基因组关联研究)的平台。目前,Illumina 的 GenomeStudio 可用于 450k 数据的质量控制,是少数 450k 分析管道的一部分。然而,GenomeStudio 无法处理大规模研究,并且现有的管道提供了有限的质量控制选项,既不支持用户的交互式探索。为了帮助在大规模全基因组 DNA 甲基化研究中尽可能灵活和透明地检测到质量差的样本,我们开发了 MethylAid;这是一个使用 RStudio 的 shiny 包的可视化和交互式网络应用程序。使用芯片上的样本相关和样本无关的质量控制探针以及用户可调整的阈值来检测质量差的样本。可以使用几个交互式诊断图来深入探索质量差的样本。此外,还可以使用用户提供的元数据注释图,例如,识别异常批次。我们的新工具使 450k 芯片数据的质量评估具有交互性、灵活性和高效性,因此有望对数据分析人员和核心设施都有用。
MethylAid 作为一个 R/Bioconductor 包实现(www.bioconductor.org/packages/3.0/bioc/html/MethylAid.html)。一个演示应用程序可从 shiny.bioexp.nl/MethylAid 获得。