National Genomics Data Center, Beijing 100101, China.
BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
Nucleic Acids Res. 2020 Jan 8;48(D1):D890-D895. doi: 10.1093/nar/gkz840.
Epigenome-Wide Association Study (EWAS) has become an effective strategy to explore epigenetic basis of complex traits. Over the past decade, a large amount of epigenetic data, especially those sourced from DNA methylation array, has been accumulated as the result of numerous EWAS projects. We present EWAS Data Hub (https://bigd.big.ac.cn/ewas/datahub), a resource for collecting and normalizing DNA methylation array data as well as archiving associated metadata. The current release of EWAS Data Hub integrates a comprehensive collection of DNA methylation array data from 75 344 samples and employs an effective normalization method to remove batch effects among different datasets. Accordingly, taking advantages of both massive high-quality DNA methylation data and standardized metadata, EWAS Data Hub provides reference DNA methylation profiles under different contexts, involving 81 tissues/cell types (that contain 25 brain parts and 25 blood cell types), six ancestry categories, and 67 diseases (including 39 cancers). In summary, EWAS Data Hub bears great promise to aid the retrieval and discovery of methylation-based biomarkers for phenotype characterization, clinical treatment and health care.
表观基因组全基因组关联研究(EWAS)已成为探索复杂性状表观遗传基础的有效策略。在过去的十年中,由于众多 EWAS 项目的开展,已经积累了大量的表观遗传数据,特别是源自 DNA 甲基化阵列的数据。我们提出了 EWAS Data Hub(https://bigd.big.ac.cn/ewas/datahub),这是一个用于收集和规范化 DNA 甲基化阵列数据以及存储相关元数据的资源。EWAS Data Hub 的当前版本整合了来自 75344 个样本的综合 DNA 甲基化阵列数据集,并采用了有效的标准化方法来消除不同数据集之间的批次效应。因此,利用大量高质量的 DNA 甲基化数据和标准化的元数据,EWAS Data Hub 提供了不同背景下的参考 DNA 甲基化图谱,涉及 81 种组织/细胞类型(包含 25 个大脑部位和 25 种血细胞类型)、6 个祖先类别和 67 种疾病(包括 39 种癌症)。总之,EWAS Data Hub 有望帮助检索和发现基于甲基化的生物标志物,用于表型特征描述、临床治疗和医疗保健。