Department of Medical Genetics, University of British Columbia, C201-4500 Oak Street, Vancouver, BC, V6H 3N1, Canada.
BC Children's Hospital Research Institute, 938 W 28th Ave, Vancouver, BC, V5Z 4H4, Canada.
Epigenetics Chromatin. 2019 Aug 9;12(1):51. doi: 10.1186/s13072-019-0296-3.
The influence of genetics on variation in DNA methylation (DNAme) is well documented. Yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches to address confounding by population stratification using DNAme data may not generalize to populations or tissues outside those in which they were developed. To aid future placental DNAme studies in assessing population stratification, we developed an ethnicity classifier, PlaNET (Placental DNAme Elastic Net Ethnicity Tool), using five cohorts with Infinium Human Methylation 450k BeadChip array (HM450k) data from placental samples that is also compatible with the newer EPIC platform.
Data from 509 placental samples were used to develop PlaNET and show that it accurately predicts (accuracy = 0.938, kappa = 0.823) major classes of self-reported ethnicity/race (African: n = 58, Asian: n = 53, Caucasian: n = 389), and produces ethnicity probabilities that are highly correlated with genetic ancestry inferred from genome-wide SNP arrays (> 2.5 million SNP) and ancestry informative markers (n = 50 SNPs). PlaNET's ethnicity classification relies on 1860 HM450K microarray sites, and over half of these were linked to nearby genetic polymorphisms (n = 955). Our placental-optimized method outperforms existing approaches in assessing population stratification in placental samples from individuals of Asian, African, and Caucasian ethnicities.
PlaNET provides an improved approach to address population stratification in placental DNAme association studies. The method can be applied to predict ethnicity as a discrete or continuous variable and will be especially useful when self-reported ethnicity information is missing and genotyping markers are unavailable.
遗传学对 DNA 甲基化(DNAme)变异的影响已有充分记录。然而,在 DNAme 关联研究中,人群分层造成的混杂因素通常未被考虑。现有的使用 DNAme 数据解决人群分层混杂问题的方法可能无法推广到开发这些方法以外的人群或组织。为了帮助未来的胎盘 DNAme 研究评估人群分层,我们开发了一种基于五个队列的胎盘样本中 Infinium Human Methylation 450k BeadChip 阵列(HM450k)数据的种族分类器 PlaNET(胎盘 DNAme 弹性网种族工具),该工具也与较新的 EPIC 平台兼容。
使用 509 个胎盘样本的数据来开发 PlaNET,并表明它可以准确预测(准确性=0.938,kappa=0.823)自我报告的种族/种族的主要类别(非洲:n=58,亚洲:n=53,白种人:n=389),并产生与从全基因组 SNP 阵列推断出的遗传祖先高度相关的种族概率(>250 万个 SNP)和遗传标记(n=50 SNP)。PlaNET 的种族分类依赖于 1860 个 HM450K 微阵列位点,其中一半以上与附近的遗传多态性(n=955)相关。我们的胎盘优化方法在评估亚洲、非洲和白种人个体的胎盘样本中的人群分层方面优于现有的方法。
PlaNET 为解决胎盘 DNAme 关联研究中的人群分层问题提供了一种改进的方法。该方法可用于预测种族作为离散或连续变量,当自我报告的种族信息缺失且基因分型标记不可用时,将特别有用。