Institute of Human Genetics, Polish Academy of Sciences, Strzeszynska 32, 60-479, Poznan, Poland.
Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland.
BMC Genomics. 2020 Oct 12;21(1):706. doi: 10.1186/s12864-020-07092-x.
Epigenetics is one of the factors shaping natural variability observed among human populations. A small proportion of heritable inter-population differences are observed in the context of both the genome-wide methylation level and the methylation status of individual CpG sites. It has been demonstrated that a limited number of carefully selected differentially methylated sites may allow discrimination between main human populations. However, most of the few published results have been performed exclusively on B-lymphocyte cell lines.
The goal of our study was to identify a set of CpG sites sufficient to discriminate between populations of European and Chinese ancestry based on the difference in the DNA methylation profile not only in cell lines but also in primary cell samples. The preliminary selection of CpG sites differentially methylated in these two populations (pop-CpGs) was based on the analysis of two groups of commercially available ethnically-specific B-lymphocyte cell lines, performed using Illumina Infinium Human Methylation 450 BeadChip Array. A subset of 10 pop-CpGs characterized by the best differentiating criteria (|Mdiff| > 1, q < 0.05; lack of the confounding genomic features), and 10 additional CpGs in their immediate vicinity, were further tested using pyrosequencing technology in both B-lymphocyte cell lines and in the primary samples of the peripheral blood representing two analyzed populations. To assess the population-discriminating potential of the selected set of CpGs (further referred to as "composite pop (CEU-CHB)-CpG marker"), three classification methods were applied. The predictive ability of the composite 8-site pop (CEU-CHB)-CpG marker was assessed using 10-fold cross-validation method on two independent sets of samples.
Our results showed that less than 10 pop-CpG sites may distinguish populations of European and Chinese ancestry; importantly, this small composite pop-CpG marker performs well in both lymphoblastoid cell lines and in non-homogenous blood samples regardless of a gender.
表观遗传学是影响人类群体间自然变异的因素之一。在全基因组甲基化水平和个体 CpG 位点甲基化状态方面,都观察到一小部分可遗传的种群间差异。已经证明,少数经过精心挑选的差异甲基化位点可以区分主要人群。然而,大多数已发表的少数结果仅在 B 淋巴细胞系上进行。
我们的研究目的是基于不仅在细胞系中,而且在原代细胞样本中 DNA 甲基化谱的差异,确定一组 CpG 位点,以区分欧洲和中国血统的人群。这两个群体(pop-CpG)中差异甲基化的 CpG 位点的初步选择是基于对两组商业上可用的种族特异性 B 淋巴细胞系的分析,使用 Illumina Infinium Human Methylation 450 BeadChip Array 进行分析。选择具有最佳区分标准的 10 个 pop-CpG(|Mdiff|>1,q<0.05;缺乏混杂的基因组特征),以及它们附近的 10 个额外 CpG,进一步使用焦磷酸测序技术在 B 淋巴细胞系和外周血原代样本中进行测试,代表了两个分析的人群。为了评估所选 CpG 集(进一步称为“复合人群(CEU-CHB)-CpG 标记”)的人群区分潜力,应用了三种分类方法。在两个独立的样本集上,使用 10 倍交叉验证方法评估了复合 8 个位点的人群(CEU-CHB)-CpG 标记的预测能力。
我们的结果表明,少于 10 个 pop-CpG 位点可能区分欧洲和中国血统的人群;重要的是,这个小的复合人群(CEU-CHB)-CpG 标记在淋巴母细胞系和非同质血液样本中表现良好,无论性别如何。