Mostafavi Hakhamanesh, Berisa Tomaz, Day Felix R, Perry John R B, Przeworski Molly, Pickrell Joseph K
Department of Biological Sciences, Columbia University, New York, New York, United States of America.
New York Genome Center, New York, New York, United States of America.
PLoS Biol. 2017 Sep 5;15(9):e2002458. doi: 10.1371/journal.pbio.2002458. eCollection 2017 Sep.
A number of open questions in human evolutionary genetics would become tractable if we were able to directly measure evolutionary fitness. As a step towards this goal, we developed a method to examine whether individual genetic variants, or sets of genetic variants, currently influence viability. The approach consists in testing whether the frequency of an allele varies across ages, accounting for variation in ancestry. We applied it to the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort and to the parents of participants in the UK Biobank. Across the genome, we found only a few common variants with large effects on age-specific mortality: tagging the APOE ε4 allele and near CHRNA3. These results suggest that when large, even late-onset effects are kept at low frequency by purifying selection. Testing viability effects of sets of genetic variants that jointly influence 1 of 42 traits, we detected a number of strong signals. In participants of the UK Biobank of British ancestry, we found that variants that delay puberty timing are associated with a longer parental life span (P6.2 × 10-6 for fathers and P2.0 × 10-3 for mothers), consistent with epidemiological studies. Similarly, variants associated with later age at first birth are associated with a longer maternal life span (P~1.4 × 10-3). Signals are also observed for variants influencing cholesterol levels, risk of coronary artery disease (CAD), body mass index, as well as risk of asthma. These signals exhibit consistent effects in the GERA cohort and among participants of the UK Biobank of non-British ancestry. We also found marked differences between males and females, most notably at the CHRNA3 locus, and variants associated with risk of CAD and cholesterol levels. Beyond our findings, the analysis serves as a proof of principle for how upcoming biomedical data sets can be used to learn about selection effects in contemporary humans.
如果我们能够直接测量进化适应性,人类进化遗传学中的许多开放性问题将变得易于处理。作为朝着这个目标迈出的一步,我们开发了一种方法来检验单个基因变异或基因变异集目前是否影响生存能力。该方法包括测试等位基因频率是否随年龄变化,并考虑祖先的差异。我们将其应用于成人健康与衰老遗传流行病学研究(GERA)队列以及英国生物银行参与者的父母。在整个基因组中,我们仅发现少数对特定年龄死亡率有重大影响的常见变异:标记APOE ε4等位基因以及靠近CHRNA3的区域。这些结果表明,即使是迟发性的大效应,在纯化选择作用下也会保持低频。在测试共同影响42个性状之一的基因变异集的生存能力效应时,我们检测到了许多强信号。在具有英国血统的英国生物银行参与者中,我们发现延迟青春期时间的变异与父母更长的寿命相关(父亲P6.2×10⁻⁶,母亲P2.0×10⁻³),这与流行病学研究一致。同样,与首次生育年龄较晚相关的变异与母亲更长的寿命相关(P~1.4×10⁻³)。对于影响胆固醇水平、冠状动脉疾病(CAD)风险、体重指数以及哮喘风险的变异也观察到了信号。这些信号在GERA队列以及非英国血统的英国生物银行参与者中表现出一致的效应。我们还发现男性和女性之间存在显著差异,最明显的是在CHRNA3基因座,以及与CAD风险和胆固醇水平相关的变异。除了我们的研究结果之外,该分析还为如何利用即将到来的生物医学数据集了解当代人类的选择效应提供了原理证明。