Maher M Cyrus, Uricchio Lawrence H, Torgerson Dara G, Hernandez Ryan D
Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94158, USA.
Hum Hered. 2012;74(3-4):118-28. doi: 10.1159/000346826. Epub 2013 Apr 11.
Identifying drivers of complex traits from the noisy signals of genetic variation obtained from high-throughput genome sequencing technologies is a central challenge faced by human geneticists today. We hypothesize that the variants involved in complex diseases are likely to exhibit non-neutral evolutionary signatures. Uncovering the evolutionary history of all variants is therefore of intrinsic interest for complex disease research. However, doing so necessitates the simultaneous elucidation of the targets of natural selection and population-specific demographic history.
Here we characterize the action of natural selection operating across complex disease categories, and use population genetic simulations to evaluate the expected patterns of genetic variation in large samples. We focus on populations that have experienced historical bottlenecks followed by explosive growth (consistent with many human populations), and describe the differences between evolutionarily deleterious mutations and those that are neutral.
Genes associated with several complex disease categories exhibit stronger signatures of purifying selection than non-disease genes. In addition, loci identified through genome-wide association studies of complex traits also exhibit signatures consistent with being in regions recurrently targeted by purifying selection. Through simulations, we show that population bottlenecks and rapid growth enable deleterious rare variants to persist at low frequencies just as long as neutral variants, but low-frequency and common variants tend to be much younger than neutral variants. This has resulted in a large proportion of modern-day rare alleles that have a deleterious effect on function and that potentially contribute to disease susceptibility.
The key question for sequencing-based association studies of complex traits is how to distinguish between deleterious and benign genetic variation. We used population genetic simulations to uncover patterns of genetic variation that distinguish these two categories, especially derived allele age, thereby providing inroads into novel methods for characterizing rare genetic variation driving complex diseases.
从高通量基因组测序技术获得的嘈杂遗传变异信号中识别复杂性状的驱动因素,是当今人类遗传学家面临的核心挑战。我们假设,参与复杂疾病的变异可能会表现出非中性的进化特征。因此,揭示所有变异的进化历史对于复杂疾病研究具有内在意义。然而,要做到这一点,就必须同时阐明自然选择的目标和特定人群的人口统计学历史。
在这里,我们描述了自然选择在复杂疾病类别中的作用,并使用群体遗传模拟来评估大样本中遗传变异的预期模式。我们关注那些经历过历史瓶颈后又出现爆发式增长的人群(这与许多人类群体一致),并描述了进化上有害的突变与中性突变之间的差异。
与几种复杂疾病类别相关的基因比非疾病基因表现出更强的纯化选择特征。此外,通过复杂性状的全基因组关联研究确定的位点也表现出与处于纯化选择反复靶向区域一致的特征。通过模拟,我们表明群体瓶颈和快速增长使有害的稀有变异能够与中性变异一样在低频下持续存在,但低频和常见变异往往比中性变异年轻得多。这导致了很大一部分对功能有有害影响且可能导致疾病易感性的现代稀有等位基因。
基于测序的复杂性状关联研究的关键问题是如何区分有害和良性遗传变异。我们使用群体遗传模拟来揭示区分这两类变异的遗传变异模式,特别是衍生等位基因年龄,从而为表征驱动复杂疾病的稀有遗传变异的新方法提供了途径。