Xue Cheng, Raveendran Muthuswamy, Harris R Alan, Fawcett Gloria L, Liu Xiaoming, White Simon, Dahdouli Mahmoud, Rio Deiros David, Below Jennifer E, Salerno William, Cox Laura, Fan Guoping, Ferguson Betsy, Horvath Julie, Johnson Zach, Kanthaswamy Sree, Kubisch H Michael, Liu Dahai, Platt Michael, Smith David G, Sun Binghua, Vallender Eric J, Wang Feng, Wiseman Roger W, Chen Rui, Muzny Donna M, Gibbs Richard A, Yu Fuli, Rogers Jeffrey
Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA.
Genome Res. 2016 Dec;26(12):1651-1662. doi: 10.1101/gr.204255.116. Epub 2016 Oct 17.
Rhesus macaques (Macaca mulatta) are the most widely used nonhuman primate in biomedical research, have the largest natural geographic distribution of any nonhuman primate, and have been the focus of much evolutionary and behavioral investigation. Consequently, rhesus macaques are one of the most thoroughly studied nonhuman primate species. However, little is known about genome-wide genetic variation in this species. A detailed understanding of extant genomic variation among rhesus macaques has implications for the use of this species as a model for studies of human health and disease, as well as for evolutionary population genomics. Whole-genome sequencing analysis of 133 rhesus macaques revealed more than 43.7 million single-nucleotide variants, including thousands predicted to alter protein sequences, transcript splicing, and transcription factor binding sites. Rhesus macaques exhibit 2.5-fold higher overall nucleotide diversity and slightly elevated putative functional variation compared with humans. This functional variation in macaques provides opportunities for analyses of coding and noncoding variation, and its cellular consequences. Despite modestly higher levels of nonsynonymous variation in the macaques, the estimated distribution of fitness effects and the ratio of nonsynonymous to synonymous variants suggest that purifying selection has had stronger effects in rhesus macaques than in humans. Demographic reconstructions indicate this species has experienced a consistently large but fluctuating population size. Overall, the results presented here provide new insights into the population genomics of nonhuman primates and expand genomic information directly relevant to primate models of human disease.
恒河猴(猕猴属)是生物医学研究中使用最广泛的非人类灵长类动物,其自然地理分布范围在所有非人类灵长类动物中最大,并且一直是许多进化和行为研究的重点。因此,恒河猴是研究最深入的非人类灵长类物种之一。然而,对于该物种全基因组的遗传变异却知之甚少。详细了解恒河猴现存的基因组变异,对于将该物种用作人类健康和疾病研究的模型以及进化群体基因组学都具有重要意义。对133只恒河猴进行的全基因组测序分析揭示了超过4370万个单核苷酸变异,其中包括数千个预计会改变蛋白质序列、转录剪接和转录因子结合位点的变异。与人类相比,恒河猴的总体核苷酸多样性高出2.5倍,推定的功能变异也略有增加。恒河猴的这种功能变异为分析编码和非编码变异及其细胞后果提供了机会。尽管恒河猴的非同义变异水平略高,但估计的适应性效应分布以及非同义变异与同义变异的比率表明,净化选择在恒河猴中比在人类中具有更强的作用。种群统计学重建表明,该物种经历了持续庞大但波动的种群规模。总体而言,本文给出的结果为非人类灵长类动物的群体基因组学提供了新的见解,并扩展了与人类疾病灵长类动物模型直接相关的基因组信息。