Keel B N, Nonneman D J, Rohrer G A
USDA, ARS, U.S. Meat Animal Research Center, Clay Center, NE, 68933, USA.
Anim Genet. 2017 Aug;48(4):404-411. doi: 10.1111/age.12557. Epub 2017 May 8.
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses.
从序列中检测到的遗传变异已被成功用于识别因果变异,并在多种生物体中定位复杂性状。高影响和中等影响变异,即那些预期会改变或破坏基因基因编码蛋白质编码的变异以及那些调节蛋白质产生的变异,可能比其他类型的遗传变异对表型变异具有更显著的影响。因此,这些功能变异的完整列表在猪基因组研究中,尤其是针对繁殖力和生产性状的研究中,将具有相当大的研究价值。全基因组序列取自美国肉类动物研究中心(USMARC)一个经过深入表型分析的实验猪群的72头奠基猪。这些动物包括所有24头奠基公猪(12头杜洛克猪和12头长白猪)以及48头约克夏 - 长白猪杂交母猪。序列读数被映射到Sscrofa10.2基因组版本,每个基因组平均覆盖度为6.1倍(×)。从测序基因组中总共鉴定出22342915个高可信度单核苷酸多态性(SNP)。其中包括2100万个先前报道的SNP以及猪SNP60基因分型芯片检测中62163个SNP的79%。在猪基因组中87.8%的基因的编码序列或非翻译区(UTR)检测到变异:在504个基因中预测到功能丧失变异,10202个基因包含非同义变异,10773个基因在UTR中有变异,13010个基因包含同义变异。大约139000个SNP被归类为功能丧失、非同义或调控型,这表明在我们检测的猪中超过99%的变异可能可以忽略不计,这使我们在未来分析中能够专注于数量少得多的功能SNP。