ReproGen - Animal Bioscience Group, Faculty of Veterinary Science, University of Sydney, 425 Werombi Road, Camden NSW 2570, Australia.
BMC Genet. 2014 Mar 17;15:34. doi: 10.1186/1471-2156-15-34.
Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse constituent tests. The extreme CSS scores capture highly differentiated loci and underlying common variants hauling excess haplotype homozygosity in the samples of a target population.
The data on high-density genotypes were analyzed for evidence of an association with either polledness or double muscling in various cohorts of cattle and sheep. In cattle, extreme CSS scores were found in the candidate regions on autosome BTA-1 and BTA-2, flanking the POLL locus and MSTN gene, for polledness and double muscling, respectively. In sheep, the regions with extreme scores were localized on autosome OAR-2 harbouring the MSTN gene for double muscling and on OAR-10 harbouring the RXFP2 gene for polledness. In comparison to the constituent tests, there was a partial agreement between the signals at the four candidate loci; however, they consistently identified additional genomic regions harbouring no known genes. Persuasively, our list of all the additional significant CSS regions contains genes that have been successfully implicated to secondary phenotypic diversity among several subpopulations in our data. For example, the method identified a strong selection signature for stature in cattle capturing selective sweeps harbouring UQCC-GDF5 and PLAG1-CHCHD7 gene regions on BTA-13 and BTA-14, respectively. Both gene pairs have been previously associated with height in humans, while PLAG1-CHCHD7 has also been reported for stature in cattle. In the additional analysis, CSS identified significant regions harbouring multiple genes for various traits under selection in European cattle including polledness, adaptation, metabolism, growth rate, stature, immunity, reproduction traits and some other candidate genes for dairy and beef production.
CSS successfully localized the candidate regions in validation datasets as well as identified previously known and novel regions for various traits experiencing selection pressure. Together, the results demonstrate the utility of CSS by its improved power, reduced false positives and high-resolution of selection signals as compared to individual constituent tests.
辨别在中性条件下进化的特征与由于各种选择压力而快速进化的特征是一项巨大的挑战。我们提出了一种新方法,即综合选择信号(CSS),它将来自其多种组成测试的等级分布的多种选择证据统一起来。极端 CSS 得分捕获了高度分化的基因座和潜在的常见变异,这些变异在目标群体的样本中携带了过多的单倍型纯合性。
对高密度基因型数据进行了分析,以寻找与牛和羊的各种群体中的无角或双肌性状相关的关联证据。在牛中,在 BTA-1 和 BTA-2 染色体上的候选区域发现了极端 CSS 得分,这些区域分别位于 POLL 基因座和 MSTN 基因的侧翼,与无角和双肌性状相关。在绵羊中,极端得分的区域位于 OAR-2 染色体上,该染色体上携带有 MSTN 基因,与双肌性状相关,而位于 OAR-10 染色体上,该染色体上携带有 RXFP2 基因,与无角性状相关。与组成测试相比,四个候选基因座之间的信号存在部分一致性;然而,它们始终可以识别出包含未知基因的其他基因组区域。令人信服的是,我们所有额外显著 CSS 区域的列表包含了已成功被牵连到我们数据中的几个亚群的次要表型多样性中的基因。例如,该方法在牛中识别出了一个与身高相关的强烈选择信号,该信号捕获了 BTA-13 和 BTA-14 染色体上分别含有 UQCC-GDF5 和 PLAG1-CHCHD7 基因区域的选择清扫。这两个基因对都曾与人类的身高有关,而 PLAG1-CHCHD7 基因也被报道与牛的身高有关。在额外的分析中,CSS 确定了欧洲牛中经历选择压力的各种性状的重要区域,这些性状包括无角、适应、代谢、生长速度、身高、免疫、繁殖性状以及其他一些奶牛和肉牛生产的候选基因。
CSS 成功地将候选区域定位在验证数据集中,同时还确定了以前已知和新的与经历选择压力的各种性状相关的区域。总的来说,与单个组成测试相比,CSS 通过提高功效、降低假阳性和提高选择信号的分辨率,证明了其有效性。