Zhou Jinghang, Liu Liyuan, Reynolds Edwardo, Huang Xixia, Garrick Dorian, Shi Yuangang
School of Agriculture, Ningxia University, Yinchuan, China.
AL Rae Centre for Genetics and Breeding, Massey University, Hamilton, New Zealand.
Front Genet. 2022 Feb 11;12:747431. doi: 10.3389/fgene.2021.747431. eCollection 2021.
Copy number variants (CNVs), which are a class of structural variant, can be important in relating genomic variation to phenotype. The primary aims of this study were to discover the common CNV regions (CNVRs) in the dual-purpose XinJiang-Brown cattle population and to detect differences between CNVs inferred using the ARS-UCD 1.2 (ARS) or the UMD 3.1 (UMD) genome assemblies based on the 150K SNP (Single Nucleotide Polymorphisms) Chip. PennCNV and CNVPartition methods were applied to calculate the deviation of the standardized signal intensity of SNPs markers to detect CNV status. Following the discovery of CNVs, we used the R package HandyCNV to generate and visualize CNVRs, compare CNVs and CNVRs between genome assemblies, and identify consensus genes using annotation resources. We identified 38 consensus CNVRs using the ARS assembly with 1.95% whole genome coverage, and 33 consensus CNVRs using the UMD assembly with 1.46% whole genome coverage using PennCNV and CNVPartition. We identified 37 genes that intersected 13 common CNVs (>5% frequency), these included functionally interesting genes such as for which an increased copy number has been negatively associated with cattle stature, and the gene family which has been linked to the immune response and adaption of cattle. The ARS map file of the GGP Bovine 150K Bead Chip maps the genomic position of more SNPs with increased accuracy compared to the UMD map file. Comparison of the CNVRs identified between the two reference assemblies suggests the newly released ARS reference assembly is better for CNV detection. In spite of this, different CNV detection methods can complement each other to generate a larger number of CNVRs than using a single approach and can highlight more genes of interest.
拷贝数变异(CNV)作为一类结构变异,在将基因组变异与表型联系起来方面可能具有重要意义。本研究的主要目的是在兼用型新疆褐牛群体中发现常见的拷贝数变异区域(CNVR),并检测基于150K单核苷酸多态性(SNP)芯片,使用ARS-UCD 1.2(ARS)或UMD 3.1(UMD)基因组组装推断出的CNV之间的差异。应用PennCNV和CNVPartition方法计算SNP标记标准化信号强度的偏差,以检测CNV状态。在发现CNV后,我们使用R包HandyCNV生成并可视化CNVR,比较基因组组装之间的CNV和CNVR,并使用注释资源鉴定共有基因。使用PennCNV和CNVPartition,我们使用ARS组装鉴定出38个共有CNVR,全基因组覆盖率为1.95%,使用UMD组装鉴定出33个共有CNVR,全基因组覆盖率为1.46%。我们鉴定出37个基因与13个常见CNV(频率>5%)相交,这些基因包括功能上有趣的基因,如拷贝数增加与牛的体型呈负相关的基因,以及与牛的免疫反应和适应性相关的基因家族。与UMD图谱文件相比,GGP牛150K芯片的ARS图谱文件更准确地映射了更多SNP的基因组位置。两个参考组装之间鉴定出的CNVR比较表明,新发布的ARS参考组装在CNV检测方面更好。尽管如此,不同的CNV检测方法可以相互补充,以产生比使用单一方法更多的CNVR,并可以突出更多感兴趣的基因。