Department of Biological Sciences, Faculty of Science, Kuwait University, Sh. Sabah Al-Salem campus, Kuwait City, Kuwait.
Environment and Life Sciences Research Center, Kuwait Institute for Scientific Research, Kuwait City, Kuwait.
Sci Rep. 2024 Oct 26;14(1):25573. doi: 10.1038/s41598-024-77773-0.
Copy number variants (CNVs) are structural variants within the eukaryotic genome that vary among individuals of a species. These variants have been associated with different phenotypic traits, making them a valuable consideration as markers for designing breeding programmes. In this study, whole genome sequence data of 60 dromedary camel samples originating from the Arabian Peninsula were analyzed to construct a comprehensive dromedary CNV map. Utilizing four CNV callers employing read-depth, split-read and paired-end mapping approaches, a total of 37,519 CNV events (17,847 deletions and 19,672 duplications) were called on the dromedary autosomes. These CNV events were merged into 2,557 regions, categorized as 1,322 losses, 122 gains, and 1,113 "mixed regions" comprising both types. The cumulative size of the CNV regions amounted to 22.5 Mb, covering roughly 1.16% of the dromedary autosomes. Approximately 32% of the defined CNV regions (comprising 60% losses, 18% gains, and 0.27% mixed regions) were found in ≥ 90% of the dromedary samples, classifying them as prevalent regions. Genes with biological functions related to the different adaptive physiologies of dromedary camels, such as fertility, heat stress, musculoskeletal development, and fat metabolism, were overlapping with or in close proximity to ~ 68% of the defined CNV regions, demonstrating their potential role in dromedaries' physiology. This study presents the first comprehensive CNV map of dromedary camels and builds on the present knowledge in understanding the genetic structure of this species.
拷贝数变异 (CNVs) 是真核生物基因组内的结构变异,在同一物种的个体间存在差异。这些变异与不同的表型特征有关,因此它们是设计繁殖计划的标记的一个有价值的考虑因素。在这项研究中,分析了来自阿拉伯半岛的 60 只单峰驼样本的全基因组序列数据,以构建一个全面的单峰驼 CNV 图谱。利用四个 CNV 调用者,采用读深度、分裂读和双端映射方法,总共在单峰驼常染色体上调用了 37519 个 CNV 事件(17847 个缺失和 19672 个重复)。这些 CNV 事件被合并成 2557 个区域,分为 1322 个缺失、122 个增益和 1113 个“混合区域”,包括两种类型。CNV 区域的总大小为 22.5 Mb,约占单峰驼常染色体的 1.16%。约 32%的定义 CNV 区域(包括 60%的缺失、18%的增益和 0.27%的混合区域)在 90%以上的单峰驼样本中被发现,将其归类为常见区域。具有与单峰驼不同适应生理相关的生物学功能的基因,如生育力、热应激、肌肉骨骼发育和脂肪代谢,与或靠近约 68%的定义 CNV 区域重叠,表明它们在单峰驼生理学中的潜在作用。本研究展示了单峰驼的第一个全面的 CNV 图谱,并在理解该物种的遗传结构方面建立了现有知识的基础。