Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China.
Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
Genome Biol. 2024 May 7;25(1):116. doi: 10.1186/s13059-024-03253-3.
Structural variations (SVs) have significant impacts on complex phenotypes by rearranging large amounts of DNA sequence.
We present a comprehensive SV catalog based on the whole-genome sequence of 1060 pigs (Sus scrofa) representing 101 breeds, covering 9.6% of the pig genome. This catalog includes 42,487 deletions, 37,913 mobile element insertions, 3308 duplications, 1664 inversions, and 45,184 break ends. Estimates of breed ancestry and hybridization using genotyped SVs align well with those from single nucleotide polymorphisms. Geographically stratified deletions are observed, along with known duplications of the KIT gene, responsible for white coat color in European pigs. Additionally, we identify a recent SINE element insertion in MYO5A transcripts of European pigs, potentially influencing alternative splicing patterns and coat color alterations. Furthermore, a Yorkshire-specific copy number gain within ABCG2 is found, impacting chromatin interactions and gene expression across multiple tissues over a stretch of genomic region of ~200 kb. Preliminary investigations into SV's impact on gene expression and traits using the Pig Genotype-Tissue Expression (PigGTEx) data reveal SV associations with regulatory variants and gene-trait pairs. For instance, a 51-bp deletion is linked to the lead eQTL of the lipid metabolism regulating gene FADS3, whose expression in embryo may affect loin muscle area, as revealed by our transcriptome-wide association studies.
This SV catalog serves as a valuable resource for studying diversity, evolutionary history, and functional shaping of the pig genome by processes like domestication, trait-based breeding, and adaptive evolution.
结构变异(SV)通过重排大量 DNA 序列,对复杂表型具有重大影响。
我们基于 1060 头猪(Sus scrofa)的全基因组序列,代表 101 个品种,展示了一个全面的 SV 目录,涵盖了猪基因组的 9.6%。该目录包括 42487 个缺失、37913 个移动元件插入、3308 个重复、1664 个倒位和 45184 个断裂末端。利用基因分型 SV 对品种起源和杂交的估计与基于单核苷酸多态性的估计非常吻合。观察到地理分层缺失,以及 KIT 基因的已知重复,该基因负责欧洲猪的白色皮毛颜色。此外,我们在欧洲猪的 MYO5A 转录本中发现了一个最近的 SINE 元件插入,可能影响替代剪接模式和皮毛颜色改变。此外,在约克夏猪的 ABCG2 中发现了一个特定的拷贝数增益,影响了跨越多个组织的大片段基因组区域(~200kb)的染色质相互作用和基因表达。使用猪基因型组织表达(PigGTEx)数据对 SV 对基因表达和性状的影响进行初步研究表明,SV 与调节变异和基因-性状对相关联。例如,51bp 的缺失与脂质代谢调节基因 FADS3 的主要 eQTL 相关,其在胚胎中的表达可能通过我们的全转录组关联研究影响腰大肌面积。
该 SV 目录为研究猪基因组的多样性、进化历史和功能塑造提供了有价值的资源,这些过程包括驯化、基于性状的繁殖和适应性进化。