Suárez-Vega Aroa, Gutiérrez-Gil Beatriz, Klopp Christophe, Tosser-Klopp Gwenola, Arranz Juan José
Departamento de Producción Animal, Facultad de Veterinaria, Universidad de León, Campus de Vegazana s/n, León, 24071, Spain.
INRA, Plateforme bioinformatique Toulouse Midi-Pyrénées, UR875 Biométrie et Intelligence Artificielle, BP 52627, 31326, Castanet-Tolosan Cedex, France.
BMC Genomics. 2017 Feb 15;18(1):170. doi: 10.1186/s12864-017-3581-1.
The identification of genetic variation underlying desired phenotypes is one of the main challenges of current livestock genetic research. High-throughput transcriptome sequencing (RNA-Seq) offers new opportunities for the detection of transcriptome variants (SNPs and short indels) in different tissues and species. In this study, we used RNA-Seq on Milk Sheep Somatic Cells (MSCs) with the goal of characterizing the genetic variation within the coding regions of the milk transcriptome in Churra and Assaf sheep, two common dairy sheep breeds farmed in Spain.
A total of 216,637 variants were detected in the MSCs transcriptome of the eight ewes analyzed. Among them, a total of 57,795 variants were detected in the regions harboring Quantitative Trait Loci (QTL) for milk yield, protein percentage and fat percentage, of which 21.44% were novel variants. Among the total variants detected, 561 (2.52%) and 1,649 (7.42%) were predicted to produce high or moderate impact changes in the corresponding transcriptional unit, respectively. In the functional enrichment analysis of the genes positioned within selected QTL regions harboring novel relevant functional variants (high and moderate impact), the KEGG pathway with the highest enrichment was "protein processing in endoplasmic reticulum". Additionally, a total of 504 and 1,063 variants were identified in the genes encoding principal milk proteins and molecules involved in the lipid metabolism, respectively. Of these variants, 20 mutations were found to have putative relevant effects on the encoded proteins.
We present herein the first transcriptomic approach aimed at identifying genetic variants of the genes expressed in the lactating mammary gland of sheep. Through the transcriptome analysis of variability within regions harboring QTL for milk yield, protein percentage and fat percentage, we have found several pathways and genes that harbor mutations that could affect dairy production traits. Moreover, remarkable variants were also found in candidate genes coding for major milk proteins and proteins related to milk fat metabolism. Several of the SNPs found in this study could be included as suitable markers in genotyping platforms or custom SNP arrays to perform association analyses in commercial populations and apply genomic selection protocols in the dairy production industry.
识别期望表型背后的遗传变异是当前家畜遗传研究的主要挑战之一。高通量转录组测序(RNA测序)为检测不同组织和物种中的转录组变异(单核苷酸多态性和短插入缺失)提供了新机会。在本研究中,我们对奶羊体细胞(MSCs)进行了RNA测序,目的是表征西班牙养殖的两种常见奶羊品种——丘拉羊和阿萨夫羊——乳汁转录组编码区域内的遗传变异。
在分析的8只母羊的MSCs转录组中共检测到216,637个变异。其中,在产奶量、蛋白质百分比和脂肪百分比的数量性状位点(QTL)所在区域共检测到57,795个变异,其中21.44%是新变异。在检测到的所有变异中,分别有561个(2.52%)和1,649个(7.42%)预计会在相应的转录单元中产生高或中等影响的变化。在对位于具有新的相关功能变异(高影响和中等影响)的选定QTL区域内的基因进行功能富集分析时,富集程度最高的KEGG途径是“内质网中的蛋白质加工”。此外,在编码主要乳蛋白和参与脂质代谢的分子的基因中分别鉴定出504个和1,063个变异。在这些变异中,发现有20个突变对编码的蛋白质具有推定的相关影响。
我们在此展示了第一种旨在识别绵羊泌乳乳腺中表达的基因的遗传变异的转录组学方法。通过对产奶量、蛋白质百分比和脂肪百分比的QTL所在区域内的变异性进行转录组分析,我们发现了几个含有可能影响奶业生产性状的突变的途径和基因。此外,在编码主要乳蛋白和与乳脂肪代谢相关的蛋白质的候选基因中也发现了显著变异。本研究中发现的几个单核苷酸多态性(SNP)可作为合适的标记纳入基因分型平台或定制SNP芯片中,以在商业群体中进行关联分析,并在奶业生产行业中应用基因组选择方案。