Suppr超能文献

巴基斯坦瘤牛品种全基因组重测序鉴定的基因组变异。

Genomic variants identified from whole-genome resequencing of indicine cattle breeds from Pakistan.

机构信息

National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Punjab, Pakistan.

Beijing Genomic Institute (BGI), Shenzhen, Guangdong, China.

出版信息

PLoS One. 2019 Apr 11;14(4):e0215065. doi: 10.1371/journal.pone.0215065. eCollection 2019.

Abstract

The primary goal of cattle genomics is the identification of genome-wide polymorphism associated with economically important traits. The bovine genome sequencing project was completed in 2009. Since then, using massively parallel sequencing technologies, a large number of Bos taurus cattle breeds have been resequenced and scanned for genome-wide polymorphisms. As a result, a substantial number of single nucleotide polymorphisms (SNPs) have been discovered across European Bos taurus genomes, whereas extremely less number of SNPs are cataloged for Bos indicus breeds. In this study, we performed whole-genome resequencing, reference-based mapping, functional annotation and gene enrichment analysis of 20 sires representing eleven important Bos indicus (indicine) breeds of Pakistan. The breeds sequenced here include: Sahiwal, Red Sindhi, Tharparkar and Cholistani (tropically adapted dairy and dual purpose breeds), Achai, Bhagnari, Dajal and Lohani (high altitude adapted dual and drought purpose breeds); Dhanni, Hisar Haryana and Gabrali (dairy and light drought purpose breeds). A total of 17.4 billion QC passed reads were produced using BGISEQ-500 next generation sequencing platform to generate 9 to 27-fold genome coverage (average ~16×) for each of the 20 sequenced sires. A total of 67,303,469 SNPs were identified, of which 3,850,365 were found novel and 1,083,842 insertions-deletions (InDels) were detected across the whole sequenced genomes (491,247 novel). Comparative analysis using coding region SNPs revealed a close relationship between the best milking indicine breeds; Red Sindhi and Sahiwal. On the other hand, Bhagnari and Tharparkar being popular for their adaptation to dry and extremely hot climates were found to share the highest number of SNPs. Functional annotation identified a total of 3,194 high-impact (disruptive) SNPs and 745 disruptive InDels (in 275 genes) that may possibly affect economically important dairy and beef traits. Functional enrichment analysis was performed and revealed that high or moderate impact variants in wingless-related integration site (Wnt) and vascular smooth muscle contraction (VSMC) signaling pathways were significantly over-represented in tropically adapted heat tolerant Pakistani-indicine breeds. On the other hand, vascular endothelial growth factor (VEGF) and hypoxia-inducible factor 1 (HIF-1) signaling pathways were found over-represented in highland adapted Pakistani-indicine breeds. Similarly, the ECM-receptor interaction and Jak-STAT signaling pathway were significantly enriched in dairy and beef purpose Pakistani-indicine cattle breeds. The Toll-like receptor signaling pathway was significantly enriched in most of the Pakistani-indicine cattle. Therefore, this study provides baseline data for further research to investigate the molecular mechanisms of major traits and to develop potential genomic markers associated with economically important breeding traits, particularly in indicine cattle.

摘要

牛基因组学的主要目标是鉴定与经济重要性状相关的全基因组多态性。牛基因组测序项目于 2009 年完成。从那时起,利用大规模并行测序技术,对大量的欧洲牛品种进行了重新测序和全基因组多态性扫描。结果,在欧洲牛基因组中发现了大量的单核苷酸多态性(SNPs),而印度牛品种的 SNPs 数量则非常少。在这项研究中,我们对 20 头代表巴基斯坦 11 个重要印度牛(indicine)品种的公牛进行了全基因组重测序、基于参考的映射、功能注释和基因富集分析。这里测序的品种包括:Sahiwal、Red Sindhi、Tharparkar 和 Cholistani(适应热带的乳用和两用牛品种)、Achai、Bhagnari、Dajal 和 Lohani(适应高海拔的两用和耐旱牛品种);Dhanni、Hisar Haryana 和 Gabrali(乳用和轻旱牛品种)。使用 BGISEQ-500 下一代测序平台共产生了 174 亿个经过 QC 过滤的读取序列,每个测序公牛的基因组覆盖率为 9 到 27 倍(平均约 16×)。共鉴定出 67303469 个 SNPs,其中 3850365 个是新发现的,1083842 个插入缺失(InDels)在整个测序基因组中被检测到(491247 个新发现的)。使用编码区 SNPs 的比较分析发现,最好的产奶 indicine 品种 Red Sindhi 和 Sahiwal 之间存在密切关系。另一方面,由于适应干燥和极热气候而广受欢迎的 Bhagnari 和 Tharparkar 被发现共享最多的 SNPs。功能注释总共鉴定出 3194 个高影响(破坏性)SNP 和 745 个破坏性 InDels(在 275 个基因中),这些 SNP 和 InDels 可能对经济上重要的乳用和肉牛性状产生影响。进行了功能富集分析,结果表明,在适应热带的耐热巴基斯坦 indicine 牛品种中,Wingless 相关整合位点(Wnt)和血管平滑肌收缩(VSMC)信号通路中的高或中度影响变体显著过表达。另一方面,在适应高海拔的巴基斯坦 indicine 牛品种中,血管内皮生长因子(VEGF)和缺氧诱导因子 1(HIF-1)信号通路被发现过表达。同样,细胞外基质-受体相互作用和 Jak-STAT 信号通路在乳用和肉用巴基斯坦 indicine 牛品种中显著富集。Toll 样受体信号通路在大多数巴基斯坦 indicine 牛中显著富集。因此,本研究为进一步研究主要性状的分子机制以及开发与经济重要繁殖性状相关的潜在基因组标记提供了基础数据,特别是在 indicine 牛中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a5/6459497/cdcffb6463ee/pone.0215065.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验