Suppr超能文献

组装全球牛的泛基因组揭示了缺失的序列和新的结构变异,为它们的多样性和进化历史提供了新的见解。

Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history.

机构信息

Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China.

Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, Maryland 20705, USA.

出版信息

Genome Res. 2022 Aug 25;32(8):1585-1601. doi: 10.1101/gr.276550.122.

Abstract

A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified 83 Mb of sequence not found in the cattle reference genome, representing 3.1% novel sequence compared with the 2.71-Gb reference. A catalog of structural variants developed from this cattle population identified 3.3 million deletions, 0.12 million inversions, and 0.18 million duplications. Estimates of breed ancestry and hybridization between cattle breeds using insertion/deletions as markers were similar to those produced by single nucleotide polymorphism-based analysis. Hundreds of deletions were observed to have stratification based on subspecies and breed. For example, an insertion of a Bov-tA1 repeat element was identified in the first intron of the gene and correlated with cattle breed geographic distribution. This insertion falls within a segment overlapping predicted enhancer and promoter regions of the gene, and could affect important traits such as immune response, olfactory functions, cell proliferation, and glucose metabolism in muscle. The results indicate that pangenomes are a valuable resource for studying diversity and evolutionary history, and help to delineate how domestication, trait-based breeding, and adaptive introgression have shaped the cattle genome.

摘要

基于代表 57 个品种的 898 头奶牛的基因组序列,创建了一个奶牛泛基因组图谱。泛基因组鉴定出了 8300 万个不在牛参考基因组中的序列,与 27.1 亿碱基对的参考基因组相比,这代表了 3.1%的新序列。从该牛群中开发的结构变异目录鉴定出了 330 万个缺失、0.12 万个倒位和 0.18 万个重复。使用插入/缺失作为标记来估计牛品种的祖先和品种间的杂交情况与基于单核苷酸多态性分析的结果相似。数百个缺失被观察到基于亚种和品种发生了分层。例如,在 基因的第一个内含子中鉴定出了 Bov-tA1 重复元件的插入,这与牛品种的地理分布有关。该插入位于与基因的预测增强子和启动子区域重叠的片段内,可能影响免疫反应、嗅觉功能、细胞增殖和肌肉葡萄糖代谢等重要性状。研究结果表明,泛基因组是研究多样性和进化历史的宝贵资源,有助于描绘驯化、基于性状的选育和适应性渗入如何塑造了牛的基因组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40a7/9435747/d5101a180788/1585f01.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验