• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

荷斯坦牛大型结构变异组库及相关数据库的建立,用于变异的发现、鉴定和应用。

A large structural variant collection in Holstein cattle and associated database for variant discovery, characterization, and application.

机构信息

Agricultural, Food & Nutritional Science, University of Alberta, Edmonton, AB, T6G 2P5, Canada.

Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada.

出版信息

BMC Genomics. 2024 Sep 30;25(1):903. doi: 10.1186/s12864-024-10812-2.

DOI:10.1186/s12864-024-10812-2
PMID:39350025
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11440700/
Abstract

BACKGROUND

Structural variants (SVs) such as deletions, duplications, and insertions are known to contribute to phenotypic variation but remain challenging to identify and genotype. A more complete, accessible, and assessable collection of SVs will assist efforts to study SV function in cattle and to incorporate SV genotyping into animal evaluation.

RESULTS

In this work we produced a large and deeply characterized collection of SVs in Holstein cattle using two popular SV callers (Manta and Smoove) and publicly available Illumina whole-genome sequence (WGS) read sets from 310 samples (290 male, 20 female, mean 20X coverage). Manta and Smoove identified 31 K and 68 K SVs, respectively. In total the SVs cover 5% (Manta) and 6% (Smoove) of the reference genome, in contrast to the 1% impacted by SNPs and indels. SV genotypes from each caller were confirmed to accurately recapitulate animal relationships estimated using WGS SNP genotypes from the same dataset, with Manta genotypes outperforming Smoove, and deletions outperforming duplications. To support efforts to link the SVs to phenotypic variation, overlapping and tag SNPs were identified for each SV, using genotype sets extracted from the WGS results corresponding to two bovine SNP chips (BovineSNP50 and BovineHD). 9% (Manta) and 11% (Smoove) of the SVs were found to have overlapping BovineHD panel SNPs, while 21% (Manta) and 9% (Smoove) have BovineHD panel tag SNPs. A custom interactive database ( https://svdb-dc.pslab.ca ) containing the identified sequence variants with extensive annotations, gene feature information, and BAM file content for all SVs was created to enable the evaluation and prioritization of SVs for further study. Illustrative examples involving the genes POPDC3, ORM1, G2E3, FANCI, TFB1M, FOXC2, N4BP2, GSTA3, and COPA show how this resource can be used to find well-supported genic SVs, determine SV breakpoints, design genotyping approaches, and identify processed pseudogenes masquerading as deletions.

CONCLUSIONS

The resources developed through this study can be used to explore sequence variation in Holstein cattle and to develop strategies for studying SVs of interest. The lack of overlapping and tag SNPs from commonly used SNP chips for most of the SVs suggests that other genotyping approaches will be needed (for example direct genotyping) to understand their potential contributions to phenotype. The included SV genotype assessments point to challenges in characterizing SVs, especially duplications, using short-read data and support ongoing efforts to better characterize cattle genomes through long-read sequencing. Lastly, the identification of previously known functional SVs and additional CDS-overlapping SVs supports the phenotypic relevance of this dataset.

摘要

背景

结构变异(SVs),如缺失、重复和插入,已知会导致表型变异,但仍然难以识别和基因分型。更完整、可访问和可评估的 SV 集合将有助于研究牛中的 SV 功能,并将 SV 基因分型纳入动物评估。

结果

在这项工作中,我们使用两种流行的 SV 调用者(Manta 和 Smoove)和来自 310 个样本(290 个雄性,20 个雌性,平均 20X 覆盖)的公共可用 Illumina 全基因组序列(WGS)读取集,在荷斯坦奶牛中产生了大量深度特征化的 SV 集合。Manta 和 Smoove 分别识别了 31K 和 68K SVs。总的来说,SVs 覆盖参考基因组的 5%(Manta)和 6%(Smoove),而不是 SNP 和插入缺失影响的 1%。来自每个调用者的 SV 基因型被证实准确地再现了使用来自同一数据集的 WGS SNP 基因型估计的动物关系,其中 Manta 基因型优于 Smoove,缺失优于重复。为了支持将 SV 与表型变异联系起来的努力,为每个 SV 确定了重叠和标签 SNP,使用从对应于两个牛 SNP 芯片(BovineSNP50 和 BovineHD)的 WGS 结果中提取的基因型集。在 Manta(9%)和 Smoove(11%)中发现 11%的 SV 具有重叠的 BovineHD 面板 SNP,而 21%(Manta)和 9%(Smoove)具有 BovineHD 面板标签 SNP。创建了一个包含带有广泛注释、基因特征信息和所有 SV 的 BAM 文件内容的识别序列变体的定制交互式数据库(https://svdb-dc.pslab.ca),以支持对 SV 进行评估和优先级排序,以进一步研究。涉及基因 POPDC3、ORM1、G2E3、FANCI、TFB1M、FOXC2、N4BP2、GSTA3 和 COPA 的说明性示例展示了如何使用此资源找到支持良好的基因 SV、确定 SV 断点、设计基因分型方法以及识别伪装成缺失的加工假基因。

结论

通过这项研究开发的资源可用于探索荷斯坦奶牛中的序列变异,并制定研究感兴趣的 SVs 的策略。对于大多数 SVs,来自常用 SNP 芯片的重叠和标签 SNP 的缺乏表明需要其他基因分型方法(例如直接基因分型)来了解它们对表型的潜在贡献。包括的 SV 基因型评估表明,使用短读数据表征 SVs,特别是重复,具有挑战性,并支持通过长读测序更好地表征牛基因组的持续努力。最后,先前已知的功能性 SVs 和其他 CDS 重叠 SVs 的鉴定支持了该数据集的表型相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/45ff562350a8/12864_2024_10812_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/0ec0eed59b8f/12864_2024_10812_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/306d6e7cc75c/12864_2024_10812_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/56f22d29539b/12864_2024_10812_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/211083b3b0c1/12864_2024_10812_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/0505f7c4a946/12864_2024_10812_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/77d6767bc41b/12864_2024_10812_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/45ff562350a8/12864_2024_10812_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/0ec0eed59b8f/12864_2024_10812_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/306d6e7cc75c/12864_2024_10812_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/56f22d29539b/12864_2024_10812_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/211083b3b0c1/12864_2024_10812_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/0505f7c4a946/12864_2024_10812_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/77d6767bc41b/12864_2024_10812_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3867/11440700/45ff562350a8/12864_2024_10812_Fig7_HTML.jpg

相似文献

1
A large structural variant collection in Holstein cattle and associated database for variant discovery, characterization, and application.荷斯坦牛大型结构变异组库及相关数据库的建立,用于变异的发现、鉴定和应用。
BMC Genomics. 2024 Sep 30;25(1):903. doi: 10.1186/s12864-024-10812-2.
2
High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data.大规模全基因组测序牛科家系队列数据中的高分辨率结构变异目录。
BMC Genomics. 2023 May 1;24(1):225. doi: 10.1186/s12864-023-09259-8.
3
Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery.对黑安格斯牛和荷斯坦牛进行全基因组重测序以发现 SNP 和 CNV。
BMC Genomics. 2011 Nov 15;12:559. doi: 10.1186/1471-2164-12-559.
4
Comparison of structural variant callers for massive whole-genome sequence data.大规模全基因组序列数据结构变异调用器的比较。
BMC Genomics. 2024 Mar 28;25(1):318. doi: 10.1186/s12864-024-10239-9.
5
Assessment of linkage disequilibrium patterns between structural variants and single nucleotide polymorphisms in three commercial chicken populations.评估三个商业鸡群中结构变异与单核苷酸多态性之间的连锁不平衡模式。
BMC Genomics. 2022 Mar 9;23(1):193. doi: 10.1186/s12864-022-08418-7.
6
Comprehensive evaluation and guidance of structural variation detection tools in chicken whole genome sequence data.鸡全基因组序列数据中结构变异检测工具的综合评估和指导
BMC Genomics. 2024 Oct 16;25(1):970. doi: 10.1186/s12864-024-10875-1.
7
Accuracy of imputation to whole-genome sequence data in Holstein Friesian cattle.荷斯坦奶牛全基因组序列数据插补的准确性
Genet Sel Evol. 2014 Jul 15;46(1):41. doi: 10.1186/1297-9686-46-41.
8
Deep sequencing of Danish Holstein dairy cattle for variant detection and insight into potential loss-of-function variants in protein coding genes.对丹麦荷斯坦奶牛进行深度测序,以检测变异并深入了解蛋白质编码基因中潜在的功能丧失变异。
BMC Genomics. 2015 Dec 9;16:1043. doi: 10.1186/s12864-015-2249-y.
9
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
10
Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing.全基因组测序结构变异检测算法的综合评估。
Genome Biol. 2019 Jun 3;20(1):117. doi: 10.1186/s13059-019-1720-5.

本文引用的文献

1
Genome-wide association analyses reveal copy number variant regions associated with reproduction and disease traits in Canadian Holstein cattle.全基因组关联分析揭示了与加拿大荷斯坦奶牛繁殖和疾病特征相关的拷贝数变异区域。
J Dairy Sci. 2024 Sep;107(9):7052-7063. doi: 10.3168/jds.2023-24295. Epub 2024 May 23.
2
Detection and characterization of copy number variation in three differentially-selected Nellore cattle populations.三个差异选择的内洛尔牛群体中拷贝数变异的检测与特征分析。
Front Genet. 2024 Apr 17;15:1377130. doi: 10.3389/fgene.2024.1377130. eCollection 2024.
3
Comparison of structural variant callers for massive whole-genome sequence data.
大规模全基因组序列数据结构变异调用器的比较。
BMC Genomics. 2024 Mar 28;25(1):318. doi: 10.1186/s12864-024-10239-9.
4
Structural variation and introgression from wild populations in East Asian cattle genomes confer adaptation to local environment.东亚牛种基因组中的结构变异和来自野生种群的基因渗入赋予了它们对当地环境的适应性。
Genome Biol. 2023 Sep 18;24(1):211. doi: 10.1186/s13059-023-03052-2.
5
Structural variants and short tandem repeats impact gene expression and splicing in bovine testis tissue.结构变异和短串联重复序列影响牛睾丸组织中的基因表达和剪接。
Genetics. 2023 Nov 1;225(3). doi: 10.1093/genetics/iyad161.
6
Discovery of non-reference processed pseudogenes in the Swedish population.瑞典人群中非参考加工假基因的发现。
Front Genet. 2023 May 30;14:1176626. doi: 10.3389/fgene.2023.1176626. eCollection 2023.
7
The Multifaceted Role of Glutathione S-Transferases in Health and Disease.谷胱甘肽 S-转移酶在健康和疾病中的多效性作用。
Biomolecules. 2023 Apr 18;13(4):688. doi: 10.3390/biom13040688.
8
High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data.大规模全基因组测序牛科家系队列数据中的高分辨率结构变异目录。
BMC Genomics. 2023 May 1;24(1):225. doi: 10.1186/s12864-023-09259-8.
9
JBrowse 2: a modular genome browser with views of synteny and structural variation.JBrowse 2:一个具有基因同线性和结构变异视图的模块化基因组浏览器。
Genome Biol. 2023 Apr 17;24(1):74. doi: 10.1186/s13059-023-02914-z.
10
In it for the long run: perspectives on exploiting long-read sequencing in livestock for population scale studies of structural variants.从长远考虑:利用长读测序技术在畜牧业中进行结构变异的群体规模研究的观点。
Genet Sel Evol. 2023 Jan 31;55(1):9. doi: 10.1186/s12711-023-00783-5.