Suppr超能文献

利用全基因组测序鉴定养殖虹鳟鱼中的高可信度结构变异

Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing.

作者信息

Liu Sixin, Gao Guangtu, Layer Ryan M, Thorgaard Gary H, Wiens Gregory D, Leeds Timothy D, Martin Kyle E, Palti Yniv

机构信息

National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, United States Department of Agriculture, Kearneysville, WV, United States.

BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, United States.

出版信息

Front Genet. 2021 Feb 25;12:639355. doi: 10.3389/fgene.2021.639355. eCollection 2021.

Abstract

Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confidence SVs in rainbow trout using whole-genome re-sequencing; and 2) to examine the contribution of transposable elements (TEs) to SVs in rainbow trout. A total of 96 rainbow trout, including 11 homozygous lines and 85 outbred fish from three breeding populations, were whole-genome sequenced with an average genome coverage of 17.2×. Putative SVs were identified using the program Smoove which integrates LUMPY and other associated tools into one package. After rigorous filtering, 13,863 high-confidence SVs were identified. Pacific Biosciences long-reads of Arlee, one of the homozygous lines used for SV detection, validated 98% (3,948 of 4,030) of the high-confidence SVs identified in the Arlee homozygous line. Based on principal component analysis, the 85 outbred fish clustered into three groups consistent with their populations of origin, further indicating that the high-confidence SVs identified in this study are robust. The repetitive DNA content of the high-confidence SV sequences was 86.5%, which is much higher than the 57.1% repetitive DNA content of the reference genome, and is also higher than the repetitive DNA content of Atlantic salmon SVs reported previously. TEs thus contribute substantially to SVs in rainbow trout as TEs make up the majority of repetitive sequences. Hundreds of the high-confidence SVs were annotated as exon-loss or gene-fusion variants, and may have phenotypic effects. The high-confidence SVs reported in this study provide a foundation for further rainbow trout SV studies.

摘要

基因组结构变异(SVs)是遗传和表型变异的主要来源,但尚未在虹鳟(一种重要的冷淡水养殖鱼类)中进行系统研究。本研究的目的是:1)使用全基因组重测序在虹鳟中鉴定和验证高可信度的SVs;2)研究转座元件(TEs)对虹鳟SVs的贡献。总共96条虹鳟,包括11个纯合品系和来自三个育种群体的85条杂交鱼,进行了全基因组测序,平均基因组覆盖度为17.2倍。使用将LUMPY和其他相关工具整合到一个软件包中的Smoove程序鉴定推定的SVs。经过严格筛选,鉴定出13,863个高可信度的SVs。用于SV检测的纯合品系之一Arlee的太平洋生物科学公司长读长验证了在Arlee纯合品系中鉴定出的98%(4,030个中的3,948个)高可信度SVs。基于主成分分析,85条杂交鱼聚为三组,与它们的起源群体一致,进一步表明本研究中鉴定出的高可信度SVs是可靠的。高可信度SV序列的重复DNA含量为86.5%,远高于参考基因组57.1%的重复DNA含量,也高于先前报道的大西洋鲑鱼SVs的重复DNA含量。因此,TEs对虹鳟的SVs有很大贡献,因为TEs构成了重复序列的大部分。数百个高可信度SVs被注释为外显子缺失或基因融合变异,可能具有表型效应。本研究报道的高可信度SVs为进一步的虹鳟SV研究提供了基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/343c/7959816/7ea4f94df7b2/fgene-12-639355-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验