Suppr超能文献

通过序列注释揭示火炬松(Pinus taeda L.)大亚基因组的独特特征。

Unique features of the loblolly pine (Pinus taeda L.) megagenome revealed through sequence annotation.

机构信息

Department of Plant Sciences, University of California, Davis, California 95616.

出版信息

Genetics. 2014 Mar;196(3):891-909. doi: 10.1534/genetics.113.159996.

Abstract

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20-40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%.

摘要

松科是针叶树科中最大的属,有超过 100 种。它们的基因组大小和复杂性(∼20-40 Gb,2n = 24)延迟了注释参考序列的出现。在这项研究中,我们展示了火炬松(Pinus taeda L.)的第一个全基因组鸟枪法组装的注释,该组装包含 20.1 Gb 的序列。MAKER-P 注释管道结合基于证据的比对和从头预测,生成了 50,172 个基因模型,其中 15,653 个被归类为高可信度。将这些基因模型与其他 13 种植物物种聚类,得到了 20,646 个基因家族,其中 1,554 个被预测为仅存在于针叶树中。在针叶树基因家族中,159 个完全由火炬松成员组成。火炬松基因模型的中位数和平均内含子长度在 24 个全测序植物基因组中最长。针叶树基因组充满了重复 DNA,其中长末端重复反转录转座子的贡献最大。对串联和分散重复内容的深入分析得出了 82%的综合估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c07/3948814/a167afc7daff/891fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验