Suppr超能文献

植物基因组中重复序列含量的大小比较分析。

Comparative analysis of repeat content in plant genomes, large and small.

作者信息

Argentin Joris, Bolser Dan, Kersey Paul J, Flicek Paul

机构信息

Institut de Biologie en Santé, Centre Hospitalier Universitaire (CHU) d'Angers, Angers, France.

European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, United Kingdom.

出版信息

Front Plant Sci. 2023 Jul 14;14:1103035. doi: 10.3389/fpls.2023.1103035. eCollection 2023.

Abstract

The DNA Features pipeline is the analysis pipeline at EMBL-EBI that annotates repeat elements, including transposable elements. With Ensembl's goal to stay at the cutting edge of genome annotation, we proved that this pipeline needed an update. We then created a new analysis that allowed the Ensembl database to store the repeat classification from the PGSB repeat classification (Recat). This new dataset was then fetched using Perl scripts and used to prove that the pipeline modification induced a gain in sensitivity. Finally, we performed a comparative analysis of transposable element distribution in all plant species available, raising new questions about transposable elements in certain branches of the taxonomic tree.

摘要

DNA特征管道是欧洲生物信息学研究所(EMBL-EBI)用于注释重复元件(包括转座元件)的分析管道。鉴于Ensembl致力于保持基因组注释的前沿地位,我们证明了该管道需要更新。然后,我们创建了一项新分析,使Ensembl数据库能够存储来自PGSB重复分类(Recat)的重复分类。接着,使用Perl脚本获取这个新数据集,并用于证明管道修改提高了敏感性。最后,我们对所有可用植物物种中的转座元件分布进行了比较分析,这引发了关于分类树某些分支中转座元件的新问题。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验