Flutre Timothée, Permal Emmanuelle, Quesneville Hadi
Unité de Recherche en Génomique-Info; UR 1164; INRA Centre de Versailles-Grignon; Versailles, France.
Mob Genet Elements. 2011 Jul;1(2):151-154. doi: 10.4161/mge.1.2.17094. Epub 2011 Jul 1.
Transposable elements (TEs) are DNA sequences that have the capacity to move and duplicate within genomes, and occasionally between them. They are present in almost all species and are especially prevalent in eukaryotes where they can account for most of the genomic content. As a result of their dynamics and their mere presence, TEs can profoundly shape genomes and gene expression. With the current pace of sequencing technology improvement, a rapidly increasing number of genomes, particularly from non-model species, are being sequenced. However, the complete annotation of these genomes and especially of the TEs they contain, still poses fundamental difficulties. In a recent article, we presented a combined method that automatically annotates TEs with accuracy and sensitivity, and takes their diversification dynamics into account in the de novo annotation process. Here, we further discuss several additional aspects of our results, notably in the light of our knowledge of TE dynamics and the conceptual model behind the TE detection algorithms currently in use. In addition, we propose a new approach that uses simulations to improve algorithm performance.
转座元件(TEs)是能够在基因组内移动和复制,偶尔也能在不同基因组之间移动和复制的DNA序列。它们几乎存在于所有物种中,在真核生物中尤为普遍,在真核生物中它们可占基因组内容的大部分。由于其动态特性及其存在,转座元件可深刻塑造基因组和基因表达。随着当前测序技术改进的步伐,越来越多的基因组,特别是来自非模式物种的基因组正在被测序。然而,对这些基因组尤其是其中所含转座元件的完整注释仍然存在根本困难。在最近的一篇文章中,我们提出了一种组合方法,该方法能够准确且灵敏地自动注释转座元件,并在从头注释过程中考虑其多样化动态。在此,我们进一步讨论我们结果的几个其他方面,特别是根据我们对转座元件动态的了解以及当前使用的转座元件检测算法背后的概念模型。此外,我们提出了一种使用模拟来提高算法性能的新方法。