Department of Plant Biology, University of Georgia, Athens, GA 30602, USA.
Nucleic Acids Res. 2010 Dec;38(22):e199. doi: 10.1093/nar/gkq862. Epub 2010 Sep 29.
Miniature inverted-repeat transposable elements (MITEs) are a special type of Class 2 non-autonomous transposable element (TE) that are abundant in the non-coding regions of the genes of many plant and animal species. The accurate identification of MITEs has been a challenge for existing programs because they lack coding sequences and, as such, evolve very rapidly. Because of their importance to gene and genome evolution, we developed MITE-Hunter, a program pipeline that can identify MITEs as well as other small Class 2 non-autonomous TEs from genomic DNA data sets. The output of MITE-Hunter is composed of consensus TE sequences grouped into families that can be used as a library file for homology-based TE detection programs such as RepeatMasker. MITE-Hunter was evaluated by searching the rice genomic database and comparing the output with known rice TEs. It discovered most of the previously reported rice MITEs (97.6%), and found sixteen new elements. MITE-Hunter was also compared with two other MITE discovery programs, FINDMITE and MUST. Unlike MITE-Hunter, neither of these programs can search large genomic data sets including whole genome sequences. More importantly, MITE-Hunter is significantly more accurate than either FINDMITE or MUST as the vast majority of their outputs are false-positives.
微型反向重复转座元件(MITEs)是一类特殊的 2 类非自主转座元件(TE),它们在许多动植物物种的基因非编码区中大量存在。由于它们缺乏编码序列,而且进化速度非常快,因此现有的程序很难准确识别 MITEs。由于它们对基因和基因组进化很重要,我们开发了 MITE-Hunter,这是一个程序管道,可以从基因组 DNA 数据集识别 MITEs 以及其他小型 2 类非自主 TEs。MITE-Hunter 的输出由分类为家族的共识 TE 序列组成,可作为同源性 TE 检测程序(如 RepeatMasker)的库文件使用。通过搜索水稻基因组数据库并将输出与已知的水稻 TEs 进行比较,对 MITE-Hunter 进行了评估。它发现了大多数以前报道的水稻 MITEs(97.6%),并发现了 16 个新元素。与另外两个 MITE 发现程序 FINDMITE 和 MUST 相比,MITE-Hunter 也有所不同,因为这两个程序都不能搜索包括整个基因组序列在内的大型基因组数据集。更重要的是,MITE-Hunter 比 FINDMITE 或 MUST 更准确,因为它们的绝大多数输出都是假阳性。