Suppr超能文献

RepeatAround:一种用于在环状基因组中查找和可视化重复序列的软件工具及其在人类线粒体DNA数据库中的应用。

RepeatAround: a software tool for finding and visualizing repeats in circular genomes and its application to a human mtDNA database.

作者信息

Goios Ana, Meirinhos José, Rocha Ricardo, Lopes Ricardo, Amorim António, Pereira Luísa

机构信息

IPATIMUP (Instituto de Patologia e Imunologia Molecular da Universidade do Porto), R. Dr. Roberto Frias, s/n, 4200-465 Porto, Portugal.

出版信息

Mitochondrion. 2006 Aug;6(4):218-24. doi: 10.1016/j.mito.2006.06.001. Epub 2006 Jun 14.

Abstract

RepeatAround is a Windows based software tool designed to find "direct repeats", "inverted repeats", "mirror repeats" and "complementary repeats", from 3 to 64 bp length, in circular genomes. It processes input files directly extracted from GenBank database, providing visualisation of the repeats location in the genomic structure, so that for instance, in most mtDNAs the user can check if the repeats are located in coding or non-coding region (and in the first case in which gene), and how far apart the repeat pair(s) are. Besides the visual tool, it provides other outputs in a spreadsheet containing information on the number and location of the repeats, facilitating graphic analyses. Several genomes can be inputed simultaneously, for phylogenetic comparison purposes. Other capabilities of the software are the generation of random circular genomes, for statistical evaluation of comparison between observed repeats distributions with their shuffled counterparts, as well as the search for specific motifs, allowing an easy confirmation of repeats flanking a newly detected rearrangement. As an example of the programme's applications we analysed the Direct Repeats distribution in a large human mtDNA database. Results showed that Direct Repeats, even the larger ones, are evenly distributed among the human mtDNA haplogroups, enabling us to state that, based only on the repetitive motifs, no haplogroup is particularly more or less prone to mtDNA macrodeletions.

摘要

RepeatAround是一款基于Windows的软件工具,旨在在环状基因组中查找长度为3至64个碱基对的“直接重复序列”“反向重复序列”“镜像重复序列”和“互补重复序列”。它直接处理从GenBank数据库中提取的输入文件,提供重复序列在基因组结构中位置的可视化,例如,在大多数线粒体DNA中,用户可以检查重复序列是位于编码区还是非编码区(如果是前者,位于哪个基因中),以及重复序列对之间的距离有多远。除了可视化工具外,它还在电子表格中提供其他输出,其中包含有关重复序列数量和位置的信息,便于进行图形分析。为了进行系统发育比较,可以同时输入多个基因组。该软件的其他功能包括生成随机环状基因组,用于对观察到的重复序列分布与其随机排列的对应物之间的比较进行统计评估,以及搜索特定基序,以便轻松确认新检测到的重排侧翼的重复序列。作为该程序应用的一个例子,我们分析了一个大型人类线粒体DNA数据库中的直接重复序列分布。结果表明,直接重复序列,即使是较大的重复序列,在人类线粒体DNA单倍群中分布均匀,这使我们能够得出结论,仅基于重复基序,没有哪个单倍群特别更容易或更不容易发生线粒体DNA大缺失。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验