Suppr超能文献

GMATo:一种用于在大型基因组中鉴定和分析微卫星的新型工具。

GMATo: A novel tool for the identification and analysis of microsatellites in large genomes.

作者信息

Wang Xuewen, Lu Peng, Luo Zhaopeng

机构信息

China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute, NO.2 Fengyang Street, Hi-tech zone, Zhengzhou 450001, China.

出版信息

Bioinformation. 2013 Jun 8;9(10):541-4. doi: 10.6026/97320630009541. Print 2013.

Abstract

UNLABELLED

Simple Sequence Repeats (SSR), also called microsatellite, is very useful for genetic marker development and genome application. The increasing whole sequences of more and more large genomes provide sources for SSR mining in silico. However currently existing SSR mining tools can't process large genomes efficiently and generate no or poor statistics. Genome-wide Microsatellite Analyzing Tool (GMATo) is a novel tool for SSR mining and statistics at genome aspects. It is faster and more accurate than existed tools SSR Locator and MISA. If a DNA sequence was too long, it was chunked to short segments at several Mb followed by motifs generation and searching using Perl powerful pattern match function. Matched loci data from each chunk were then merged to produce final SSR loci information. Only one input file is required which contains raw fasta DNA sequences and output files in tabular format list all SSR loci information and statistical distribution at four classifications. GMATo was programmed in Java and Perl with both graphic and command line interface, either executable alone in platform independent manner with full parameters control. Software GMATo is a powerful tool for complete SSR characterization in genomes at any size.

AVAILABILITY

The soft GMATo is freely available at http://sourceforge.net/projects/gmato/files/?source=navbar or on contact.

摘要

未标注

简单序列重复(SSR),也称为微卫星,在遗传标记开发和基因组应用中非常有用。越来越多大型基因组的全序列增加为电子克隆SSR提供了来源。然而,目前现有的SSR挖掘工具无法高效处理大型基因组,也无法生成统计数据或生成的统计数据质量不佳。全基因组微卫星分析工具(GMATo)是一种用于基因组层面SSR挖掘和统计的新型工具。它比现有的工具SSR Locator和MISA更快、更准确。如果DNA序列太长,它会被分成几个兆字节的短片段,然后使用Perl强大的模式匹配功能生成基序并进行搜索。然后将每个片段的匹配位点数据合并,以生成最终的SSR位点信息。只需要一个输入文件,其中包含原始的fasta DNA序列,输出文件以表格形式列出所有SSR位点信息以及四种分类下的统计分布。GMATo是用Java和Perl编写的,具有图形界面和命令行界面,可以以独立于平台的方式单独执行,并具有完整的参数控制。软件GMATo是一个强大的工具,可用于对任何大小的基因组进行完整的SSR表征。

可用性

软件GMATo可从http://sourceforge.net/projects/gmato/files/?source=navbar免费获取,也可通过联系获取。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验