Suppr超能文献

用于比较基因发现的广义对隐马尔可夫模型的高效实现。

Efficient implementation of a generalized pair hidden Markov model for comparative gene finding.

作者信息

Majoros W H, Pertea M, Salzberg S L

机构信息

Bioinformatics Department, The Institute for Genomic Research, Rockville, MD, USA.

出版信息

Bioinformatics. 2005 May 1;21(9):1782-8. doi: 10.1093/bioinformatics/bti297. Epub 2005 Feb 2.

Abstract

MOTIVATION

The increased availability of genome sequences of closely related organisms has generated much interest in utilizing homology to improve the accuracy of gene prediction programs. Generalized pair hidden Markov models (GPHMMs) have been proposed as one means to address this need. However, all GPHMM implementations currently available are either closed-source or the details of their operation are not fully described in the literature, leaving a significant hurdle for others wishing to advance the state of the art in GPHMM design.

RESULTS

We have developed an open-source GPHMM gene finder, TWAIN, which performs very well on two related Aspergillus species, A.fumigatus and A.nidulans, finding 89% of the exons and predicting 74% of the gene models exactly correctly in a test set of 147 conserved gene pairs. We describe the implementation of this GPHMM and we explicitly address the assumptions and limitations of the system. We suggest possible ways of relaxing those assumptions to improve the utility of the system without sacrificing efficiency beyond what is practical.

AVAILABILITY

Available at http://www.tigr.org/software/pirate/twain/twain.html under the open-source Artistic License.

摘要

动机

亲缘关系相近的生物体基因组序列可用性的增加引发了人们对利用同源性提高基因预测程序准确性的浓厚兴趣。广义配对隐马尔可夫模型(GPHMM)已被提出作为满足这一需求的一种方法。然而,目前所有可用的GPHMM实现要么是闭源的,要么其操作细节在文献中没有得到充分描述,这给其他希望推动GPHMM设计技术进步的人带来了重大障碍。

结果

我们开发了一个开源的GPHMM基因查找器TWAIN,它在两种相关的曲霉菌物种烟曲霉和构巢曲霉上表现出色,在147个保守基因对的测试集中,正确找到89%的外显子并准确预测74%的基因模型。我们描述了这个GPHMM的实现,并明确阐述了系统的假设和局限性。我们提出了在不牺牲超出实际可行效率的前提下放宽这些假设以提高系统实用性的可能方法。

可用性

可在http://www.tigr.org/software/pirate/twain/twain.html上根据开源艺术许可获取。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验