Suppr超能文献

元基因组序列中的基因和翻译起始位点预测。

Gene and translation initiation site prediction in metagenomic sequences.

机构信息

Computational Biology and Bioinformatics Group, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA.

出版信息

Bioinformatics. 2012 Sep 1;28(17):2223-30. doi: 10.1093/bioinformatics/bts429. Epub 2012 Jul 12.

Abstract

MOTIVATION

Gene prediction in metagenomic sequences remains a difficult problem. Current sequencing technologies do not achieve sufficient coverage to assemble the individual genomes in a typical sample; consequently, sequencing runs produce a large number of short sequences whose exact origin is unknown. Since these sequences are usually smaller than the average length of a gene, algorithms must make predictions based on very little data.

RESULTS

We present MetaProdigal, a metagenomic version of the gene prediction program Prodigal, that can identify genes in short, anonymous coding sequences with a high degree of accuracy. The novel value of the method consists of enhanced translation initiation site identification, ability to identify sequences that use alternate genetic codes and confidence values for each gene call. We compare the results of MetaProdigal with other methods and conclude with a discussion of future improvements.

AVAILABILITY

The Prodigal software is freely available under the General Public License from http://code.google.com/p/prodigal/.

摘要

动机

在宏基因组序列中进行基因预测仍然是一个难题。当前的测序技术无法实现足够的覆盖度来组装典型样本中的单个基因组;因此,测序运行会产生大量的短序列,其确切来源是未知的。由于这些序列通常比基因的平均长度小,因此算法必须基于非常少的数据进行预测。

结果

我们提出了 MetaProdigal,这是基因预测程序 Prodigal 的宏基因组版本,可以非常准确地识别短的、匿名的编码序列中的基因。该方法的新颖之处在于增强了翻译起始位点的识别能力、识别使用替代遗传密码的序列的能力以及每个基因调用的置信值。我们将 MetaProdigal 的结果与其他方法进行了比较,并讨论了未来的改进。

可用性

Prodigal 软件可根据通用公共许可证免费从 http://code.google.com/p/prodigal/ 获取。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验