• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过“逐帧”算法寻找原核生物基因:靶向基因起始位点和重叠基因。

Finding prokaryotic genes by the 'frame-by-frame' algorithm: targeting gene starts and overlapping genes.

作者信息

Shmatkov A M, Melikyan A A, Chernousko F L, Borodovsky M

机构信息

Russian Academy of Science, Institute for Problems in Mechanics, Moscow, Russia.

出版信息

Bioinformatics. 1999 Nov;15(11):874-86. doi: 10.1093/bioinformatics/15.11.874.

DOI:10.1093/bioinformatics/15.11.874
PMID:10743554
Abstract

MOTIVATION

Tightly packed prokaryotic genes frequently overlap with each other. This feature, rarely seen in eukaryotic DNA, makes detection of translation initiation sites and, therefore, exact predictions of prokaryotic genes notoriously difficult. Improving the accuracy of precise gene prediction in prokaryotic genomic DNA remains an important open problem.

RESULTS

A software program implementing a new algorithm utilizing a uniform Hidden Markov Model for prokaryotic gene prediction was developed. The algorithm analyzes a given DNA sequence in each of six possible global reading frames independently. Twelve complete prokaryotic genomes were analyzed using the new tool. The accuracy of gene finding, predicting locations of protein-coding ORFs, as well as the accuracy of precise gene prediction, and detecting the whole gene including translation initiation codon were assessed by comparison with existing annotation. It was shown that in terms of gene finding, the program performs at least as well as the previously developed tools, such as GeneMark and GLIMMER. In terms of precise gene prediction the new program was shown to be more accurate, by several percentage points, than earlier developed tools, such as GeneMark.hmm, ECOPARSE and ORPHEUS. The results of testing the program indicated the possibility of systematic bias in start codon annotation in several early sequenced prokaryotic genomes.

AVAILABILITY

The new gene-finding program can be accessed through the Web site: http:@dixie.biology.gatech.edu/GeneMark/fbf.cgi

CONTACT

mark@amber.gatech.edu.

摘要

动机

紧密排列的原核基因经常相互重叠。这种在真核DNA中罕见的特征使得翻译起始位点的检测变得困难,因此,原核基因的精确预测也非常困难。提高原核基因组DNA中精确基因预测的准确性仍然是一个重要的开放性问题。

结果

开发了一种软件程序,该程序实现了一种利用统一隐马尔可夫模型进行原核基因预测的新算法。该算法独立分析六个可能的全局阅读框中的每一个给定DNA序列。使用这个新工具分析了12个完整的原核基因组。通过与现有注释进行比较,评估了基因发现、蛋白质编码开放阅读框位置预测的准确性,以及精确基因预测和检测包括翻译起始密码子在内的整个基因的准确性。结果表明,在基因发现方面,该程序的表现至少与之前开发的工具(如GeneMark和GLIMMER)一样好。在精确基因预测方面,新程序比早期开发的工具(如GeneMark.hmm、ECOPARSE和ORPHEUS)更准确,高出几个百分点。对该程序的测试结果表明,在几个早期测序的原核基因组中,起始密码子注释可能存在系统偏差。

可用性

可以通过网站http:@dixie.biology.gatech.edu/GeneMark/fbf.cgi访问这个新的基因发现程序。

联系方式

mark@amber.gatech.edu。

相似文献

1
Finding prokaryotic genes by the 'frame-by-frame' algorithm: targeting gene starts and overlapping genes.通过“逐帧”算法寻找原核生物基因:靶向基因起始位点和重叠基因。
Bioinformatics. 1999 Nov;15(11):874-86. doi: 10.1093/bioinformatics/15.11.874.
2
GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.GeneMarkS:一种用于预测微生物基因组中基因起始位点的自训练方法。对在调控区域中寻找序列基序的启示。
Nucleic Acids Res. 2001 Jun 15;29(12):2607-18. doi: 10.1093/nar/29.12.2607.
3
Probabilistic methods of identifying genes in prokaryotic genomes: connections to the HMM theory.原核生物基因组中基因识别的概率方法:与隐马尔可夫模型理论的联系。
Brief Bioinform. 2004 Jun;5(2):118-30. doi: 10.1093/bib/5.2.118.
4
Accuracy improvement for identifying translation initiation sites in microbial genomes.提高微生物基因组中翻译起始位点识别的准确性。
Bioinformatics. 2004 Dec 12;20(18):3308-17. doi: 10.1093/bioinformatics/bth390. Epub 2004 Jul 9.
5
TICO: a tool for improving predictions of prokaryotic translation initiation sites.TICO:一种用于改进原核生物翻译起始位点预测的工具。
Bioinformatics. 2005 Sep 1;21(17):3568-9. doi: 10.1093/bioinformatics/bti563. Epub 2005 Jun 30.
6
Re-annotation of genome microbial coding-sequences: finding new genes and inaccurately annotated genes.基因组微生物编码序列的重新注释:发现新基因和注释不准确的基因。
BMC Bioinformatics. 2002;3:5. doi: 10.1186/1471-2105-3-5. Epub 2002 Feb 5.
7
How to interpret an anonymous bacterial genome: machine learning approach to gene identification.如何解读匿名细菌基因组:用于基因识别的机器学习方法
Genome Res. 1998 Nov;8(11):1154-71. doi: 10.1101/gr.8.11.1154.
8
A novel bacterial gene-finding system with improved accuracy in locating start codons.一种在定位起始密码子方面具有更高准确性的新型细菌基因发现系统。
DNA Res. 2001 Jun 30;8(3):97-106. doi: 10.1093/dnares/8.3.97.
9
GeneMark.hmm: new solutions for gene finding.基因标记隐马尔可夫模型:基因发现的新解决方案。
Nucleic Acids Res. 1998 Feb 15;26(4):1107-15. doi: 10.1093/nar/26.4.1107.
10
A comparative genomic method for computational identification of prokaryotic translation initiation sites.一种用于原核生物翻译起始位点计算识别的比较基因组方法。
Nucleic Acids Res. 2002 Jul 15;30(14):3181-91. doi: 10.1093/nar/gkf423.

引用本文的文献

1
Genome-Scale Transcription-Translation Mapping Reveals Features of Zymomonas mobilis Transcription Units and Promoters.全基因组转录-翻译图谱揭示了运动发酵单胞菌转录单元和启动子的特征。
mSystems. 2020 Jul 21;5(4):e00250-20. doi: 10.1128/mSystems.00250-20.
2
Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes.无领导转录和非典型基因的建模可提高原核生物中基因预测的准确性。
Genome Res. 2018 Jul;28(7):1079-1089. doi: 10.1101/gr.230615.117. Epub 2018 May 17.
3
Recognizing short coding sequences of prokaryotic genome using a novel iteratively adaptive sparse partial least squares algorithm.
利用一种新颖的迭代自适应稀疏偏最小二乘算法识别原核基因组的短编码序列。
Biol Direct. 2013 Sep 25;8:23. doi: 10.1186/1745-6150-8-23.
4
Evaluating bacterial gene-finding HMM structures as probabilistic logic programs.评估细菌基因发现 HMM 结构作为概率逻辑程序。
Bioinformatics. 2012 Mar 1;28(5):636-42. doi: 10.1093/bioinformatics/btr698. Epub 2012 Jan 3.
5
Complete genome sequence of the giant virus OBP and comparative genome analysis of the diverse ΦKZ-related phages.OBP 巨型病毒的全基因组序列和不同 ΦKZ 相关噬菌体的比较基因组分析。
J Virol. 2012 Feb;86(3):1844-52. doi: 10.1128/JVI.06330-11. Epub 2011 Nov 30.
6
Prokaryotic gene finding based on physicochemical characteristics of codons calculated from molecular dynamics simulations.基于分子动力学模拟计算出的密码子理化特性进行原核生物基因发现。
Biophys J. 2008 Jun;94(11):4173-83. doi: 10.1529/biophysj.107.116392. Epub 2008 Mar 7.
7
Insight into the haem d1 biosynthesis pathway in heliobacteria through bioinformatics analysis.通过生物信息学分析深入了解嗜盐菌中血红素d1生物合成途径。
Microbiology (Reading). 2007 Oct;153(Pt 10):3548-3562. doi: 10.1099/mic.0.2007/007930-0.
8
Complete genomic sequence and mass spectrometric analysis of highly diverse, atypical Bacillus thuringiensis phage 0305phi8-36.高度多样的非典型苏云金芽孢杆菌噬菌体0305phi8-36的全基因组序列及质谱分析
Virology. 2007 Nov 25;368(2):405-21. doi: 10.1016/j.virol.2007.06.043. Epub 2007 Jul 30.
9
MetaGene: prokaryotic gene finding from environmental genome shotgun sequences.MetaGene:从环境基因组鸟枪法测序中寻找原核生物基因
Nucleic Acids Res. 2006;34(19):5623-30. doi: 10.1093/nar/gkl723. Epub 2006 Oct 5.
10
Dual control of quorum sensing by two TraM-type antiactivators in Agrobacterium tumefaciens octopine strain A6.根癌土壤杆菌章鱼碱型菌株A6中两种TraM型抗激活剂对群体感应的双重控制。
J Bacteriol. 2006 Apr;188(7):2435-45. doi: 10.1128/JB.188.7.2435-2445.2006.