Suppr超能文献

一种用于预测原核生物操纵子的强大的非同源性方法。

A powerful non-homology method for the prediction of operons in prokaryotes.

作者信息

Moreno-Hagelsieb Gabriel, Collado-Vides Julio

机构信息

Program of Computational Genomics, CIFN, UNAM, A.P. 565-A, Cuernavaca, Morelos 62100, Mexico.

出版信息

Bioinformatics. 2002;18 Suppl 1:S329-36. doi: 10.1093/bioinformatics/18.suppl_1.s329.

Abstract

MOTIVATION

The prediction of the transcription unit organization of genomes is an important clue in the inference of functional relationships of genes, the interpretation and evaluation of transcriptome experiments, and the overall inference of the regulatory networks governing the expression of genes in response to the environment. Though several methods have been devised to predict operons, most need a high characterization of the genome analysed. Log-likelihoods derived from inter-genic distance distributions work surprisingly well to predict operons in Escherichia coli and are available for any genome as soon as the gene sets are predicted.

RESULTS

Here we provide evidence that the very same method is applicable to any prokaryotic genome. First, the method has the same efficiency when evaluated using a collection of experimentally known operons of Bacillus subtilis. Second, operons among most if not all prokaryotes seem to have the same tendencies to keep short distances between their genes, the most frequent distances being the overlaps of four and one base pairs. The universality of this structural feature allows us to predict the organization of transcription units in all prokaryotes. Third, predicted operons contain a higher proportion of genes with related phylogenetic profiles and conservation of adjacency than predicted borders of transcription units.

摘要

动机

预测基因组转录单元的组织是推断基因功能关系、解释和评估转录组实验以及全面推断调控基因表达以响应环境的调控网络的重要线索。尽管已经设计了几种方法来预测操纵子,但大多数方法都需要对所分析的基因组进行高度表征。从基因间距离分布得出的对数似然性在预测大肠杆菌中的操纵子时效果惊人,并且一旦预测出基因集,就可用于任何基因组。

结果

在这里,我们提供证据表明,同样的方法适用于任何原核生物基因组。首先,当使用枯草芽孢杆菌的一组实验已知操纵子进行评估时,该方法具有相同的效率。其次,大多数(如果不是全部)原核生物中的操纵子似乎具有相同的趋势,即其基因之间保持短距离,最常见的距离是四个和一个碱基对的重叠。这种结构特征的普遍性使我们能够预测所有原核生物中转录单元的组织。第三,与预测的转录单元边界相比,预测的操纵子包含具有相关系统发育谱和邻接保守性的基因的比例更高。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验