Moore Jonathan E, Lake James A
Molecular Biology Institute, University of California Los Angeles, Los Angeles, CA 90095, USA.
Nucleic Acids Res. 2003 Dec 15;31(24):7271-9. doi: 10.1093/nar/gkg905.
The accurate prediction of higher eukaryotic gene structures and regulatory elements directly from genomic sequences is an important early step in the understanding of newly assembled contigs and finished genomes. As more new genomes are sequenced, comparative approaches are becoming increasingly practical and valuable for predicting genes and regulatory elements. We demonstrate the effectiveness of a comparative method called pattern filtering; it utilizes synteny between two or more genomic segments for the annotation of genomic sequences. Pattern filtering optimally detects the signatures of conserved functional elements despite the stochastic noise inherent in evolutionary processes, allowing more accurate annotation of gene models. We anticipate that pattern filtering will facilitate sequence annotation and the discovery of new functional elements by the genetics and genomics communities.
直接从基因组序列准确预测高等真核生物的基因结构和调控元件是理解新组装的重叠群和完成的基因组的重要早期步骤。随着越来越多的新基因组被测序,比较方法在预测基因和调控元件方面变得越来越实用和有价值。我们展示了一种称为模式过滤的比较方法的有效性;它利用两个或多个基因组片段之间的同线性来注释基因组序列。尽管进化过程中存在随机噪声,但模式过滤能最佳地检测保守功能元件的特征,从而更准确地注释基因模型。我们预计模式过滤将促进遗传学和基因组学领域的序列注释以及新功能元件的发现。