Korf I, Flicek P, Duan D, Brent M R
Department of Computer Science, Washington University, Campus Box 1045, St. Louis, MO, 63130, USA.
Bioinformatics. 2001;17 Suppl 1:S140-8. doi: 10.1093/bioinformatics/17.suppl_1.s140.
TWINSCAN is a new gene-structure prediction system that directly extends the probability model of GENSCAN, allowing it to exploit homology between two related genomes. Separate probability models are used for conservation in exons, introns, splice sites, and UTRs, reflecting the differences among their patterns of evolutionary conservation. TWINSCAN is specifically designed for the analysis of high-throughput genomic sequences containing an unknown number of genes. In experiments on high-throughput mouse sequences, using homologous sequences from the human genome, TWINSCAN shows notable improvement over GENSCAN in exon sensitivity and specificity and dramatic improvement in exact gene sensitivity and specificity. This improvement can be attributed entirely to modeling the patterns of evolutionary conservation in genomic sequence.
TWINSCAN是一种新的基因结构预测系统,它直接扩展了GENSCAN的概率模型,使其能够利用两个相关基因组之间的同源性。针对外显子、内含子、剪接位点和非翻译区的保守性使用了单独的概率模型,反映了它们进化保守模式的差异。TWINSCAN专门设计用于分析包含未知数量基因的高通量基因组序列。在对高通量小鼠序列的实验中,使用来自人类基因组的同源序列,TWINSCAN在外显子敏感性和特异性方面比GENSCAN有显著提高,在精确基因敏感性和特异性方面有显著提升。这种改进完全归因于对基因组序列进化保守模式的建模。