Nachtweide Stefanie, Stanke Mario
Institute of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Straße 47, 17487, Greifswald, Germany.
Methods Mol Biol. 2019;1962:139-160. doi: 10.1007/978-1-4939-9173-0_8.
Comparing multiple related genomes can help to improve their structural annotation. The accuracy and consistency of the predicted exon-intron structures of the protein coding genes can be higher when considering all genomes at once rather than annotating one genome at a time.The comparative gene prediction algorithm of AUGUSTUS performs such a multi-genome annotation. A multiple alignment of genomes is used to exploit evolutionary clues to conservation and negative selection. Further, AUGUSTUS exploits the fact that orthologous genes typically have congruent exon-intron structures. Comparative AUGUSTUS simultaneously predicts the genes in all input genomes. In this chapter we walk the reader through a small example from eight vertebrate species, including the construction of an alignment of the input genomes and how to integrate RNA-Seq evidence from multiple species for gene finding.
比较多个相关基因组有助于改进其结构注释。一次性考虑所有基因组时,蛋白质编码基因预测的外显子-内含子结构的准确性和一致性可能会高于一次注释一个基因组的情况。AUGUSTUS的比较基因预测算法可执行这种多基因组注释。基因组的多重比对用于利用进化线索来确定保守性和负选择。此外,AUGUSTUS利用了直系同源基因通常具有一致的外显子-内含子结构这一事实。比较AUGUSTUS可同时预测所有输入基因组中的基因。在本章中,我们将引导读者了解一个来自八个脊椎动物物种的小例子,包括输入基因组比对的构建以及如何整合来自多个物种的RNA-Seq证据用于基因发现。