Duret L, Mouchiroud D, Gouy M
Laboratoire de Biométrie, Génétique et Biologie des Populations, Université Claude Bernard, Lyon I, URA-CNRS 243, Villeurbanne, France.
Nucleic Acids Res. 1994 Jun 25;22(12):2360-5. doi: 10.1093/nar/22.12.2360.
Comparison of homologous genes is a major step for many studies related to genome structure, function or evolution. Similarity search programs easily find genes homologous to a given sequence. However, only very tedious manual procedures allow the retrieval of all sets of homologous genes sequenced for a given set of species. Moreover, this search often generates errors due to the complexity of data to be managed simultaneously: phylogenetic trees, alignments, taxonomy, sequences and related information. HOVERGEN helps to solve these problems by integrating all this information. HOVERGEN corresponds to GenBank sequences from all vertebrate species, with some data corrected, clarified, or completed, notably to address the problem of redundancy. Coding sequences have been classified in gene families. Protein multiple alignments and phylogenetic trees have been calculated for each family. Sequences and related information have been structured in an ACNUC database which permits complex selections. A graphical interface has been developed to visualize and edit trees. Genes are displayed in color, according to their taxonomy. Users have directly access to all information attached to sequences and to multiple alignments simply by clicking on genes. This graphical tool gives thus a rapid and simple access to all data necessary to interpret homology relationships between genes. HOVERGEN allows the user to easily select sets of homologous vertebrate genes, and thus is particularly useful for comparative sequence analysis, or molecular evolution studies.
同源基因的比较是许多与基因组结构、功能或进化相关研究的重要步骤。相似性搜索程序能够轻松找到与给定序列同源的基因。然而,只有非常繁琐的手动操作才能检索出针对给定物种集测序的所有同源基因集。此外,由于要同时管理的数据(系统发育树、比对、分类学、序列及相关信息)的复杂性,这种搜索经常会产生错误。HOVERGEN通过整合所有这些信息来帮助解决这些问题。HOVERGEN包含所有脊椎动物物种的GenBank序列,并对一些数据进行了校正、澄清或补充,特别是为了解决冗余问题。编码序列已被分类到基因家族中。已为每个家族计算了蛋白质多序列比对和系统发育树。序列及相关信息已构建在一个ACNUC数据库中,该数据库允许进行复杂的选择。已开发出一个图形界面来可视化和编辑树。基因根据其分类学以不同颜色显示。用户只需点击基因就能直接访问与序列及多序列比对相关的所有信息。因此,这个图形工具能让用户快速、简便地获取解释基因间同源关系所需的所有数据。HOVERGEN允许用户轻松选择同源脊椎动物基因集,因此对比较序列分析或分子进化研究特别有用。