Genome Resource and Analysis Unit, RIKEN Center for Developmental Biology, Kobe, Hyogo 650-0047, Japan.
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W22-8. doi: 10.1093/nar/gkt389. Epub 2013 May 15.
We report a new web server, aLeaves (http://aleaves.cdb.riken.jp/), for homologue collection from diverse animal genomes. In molecular comparative studies involving multiple species, orthology identification is the basis on which most subsequent biological analyses rely. It can be achieved most accurately by explicit phylogenetic inference. More and more species are subjected to large-scale sequencing, but the resultant resources are scattered in independent project-based, and multi-species, but separate, web sites. This complicates data access and is becoming a serious barrier to the comprehensiveness of molecular phylogenetic analysis. aLeaves, launched to overcome this difficulty, collects sequences similar to an input query sequence from various data sources. The collected sequences can be passed on to the MAFFT sequence alignment server (http://mafft.cbrc.jp/alignment/server/), which has been significantly improved in interactivity. This update enables to switch between (i) sequence selection using the Archaeopteryx tree viewer, (ii) multiple sequence alignment and (iii) tree inference. This can be performed as a loop until one reaches a sensible data set, which minimizes redundancy for better visibility and handling in phylogenetic inference while covering relevant taxa. The work flow achieved by the seamless link between aLeaves and MAFFT provides a convenient online platform to address various questions in zoology and evolutionary biology.
我们报告了一个新的网络服务器,aLeaves(http://aleaves.cdb.riken.jp/),用于从不同动物基因组中收集同源物。在涉及多个物种的分子比较研究中,同源物的识别是大多数后续生物学分析所依赖的基础。通过显式的系统发育推断可以最准确地实现。越来越多的物种正在进行大规模测序,但由此产生的资源分散在独立的基于项目的、多物种但独立的网站中。这使得数据访问变得复杂,并且成为分子系统发育分析全面性的严重障碍。aLeaves 的推出是为了克服这一困难,它从各种数据源中收集与输入查询序列相似的序列。收集到的序列可以传递给 MAFFT 序列比对服务器(http://mafft.cbrc.jp/alignment/server/),该服务器在交互性方面有了显著的改进。此更新允许在 (i) 使用始祖鸟树查看器选择序列、(ii) 多序列比对和 (iii) 树推断之间进行切换。可以重复此操作,直到获得一个合理的数据集,该数据集在最小化冗余的同时,提高了在系统发育推断中可见性和可处理性,同时涵盖了相关分类群。aLeaves 和 MAFFT 之间的无缝链接所实现的工作流程为解决动物学和进化生物学中的各种问题提供了一个方便的在线平台。