Suppr超能文献

PhyloGena——一个用于对未知序列进行自动系统发育注释的用户友好型系统。

PhyloGena--a user-friendly system for automated phylogenetic annotation of unknown sequences.

作者信息

Hanekamp Kristian, Bohnebeck Uta, Beszteri Bánk, Valentin Klaus

机构信息

Center for Computing Technologies (TZI), P.O.B. 330440, D-28334 Bremen, Germany.

出版信息

Bioinformatics. 2007 Apr 1;23(7):793-801. doi: 10.1093/bioinformatics/btm016. Epub 2007 Mar 1.

Abstract

MOTIVATION

Phylogenomic approaches towards functional and evolutionary annotation of unknown sequences have been suggested to be superior to those based only on pairwise local alignments. User-friendly software tools making the advantages of phylogenetic annotation available for the ever widening range of bioinformatically uninitiated biologists involved in genome/EST annotation projects are, however, not available. We were particularly confronted with this issue in the annotation of sequences from different groups of complex algae originating from secondary endosymbioses, where the identification of the phylogenetic origin of genes is often more problematic than in taxa well represented in the databases (e.g. animals, plants or fungi).

RESULTS

We present a flexible pipeline with a user-friendly, interactive graphical user interface running on desktop computers that automatically performs a basic local alignment search tool (BLAST) search of query sequences, selects a representative subset of them, then creates a multiple alignment from the selected sequences, and finally computes a phylogenetic tree. The pipeline, named PhyloGena, uses public domain software for all standard bioinformatics tasks (similarity search, multiple alignment, and phylogenetic reconstruction). As the major technological innovation, selection of a meaningful subset of BLAST hits was implemented using logic programming, mimicing the selection procedure (BLAST tables, multiple alignments and phylogenetic trees) are displayed graphically, allowing the user to interact with the pipeline and deduce the function and phylogenetic origin of the query. PhyloGena thus makes phylogenomic annotation available also for those biologists without access to large computing facilities and with little informatics background. Although phylogenetic annotation is particularly useful when working with composite genomes (e.g. from complex algae), PhyloGena can be helpful in expressed sequence tag and genome annotation also in other organisms.

AVAILABILITY

PhyloGena (executables for LINUX and Windows 2000/XP as well as source code) is available by anonymous ftp from http://www.awi.de/en/phylogena.

摘要

动机

有人提出,用于未知序列功能和进化注释的系统发育基因组学方法优于仅基于两两局部比对的方法。然而,对于越来越多参与基因组/EST注释项目但缺乏生物信息学知识的生物学家来说,尚无用户友好的软件工具来利用系统发育注释的优势。在注释源自次生内共生的不同复杂藻类群体的序列时,我们尤其遇到了这个问题,在这些藻类中,基因系统发育起源的鉴定往往比数据库中代表性良好的分类群(如动物、植物或真菌)更具挑战性。

结果

我们展示了一个灵活的流程,它具有在台式计算机上运行的用户友好的交互式图形用户界面,可自动对查询序列执行基本局部比对搜索工具(BLAST)搜索,选择其中一个代表性子集,然后从所选序列创建多序列比对,最后计算系统发育树。这个名为PhyloGena的流程在所有标准生物信息学任务(相似性搜索、多序列比对和系统发育重建)中都使用了开源软件。作为主要的技术创新,使用逻辑编程实现了对BLAST命中结果中有意义子集的选择,模仿了专家手动选择的过程。BLAST表、多序列比对和系统发育树以图形方式显示,允许用户与流程交互并推断查询序列的功能和系统发育起源。因此,PhyloGena也使那些没有大型计算设备且几乎没有信息学背景的生物学家能够进行系统发育基因组学注释。尽管系统发育注释在处理复合基因组(如来自复杂藻类的基因组)时特别有用,但PhyloGena在其他生物体的表达序列标签和基因组注释中也可能有所帮助。

可用性

可通过匿名ftp从http://www.awi.de/en/phylogena获得PhyloGena(适用于LINUX和Windows 2000/XP的可执行文件以及源代码)。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验