Hotz-Wagenblatt Agnes, Hankeln Thomas, Ernst Peter, Glatting Karl-Heinz, Schmidt Erwin R, Suhai Sándor
Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany.
Nucleic Acids Res. 2003 Jul 1;31(13):3716-9. doi: 10.1093/nar/gkg566.
In high throughput sequence analysis, it is often necessary to combine the results of contemporary bioinformatics tools, because no individual tool alone computes all the requested information. ESTAnnotator is a tool for the high throughput annotation of expressed sequence tags (ESTs) by automatically running a collection of bioinformatics applications. In the first step, a quality check is performed and repeats, vector parts and low quality sequences are masked. Then successive steps of database searching and EST clustering are performed. Already known transcripts present within mRNA and genomic DNA reference databases are identified. Subsequently, tools for the clustering of anonymous ESTs, and for further database searches at the protein level, are applied. Finally, the outputs of each individual tool are gathered and the relevant results presented in a descriptive summary. ESTAnnotator was already successfully applied for the systematic identification and characterisation of novel human genes involved in cartilage/bone formation, growth, differentiation and homeostasis. ESTAnnotator is available at http://genome.dkfz-heidelberg.de, contact: genome@dkfz.de.
在高通量序列分析中,常常需要整合当代生物信息学工具的结果,因为没有任何一个单独的工具能够计算出所有所需信息。ESTAnnotator是一个通过自动运行一系列生物信息学应用程序对表达序列标签(EST)进行高通量注释的工具。第一步,进行质量检查并屏蔽重复序列、载体部分和低质量序列。然后依次进行数据库搜索和EST聚类。识别出mRNA和基因组DNA参考数据库中已有的转录本。随后,应用用于匿名EST聚类以及在蛋白质水平进行进一步数据库搜索的工具。最后,收集每个单独工具的输出结果,并以描述性总结的形式呈现相关结果。ESTAnnotator已成功应用于系统鉴定和表征参与软骨/骨形成、生长、分化和体内平衡的新型人类基因。可通过http://genome.dkfz-heidelberg.de获取ESTAnnotator,联系方式:genome@dkfz.de。