EST注释器：一种用于高通量EST注释的工具。

ESTAnnotator: A tool for high throughput EST annotation.

作者信息

Hotz-Wagenblatt Agnes, Hankeln Thomas, Ernst Peter, Glatting Karl-Heinz, Schmidt Erwin R, Suhai Sándor

机构信息

Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany.

出版信息

Nucleic Acids Res. 2003 Jul 1;31(13):3716-9. doi: 10.1093/nar/gkg566.

DOI:10.1093/nar/gkg566

PMID:12824401

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC169160/

Abstract

In high throughput sequence analysis, it is often necessary to combine the results of contemporary bioinformatics tools, because no individual tool alone computes all the requested information. ESTAnnotator is a tool for the high throughput annotation of expressed sequence tags (ESTs) by automatically running a collection of bioinformatics applications. In the first step, a quality check is performed and repeats, vector parts and low quality sequences are masked. Then successive steps of database searching and EST clustering are performed. Already known transcripts present within mRNA and genomic DNA reference databases are identified. Subsequently, tools for the clustering of anonymous ESTs, and for further database searches at the protein level, are applied. Finally, the outputs of each individual tool are gathered and the relevant results presented in a descriptive summary. ESTAnnotator was already successfully applied for the systematic identification and characterisation of novel human genes involved in cartilage/bone formation, growth, differentiation and homeostasis. ESTAnnotator is available at http://genome.dkfz-heidelberg.de, contact: genome@dkfz.de.

摘要

在高通量序列分析中，常常需要整合当代生物信息学工具的结果，因为没有任何一个单独的工具能够计算出所有所需信息。ESTAnnotator是一个通过自动运行一系列生物信息学应用程序对表达序列标签（EST）进行高通量注释的工具。第一步，进行质量检查并屏蔽重复序列、载体部分和低质量序列。然后依次进行数据库搜索和EST聚类。识别出mRNA和基因组DNA参考数据库中已有的转录本。随后，应用用于匿名EST聚类以及在蛋白质水平进行进一步数据库搜索的工具。最后，收集每个单独工具的输出结果，并以描述性总结的形式呈现相关结果。ESTAnnotator已成功应用于系统鉴定和表征参与软骨/骨形成、生长、分化和体内平衡的新型人类基因。可通过http://genome.dkfz-heidelberg.de获取ESTAnnotator，联系方式：genome@dkfz.de。

相似文献

ESTAnnotator: A tool for high throughput EST annotation.EST注释器：一种用于高通量EST注释的工具。

Nucleic Acids Res. 2003 Jul 1;31(13):3716-9. doi: 10.1093/nar/gkg566.

ESAP plus: a web-based server for EST-SSR marker development.ESAP plus：一个用于EST-SSR标记开发的基于网络的服务器。

BMC Genomics. 2016 Dec 22;17(Suppl 13):1035. doi: 10.1186/s12864-016-3328-4.

EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments.EGassembler：用于大规模处理、聚类和组装EST及基因组DNA片段的在线生物信息学服务。

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W459-62. doi: 10.1093/nar/gkl066.

ESTMAP: a system for expressed sequence tags mapping on genomic sequences.ESTMAP：一种用于将表达序列标签映射到基因组序列上的系统。

IEEE Trans Nanobioscience. 2003 Jun;2(2):75-8. doi: 10.1109/tnb.2003.813928.

ESTIMA, a tool for EST management in a multi-project environment.ESTIMA，一种用于多项目环境中EST管理的工具。

BMC Bioinformatics. 2004 Nov 4;5:176. doi: 10.1186/1471-2105-5-176.

The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes.TIGR基因索引：EST和已知基因的聚类与组装以及与真核生物基因组的整合

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D71-4. doi: 10.1093/nar/gki064.

ESTExplorer: an expressed sequence tag (EST) assembly and annotation platform.ESTExplorer：一个表达序列标签（EST）组装与注释平台。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W143-7. doi: 10.1093/nar/gkm378. Epub 2007 Jun 1.

GARSA: genomic analysis resources for sequence annotation.GARSA：用于序列注释的基因组分析资源。

Bioinformatics. 2005 Dec 1;21(23):4302-3. doi: 10.1093/bioinformatics/bti705. Epub 2005 Oct 6.

PESTAS: a web server for EST analysis and sequence mining.PESTAS：用于EST分析和序列挖掘的网络服务器。

Bioinformatics. 2009 Jul 15;25(14):1846-8. doi: 10.1093/bioinformatics/btp293. Epub 2009 May 4.

ESTree db: a tool for peach functional genomics.ESTree数据库：一种用于桃功能基因组学的工具。

BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S16. doi: 10.1186/1471-2105-6-S4-S16.

引用本文的文献

Sma3s: a three-step modular annotator for large sequence datasets.Sma3s：一种用于大型序列数据集的三步模块化注释器。

DNA Res. 2014 Aug;21(4):341-53. doi: 10.1093/dnares/dsu001. Epub 2014 Feb 5.

TranSeqAnnotator: large-scale analysis of transcriptomic data.TranSeqAnnotator：大规模转录组数据分析。

BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S24. doi: 10.1186/1471-2105-13-S17-S24. Epub 2012 Dec 13.

FastAnnotator--an efficient transcript annotation web tool.FastAnnotator——一个高效的转录本注释网络工具。

BMC Genomics. 2012;13 Suppl 7(Suppl 7):S9. doi: 10.1186/1471-2164-13-S7-S9. Epub 2012 Dec 13.

The mining of toxin-like polypeptides from EST database by single residue distribution analysis.通过单残基分布分析从 EST 数据库中挖掘毒素样多肽。

BMC Genomics. 2011 Jan 31;12:88. doi: 10.1186/1471-2164-12-88.

SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read.SeqTrim：一种用于预处理任何类型序列读取的高通量管道。

BMC Bioinformatics. 2010 Jan 20;11:38. doi: 10.1186/1471-2105-11-38.

GarlicESTdb: an online database and mining tool for garlic EST sequences.大蒜EST数据库：一个用于大蒜EST序列的在线数据库和挖掘工具。

BMC Plant Biol. 2009 May 18;9:61. doi: 10.1186/1471-2229-9-61.

ESTPiper--a web-based analysis pipeline for expressed sequence tags.ESTPiper——一个用于表达序列标签的基于网络的分析流程。

BMC Genomics. 2009 Apr 21;10:174. doi: 10.1186/1471-2164-10-174.

OREST: the online resource for EST analysis.OREST：EST分析的在线资源。

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W140-4. doi: 10.1093/nar/gkn253. Epub 2008 May 7.

In silico analysis of expressed sequence tags from Trichostrongylus vitrinus (Nematoda): comparison of the automated ESTExplorer workflow platform with conventional database searches.玻璃细颈线虫（线虫纲）表达序列标签的电子分析：自动化ESTExplorer工作流程平台与传统数据库搜索的比较

BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2105-9-S1-S10.

ESTExplorer: an expressed sequence tag (EST) assembly and annotation platform.ESTExplorer：一个表达序列标签（EST）组装与注释平台。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W143-7. doi: 10.1093/nar/gkm378. Epub 2007 Jun 1.

本文引用的文献

A task framework for the web interface W2H.用于网络界面W2H的任务框架。

Bioinformatics. 2003 Jan 22;19(2):278-82. doi: 10.1093/bioinformatics/19.2.278.

PipeOnline 2.0: automated EST processing and functional data sorting.PipeOnline 2.0：EST自动化处理与功能数据分类

Nucleic Acids Res. 2002 Nov 1;30(21):4761-9. doi: 10.1093/nar/gkf585.

Assembly, annotation, and integration of UNIGENE clusters into the human genome draft.将UNIGENE簇组装、注释并整合到人类基因组草图中。

Genome Res. 2001 May;11(5):904-18. doi: 10.1101/gr.gr-1645r.

STACK: Sequence Tag Alignment and Consensus Knowledgebase.STACK：序列标签比对与一致性知识库。

Nucleic Acids Res. 2001 Jan 1;29(1):234-8. doi: 10.1093/nar/29.1.234.

An optimized protocol for analysis of EST sequences.一种用于表达序列标签（EST）序列分析的优化方案。

Nucleic Acids Res. 2000 Sep 15;28(18):3657-65. doi: 10.1093/nar/28.18.3657.

W2H: WWW interface to the GCG sequence analysis package.W2H：GCG序列分析软件包的万维网接口。

Bioinformatics. 1998 Jun;14(5):452-7. doi: 10.1093/bioinformatics/14.5.452.

Base-calling of automated sequencer traces using phred. I. Accuracy assessment.使用Phred对自动测序仪轨迹进行碱基识别。I. 准确性评估。

Genome Res. 1998 Mar;8(3):175-85. doi: 10.1101/gr.8.3.175.

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST：新一代蛋白质数据库搜索程序。

Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.

Complementary DNA sequencing: expressed sequence tags and human genome project.互补DNA测序：表达序列标签与人类基因组计划

Science. 1991 Jun 21;252(5013):1651-6. doi: 10.1126/science.2047873.

A contig assembly program based on sensitive detection of fragment overlaps.一种基于片段重叠灵敏检测的重叠群组装程序。

Genomics. 1992 Sep;14(1):18-25. doi: 10.1016/s0888-7543(05)80277-0.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验