Suppr超能文献

ESTExplorer:一个表达序列标签(EST)组装与注释平台。

ESTExplorer: an expressed sequence tag (EST) assembly and annotation platform.

作者信息

Nagaraj Shivashankar H, Deshpande Nandan, Gasser Robin B, Ranganathan Shoba

机构信息

Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.

出版信息

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W143-7. doi: 10.1093/nar/gkm378. Epub 2007 Jun 1.

Abstract

The analysis of expressed sequence tag (EST) datasets offers a rapid and cost-effective approach to elucidate the transcriptome of an organism, but requiring several computational methods for assembly and annotation. ESTExplorer is a comprehensive workflow system for EST data management and analysis. The pipeline uses a 'distributed control approach' in which the most appropriate bioinformatics tools are implemented over different dedicated processors. Species-specific repeat masking and conceptual translation are in-built. ESTExplorer accepts a set of ESTs in FASTA format which can be analysed using programs selected by the user. After pre-processing and assembly, the dataset is annotated at the nucleotide and protein levels, following conceptual translation. Users may optionally provide ESTExplorer with assembled contigs for annotation purposes. Functionally annotated contigs/ESTs can be analysed individually. The overall outputs are gene ontologies, protein functional identifications in terms of mapping to protein domains and metabolic pathways. ESTExplorer has been applied successfully to annotate large EST datasets from parasitic nematodes and to identify novel genes as potential targets for parasite intervention. ESTExplorer runs on a Linux cluster and is freely available for the academic community at http://estexplorer.biolinfo.org.

摘要

对表达序列标签(EST)数据集进行分析,为阐明生物体的转录组提供了一种快速且经济高效的方法,但需要多种计算方法进行组装和注释。ESTExplorer是一个用于EST数据管理和分析的综合工作流程系统。该流程采用“分布式控制方法”,在不同的专用处理器上实施最合适的生物信息学工具。内置了物种特异性重复序列屏蔽和概念性翻译功能。ESTExplorer接受一组FASTA格式的EST,用户可以使用所选程序对其进行分析。经过预处理和组装后,数据集在概念性翻译后在核苷酸和蛋白质水平上进行注释。用户可以选择为注释目的向ESTExplorer提供组装好的重叠群。功能注释的重叠群/EST可以单独进行分析。总体输出包括基因本体论、根据映射到蛋白质结构域和代谢途径的蛋白质功能鉴定。ESTExplorer已成功应用于注释来自寄生线虫的大型EST数据集,并识别作为寄生虫干预潜在靶点的新基因。ESTExplorer在Linux集群上运行,可通过http://estexplorer.biolinfo.org免费提供给学术界使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/072e/1933243/4daefcee642a/gkm378f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验