Masoudi-Nejad Ali, Tonomura Koichiro, Kawashima Shuichi, Moriya Yuki, Suzuki Masanori, Itoh Masumi, Kanehisa Minoru, Endo Takashi, Goto Susumu
Laboratory of Bioknowledge Systems, Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho Uji, Kyoto 611-0011, Japan.
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W459-62. doi: 10.1093/nar/gkl066.
Expressed sequence tag (EST) sequencing has proven to be an economically feasible alternative for gene discovery in species lacking a draft genome sequence. Ongoing large-scale EST sequencing projects feel the need for bioinformatics tools to facilitate uniform EST handling. This brings about a renewed importance for a universal tool for processing and functional annotation of large sets of ESTs. EGassembler (http://egassembler.hgc.jp/) is a web server, which provides an automated as well as a user-customized analysis tool for cleaning, repeat masking, vector trimming, organelle masking, clustering and assembling of ESTs and genomic fragments. The web server is publicly available and provides the community a unique all-in-one online application web service for large-scale ESTs and genomic DNA clustering and assembling. Running on a Sun Fire 15K supercomputer, a significantly large volume of data can be processed in a short period of time. The results can be used to functionally annotate genes, to facilitate splice alignment analysis, to link the transcripts to genetic and physical maps, design microarray chips, to perform transcriptome analysis and to map to KEGG metabolic pathways. The service provides an excellent bioinformatics tool to research groups in wet-lab as well as an all-in-one-tool for sequence handling to bioinformatics researchers.
表达序列标签(EST)测序已被证明是在缺乏基因组草图序列的物种中进行基因发现的一种经济可行的替代方法。正在进行的大规模EST测序项目感到需要生物信息学工具来促进对EST的统一处理。这使得用于处理大量EST并进行功能注释的通用工具再次变得重要。EGassembler(http://egassembler.hgc.jp/)是一个网络服务器,它为EST和基因组片段的清理、重复序列屏蔽、载体修剪、细胞器屏蔽、聚类和组装提供了一个自动化的以及用户定制的分析工具。该网络服务器是公开可用的,为社区提供了一个独特的一体化在线应用网络服务,用于大规模EST和基因组DNA的聚类和组装。它运行在一台Sun Fire 15K超级计算机上,能够在短时间内处理大量数据。结果可用于对基因进行功能注释、促进剪接比对分析、将转录本与遗传图谱和物理图谱进行关联、设计微阵列芯片、进行转录组分析以及映射到KEGG代谢途径。该服务为湿实验室的研究团队提供了一个出色的生物信息学工具,也为生物信息学研究人员提供了一个用于序列处理的一体化工具。