Gemünd C, Ramu C, Altenberg-Greulich B, Gibson T J
European Molecular Biology Laboratory, Postfach 10.2209, 69012 Heidelberg, Germany.
Nucleic Acids Res. 2001 Mar 15;29(6):1272-7. doi: 10.1093/nar/29.6.1272.
Expressed sequence tags (ESTs) are randomly sequenced cDNA clones. Currently, nearly 3 million human and 2 million mouse ESTs provide valuable resources that enable researchers to investigate the products of gene expression. The EST databases have proven to be useful tools for detecting homologous genes, for exon mapping, revealing differential splicing, etc. With the increasing availability of large amounts of poorly characterised eukaryotic (notably human) genomic sequence, ESTs have now become a vital tool for gene identification, sometimes yielding the only unambiguous evidence for the existence of a gene expression product. However, BLAST-based Web servers available to the general user have not kept pace with these developments and do not provide appropriate tools for querying EST databases with large highly spliced genes, often spanning 50 000-100 000 bases or more. Here we describe Gene2EST (http://woody.embl-heidelberg.de/gene2est/), a server that brings together a set of tools enabling efficient retrieval of ESTs matching large DNA queries and their subsequent analysis. RepeatMasker is used to mask dispersed repetitive sequences (such as Alu elements) in the query, BLAST2 for searching EST databases and Artemis for graphical display of the findings. Gene2EST combines these components into a Web resource targeted at the researcher who wishes to study one or a few genes to a high level of detail.
表达序列标签(ESTs)是随机测序的cDNA克隆。目前,近300万个人类EST和200万小鼠EST提供了宝贵的资源,使研究人员能够研究基因表达的产物。EST数据库已被证明是检测同源基因、外显子定位、揭示可变剪接等的有用工具。随着大量特征不明的真核生物(尤其是人类)基因组序列的日益可得,EST现在已成为基因识别的重要工具,有时能提供基因表达产物存在的唯一明确证据。然而,普通用户可用的基于BLAST的网络服务器并未跟上这些发展,且没有提供用于查询含有高度可变剪接的大基因(通常跨度为50000 - 100000个碱基或更多)的EST数据库的适当工具。在此,我们描述了Gene2EST(http://woody.embl - heidelberg.de/gene2est/),这是一个整合了一组工具的服务器,能够高效检索与大型DNA查询匹配的EST并进行后续分析。RepeatMasker用于屏蔽查询中的分散重复序列(如Alu元件),BLAST2用于搜索EST数据库,Artemis用于以图形方式展示结果。Gene2EST将这些组件整合为一个网络资源,面向希望深入研究一个或几个基因的研究人员。