Mironov A A, Pevzner P A
Laboratory of Mathematical Methods, National Center for Biotechnology NIIGENETIKA, Moscow, Russia.
Microb Comp Genomics. 1999;4(3):167-72. doi: 10.1089/omi.1.1999.4.167.
The expressed sequence tag (EST) data provide a powerful tool for identification of transcribed DNA sequences. However, as EST are relatively short, many exons are poorly covered by EST, thus reducing the utility of EST data. Recently, signature sequence tag (SST) fingerprints were proposed as an alternative to EST fingerprints. Given a fingerprint set of probes, SST of a clone is a subset of probes from the fingerprint set that hybridize with the clone. We demonstrate that besides being a powerful technique for screening cDNA libraries, SST technology provides for very accurate gene predictions. Even with a small fingerprint set (600-800 probes), SST-based gene recognition outperforms many conventional and EST-based methods. The increase in the size of the fingerprint set to 1500 probes provides almost perfect gene recognition. Even more importantly, SST-based gene predictions miss very few exons and, therefore, provide an opportunity to bypass the cDNA sequencing step on the way from finished genomic sequence to mutation detection in gene-hunting projects. Because SST data can be obtained in a highly parallel and inexpensive way, SST technology has a potential of complementing EST technology for gene hunting.
表达序列标签(EST)数据为鉴定转录的DNA序列提供了一个强大的工具。然而,由于EST相对较短,许多外显子被EST覆盖的程度较差,从而降低了EST数据的实用性。最近,特征序列标签(SST)指纹图谱被提议作为EST指纹图谱的替代方法。给定一组探针指纹,一个克隆的SST是来自该指纹组且能与该克隆杂交的探针子集。我们证明,SST技术除了是筛选cDNA文库的强大技术外,还能提供非常准确的基因预测。即使使用较小的指纹组(600 - 800个探针),基于SST的基因识别也优于许多传统的和基于EST的方法。将指纹组大小增加到1500个探针时,能提供几乎完美的基因识别。更重要的是,基于SST的基因预测遗漏的外显子极少,因此,在从完成的基因组序列到基因搜寻项目中的突变检测过程中,提供了一个绕过cDNA测序步骤的机会。由于SST数据可以以高度并行且廉价的方式获得,SST技术在基因搜寻方面有补充EST技术的潜力。