Huang X
Department of Computer Science, Michigan Technological University, Houghton, USA.
Microb Comp Genomics. 1996;1(4):281-91. doi: 10.1089/mcg.1996.1.281.
We describe a computer program, named DNA-Protein Search (DPS), for comparing a megabase DNA sequence with a protein sequence database. The DPS program addresses the problems of frameshifts and introns in the DNA sequence. The DPS program was used to compare each of the following sequences with the Swiss-Prot database: the 1.8-megabase sequence of the Haemophilus influenzae Rd genome, the 0.58-megabase sequence of the Mycoplasma genitalium genome, and the 0.56-megabase sequence of Saccharomyces cerevisiae chromosome VIII. The comparisons found new regions that are similar to protein sequences. The sensitivity of DPS was evaluated using as test data the known coding regions of the three DNA sequences. The results demonstrate that the DPS program is a useful tool for finding the coding regions of the DNA sequence. The DPS program uses an order of magnitude less computer memory and is several times faster than the BLASTX program.
我们描述了一个名为DNA-蛋白质搜索(DPS)的计算机程序,用于将兆碱基DNA序列与蛋白质序列数据库进行比较。DPS程序解决了DNA序列中的移码和内含子问题。DPS程序被用于将以下每个序列与瑞士蛋白质数据库进行比较:流感嗜血杆菌Rd基因组的1.8兆碱基序列、生殖支原体基因组的0.58兆碱基序列以及酿酒酵母八号染色体的0.56兆碱基序列。这些比较发现了与蛋白质序列相似的新区域。使用这三个DNA序列的已知编码区域作为测试数据来评估DPS的灵敏度。结果表明,DPS程序是寻找DNA序列编码区域的有用工具。DPS程序使用的计算机内存少一个数量级,并且比BLASTX程序快几倍。