Reneker Jeff, Shyu Chi-Ren, Zeng Peiyu, Polacco Joseph C, Gassmann Walter
Department of Computer Science, University of Missouri, Columbia, MO 65211, USA.
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W649-53. doi: 10.1093/nar/gkh455.
We have developed a web server for the life sciences community to use to search for short repeats of DNA sequence of length between 3 and 10,000 bases within multiple species. This search employs a unique and fast hash function approach. Our system also applies information retrieval algorithms to discover knowledge of cross-species conservation of repeat sequences. Furthermore, we have incorporated a part of the Gene Ontology database into our information retrieval algorithms to broaden the coverage of the search. Our web server and tutorial can be found at http://acmes.rnet.missouri.edu.
我们为生命科学领域的科研人员开发了一个网络服务器,用于在多个物种中搜索长度在3至10,000个碱基之间的短DNA重复序列。该搜索采用了一种独特且快速的哈希函数方法。我们的系统还应用信息检索算法来发现重复序列的跨物种保守性知识。此外,我们已将基因本体数据库的一部分纳入信息检索算法,以扩大搜索范围。我们的网络服务器及使用指南可在http://acmes.rnet.missouri.edu上找到。