Tullo A, Liuni S, Attimonelli M
Dipartimento di Biochimica e Biologia Molecolare, University of Bari, Italy.
Protein Seq Data Anal. 1990 Sep;3(4):327-34.
EMBL and GenBank keyword indexes have no hierarchical structure. In this paper we present a method for merging and reorganizing them in a tree structure whose primary roots are the keywords 'protein', 'DNA', 'RNA', and 'unclassified'. Synonymous keywords have been grouped together and erroneous keywords have been corrected. This taxonomic organization of keywords results in a more extensive and efficient retrieval which is further aided by "synonyms declaration". The tree has been produced using the computer programs GENPOINT and CREANET.
欧洲分子生物学实验室(EMBL)和基因银行(GenBank)的关键词索引没有层次结构。在本文中,我们提出了一种方法,将它们合并并重新组织成一种树形结构,其主要根节点为“蛋白质”“DNA”“RNA”和“未分类”这些关键词。同义词已被归为一组,错误的关键词也已得到纠正。这种关键词的分类组织方式实现了更广泛、高效的检索,“同义词声明”进一步辅助了检索。该树形结构是使用计算机程序GENPOINT和CREANET生成的。