Perrière G, Moszer I, Gojobori T
Laboratoire de Biometrie, Universite Claude Bernard-Lyon, Villeurbanne, France.
Nucleic Acids Res. 1996 Jan 1;24(1):41-5. doi: 10.1093/nar/24.1.41.
In the context of the international project aimed at sequencing the whole genome of Bacillus subtilis we have developed a non-redundant, fully annotated database of sequences from this organism. Starting from the B.subtilis sequences available in the EMBL, GenBank and DDBJ collections we have removed all encountered duplications and then added extra annotations to the sequences (e.g. accession numbers for the genes, locations on the genetic map, codon usage, etc.) We have also added cross-references to the EMBL, MEDLINE, SWISS-PROT and ENZYME data banks. The present system results from merging of the NRSub and SubtiList databases and the sequence contigs used in the two systems are identical. NRSub is distributed as a flatfile in EMBL format (which is supported by most sequence analysis software packages) and as an ACNUC database, while SubtiList is distributed as a relational database under 4th Dimension. It is possible to access the data through two dedicated World Wide Web servers located in France and Japan.
在旨在对枯草芽孢杆菌全基因组进行测序的国际项目背景下,我们开发了一个关于该生物体序列的非冗余、全注释数据库。从EMBL、GenBank和DDBJ数据库中可得的枯草芽孢杆菌序列出发,我们去除了所有发现的重复序列,然后为这些序列添加了额外注释(如基因登录号、遗传图谱上的位置、密码子使用情况等)。我们还添加了与EMBL、MEDLINE、SWISS-PROT和ENZYME数据库的交叉引用。当前系统是由NRSub和SubtiList数据库合并而成,两个系统中使用的序列重叠群是相同的。NRSub以EMBL格式的平面文件(大多数序列分析软件包都支持)和ACNUC数据库的形式分发,而SubtiList以关系数据库的形式在4th Dimension下分发。可以通过位于法国和日本的两个专用万维网服务器访问这些数据。