Gouy Manolo, Delmotte Stéphane
Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, 69622 Villeurbanne Cedex, France.
Biochimie. 2008 Apr;90(4):555-62. doi: 10.1016/j.biochi.2007.07.003. Epub 2007 Jul 15.
The ACNUC biological sequence database system provides powerful and fast query and extraction capabilities to a variety of nucleotide and protein sequence databases. The collection of ACNUC databases served by the Pôle Bio-Informatique Lyonnais includes the EMBL, GenBank, RefSeq and UniProt nucleotide and protein sequence databases and a series of other sequence databases that support comparative genomics analyses: HOVERGEN and HOGENOM containing families of homologous protein-coding genes from vertebrate and prokaryotic genomes, respectively; Ensembl and Genome Reviews for analyses of prokaryotic and of selected eukaryotic genomes. This report describes the main features of the ACNUC system and the access to ACNUC databases from any internet-connected computer. Such access was made possible by the definition of a remote ACNUC access protocol and the implementation of Application Programming Interfaces between the C, Python and R languages and this communication protocol. Two retrieval programs for ACNUC databases, Query_win, with a graphical user interface and raa_query, with a command line interface, are also described. Altogether, these bioinformatics tools provide users with either ready-to-use means of querying remote sequence databases through a variety of selection criteria, or a simple way to endow application programs with an extensive access to these databases. Remote access to ACNUC databases is open to all and fully documented (http://pbil.univ-lyon1.fr/databases/acnuc/acnuc.html).
ACNUC生物序列数据库系统为各种核苷酸和蛋白质序列数据库提供了强大且快速的查询与提取功能。里昂生物信息中心提供服务的ACNUC数据库集合包括EMBL、GenBank、RefSeq和UniProt核苷酸与蛋白质序列数据库,以及一系列支持比较基因组学分析的其他序列数据库:HOVERGEN和HOGENOM分别包含来自脊椎动物和原核生物基因组的同源蛋白质编码基因家族;Ensembl和Genome Reviews用于原核生物及选定真核生物基因组的分析。本报告描述了ACNUC系统的主要特性,以及如何从任何联网计算机访问ACNUC数据库。通过定义远程ACNUC访问协议以及在C、Python和R语言与该通信协议之间实现应用程序接口,实现了这种访问。还介绍了用于ACNUC数据库的两个检索程序,具有图形用户界面的Query_win和具有命令行界面的raa_query。总之,这些生物信息学工具为用户提供了通过各种选择标准查询远程序列数据库的现成方法,或者为应用程序提供广泛访问这些数据库的简单途径。对ACNUC数据库的远程访问向所有人开放且有完整文档说明(http://pbil.univ-lyon1.fr/databases/acnuc/acnuc.html)。