Barré Aurélien, de Daruvar Antoine, Blanchard Alain
INRA-Université de Bordeaux 2, IBVM, Bordeaux, France.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D307-10. doi: 10.1093/nar/gkh114.
Bacteria belonging to the class Mollicutes were among the first ones to be selected for complete genome sequencing because of the minimal size of their genomes and their pathogenicity for humans and a broad range of animals and plants. At this time six genome sequences have been publicly released (Mycoplasma genitalium, Mycoplasma pneumoniae, Ureaplasma urealyticum-parvum, Mycoplasma pulmonis, Mycoplasma penetrans and Mycoplasma gallisepticum) and as the number of available mollicute genomes increases, comparative genomics analysis within this model group of organisms becomes more and more instructive. However, such an analysis is difficult to carry out without a suitable platform gathering not only the original annotations but also relevant information available in public databases or obtained by applying common bioinformatics methods. With the aim of solving these difficulties, we have developed a web-accessible database named MolliGen (http://cbi.labri.fr/outils/molligen/). After selecting a set of genomes the user can launch various types of search based on annotation, position on the chromosomes or sequence similarity. In addition, relationships of putative orthology have been precomputed to allow differential genome queries. The results are presented in table format with multiple links to public databases and to bioinformatic analyses such as multiple alignments or BLAST search. Specific tools were also developed for the graphical visualization of the results, including a multi- genome browser for displaying dynamic pictures with clickable objects and for viewing relationships of precomputed similarity. MolliGen is designed to integrate all the complete genomes of mollicutes as they become available.
由于支原体纲细菌基因组规模最小,且对人类以及多种动植物具有致病性,因此它们是最早被选来进行全基因组测序的细菌之一。目前,已有六个基因组序列公开发布(生殖支原体、肺炎支原体、解脲脲原体微小亚种、肺支原体、穿透支原体和鸡败血支原体),随着可获取的支原体基因组数量不断增加,对这一模式生物群体进行比较基因组学分析变得越来越具有指导意义。然而,如果没有一个合适的平台,不仅收集原始注释,还收集公共数据库中可用的相关信息或通过应用常见生物信息学方法获得的信息,就很难进行这样的分析。为了解决这些难题,我们开发了一个名为MolliGen的可通过网络访问的数据库(http://cbi.labri.fr/outils/molligen/)。用户在选择一组基因组后,可以基于注释、染色体位置或序列相似性进行各种类型的搜索。此外,还预先计算了假定的直系同源关系,以允许进行差异基因组查询。结果以表格形式呈现,并带有指向公共数据库和生物信息学分析(如多重比对或BLAST搜索)的多个链接。还开发了特定工具用于结果的图形化可视化,包括一个多基因组浏览器,用于显示带有可点击对象的动态图片以及查看预先计算的相似性关系。MolliGen旨在整合所有可用的支原体完整基因组。