Barbe Valérie, Cruveiller Stéphane, Kunst Frank, Lenoble Patricia, Meurice Guillaume, Sekowska Agnieszka, Vallenet David, Wang Tingzhang, Moszer Ivan, Médigue Claudine, Danchin Antoine
CEA, Institut de Génomique, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France.
CEA, Institut de Génomique, Laboratoire de Génomique Comparative/CNRS UMR8030, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France.
Microbiology (Reading). 2009 Jun;155(Pt 6):1758-1775. doi: 10.1099/mic.0.027839-0. Epub 2009 Apr 21.
Comparative genomics is the cornerstone of identification of gene functions. The immense number of living organisms precludes experimental identification of functions except in a handful of model organisms. The bacterial domain is split into large branches, among which the Firmicutes occupy a considerable space. Bacillus subtilis has been the model of Firmicutes for decades and its genome has been a reference for more than 10 years. Sequencing the genome involved more than 30 laboratories, with different expertises, in a attempt to make the most of the experimental information that could be associated with the sequence. This had the expected drawback that the sequencing expertise was quite varied among the groups involved, especially at a time when sequencing genomes was extremely hard work. The recent development of very efficient, fast and accurate sequencing techniques, in parallel with the development of high-level annotation platforms, motivated the present resequencing work. The updated sequence has been reannotated in agreement with the UniProt protein knowledge base, keeping in perspective the split between the paleome (genes necessary for sustaining and perpetuating life) and the cenome (genes required for occupation of a niche, suggesting here that B. subtilis is an epiphyte). This should permit investigators to make reliable inferences to prepare validation experiments in a variety of domains of bacterial growth and development as well as build up accurate phylogenies.
比较基因组学是鉴定基因功能的基石。除了少数模式生物外,大量的生物使得通过实验鉴定其功能变得不可能。细菌域被分为多个大的分支,其中厚壁菌门占据了相当大的比例。几十年来,枯草芽孢杆菌一直是厚壁菌门的模式生物,其基因组十多年来一直是参考标准。对该基因组进行测序涉及30多个具有不同专业知识的实验室,目的是充分利用与该序列相关的实验信息。这带来了一个意料之中的缺点,即参与其中的各个团队的测序专业知识差异很大,尤其是在基因组测序工作极其艰巨的那个时期。高效、快速且准确的测序技术的最新发展,以及高级注释平台的发展,推动了目前的重测序工作。更新后的序列已根据UniProt蛋白质知识库进行了重新注释,同时兼顾了古基因组(维持和延续生命所需的基因)和新基因组(占据生态位所需的基因,在此表明枯草芽孢杆菌是一种附生植物)之间的区分。这应该能让研究人员做出可靠的推断,以便在细菌生长和发育的各个领域准备验证实验,并构建准确的系统发育树。