Department of Phytomedicine, Humboldt-Universität zu Berlin, Lentzeallee 55-57, 14195, Berlin, Germany.
Hôpital de la Pitié-Salpêtrière, Institute of Cardiometabolism and Nutrition, 47 Boulevard de l'Hôpital, 75013, Paris, France.
Microb Biotechnol. 2018 Jan;11(1):3-17. doi: 10.1111/1751-7915.13043.
Genome annotation is, nowadays, performed via automatic pipelines that cannot discriminate between right and wrong annotations. Given their importance in increasing the accuracy of the genome annotations of other organisms, it is critical that the annotations of model organisms reflect the current annotation gold standard. The genome of Bacillus subtilis strain 168 was sequenced twenty years ago. Using a combination of inductive, deductive and abductive reasoning, we present a unique, manually curated annotation, essentially based on experimental data. This reveals how this bacterium lives in a plant niche, while carrying a paleome operating system common to Firmicutes and Tenericutes. Dozens of new genomic objects and an extensive literature survey have been included for the sequence available at the INSDC (AccNum AL009126.3). We also propose an extension to Demerec's nomenclature rules that will help investigators connect to this type of curated annotation via the use of common gene names.
目前,基因组注释是通过自动流水线完成的,这些流水线无法区分注释的对错。鉴于模型生物的注释对于提高其他生物基因组注释的准确性至关重要,因此反映当前注释黄金标准的模型生物的注释至关重要。枯草芽孢杆菌 168 株的基因组在二十年前就已经测序。我们运用归纳法、演绎法和溯因推理的综合方法,提供了一个独特的、经过人工整理的注释,主要基于实验数据。这揭示了这种细菌如何在植物生境中生存,同时携带普遍存在于厚壁菌门和柔膜菌门的古菌操作系统。我们还为 INSDC(AccNum AL009126.3)上可用的序列添加了数十个新的基因组对象和广泛的文献调查。我们还提出了对 Demerec 命名规则的扩展,这将有助于研究人员通过使用通用基因名称来连接到这种经过整理的注释。