Federal Research and Clinical Center of Physical and Chemical Medicine, Federal Medical and Biological Agency of Russia, Moscow, Russian Federation.
PLoS Comput Biol. 2020 Oct 9;16(10):e1008222. doi: 10.1371/journal.pcbi.1008222. eCollection 2020 Oct.
Comparative genomics studies may be used to acquire new knowledge regarding genome architecture, which defines the rules for combining sets of genes in the genome of living organisms. Hundreds of thousands of prokaryotic genomes have been sequenced and assembled. However, computational tools capable of simultaneously comparing large numbers of genomes are lacking. We developed the Genome Complexity Browser, a tool that allows the visualization of gene contexts, in a graph-based format, and the quantification of variability for different segments of a genome. The graph-based visualization allows the inspection of changes in gene contents and neighborhoods across hundreds of genomes, simultaneously, which may facilitate the identification of conserved and variable segments of operons or the estimation of the overall variability associated with a particular genome locus. We introduced a measure called complexity, to quantify genome variability. Intraspecies and interspecies comparisons revealed that regions with high complexity values tended to be located in areas that are conserved across different strains and species.
比较基因组学研究可以用来获取关于基因组结构的新知识,基因组结构定义了生物体基因组中基因组合的规则。已经测序和组装了数十万种原核生物基因组。然而,缺乏能够同时比较大量基因组的计算工具。我们开发了基因组复杂性浏览器,这是一种工具,允许以基于图形的格式可视化基因上下文,并量化基因组不同部分的可变性。基于图形的可视化允许同时检查数百个基因组中基因含量和基因邻居的变化,这可能有助于识别操纵子的保守和可变部分,或估计与特定基因组位点相关的整体可变性。我们引入了一个称为复杂性的度量来量化基因组的可变性。种内和种间比较表明,复杂性值高的区域往往位于不同菌株和物种之间保守的区域。