Durand Dannie, Sankoff David
Department of Biological Sciences, Carnegie Mellon University, 4400 Fifth Avenue, Pittsburgh, PA 15213, USA.
J Comput Biol. 2003;10(3-4):453-82. doi: 10.1089/10665270360688129.
Comparing chromosomal gene order in two or more related species is an important approach to studying the forces that guide genome organization and evolution. Linked clusters of similar genes found in related genomes are often used to support arguments of evolutionary relatedness or functional selection. However, as the gene order and the gene complement of sister genomes diverge progressively due to large scale rearrangements, horizontal gene transfer, gene duplication and gene loss, it becomes increasingly difficult to determine whether observed similarities in local genomic structure are indeed remnants of common ancestral gene order, or are merely coincidences. A rigorous comparative genomics requires principled methods for distinguishing chance commonalities, within or between genomes, from genuine historical or functional relationships. In this paper, we construct tests for significant groupings against null hypotheses of random gene order, taking incomplete clusters, multiple genomes, and gene families into account. We consider both the significance of individual clusters of prespecified genes and the overall degree of clustering in whole genomes.
比较两个或多个相关物种的染色体基因顺序是研究引导基因组组织和进化的力量的重要方法。在相关基因组中发现的相似基因的连锁簇常被用来支持进化相关性或功能选择的论点。然而,由于大规模重排、水平基因转移、基因复制和基因丢失,姐妹基因组的基因顺序和基因组成逐渐分化,确定观察到的局部基因组结构相似性究竟是共同祖先基因顺序的残余,还是仅仅是巧合变得越来越困难。严格的比较基因组学需要有原则的方法来区分基因组内部或之间的偶然共性与真正的历史或功能关系。在本文中,我们构建了针对随机基因顺序零假设的显著分组检验,同时考虑了不完整的簇、多个基因组和基因家族。我们既考虑了预先指定基因的单个簇的显著性,也考虑了整个基因组的聚类总体程度。