Graduate School of Science and Technology, Hirosaki University, Hirosaki, Aomori 036-8561, Japan.
Genomics Proteomics Bioinformatics. 2009 Dec;7(4):163-74. doi: 10.1016/S1672-0229(08)60046-7.
We explored the possibilities of whole-genome duplication (WGD) in prokaryotic species, where we performed statistical analyses of the configurations of the central angles between homologous tandem repeats (TRs) on the circular chromosomes. At first, we detected TRs on their chromosomes and identified equivalent tandem repeat pairs (ETRPs); here, an ETRP is defined as a pair of tandem repeats sequentially similar to each other. Then we carried out statistical analyses of the central angle distributions of the detected ETRPs on each circular chromosome by way of comparisons between the detected distributions and those generated by null models. In the analyses, we estimated a P value by a simulation using the Kullback-Leibler divergence as a distance measure between two distributions. As a result, the central angle distributions for 8 out of the 203 prokaryotic species showed statistically significant deviations (P<0.05). In particular, we found out the characteristic feature of one round of WGD in Photorhabdus luminescens genome and that of two rounds of WGD in Escherichia coli K12.
我们探索了原核生物中全基因组复制(WGD)的可能性,对环状染色体上同源串联重复(TR)的中心角配置进行了统计分析。首先,我们在染色体上检测到 TR,并确定了等效串联重复对(ETRP);这里,ETRP 定义为彼此顺序相似的一对串联重复。然后,我们通过比较检测到的分布和通过 null 模型生成的分布,对每个环状染色体上检测到的 ETRP 的中心角分布进行了统计分析。在分析中,我们使用 Kullback-Leibler 散度作为两个分布之间的距离度量,通过模拟来估计 P 值。结果,203 种原核生物中有 8 种的中心角分布表现出统计学上的显著偏差(P<0.05)。特别是,我们发现了 Photorhabdus luminescens 基因组中一轮 WGD 和 Escherichia coli K12 中两轮 WGD 的特征。