Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidade de Vigo, Vigo, Spain.
Mol Biol Evol. 2010 May;27(5):1139-51. doi: 10.1093/molbev/msq001. Epub 2010 Jan 7.
We carried out an in silico analysis of the complete genome sequences of 14 species, including eukaryotes, prokaryotes, and archaea, to investigate the proportion of amplified fragment length polymorphism bands that are homoplasious for the different species, as well as the distribution of fragment lengths. We investigated several possible reasons for the disagreement, previously observed in Arabidopsis thaliana, between the observed fragment length distribution and the null random sequence distribution, which occurs in the direction of a deficit of fragments of small length and an excess of those of large length with respect to the null distribution. We made the following findings: 1) The positive relationship previously found between the percentage of homoplasy and genome size is a direct consequence of the number of observed bands and the GC content. For the same number of observed bands, the percentage of homoplasy is independent of the genome size of the species. 2) The disagreement between the observed fragment length distribution and the null random sequence distribution observed in A. thaliana is a phenomenon that also occurs in other species. 3) This disagreement is due neither to the structure of the genomes in isochores nor the possible impact of indels in reducing the number of restriction sites, two hypotheses discussed in the literature. 4) Nor is the disagreement eliminated by using restriction enzymes with balanced motifs. 5) The discrepancy seems to be caused, rather, by the nonrandom distribution of restriction enzyme motifs.
我们对包括真核生物、原核生物和古菌在内的 14 个物种的全基因组序列进行了计算机分析,以研究不同物种中同源扩增片段长度多态性(AFLP)带的比例,以及片段长度的分布。我们研究了几个可能的原因,这些原因之前在拟南芥中观察到,与观察到的片段长度分布和零假设随机序列分布之间的不一致,这在片段长度的方向上表现为小片段的缺失和大片段的过剩,相对于零假设分布。我们有以下发现:1)先前发现的同态性百分比与基因组大小之间的正相关关系是观察到的带数和 GC 含量的直接结果。对于相同数量的观察带,同态性百分比与物种的基因组大小无关。2)拟南芥中观察到的片段长度分布与零假设随机序列分布之间的不一致是在其他物种中也存在的现象。3)这种不一致既不是由于同调区的基因组结构,也不是由于插入缺失(indels)可能对减少限制酶位点数量的影响,这两个假设在文献中都有讨论。4)使用具有平衡基序的限制酶也不能消除这种不一致。5)这种差异似乎是由限制酶基序的非随机分布引起的。