Wolfe K H, Sharp P M
Department of Genetics, Trinity College, Dublin, Ireland.
Gene. 1988 Jun 30;66(2):215-22. doi: 10.1016/0378-1119(88)90358-7.
We have used a rapid computer dot-matrix comparison method to identify all DNA regions which have been evolutionarily conserved between the completely sequenced chloroplast genomes of tobacco and a liverwort. Analysis of these regions reveals 74 homologous open reading frames (ORFs) which have been conserved as to length and amino acid sequence; these ORFs also have an excess of nucleotide substitutions at silent sites of codons. Since the nonfunctional parts of these genomes have become saturated with mutations and show no sequence similarity whatsoever, the homologous ORFs are almost certainly functional. A further four pairs of ORFs show homology limited to only a short part of their putative gene products. Amino acid sequence identities range between 50 and 99%; some chloroplast proteins are seen to be among the most slowly evolving of all known proteins. A search of the nucleotide and amino acid sequence databanks has revealed several previously unidentified genes in chloroplast sequences from other species, but no new homologies to prokaryotic genes.
我们采用了一种快速的计算机点阵比较方法,来识别在烟草和一种叶苔完全测序的叶绿体基因组之间进化上保守的所有DNA区域。对这些区域的分析揭示了74个同源开放阅读框(ORF),其长度和氨基酸序列均得以保守;这些ORF在密码子的沉默位点也有过量的核苷酸替换。由于这些基因组的非功能部分已被突变饱和且没有任何序列相似性,所以这些同源ORF几乎肯定是有功能的。另外四对ORF显示出的同源性仅限于其假定基因产物的一小部分。氨基酸序列同一性在50%至99%之间;一些叶绿体蛋白被认为是所有已知蛋白中进化最慢的。对核苷酸和氨基酸序列数据库的搜索揭示了其他物种叶绿体序列中几个先前未鉴定的基因,但没有发现与原核基因的新同源性。