Day G R, Blake R D
Nucleic Acids Res. 1982 Dec 20;10(24):8323-39. doi: 10.1093/nar/10.24.8323.
Methods of computer analysis for the recurrence of symmetrical and repetitive elements in large numbers of DNA sequences are described, together with derivations of appropriate quantitative criteria for the evaluation of the statistical significance of these elements in DNAs of different base composition. Examples of some extraordinary variations in the occurrence of symmetrical and repetitive elements are provided, many of which are new. Special consideration is devoted to a determination of the statistical significance of a two-fold palindrome at the origin of replication. A computer search of 14 independently determined DNA sequences containing an origin of replication locus indicates each contains a large two-fold palindrome. The average length of this palindrome is 28 +/- 6 base pairs, of which 22 contribute to the palindromic symmetry. The probability of occurrence of such a palindrome is only 1/26000, while the probability of occurrence in all 14 different species is (1/26000).
本文描述了用于分析大量DNA序列中对称和重复元件复发情况的计算机分析方法,以及推导用于评估这些元件在不同碱基组成的DNA中统计显著性的适当定量标准。文中提供了一些对称和重复元件出现情况的异常变异实例,其中许多是新发现的。特别关注了复制起点处双折回文序列统计显著性的确定。对14个独立确定的包含复制起点位点的DNA序列进行计算机搜索表明,每个序列都包含一个大的双折回文序列。该回文序列的平均长度为28 +/- 6个碱基对,其中22个碱基对构成回文对称性。这种回文序列出现的概率仅为1/26000,而在所有14个不同物种中出现的概率为(1/26000)¹⁴ 。 (注:原文最后一处(1/26000) 表述有误,应为(1/26000)¹⁴ ,译文按正确理解翻译)