Arkin A P, Youvan D C
Department of Chemistry, Massachusetts Institute of Technology, Cambridge 02139.
Biotechnology (N Y). 1992 Mar;10(3):297-300. doi: 10.1038/nbt0392-297.
In random mutagenesis, synthesis of an NNN triplet (i.e. equiprobable A, C, G, and T at each of the three positions in the codon) could be considered an optimal nucleotide mixture because all 20 amino acids are encoded. NN(G,C) might be considered a slightly more intelligent "dope" because the entire set of amino acids is still encoded using only half as many codons. Using a general algorithm described herein, it is possible to formulate more complex doping schemes which encode specific subsets of the twenty amino acids, excluding others from the mix. Maximizing the equiprobability of amino acid residues contributing to such a subset is suggested as an optimal basis for performing semi-random mutagenesis. This is important for reducing the nucleotide complexity of combinatorial cassettes so that "sequence space" can be searched more efficiently. Computer programs have been developed to provide tables of optimized dopes compatible with automated DNA synthesizers.
在随机诱变中,合成NNN三联体(即密码子的三个位置上A、C、G和T出现的概率相等)可被视为一种最佳核苷酸混合物,因为它能编码所有20种氨基酸。NN(G,C) 可能被认为是一种略具智能的“掺杂”,因为整套氨基酸仍仅用一半数量的密码子编码。使用本文所述的通用算法,可以制定更复杂的掺杂方案,该方案编码20种氨基酸的特定子集,而将其他氨基酸排除在混合物之外。建议将构成此类子集的氨基酸残基的等概率最大化作为进行半随机诱变的最佳基础。这对于降低组合盒的核苷酸复杂性很重要,以便能更有效地搜索“序列空间”。已开发出计算机程序来提供与自动化DNA合成仪兼容的优化掺杂表。