Black J A, Harkins R N
J Mol Evol. 1975 Jun 9;5(1):47-55. doi: 10.1007/BF01732013.
We have compiled the dipeptide frequencies in 100 known protein sequences. We suggest that dipeptides which occur with low frequencies can be used to locate proteins where partial gene duplication may have taken place. The 48 residue sequence of posterior pituitary peptide contains two Cys Trp pairs. The adjacent portions of the sequence are compatible with a partial gene duplication in the evolutionary history of posterior pituitary peptide.
我们已汇编了100个已知蛋白质序列中的二肽频率。我们认为,出现频率较低的二肽可用于定位可能发生部分基因复制的蛋白质。垂体后叶肽的48个残基序列包含两对半胱氨酸-色氨酸。该序列的相邻部分与垂体后叶肽进化史上的部分基因复制情况相符。