Wu T T, Reid-Miller M, Perry H M, Kabat E A
EMBO J. 1984 Sep;3(9):2033-40. doi: 10.1002/j.1460-2075.1984.tb02088.x.
A computer sorting method was used to construct a dictionary by which long identical repeats of nucleotides could be identified and all available sequences of immunoglobulin switch regions were examined. The genomic mouse gamma 2b switch region contains two sets of four long identical repeats comprising 102, 72, 98 and 109 nucleotides respectively. The first is separated from the second by 347 nucleotides which contain the first 46 nucleotides of the 98 nucleotide set as a third partial repeat. These sets overlap the 49-bp separation between identical five nucleotide repeats GAGCT, GGGGT, ACCAG and CGAGC. Switches from S mu to S gamma 2b in and between these sets involve deletions of all or part of a set. Frequencies and locations of short repeats show differences between S mu, S epsilon, and S alpha and the other switch regions; these could determine specificity and locations of switches.
采用一种计算机排序方法构建了一个字典,通过该字典可以识别核苷酸的长重复序列,并对免疫球蛋白转换区的所有可用序列进行了检查。基因组小鼠γ2b转换区包含两组四个长重复序列,分别由102、72、98和109个核苷酸组成。第一组与第二组之间被347个核苷酸隔开,其中包含98个核苷酸组的前46个核苷酸作为第三个部分重复序列。这些组与相同的五个核苷酸重复序列GAGCT、GGGGT、ACCAG和CGAGC之间的49bp间隔重叠。在这些组内和组间从Sμ到Sγ2b的转换涉及一组全部或部分的缺失。短重复序列的频率和位置在Sμ、Sε和Sα与其他转换区之间存在差异;这些差异可能决定转换的特异性和位置。