Soares M B, Schon E, Henderson A, Karathanasis S K, Cate R, Zeitlin S, Chirgwin J, Efstratiadis A
Mol Cell Biol. 1985 Aug;5(8):2090-103. doi: 10.1128/mcb.5.8.2090-2103.1985.
Rats and mice have two, equally expressed, nonallelic genes encoding preproinsulin (genes I and II). Cytological hybridization with metaphase chromosomes indicated that both genes reside on rat chromosome I but are approximately 100,000 kilobases apart. In mice the two genes reside on two different chromosomes. DNA sequence comparisons of the gene-flanking regions in rats and mice indicated that the preproinsulin gene I has lost one of the two introns present in gene II, is flanked by a long (41-base) direct repeat, and has a remnant of a polydeoxyadenylate acid tract preceding the downstream direct repeat. These structural features indicated that gene I was generated by an RNA-mediated duplication-transposition event involving a transcript of gene II which was initiated upstream from the normal capping site. Sequence divergence analysis indicated that the pair of the original gene and its retroposed, but functional, counterpart (which appeared about 35 million years ago) is maintained by strong negative selection operating primarily on the segments encoding the chains of the mature hormone, whereas the segments encoding the parts of the polypeptide that are eliminated during processing and also the introns and the flanking regions are evolving neutrally.
大鼠和小鼠有两个编码前胰岛素原的非等位基因(基因I和基因II),二者表达水平相当。与中期染色体的细胞杂交表明,这两个基因都位于大鼠的1号染色体上,但相距约100,000千碱基对。在小鼠中,这两个基因位于两条不同的染色体上。大鼠和小鼠基因侧翼区域的DNA序列比较表明,前胰岛素原基因I已失去了基因II中存在的两个内含子之一,其侧翼有一个长的(41个碱基)直接重复序列,并且在下游直接重复序列之前有一段多聚脱氧腺苷酸序列的残余。这些结构特征表明,基因I是由RNA介导的复制-转座事件产生的,该事件涉及基因II的转录本,该转录本在正常加帽位点上游起始。序列分歧分析表明,原始基因及其反转但仍具功能的对应基因对(约在3500万年前出现)主要通过对编码成熟激素链的片段进行强负选择来维持,而编码在加工过程中被去除的多肽部分的片段以及内含子和侧翼区域则在中性进化。