Paulsson G, Höög C, Bernholm K, Wieslander L
Department of Molecular Genetics, Medical Nobel Institute, Karolinska Institutet, Stockholm, Sweden.
J Mol Biol. 1992 May 20;225(2):349-61. doi: 10.1016/0022-2836(92)90926-b.
Balbiani ring (BR) genes in diptera encode large secretory proteins and are classical model systems for studies of gene expression. In Chironomus tentans, four closely related BR genes, BR 1, BR 2.1, BR 2.2 and BR 6 form a gene family. The BR genes have been partially characterized and are known to contain long arrays of tandemly arranged repeat units with an hierarchical repeat organization. Here, we report the sequence organization of the complete transcribed part of the BR 1 gene in C. tentans. The gene contains five exons and four introns. Three of the introns are located at the 5' end and the fourth at the 3' end of the gene. Exon 4 is approximately 35,000 bases long and is built completely from tandemly organized repeats. We show that this long repeat block contains two types of related repeat units, beta and gamma. Each type forms a large uninterrupted array, a 5' beta array and a 3' gamma array with a sharp border between them. In the hierarchical repeat structure in each repeat array, all repeats are virtually identical at one level of repetition, but shown differences at the next level. The whole repeat block in the BR 1 gene fluctuates in size between different alleles, but not by more than 10%. In contrast, within the block, the beta and gamma arrays vary in length between 8000 and 29,000 bases in an inverse fashion, together keeping the overall length requirement. We propose that the length of exon 4 is conserved by selection of cross-over products of a given length, and that the internal hierarchical sequence organization in the BR 1 gene is a consequence of the combined action of several different sequence turnover mechanisms, all dependent on the unequal pairing of homologous sequences at different, competing levels of repetition.
双翅目昆虫中的巴尔比亚尼环(BR)基因编码大型分泌蛋白,是研究基因表达的经典模型系统。在摇蚊中,四个密切相关的BR基因,BR 1、BR 2.1、BR 2.2和BR 6形成一个基因家族。BR基因已得到部分表征,已知其包含具有层级重复组织的串联排列重复单元的长阵列。在此,我们报告了摇蚊中BR 1基因完整转录部分的序列组织。该基因包含五个外显子和四个内含子。其中三个内含子位于基因的5'端,第四个位于基因的3'端。外显子4约35000个碱基长,完全由串联组织的重复序列构成。我们表明,这个长重复块包含两种相关的重复单元,β和γ。每种类型形成一个大的不间断阵列,一个5'β阵列和一个3'γ阵列,它们之间有明显的边界。在每个重复阵列的层级重复结构中,所有重复序列在一个重复水平上几乎相同,但在下一个水平上显示出差异。BR 1基因中的整个重复块在不同等位基因之间大小有所波动,但不超过10%。相比之下,在该块内,β和γ阵列的长度以相反的方式在8000至29000个碱基之间变化,共同保持总体长度要求。我们提出,外显子4的长度通过选择给定长度的交叉产物得以保守,并且BR 1基因中的内部层级序列组织是几种不同序列周转机制共同作用的结果,所有这些机制都依赖于同源序列在不同竞争重复水平上的不等配对。