Brisson N, Verma D P
Proc Natl Acad Sci U S A. 1982 Jul;79(13):4055-9. doi: 10.1073/pnas.79.13.4055.
Leghemoglobin (Lb) genes in soybean represent a small family of closely related genes. Three Lb sequences isolated from a genomic library were analyzed at the nucleotide sequence level. A Lb gene present on an 11.5-kilobase (kb) EcoRI genomic fragment spans approximately 1,200 nucleotides and is interrupted at amino acid positions 32 to 33, 68 to 69, and 103 to 104. The intervening sequences, as well as the 5' and 3' flanking regions of this gene, contain the consensus sequences found in other eukaryotic genes. The length of the 5'-untranslated region is 49 bases as determined by nuclease S1 mapping. R-loop analysis of the DNA from the recombinant phage containing the 11.5-kb EcoRI genomic fragment showed that another Lb gene is located 2.5 kb away. The nucleotide sequence of the second gene showed that this gene is incomplete, containing only exons 3 and 4. The deduced amino acid sequence of this gene, although showing 76% homology with the corresponding region of the other Lb gene, is not represented in any of the known Lb proteins. Both genes are oriented in the same direction with respect to the coding strand. Analysis of the sequence present on a second genomic clone containing a 4.2-kb EcoRI fragment revealed a truncated Lb gene showing homology with the last exon and the noncoding region at the 3' end of the two other Lb genes.
大豆中的豆血红蛋白(Lb)基因代表了一个由密切相关的基因组成的小家族。从基因组文库中分离出的三个Lb序列在核苷酸序列水平上进行了分析。存在于一个11.5千碱基(kb)的EcoRI基因组片段上的一个Lb基因跨度约为1200个核苷酸,在氨基酸位置32至33、68至69和103至104处被打断。这些间隔序列以及该基因的5'和3'侧翼区域包含在其他真核基因中发现的共有序列。通过核酸酶S1图谱分析确定5'-非翻译区的长度为49个碱基。对含有11.5-kb EcoRI基因组片段的重组噬菌体的DNA进行R环分析表明,另一个Lb基因位于2.5 kb之外。第二个基因的核苷酸序列表明该基因是不完整的,仅包含外显子3和4。该基因推导的氨基酸序列虽然与另一个Lb基因的相应区域显示出76%的同源性,但在任何已知的Lb蛋白中都未出现。这两个基因相对于编码链的方向相同。对包含一个4.2-kb EcoRI片段的第二个基因组克隆上存在的序列进行分析,发现了一个截短的Lb基因,它与另外两个Lb基因的最后一个外显子和3'端的非编码区域具有同源性。