Lycett G W, Croy R R, Shirsat A H, Boulter D
Nucleic Acids Res. 1984 Jun 11;12(11):4493-506. doi: 10.1093/nar/12.11.4493.
One of several genes coding for the major pea storage protein, legumin, has been completely sequenced. The sequence covers the whole of the transcribed region, plus 5' and 3' untranscribed sequences. The predicted protein sequence starts with a signal peptide and is followed by the legumin alpha polypeptide sequence of 36. 44kd and the beta polypeptide sequence of 20. 19kd . Compared to other legume storage proteins, the alpha and beta polypeptide sequences encoded by this legumin gene, which contain 3 met and 5 cys residues, are relatively rich in the sulphur amino acids. The coding sequence is interrupted by three introns which show boundary sequences typical of higher plant genes. The 5' end of the gene sequence contains a 'TATA box', a ' CAAT box' and a sequence showing some homology to an ' AGGA box'. An extra sequence, identical to the normal polyadenylation signal of the legumin message is seen in the 3' untranscribed region. The structure of the gene and the possible significance of secondary structures in the nascent RNA transcript in affecting the choice of polyadenylation site is discussed.
编码豌豆主要贮藏蛋白豆球蛋白的几个基因之一已被完全测序。该序列涵盖了整个转录区域以及5'和3'非转录序列。预测的蛋白质序列起始于一个信号肽,随后是36.44kd的豆球蛋白α多肽序列和20.19kd的β多肽序列。与其他豆科植物贮藏蛋白相比,该豆球蛋白基因编码的α和β多肽序列含有3个甲硫氨酸残基和5个半胱氨酸残基,硫氨基酸相对丰富。编码序列被三个内含子中断,这些内含子显示出高等植物基因典型的边界序列。基因序列的5'端包含一个“TATA盒”、一个“CAAT盒”以及一个与“AGGA盒”有一定同源性的序列。在3'非转录区域发现了一个额外的序列,它与豆球蛋白信使RNA的正常聚腺苷酸化信号相同。本文讨论了该基因的结构以及新生RNA转录本中二级结构在影响聚腺苷酸化位点选择方面可能具有的意义。