Kalinski A, Weisemann J M, Matthews B F, Herman E M
Plant Molecular Biology Laboratory, United States Department of Agriculture, Beltsville, Maryland 20705.
J Biol Chem. 1990 Aug 15;265(23):13843-8.
A 34,000-Da protein (P34) is one of the four major soybean oil body proteins observed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis of isolated organic solvent-extracted oil bodies from mature seeds. P34 is processed during seedling growth to a 32,000-Da polypeptide (P32) by the removal of an amino-terminal decapeptide (Herman, E.M., Melroy, D.L., and Buckhout, T.J. (1990) Plant Physiol, in press). A soybean lambda ZAP II cDNA library constructed from RNA isolated from midmaturation seeds was screened with monoclonal antibodies directed against two different epitopes of P34. The isolated cDNA clone encoding P34 contains 1,350 base pairs terminating in a poly(A)+ tail and an open reading frame 1,137 base pairs in length. The open reading frame includes a deduced amino acid sequence which matches 23 of 25 amino-terminal amino acids determined by automated Edman degradation of P34 and P32. The cDNA predicts a mature protein of 257 amino acids and of 28,641 Da. The open reading frame extends 5' from the known amino terminus of P34 encoding a possible precursor and signal sequence segments with a combined additional 122 amino acids. Prepro-P34 is deduced to be a polypeptide of 42,714 Da, indicating that the cDNA clone apparently encodes a polypeptide of 379 amino acids. A comparison of the nucleotide and deduced amino acid sequences in the GenBank Data Bank with the sequence of P34 has shown considerable sequence similarity to the thiol proteases of the papain family. Southern blot analysis of genomic DNA indicated that the P34 gene has a low copy number.
一种34,000道尔顿的蛋白质(P34)是通过对成熟种子中分离出的有机溶剂提取油体进行十二烷基硫酸钠-聚丙烯酰胺凝胶电泳观察到的四种主要大豆油体蛋白之一。在幼苗生长过程中,P34通过去除氨基末端的十肽被加工成32,000道尔顿的多肽(P32)(赫尔曼,E.M.,梅尔罗伊,D.L.,和巴克霍特,T.J.(1990)《植物生理学》,即将发表)。用针对P34两个不同表位的单克隆抗体筛选了由中成熟种子分离的RNA构建的大豆λZAP II cDNA文库。分离出的编码P34的cDNA克隆包含1350个碱基对,末端为聚腺苷酸加尾,开放阅读框长度为1137个碱基对。该开放阅读框包括一个推导的氨基酸序列,该序列与通过对P34和P32进行自动埃德曼降解确定的25个氨基末端氨基酸中的23个相匹配。该cDNA预测一个由257个氨基酸组成、分子量为28,641道尔顿的成熟蛋白。开放阅读框从P34已知的氨基末端向5'端延伸,编码一个可能的前体和信号序列片段,总共还有122个氨基酸。推导前体蛋白P34是一个分子量为42,714道尔顿的多肽,表明该cDNA克隆显然编码一个由379个氨基酸组成的多肽。将GenBank数据库中的核苷酸和推导氨基酸序列与P34的序列进行比较,发现与木瓜蛋白酶家族的巯基蛋白酶有相当大的序列相似性。基因组DNA的Southern印迹分析表明,P34基因的拷贝数较低。