Hirano H
Yokohama City University, Kihara Institute for Biological Research, Japan.
J Protein Chem. 1997 Jul;16(5):533-6. doi: 10.1023/a:1026377931210.
The partial amino acid sequences of 121 rice proteins separated by two-dimensional gel electrophoresis (2D-PAGE), were determined for a protein sequence data file. In the Rice Genome Research Program (RGP), more than 20,000 cDNA clones randomly selected from rice cDNA libraries have been sequenced to construct a cDNA catalog. Complimentary DNAs encoding about 30% of proteins in the protein sequence data file could be identified in the catalog by computer search. It was deduced that 20,000-40,000 genes are present in the rice genome. Only half of about 20,000 cDNAs sequenced in the RGP, corresponding to 1/4-1/2 of genes present in the entire rice genome, should have unique sequences after considering gene redundancy. This is consistent with the fact that the cDNAs encoding about 30% of the sequenced proteins could be identified in the catalog. If the size of the cDNA catalog is enlarged further, cDNAs encoding all proteins separated by 2D-PAGE could be easily identified from the catalog by using the protein sequence data.
通过二维凝胶电泳(2D-PAGE)分离出121种水稻蛋白质的部分氨基酸序列,用于构建蛋白质序列数据文件。在水稻基因组研究计划(RGP)中,已对从水稻cDNA文库中随机选取的20000多个cDNA克隆进行了测序,以构建一个cDNA目录。通过计算机搜索,在该目录中可以鉴定出蛋白质序列数据文件中约30%的蛋白质的互补DNA。据推断,水稻基因组中存在20000 - 40000个基因。在考虑基因冗余后,RGP中测序的约20000个cDNA中只有一半,相当于整个水稻基因组中基因的1/4 - 1/2,应该具有独特的序列。这与在目录中能够鉴定出约30%的已测序蛋白质的互补DNA这一事实是一致的。如果进一步扩大cDNA目录的规模,利用蛋白质序列数据就可以很容易地从目录中鉴定出通过二维凝胶电泳分离出的所有蛋白质的cDNA。