Cooke R, Raynal M, Laudié M, Grellet F, Delseny M, Morris P C, Guerrier D, Giraudat J, Quigley F, Clabault G, Li Y F, Mache R, Krivitzky M, Gy I J, Kreis M, Lecharny A, Parmentier Y, Marbach J, Fleck J, Clément B, Philipps G, Hervé C, Bardet C, Tremousaygue D, Höfte H
Laboratoire de Physiologie et Biologie Moléculaires Végétal es, URA565 du CNRS, Université de Perpignan, France.
Plant J. 1996 Jan;9(1):101-24. doi: 10.1046/j.1365-313x.1996.09010101.x.
Nearly 7000 Arabidopsis thaliana-expressed sequence tags (ESTs) from 10 cDNA libraries have been sequenced, of which almost 5000 non-redundant tags have been submitted to the EMBL data bank. The quality of the cDNA libraries used is analysed. Similarity searches in international protein data banks have allowed the detection of significant similarities to a wide range of proteins from many organisms. Alignment with ESTs from the rice systematic sequencing project has allowed the detection of amino acid motifs which are conserved between the two organisms, thus identifying tags to genes encoding highly conserved proteins. These genes are candidates for a common framework in genome mapping projects in different plants.
已对来自10个cDNA文库的近7000个拟南芥表达序列标签(EST)进行了测序,其中近5000个非冗余标签已提交至EMBL数据库。分析了所用cDNA文库的质量。在国际蛋白质数据库中进行的相似性搜索使得能够检测到与许多生物体的多种蛋白质具有显著相似性。与水稻系统测序项目的EST进行比对,使得能够检测到两种生物体之间保守的氨基酸基序,从而鉴定出编码高度保守蛋白质的基因标签。这些基因是不同植物基因组图谱绘制项目中通用框架的候选基因。