Nagase T, Kikuno R, Ohara O
Kazusa DNA Research Institute, Kisarazu, Chiba, Japan.
DNA Res. 2001 Dec 31;8(6):319-27. doi: 10.1093/dnares/8.6.319.
As an extension of human cDNA projects for accumulating sequence information on the coding sequences of unidentified genes, we herein present the entire sequences of 50 cDNA clones, named KIAA1939-KIAA1988. cDNA clones to be entirely sequenced were selected by two approaches based on their protein-coding potentialities prior to sequencing: 10 cDNA clones were chosen because their encoding proteins had a molecular mass larger than 50 kDa in an in vitro transcription/translation system; the remaining 40 cDNA clones were selected because their putative proteins-as determined by analysis of the genomic sequences flanked by both the terminal sequences of cDNAs using the GENSCAN gene prediction program-were larger than 400 amino acid residues. According to the sequence data, the average sizes of the inserts and corresponding open reading frames of cDNA clones analyzed here were 4.6 kb and 1.9 kb (643 amino acid residues), respectively. From the results of homology and motif searches against the public databases, the functional categories of the 31 predicted gene products could be assigned; 25 of these predicted gene products (81%) were classified into proteins relating to cell signaling/communication, nucleic acid management, and cell structure/motility. The expression profiles of the genes were also studied in 10 human tissues, 8 brain regions, spinal cord, fetal brain and fetal liver by reverse transcription-coupled polymerase chain reaction, the products of which were quantified by enzyme-linked immunosorbent assay.
作为人类cDNA项目的延伸,旨在积累未知基因编码序列的信息,我们在此展示了50个cDNA克隆(命名为KIAA1939 - KIAA1988)的完整序列。在测序之前,通过两种基于其蛋白质编码潜力的方法选择了要进行全序列测定的cDNA克隆:10个cDNA克隆是因为它们在体外转录/翻译系统中编码的蛋白质分子量大于50 kDa而被选中;其余40个cDNA克隆是因为使用GENSCAN基因预测程序分析cDNA末端序列两侧的基因组序列所确定的推定蛋白质大于400个氨基酸残基而被选中。根据序列数据,此处分析的cDNA克隆的插入片段和相应开放阅读框的平均大小分别为4.6 kb和1.9 kb(643个氨基酸残基)。通过对公共数据库进行同源性和基序搜索的结果,可以确定31个预测基因产物的功能类别;其中25个预测基因产物(81%)被归类为与细胞信号传导/通讯、核酸管理以及细胞结构/运动相关的蛋白质。还通过逆转录偶联聚合酶链反应研究了这些基因在10种人类组织、8个脑区、脊髓、胎儿脑和胎儿肝脏中的表达谱,其产物通过酶联免疫吸附测定进行定量。