Suppr超能文献

未鉴定的人类基因编码序列的预测。IV. 通过对人类细胞系KG-1的cDNA克隆进行分析推导得到的40个新基因(KIAA0121-KIAA0160)的编码序列

Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1.

作者信息

Nagase T, Seki N, Tanaka A, Ishikawa K, Nomura N

机构信息

Kazusa DNA Research Institute, Chiba, Japan.

出版信息

DNA Res. 1995 Aug 31;2(4):167-74, 199-210. doi: 10.1093/dnares/2.4.167.

Abstract

In this series of projects regarding the accumulation of sequence information of unidentified human genes, we newly deduced the sequences of 40 full-length cDNA clones of human cell line KG-1, and predicted the coding sequences of the corresponding genes, named KIAA0121 to 0160. The results of a computer search of public databases indicated that the sequences of 13 genes were unrelated to any reported genes, while the remaining 27 genes carried sequences which showed some similarities to known genes. Obvious unique sequences noted were as follows. A stretch of triplet repeats was contained in each of three genes: These were GAG(Glu) in KIAA0122 and KIAA0147, and TCC(Ser) in KIAA0150. A stretch of 10 amino acid-residues was repeated 21 times in KIAA0139, and a homologous sequence of 76-78 nucleotides was found repeated 6 times in the untranslated region of KIAA0125. Northern hybridization analysis demonstrated that 13 genes were expressed in a cell- or tissue-specific manner. Although a vast number of expressed sequence tags (ESTs) have been registered for comprehensive analysis of cDNA clones, our sequence data indicated that their distribution is very unbalanced: e.g. while no EST hit 7 genes, 85 ESTs fell in a single gene.

摘要

在这一系列关于未鉴定人类基因序列信息积累的项目中,我们新推导了人类细胞系KG-1的40个全长cDNA克隆的序列,并预测了相应基因的编码序列,命名为KIAA0121至0160。对公共数据库进行计算机检索的结果表明,13个基因的序列与任何已报道的基因均无关联,而其余27个基因的序列与已知基因有一些相似之处。明显的独特序列如下。三个基因中的每一个都包含一段三联体重复序列:KIAA0122和KIAA0147中的是GAG(Glu),KIAA0150中的是TCC(Ser)。一段10个氨基酸残基的序列在KIAA0139中重复了21次,在KIAA0125的非翻译区发现一段76 - 78个核苷酸的同源序列重复了6次。Northern杂交分析表明,13个基因以细胞或组织特异性方式表达。尽管已经登记了大量的表达序列标签(EST)用于cDNA克隆的综合分析,但我们的序列数据表明它们的分布非常不均衡:例如,7个基因没有EST匹配,而单个基因有85个EST匹配。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验