Suppr超能文献

人类转录单元总数的评估。

Assessment of the total number of human transcription units.

作者信息

Das M, Burge C B, Park E, Colinas J, Pelletier J

机构信息

Department of Biochemistry, McGill University, Rm 810, 3655 Drummond St., Montreal, Quebec, H3G 1Y6, Canada.

出版信息

Genomics. 2001 Sep;77(1-2):71-8. doi: 10.1006/geno.2001.6620.

Abstract

Variation in the estimates of the number of genes encoded by the human genome (28,000-120,000) attests to the difficulty of systematically identifying human genes. Sequencing of human chromosome 22 (Chr22) provided the first comprehensive, unbiased view of an entire human chromosome, and intensive analysis of this sequence identified 545 genes and 134 pseudogenes that had similarity or identity to known proteins and/or ESTs and which were listed in the gene annotation (http://www.sanger.ac.uk/HGP/Chr22). This analysis yielded an estimate of approximately 36,000 functional expressed genes in the human genome (and 9000 pseudogenes). However, a key uncertainty in this estimate was that hundreds of additional genes beyond those annotated in the Chr22 sequence are predicted by the gene prediction program Genscan, an unknown number of which might represent additional expressed genes. To determine what fraction of these "predicted novel genes" (PNGs) represents expressed human genes, we used a sensitive RT-PCR assay to detect predicted transcripts in 17 tissues and one cell line. Our results indicate that at least 5000-9000 additional human genes which lack similarity to known genes or proteins exist in the human genome, increasing baseline gene estimates to approximately 41,000-45,000.

摘要

人类基因组编码基因数量估计值的变化范围(28,000 - 120,000个)证明了系统鉴定人类基因的困难。人类22号染色体(Chr22)的测序首次提供了对整个人类染色体的全面、无偏差的视图,对该序列的深入分析鉴定出了545个基因和134个假基因,它们与已知蛋白质和/或EST具有相似性或同一性,并在基因注释中列出(http://www.sanger.ac.uk/HGP/Chr22)。该分析得出人类基因组中约有36,000个功能性表达基因(以及9000个假基因)的估计值。然而,该估计中的一个关键不确定性在于,基因预测程序Genscan预测了Chr22序列注释之外的数百个额外基因,其中未知数量的基因可能代表额外的表达基因。为了确定这些“预测的新基因”(PNG)中有多少比例代表人类表达基因,我们使用了灵敏的RT-PCR分析方法来检测17种组织和一种细胞系中的预测转录本。我们的结果表明,人类基因组中至少存在5000 - 9000个与已知基因或蛋白质缺乏相似性的额外人类基因,将基线基因估计值提高到约41,000 - 45,000个。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验