Yamashita Riu, Suzuki Yutaka, Nakai Kenta, Sugano Sumio
Laboratory of Genome Database, Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1, Shirokane-dai, Minato-ku, Tokyo 108-8639, Japan.
C R Biol. 2003 Oct-Nov;326(10-11):987-91. doi: 10.1016/j.crvi.2003.09.028.
Using the 5'-end sequence data from 'oligo-capped' cDNAs, we generated a representative full-length cDNA dataset for 4870 RefSeq entries, and analyzed the 5' untranslated region (UTR) of these genes. To our surprise, about half of the 4870 genes had an upstream ATG before the ATG that starts the longest open reading frame (ORF), suggesting that about half of them have small ORFs in their 5' UTR of average length of 31 amino acids. They require attention for further analysis to identify their biological role.
利用来自“oligo-capped”cDNA的5'端序列数据,我们针对4870个RefSeq条目生成了一个具有代表性的全长cDNA数据集,并分析了这些基因的5'非翻译区(UTR)。令我们惊讶的是,在这4870个基因中,约有一半在起始最长开放阅读框(ORF)的ATG之前存在上游ATG,这表明其中约一半基因在其平均长度为31个氨基酸的5'UTR中具有小开放阅读框。它们需要进一步分析以确定其生物学作用。