Suppr超能文献

人类5'非翻译区序列的CART分类

CART classification of human 5' UTR sequences.

作者信息

Davuluri R V, Suzuki Y, Sugano S, Zhang M Q

机构信息

Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA.

出版信息

Genome Res. 2000 Nov;10(11):1807-16. doi: 10.1101/gr.gr-1460r.

Abstract

A nonredundant database of 2312 full-length human 5'-untranslated regions (UTRs) was carefully prepared using state-of-the-art experimental and computational technologies. A comprehensive computational analysis of this data was conducted for characterizing the 5' UTR features. Classification and regression tree (CART) analysis was used to classify the data into three distinct classes. Class I consists of mRNAs that are believed to be poorly translated with long 5' UTRs filled with potential inhibitory features. Class II consists of terminal oligopyrimidine tract (TOP) mRNAs that are regulated in a growth-dependent manner, and class III consists of mRNAs with favorable 5' UTR features that may help efficient translation. The most accurate tree we found has 92.5% classification accuracy as estimated by cross validation. The classification model included the presence of TOP, a secondary structure, 5' UTR length, and the presence of upstream AUGs (uAUGs) as the most relevant variables. The present classification and characterization of the 5' UTRs provide precious information for better understanding the translational regulation of human mRNAs. Furthermore, this database and classification can help people build better computational models for predicting the 5'-terminal exon and separating the 5' UTR from the coding region.

摘要

利用最先进的实验和计算技术,精心构建了一个包含2312个全长人类5'非翻译区(UTR)的非冗余数据库。对这些数据进行了全面的计算分析,以表征5'UTR的特征。使用分类与回归树(CART)分析将数据分为三个不同的类别。第一类由5'UTR较长且充满潜在抑制特征、翻译效率较低的mRNA组成。第二类由以生长依赖方式调控的末端寡嘧啶序列(TOP)mRNA组成,第三类由具有有利于高效翻译的5'UTR特征的mRNA组成。通过交叉验证估计,我们发现的最准确的树具有92.5%的分类准确率。分类模型将TOP的存在、二级结构、5'UTR长度以及上游AUG(uAUG)的存在作为最相关的变量。目前对5'UTR的分类和表征为更好地理解人类mRNA的翻译调控提供了宝贵信息。此外,该数据库和分类有助于人们构建更好的计算模型,用于预测5'末端外显子并将5'UTR与编码区区分开来。

相似文献

1
CART classification of human 5' UTR sequences.人类5'非翻译区序列的CART分类
Genome Res. 2000 Nov;10(11):1807-16. doi: 10.1101/gr.gr-1460r.
4
Deciphering the rules by which 5'-UTR sequences affect protein expression in yeast.解析影响酵母中 5'UTR 序列蛋白质表达的规则。
Proc Natl Acad Sci U S A. 2013 Jul 23;110(30):E2792-801. doi: 10.1073/pnas.1222534110. Epub 2013 Jul 5.

引用本文的文献

5
Investigating the NRAS 5' UTR as a target for small molecules.研究NRAS 5'UTR 作为小分子的靶标。
Cell Chem Biol. 2023 Jun 15;30(6):643-657.e8. doi: 10.1016/j.chembiol.2023.05.004. Epub 2023 May 30.

本文引用的文献

4
Prediction of eukaryotic mRNA translational properties.真核生物信使核糖核酸翻译特性的预测
Bioinformatics. 1999 Jul-Aug;15(7-8):704-12. doi: 10.1093/bioinformatics/15.7.704.
9
Translational control: the cancer connection.翻译控制:与癌症的关联
Int J Biochem Cell Biol. 1999 Jan;31(1):1-23. doi: 10.1016/s1357-2725(98)00127-7.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验