• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度学习架构分析RNA翻译为翻译控制提供了新的见解。

Analysis of RNA translation with a deep learning architecture provides new insight into translation control.

作者信息

Fan Xiaojuan, Chang Tiangen, Chen Chuyun, Hafner Markus, Wang Zefeng

机构信息

Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health.

RNA Molecular Biology Laboratory, National Institute of Arthritis and Musculoskeletal and Skin Disease, Bethesda, MD, USA.

出版信息

bioRxiv. 2024 Jul 2:2023.07.08.548206. doi: 10.1101/2023.07.08.548206.

DOI:10.1101/2023.07.08.548206
PMID:39005319
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11244891/
Abstract

Accurate annotation of coding regions in RNAs is essential for understanding gene translation. We developed a deep neural network to directly predict and analyze translation initiation and termination sites from RNA sequences. Trained with human transcripts, our model learned hidden rules of translation control and achieved a near perfect prediction of canonical translation sites across entire human transcriptome. Surprisingly, this model revealed a new role of codon usage in regulating translation termination, which was experimentally validated. We also identified thousands of new open reading frames in mRNAs or lncRNAs, some of which were confirmed experimentally. The model trained with human mRNAs achieved high prediction accuracy of canonical translation sites in all eukaryotes and good prediction in polycistronic transcripts from prokaryotes or RNA viruses, suggesting a high degree of conservation in translation control. Collectively, we present a general and efficient deep learning model for RNA translation, generating new insights into the complexity of translation regulation.

摘要

准确注释RNA中的编码区域对于理解基因翻译至关重要。我们开发了一种深度神经网络,用于直接从RNA序列预测和分析翻译起始和终止位点。通过人类转录本进行训练,我们的模型学习到了翻译控制的隐藏规则,并在整个人类转录组中对规范翻译位点实现了近乎完美的预测。令人惊讶的是,该模型揭示了密码子使用在调节翻译终止中的新作用,这一点已通过实验得到验证。我们还在mRNA或lncRNA中鉴定出数千个新的开放阅读框,其中一些已通过实验得到证实。用人类mRNA训练的模型在所有真核生物中对规范翻译位点都具有很高的预测准确性,并且对来自原核生物或RNA病毒的多顺反子转录本也有良好的预测,这表明翻译控制具有高度的保守性。总体而言,我们提出了一种通用且高效的用于RNA翻译的深度学习模型,为翻译调控的复杂性提供了新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/42d389108368/nihpp-2023.07.08.548206v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/ac8636eb8d3b/nihpp-2023.07.08.548206v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/e275e1ec5432/nihpp-2023.07.08.548206v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/dd44a6b85711/nihpp-2023.07.08.548206v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/42d389108368/nihpp-2023.07.08.548206v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/ac8636eb8d3b/nihpp-2023.07.08.548206v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/e275e1ec5432/nihpp-2023.07.08.548206v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/dd44a6b85711/nihpp-2023.07.08.548206v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f51d/11244891/42d389108368/nihpp-2023.07.08.548206v2-f0004.jpg

相似文献

1
Analysis of RNA translation with a deep learning architecture provides new insight into translation control.使用深度学习架构分析RNA翻译为翻译控制提供了新的见解。
bioRxiv. 2024 Jul 2:2023.07.08.548206. doi: 10.1101/2023.07.08.548206.
2
Analysis of RNA translation with a deep learning architecture provides new insight into translation control.使用深度学习架构对RNA翻译进行分析,为翻译控制提供了新的见解。
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf277.
3
RNA G-quadruplexes at upstream open reading frames cause DHX36- and DHX9-dependent translation of human mRNAs.RNA G-四链体在上游开放阅读框处引发 DHX36 和 DHX9 依赖性的人类 mRNA 翻译。
Genome Biol. 2018 Dec 27;19(1):229. doi: 10.1186/s13059-018-1602-2.
4
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
5
Cellular mRNAs access second ORFs using a novel amino acid sequence-dependent coupled translation termination-reinitiation mechanism.细胞 mRNA 利用一种新的依赖于氨基酸序列的偶联翻译终止-再起始机制来获取第二个 ORF。
RNA. 2014 Mar;20(3):373-81. doi: 10.1261/rna.041574.113. Epub 2014 Jan 10.
6
Improving deep models of protein-coding potential with a Fourier-transform architecture and machine translation task.利用傅里叶变换架构和机器翻译任务改进蛋白质编码潜力的深度模型。
bioRxiv. 2023 Apr 19:2023.04.03.535488. doi: 10.1101/2023.04.03.535488.
7
Regulation of translation via mRNA structure in prokaryotes and eukaryotes.原核生物和真核生物中通过mRNA结构对翻译的调控。
Gene. 2005 Nov 21;361:13-37. doi: 10.1016/j.gene.2005.06.037. Epub 2005 Oct 5.
8
Translation of the downstream ORF from bicistronic mRNAs by human cells: Impact of codon usage and splicing in the upstream ORF.人类细胞对双顺反子mRNA下游开放阅读框的翻译:上游开放阅读框中密码子使用和剪接的影响。
Protein Sci. 2025 Feb;34(2):e70036. doi: 10.1002/pro.70036.
9
Global analysis of ribosome-associated noncoding RNAs unveils new modes of translational regulation.全球核糖体相关非编码 RNA 分析揭示了新的翻译调控模式。
Proc Natl Acad Sci U S A. 2017 Nov 14;114(46):E10018-E10027. doi: 10.1073/pnas.1708433114. Epub 2017 Oct 30.
10
Non-Canonical Translation Initiation Mechanisms Employed by Eukaryotic Viral mRNAs.真核病毒 mRNA 所采用的非经典翻译起始机制。
Biochemistry (Mosc). 2021 Sep;86(9):1060-1094. doi: 10.1134/S0006297921090042.

本文引用的文献

1
The new uORFdb: integrating literature, sequence, and variation data in a central hub for uORF research.新的 uORFdb:在 uORF 研究的中心枢纽中整合文献、序列和变异数据。
Nucleic Acids Res. 2023 Jan 6;51(D1):D328-D336. doi: 10.1093/nar/gkac899.
2
The Codon Statistics Database: A Database of Codon Usage Bias.密码子统计数据库:一个密码子使用偏性数据库。
Mol Biol Evol. 2022 Aug 3;39(8). doi: 10.1093/molbev/msac157.
3
Machine learning predicts translation initiation sites in neurologic diseases with nucleotide repeat expansions.机器学习预测核苷酸重复扩展导致的神经疾病中的翻译起始位点。
PLoS One. 2022 Jun 1;17(6):e0256411. doi: 10.1371/journal.pone.0256411. eCollection 2022.
4
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
5
The SARS-CoV-2 subgenome landscape and its novel regulatory features.SARS-CoV-2 亚基因组景观及其新型调控特征。
Mol Cell. 2021 May 20;81(10):2135-2147.e5. doi: 10.1016/j.molcel.2021.02.036. Epub 2021 Mar 3.
6
Utilizing Amino Acid Composition and Entropy of Potential Open Reading Frames to Identify Protein-Coding Genes.利用潜在开放阅读框的氨基酸组成和熵来识别蛋白质编码基因。
Microorganisms. 2021 Jan 8;9(1):129. doi: 10.3390/microorganisms9010129.
7
SARS-CoV-2 Disrupts Splicing, Translation, and Protein Trafficking to Suppress Host Defenses.SARS-CoV-2 通过干扰剪接、翻译和蛋白质运输来抑制宿主防御。
Cell. 2020 Nov 25;183(5):1325-1339.e21. doi: 10.1016/j.cell.2020.10.004. Epub 2020 Oct 8.
8
Global sequence features based translation initiation site prediction in human genomic sequences.基于全局序列特征的人类基因组序列翻译起始位点预测
Heliyon. 2020 Sep 14;6(9):e04825. doi: 10.1016/j.heliyon.2020.e04825. eCollection 2020 Sep.
9
Pervasive functional translation of noncanonical human open reading frames.广泛存在的非规范人类开放阅读框的功能翻译。
Science. 2020 Mar 6;367(6482):1140-1146. doi: 10.1126/science.aay0262.
10
Stop codon context influences genome-wide stimulation of termination codon readthrough by aminoglycosides.终止密码子上下文影响氨基糖苷类药物对终止密码子通读的全基因组刺激。
Elife. 2020 Jan 23;9:e52611. doi: 10.7554/eLife.52611.