利用 mRNA 可及性提高大肠杆菌中蛋白质丰度的预测准确性。

Improving the prediction accuracy of protein abundance in Escherichia coli using mRNA accessibility.

机构信息

Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Japan.

出版信息

Nucleic Acids Res. 2020 Aug 20;48(14):e81. doi: 10.1093/nar/gkaa481.

DOI:10.1093/nar/gkaa481

PMID:32504488

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7641306/

Abstract

RNA secondary structure around translation initiation sites strongly affects the abundance of expressed proteins in Escherichia coli. However, detailed secondary structural features governing protein abundance remain elusive. Recent advances in high-throughput DNA synthesis and experimental systems enable us to obtain large amounts of data. Here, we evaluated six types of structural features using two large-scale datasets. We found that accessibility, which is the probability that a given region around the start codon has no base-paired nucleotides, showed the highest correlation with protein abundance in both datasets. Accessibility showed a significantly higher correlation (Spearman's ρ = 0.709) than the widely used minimum free energy (0.554) in one of the datasets. Interestingly, accessibility showed the highest correlation only when it was calculated by a log-linear model, indicating that the RNA structural model and how to utilize it are important. Furthermore, by combining the accessibility and activity of the Shine-Dalgarno sequence, we devised a method for predicting protein abundance more accurately than existing methods. We inferred that the log-linear model has a broader probabilistic distribution than the widely used Turner energy model, which contributed to more accurate quantification of ribosome accessibility to translation initiation sites.

摘要

在翻译起始位点周围的 RNA 二级结构强烈影响大肠杆菌中表达蛋白的丰度。然而，控制蛋白丰度的详细二级结构特征仍然难以捉摸。高通量 DNA 合成和实验系统的最新进展使我们能够获得大量数据。在这里，我们使用两个大型数据集评估了六种结构特征。我们发现，在两个数据集中，起始密码子周围区域没有碱基配对核苷酸的可能性（即可及性）与蛋白丰度的相关性最高。在其中一个数据集，可及性的相关性（Spearman's ρ = 0.709）显著高于广泛使用的最小自由能（0.554）。有趣的是，只有当通过对数线性模型计算时，可及性才显示出最高的相关性，这表明 RNA 结构模型及其使用方式很重要。此外，通过结合 Shine-Dalgarno 序列的可及性和活性，我们设计了一种方法，比现有方法更准确地预测蛋白丰度。我们推断对数线性模型具有比广泛使用的 Turner 能量模型更广泛的概率分布，这有助于更准确地量化核糖体对翻译起始位点的可及性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6e6/7641306/a2d5b7f470c1/gkaa481fig1.jpg

相似文献

Improving the prediction accuracy of protein abundance in Escherichia coli using mRNA accessibility.利用 mRNA 可及性提高大肠杆菌中蛋白质丰度的预测准确性。

Nucleic Acids Res. 2020 Aug 20;48(14):e81. doi: 10.1093/nar/gkaa481.

Anatomy of Escherichia coli ribosome binding sites.大肠杆菌核糖体结合位点的剖析。

J Mol Biol. 2001 Oct 12;313(1):215-28. doi: 10.1006/jmbi.2001.5040.

Structured mRNAs regulate translation initiation by binding to the platform of the ribosome.结构化mRNA通过与核糖体平台结合来调节翻译起始。

Cell. 2007 Sep 21;130(6):1019-31. doi: 10.1016/j.cell.2007.07.008.

Novel Translation Initiation Regulation Mechanism in Escherichia coli ptrB Mediated by a 5'-Terminal AUG.由5'-末端AUG介导的大肠杆菌ptrB中的新型翻译起始调控机制。

J Bacteriol. 2017 Jun 27;199(14). doi: 10.1128/JB.00091-17. Print 2017 Jul 15.

Trapping the ribosome to control gene expression.捕获核糖体以控制基因表达。

Cell. 2007 Sep 21;130(6):983-5. doi: 10.1016/j.cell.2007.09.002.

Temperature-dependent translation of leaderless and canonical mRNAs in Escherichia coli.大肠杆菌中无 leader 序列和标准 mRNA 的温度依赖性翻译

FEMS Microbiol Lett. 2002 Jun 4;211(2):161-7. doi: 10.1111/j.1574-6968.2002.tb11219.x.

Epsilon as an initiator of translation of CAT mRNA in Escherichia coli.ε作为大肠杆菌中CAT信使核糖核酸翻译的起始因子。

Biochem Biophys Res Commun. 2000 Jul 5;273(2):528-31. doi: 10.1006/bbrc.2000.2958.

Unfolding of mRNA secondary structure by the bacterial translation initiation complex.细菌翻译起始复合物对mRNA二级结构的解折叠

Mol Cell. 2006 Apr 7;22(1):105-15. doi: 10.1016/j.molcel.2006.02.014.

Translation initiation of the replication initiator repB gene of promiscuous plasmid pMV158 is led by an extended non-SD sequence.混杂质粒 pMV158 的复制起始子 repB 基因的翻译起始由一个扩展的非 SD 序列引导。

Plasmid. 2013 Jul;70(1):69-77. doi: 10.1016/j.plasmid.2013.01.011. Epub 2013 Feb 16.

The downstream box: an efficient and independent translation initiation signal in Escherichia coli.下游框：大肠杆菌中一种高效且独立的翻译起始信号

EMBO J. 1996 Feb 1;15(3):665-74.

引用本文的文献

Transfer learning for cross-context prediction of protein expression from 5'UTR sequence.从 5'UTR 序列跨情境预测蛋白质表达的迁移学习

Nucleic Acids Res. 2024 Jul 22;52(13):e58. doi: 10.1093/nar/gkae491.

DeepRaccess: high-speed RNA accessibility prediction using deep learning.DeepRaccess：使用深度学习进行高速RNA可及性预测

Front Bioinform. 2023 Oct 10;3:1275787. doi: 10.3389/fbinf.2023.1275787. eCollection 2023.

PARROT: Prediction of enzyme abundances using protein-constrained metabolic models.利用蛋白约束代谢模型预测酶丰度。

PLoS Comput Biol. 2023 Oct 19;19(10):e1011549. doi: 10.1371/journal.pcbi.1011549. eCollection 2023 Oct.

Ultradeep characterisation of translational sequence determinants refutes rare-codon hypothesis and unveils quadruplet base pairing of initiator tRNA and transcript.对翻译序列决定因素的超深度分析否定了稀有密码子假说，并揭示了起始 tRNA 和转录物的四联体碱基配对。

Nucleic Acids Res. 2023 Mar 21;51(5):2377-2396. doi: 10.1093/nar/gkad040.

QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity.QRNAstruct：一种通过与生物活性的回归来提取 RNA 二级结构特征的方法。

Nucleic Acids Res. 2022 Jul 22;50(13):e73. doi: 10.1093/nar/gkac220.

Analysis of 11,430 recombinant protein production experiments reveals that protein yield is tunable by synonymous codon changes of translation initiation sites.分析 11430 个重组蛋白生产实验表明，蛋白质产量可以通过翻译起始位点的同义密码子变化进行调节。

PLoS Comput Biol. 2021 Oct 5;17(10):e1009461. doi: 10.1371/journal.pcbi.1009461. eCollection 2021 Oct.

Learning the Regulatory Code of Gene Expression.学习基因表达的调控密码。

Front Mol Biosci. 2021 Jun 10;8:673363. doi: 10.3389/fmolb.2021.673363. eCollection 2021.

Universal Constraints on Protein Evolution in the Long-Term Evolution Experiment with Escherichia coli.在大肠杆菌的长期进化实验中，蛋白质进化的普遍约束。

Genome Biol Evol. 2021 Jun 8;13(6). doi: 10.1093/gbe/evab070.

TISIGNER.com: web services for improving recombinant protein production.TISIGNER.com：用于改进重组蛋白生产的网络服务。

Nucleic Acids Res. 2021 Jul 2;49(W1):W654-W661. doi: 10.1093/nar/gkab175.

Ligand-dependent tRNA processing by a rationally designed RNase P riboswitch.由合理设计的 RNase P 核糖开关进行配体依赖性 tRNA 加工。

Nucleic Acids Res. 2021 Feb 22;49(3):1784-1800. doi: 10.1093/nar/gkaa1282.

本文引用的文献

Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli.评估 244000 个合成序列揭示了优化大肠杆菌翻译的设计原则。

Nat Biotechnol. 2018 Nov;36(10):1005-1015. doi: 10.1038/nbt.4238. Epub 2018 Sep 24.

The revival of the Gini importance?基尼重要性的复兴？

Bioinformatics. 2018 Nov 1;34(21):3711-3718. doi: 10.1093/bioinformatics/bty373.

Precise quantification of translation inhibition by mRNA structures that overlap with the ribosomal footprint in N-terminal coding sequences.对与N端编码序列中的核糖体足迹重叠的mRNA结构所导致的翻译抑制进行精确量化。

Nucleic Acids Res. 2017 May 19;45(9):5437-5448. doi: 10.1093/nar/gkx061.

microRNA-122 target sites in the hepatitis C virus RNA NS5B coding region and 3' untranslated region: function in replication and influence of RNA secondary structure.丙型肝炎病毒RNA NS5B编码区和3'非翻译区中的微小RNA-122靶位点：在复制中的功能及RNA二级结构的影响

Cell Mol Life Sci. 2017 Feb;74(4):747-760. doi: 10.1007/s00018-016-2377-9. Epub 2016 Sep 27.

The Shine-Dalgarno sequence of riboswitch-regulated single mRNAs shows ligand-dependent accessibility bursts.核糖开关调控的单个信使核糖核酸（mRNA）的夏因-达尔加诺序列显示出配体依赖性的可及性爆发。

Nat Commun. 2016 Jan 19;7:8976. doi: 10.1038/ncomms9976.

Codon influence on protein expression in E. coli correlates with mRNA levels.密码子对大肠杆菌中蛋白质表达的影响与mRNA水平相关。

Nature. 2016 Jan 21;529(7586):358-363. doi: 10.1038/nature16509. Epub 2016 Jan 13.

Predictable tuning of protein expression in bacteria.可预测地调控细菌中的蛋白质表达。

Nat Methods. 2016 Mar;13(3):233-6. doi: 10.1038/nmeth.3727. Epub 2016 Jan 11.

Large-scale de novo DNA synthesis: technologies and applications.大规模从头 DNA 合成：技术与应用。

Nat Methods. 2014 May;11(5):499-507. doi: 10.1038/nmeth.2918.

Causes and effects of N-terminal codon bias in bacterial genes.细菌基因 N 末端密码子偏好性的原因和影响。

Science. 2013 Oct 25;342(6157):475-9. doi: 10.1126/science.1241934. Epub 2013 Sep 26.

Predictive design of mRNA translation initiation region to control prokaryotic translation efficiency.预测设计 mRNA 翻译起始区以控制原核翻译效率。

Metab Eng. 2013 Jan;15:67-74. doi: 10.1016/j.ymben.2012.10.006. Epub 2012 Nov 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用 mRNA 可及性提高大肠杆菌中蛋白质丰度的预测准确性。

Improving the prediction accuracy of protein abundance in Escherichia coli using mRNA accessibility.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献