• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

mRNA的自然选择与算法设计

Natural selection and algorithmic design of mRNA.

作者信息

Cohen Barry, Skiena Steven

机构信息

Department of Computer Science, University Heights, New Jersey Institute of Technology, Newark, NJ 07102, USA.

出版信息

J Comput Biol. 2003;10(3-4):419-32. doi: 10.1089/10665270360688101.

DOI:10.1089/10665270360688101
PMID:12935336
Abstract

Messenger RNA (mRNA) sequences serve as templates for proteins according to the triplet code, in which each of the 4(3) = 64 different codons (sequences of three consecutive nucleotide bases) in RNA either terminate transcription or map to one of the 20 different amino acids (or residues) which build up proteins. Because there are more codons than residues, there is inherent redundancy in the coding. Certain residues (e.g., tryptophan) have only a single corresponding codon, while other residues (e.g., arginine) have as many as six corresponding codons. This freedom implies that the number of possible RNA sequences coding for a given protein grows exponentially in the length of the protein. Thus nature has wide latitude to select among mRNA sequences which are informationally equivalent, but structurally and energetically divergent. In this paper, we explore how nature takes advantage of this freedom and how to algorithmically design structures more energetically favorable than have been built through natural selection. In particular: (1) Natural Selection--we perform the first large-scale computational experiment comparing the stability of mRNA sequences from a variety of organisms to random synonymous sequences which respect the codon preferences of the organism. This experiment was conducted on over 27,000 sequences from 34 microbial species with 36 genomic structures. We provide evidence that in all genomic structures highly stable sequences are disproportionately abundant, and in 19 of 36 cases highly unstable sequences are disproportionately abundant. This suggests that the stability of mRNA sequences is subject to natural selection. (2) Artificial Selection--motivated by these biological results, we examine the algorithmic problem of designing the most stable and unstable mRNA sequences which code for a target protein. We give a polynomial-time dynamic programming solution to the most stable sequence problem (MSSP), which is asymptotically no more complex than secondary structure prediction. We show that the corresponding least stable sequence problem (LSSP) is NP-complete, and develop two heuristics for the construction of such sequences. We have implemented these algorithms, and present experimental results placing the high/low stability sequences in context with both wildtype and random encodings. Our implementation has already been applied to the design of RNA "code-words" creating little or no secondary structure in RNA computing (Brenneman and Condon, 2001; Marathe et al., 2001), and we anticipate a variety of other applications of this work to sequence design problems (Skiena, 2001).

摘要

信使核糖核酸(mRNA)序列根据三联体密码作为蛋白质的模板,其中RNA中4³ = 64种不同密码子(三个连续核苷酸碱基的序列)中的每一个要么终止转录,要么映射到构成蛋白质的20种不同氨基酸(或残基)之一。由于密码子比残基多,编码中存在固有的冗余。某些残基(例如色氨酸)只有一个相应的密码子,而其他残基(例如精氨酸)有多达六个相应的密码子。这种自由度意味着编码给定蛋白质的可能RNA序列的数量随蛋白质长度呈指数增长。因此,自然界有很大的自由度在信息上等效但结构和能量上不同的mRNA序列中进行选择。在本文中,我们探讨了自然界如何利用这种自由度,以及如何通过算法设计出比通过自然选择构建的结构在能量上更有利的结构。具体而言:(1)自然选择——我们进行了首次大规模计算实验,比较了来自各种生物体的mRNA序列与遵循生物体密码子偏好的随机同义序列的稳定性。该实验对来自34种微生物物种的27000多个序列和36种基因组结构进行了研究。我们提供的证据表明,在所有基因组结构中,高度稳定的序列不成比例地丰富,在36种情况中的19种中,高度不稳定的序列不成比例地丰富。这表明mRNA序列的稳定性受到自然选择的影响。(2)人工选择——受这些生物学结果的启发,我们研究了设计编码目标蛋白质的最稳定和最不稳定mRNA序列的算法问题。我们给出了最稳定序列问题(MSSP)的多项式时间动态规划解决方案,其渐近复杂度不超过二级结构预测。我们表明相应的最不稳定序列问题(LSSP)是NP完全问题,并开发了两种构建此类序列的启发式方法。我们已经实现了这些算法,并展示了将高/低稳定性序列与野生型和随机编码相关联的实验结果。我们的实现已经应用于RNA“码字”的设计,这些“码字”在RNA计算中几乎不产生或不产生二级结构(Brenneman和Condon,2001;Marathe等人,2001),并且我们预计这项工作在序列设计问题上还有各种其他应用(Skiena,2001)。

相似文献

1
Natural selection and algorithmic design of mRNA.mRNA的自然选择与算法设计
J Comput Biol. 2003;10(3-4):419-32. doi: 10.1089/10665270360688101.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
Statistical evidence for conserved, local secondary structure in the coding regions of eukaryotic mRNAs and pre-mRNAs.真核生物mRNA和前体mRNA编码区域中保守的局部二级结构的统计证据。
Nucleic Acids Res. 2005 Nov 7;33(19):6338-48. doi: 10.1093/nar/gki923. Print 2005.
4
Indications that "codon boundaries" are physico-chemically defined and that protein-folding information is contained in the redundant exon bases.有迹象表明“密码子边界”是由物理化学定义的,并且蛋白质折叠信息包含在冗余的外显子碱基中。
Theor Biol Med Model. 2006 Aug 7;3:28. doi: 10.1186/1742-4682-3-28.
5
An unbiased adaptive sampling algorithm for the exploration of RNA mutational landscapes under evolutionary pressure.一种用于在进化压力下探索RNA突变景观的无偏自适应采样算法。
J Comput Biol. 2011 Nov;18(11):1465-79. doi: 10.1089/cmb.2011.0181. Epub 2011 Oct 28.
6
Evolution of the genetic code through progressive symmetry breaking.遗传密码通过渐进对称破缺的演变。
J Theor Biol. 2014 Apr 21;347:95-108. doi: 10.1016/j.jtbi.2014.01.002. Epub 2014 Jan 14.
7
Predicting candidate genomic sequences that correspond to synthetic functional RNA motifs.预测与合成功能性RNA基序相对应的候选基因组序列。
Nucleic Acids Res. 2005 Oct 27;33(18):6057-69. doi: 10.1093/nar/gki911. Print 2005.
8
Correlation between nucleotide composition and folding energy of coding sequences with special attention to wobble bases.编码序列的核苷酸组成与折叠能之间的相关性,特别关注摆动碱基。
Theor Biol Med Model. 2008 Jul 29;5:14. doi: 10.1186/1742-4682-5-14.
9
Conflicting selection pressures on synonymous codon use in yeast suggest selection on mRNA secondary structures.酵母中同义密码子使用上相互冲突的选择压力表明对mRNA二级结构存在选择。
BMC Evol Biol. 2008 Jul 31;8:224. doi: 10.1186/1471-2148-8-224.
10
RNA secondary structure design.RNA二级结构设计
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Feb;75(2 Pt 1):021920. doi: 10.1103/PhysRevE.75.021920. Epub 2007 Feb 28.

引用本文的文献

1
Fine scale structural information substantially improves multivariate regression model for mRNA in-vial degradation prediction.精细尺度结构信息显著改进了用于mRNA瓶内降解预测的多元回归模型。
bioRxiv. 2025 Aug 23:2025.08.15.670605. doi: 10.1101/2025.08.15.670605.
2
mRNA folding algorithms for structure and codon optimization.用于结构和密码子优化的mRNA折叠算法。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf386.
3
EnsembleDesign: messenger RNA design minimizing ensemble free energy via probabilistic lattice parsing.
整体设计:通过概率晶格解析使整体自由能最小化的信使核糖核酸设计。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i391-i400. doi: 10.1093/bioinformatics/btaf245.
4
Algorithm for optimized mRNA design improves stability and immunogenicity.优化 mRNA 设计的算法可提高稳定性和免疫原性。
Nature. 2023 Sep;621(7978):396-403. doi: 10.1038/s41586-023-06127-z. Epub 2023 May 2.
5
Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics.基于 mRNA 的治疗方法中,mRNA 结构、稳定性和翻译的组合优化。
Nat Commun. 2022 Mar 22;13(1):1536. doi: 10.1038/s41467-022-28776-w.
6
Stability Modelling of mRNA Vaccine Quality Based on Temperature Monitoring throughout the Distribution Chain.基于整个分销链温度监测的mRNA疫苗质量稳定性建模
Pharmaceutics. 2022 Feb 17;14(2):430. doi: 10.3390/pharmaceutics14020430.
7
Theoretical basis for stabilizing messenger RNA through secondary structure design.通过二级结构设计稳定信使 RNA 的理论基础。
Nucleic Acids Res. 2021 Oct 11;49(18):10604-10617. doi: 10.1093/nar/gkab764.
8
Theoretical basis for stabilizing messenger RNA through secondary structure design.通过二级结构设计稳定信使核糖核酸的理论基础。
bioRxiv. 2021 Feb 19:2020.08.22.262931. doi: 10.1101/2020.08.22.262931.
9
Design of RNAs: comparing programs for inverse RNA folding.RNA 设计:比较反向 RNA 折叠程序。
Brief Bioinform. 2018 Mar 1;19(2):350-358. doi: 10.1093/bib/bbw120.
10
incaRNAfbinv: a web server for the fragment-based design of RNA sequences.incaRNAfbinv:一个用于基于片段设计RNA序列的网络服务器。
Nucleic Acids Res. 2016 Jul 8;44(W1):W308-14. doi: 10.1093/nar/gkw440. Epub 2016 May 16.