• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于马尔可夫链的采样方法在最近邻热力学模型下探索RNA二级结构及扩展应用

Markov Chain-Based Sampling for Exploring RNA Secondary Structure under the Nearest Neighbor Thermodynamic Model and Extended Applications.

作者信息

Kirkpatrick Anna, Patton Kalen, Tetali Prasad, Mitchell Cassie

机构信息

School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332, USA.

School of Computer Science, Georgia Institute of Technology, Atlanta, GA 30332, USA.

出版信息

Math Comput Appl. 2020 Dec;25(4). doi: 10.3390/mca25040067. Epub 2020 Oct 10.

DOI:10.3390/mca25040067
PMID:35924027
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9344895/
Abstract

Ribonucleic acid (RNA) secondary structures and branching properties are important for determining functional ramifications in biology. While energy minimization of the Nearest Neighbor Thermodynamic Model (NNTM) is commonly used to identify such properties (number of hairpins, maximum ladder distance, etc.), it is difficult to know whether the resultant values fall within expected dispersion thresholds for a given energy function. The goal of this study was to construct a Markov chain capable of examining the dispersion of RNA secondary structures and branching properties obtained from NNTM energy function minimization independent of a specific nucleotide sequence. Plane trees are studied as a model for RNA secondary structure, with energy assigned to each tree based on the NNTM, and a corresponding Gibbs distribution is defined on the trees. Through a bijection between plane trees and 2-Motzkin paths, a Markov chain converging to the Gibbs distribution is constructed, and fast mixing time is established by estimating the spectral gap of the chain. The spectral gap estimate is obtained through a series of decompositions of the chain and also by building on known mixing time results for other chains on Dyck paths. The resulting algorithm can be used as a tool for exploring the branching structure of RNA, especially for long sequences, and to examine branching structure dependence on energy model parameters. Full exposition is provided for the mathematical techniques used with the expectation that these techniques will prove useful in bioinformatics, computational biology, and additional extended applications.

摘要

核糖核酸(RNA)的二级结构和分支特性对于确定生物学中的功能分支很重要。虽然最近邻热力学模型(NNTM)的能量最小化通常用于识别此类特性(发夹数量、最大梯级距离等),但很难知道所得值是否落在给定能量函数的预期离散阈值范围内。本研究的目的是构建一个马尔可夫链,能够独立于特定核苷酸序列检查从NNTM能量函数最小化获得的RNA二级结构和分支特性的离散情况。平面树被作为RNA二级结构的模型进行研究,根据NNTM为每棵树分配能量,并在这些树上定义相应的吉布斯分布。通过平面树与2 - 莫兹金路径之间的双射,构建了一个收敛到吉布斯分布的马尔可夫链,并通过估计该链的谱隙建立了快速混合时间。谱隙估计是通过对链的一系列分解以及基于对戴克路径上其他链的已知混合时间结果得到的。所得算法可作为探索RNA分支结构的工具,特别是对于长序列,并用于检查分支结构对能量模型参数的依赖性。文中对所使用的数学技术进行了全面阐述,期望这些技术在生物信息学、计算生物学及其他扩展应用中被证明是有用的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66b4/9344895/b664e93d8b60/nihms-1820213-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66b4/9344895/14e3011e01ed/nihms-1820213-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66b4/9344895/5a814d861eed/nihms-1820213-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66b4/9344895/b664e93d8b60/nihms-1820213-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66b4/9344895/14e3011e01ed/nihms-1820213-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66b4/9344895/5a814d861eed/nihms-1820213-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66b4/9344895/b664e93d8b60/nihms-1820213-f0003.jpg

相似文献

1
Markov Chain-Based Sampling for Exploring RNA Secondary Structure under the Nearest Neighbor Thermodynamic Model and Extended Applications.基于马尔可夫链的采样方法在最近邻热力学模型下探索RNA二级结构及扩展应用
Math Comput Appl. 2020 Dec;25(4). doi: 10.3390/mca25040067. Epub 2020 Oct 10.
2
Effects of non-nearest neighbors on the thermodynamic stability of RNA GNRA hairpin tetraloops.非近邻效应对 RNA GNRA 发夹四环四联体热力学稳定性的影响。
Biochemistry. 2012 Mar 20;51(11):2192-8. doi: 10.1021/bi300008j. Epub 2012 Mar 8.
3
Improving RNA Branching Predictions: Advances and Limitations.提高 RNA 分支预测的准确性:进展与局限。
Genes (Basel). 2021 Mar 25;12(4):469. doi: 10.3390/genes12040469.
4
TurboFold: iterative probabilistic estimation of secondary structures for multiple RNA sequences.TurboFold:用于多个 RNA 序列的二级结构的迭代概率估计。
BMC Bioinformatics. 2011 Apr 20;12:108. doi: 10.1186/1471-2105-12-108.
5
RNA folding kinetics using Monte Carlo and Gillespie algorithms.使用蒙特卡洛算法和 Gillespie 算法的 RNA 折叠动力学
J Math Biol. 2018 Apr;76(5):1195-1227. doi: 10.1007/s00285-017-1169-7. Epub 2017 Aug 5.
6
Roadblocked monotonic paths and the enumeration of coalescent histories for non-matching caterpillar gene trees and species trees.受阻单调路径与不匹配毛毛虫基因树和物种树的合并历史枚举
Adv Appl Math. 2020 Feb;113. doi: 10.1016/j.aam.2019.101939. Epub 2019 Oct 31.
7
Energy parameters and novel algorithms for an extended nearest neighbor energy model of RNA.RNA 扩展最近邻能量模型的能量参数和新算法。
PLoS One. 2014 Feb 21;9(2):e85412. doi: 10.1371/journal.pone.0085412. eCollection 2014.
8
Thermodynamic characterization of RNA duplexes containing naturally occurring 1 x 2 nucleotide internal loops.含有天然存在的1×2核苷酸内环的RNA双链体的热力学表征。
Biochemistry. 2007 Dec 18;46(50):14715-24. doi: 10.1021/bi701024w. Epub 2007 Nov 20.
9
Estimating Energy Parameters for RNA Secondary Structure Predictions Using Both Experimental and Computational Data.利用实验和计算数据估算 RNA 二级结构预测的能量参数。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1645-1655. doi: 10.1109/TCBB.2018.2813388. Epub 2018 Mar 12.
10
Searching for convergence in phylogenetic Markov chain Monte Carlo.在系统发育马尔可夫链蒙特卡罗方法中寻找收敛性。
Syst Biol. 2006 Aug;55(4):553-65. doi: 10.1080/10635150600812544.

本文引用的文献

1
A biologist's guide to Bayesian phylogenetic analysis.生物学家贝叶斯系统发育分析指南。
Nat Ecol Evol. 2017 Oct;1(10):1446-1454. doi: 10.1038/s41559-017-0280-x. Epub 2017 Sep 21.
2
Sizes of Long RNA Molecules Are Determined by the Branching Patterns of Their Secondary Structures.长链RNA分子的大小由其二级结构的分支模式决定。
Biophys J. 2016 Nov 15;111(10):2077-2085. doi: 10.1016/j.bpj.2016.10.014.
3
A simple introduction to Markov Chain Monte-Carlo sampling.马尔可夫链蒙特卡罗采样简介。
Psychon Bull Rev. 2018 Feb;25(1):143-154. doi: 10.3758/s13423-016-1015-8.
4
Markov chain Monte Carlo: an introduction for epidemiologists.马尔可夫链蒙特卡罗法:流行病学研究人员入门。
Int J Epidemiol. 2013 Apr;42(2):627-34. doi: 10.1093/ije/dyt043.
5
Automated and fast building of three-dimensional RNA structures.自动化快速构建三维 RNA 结构。
Sci Rep. 2012;2:734. doi: 10.1038/srep00734. Epub 2012 Oct 15.
6
Evaluation of a sophisticated SCFG design for RNA secondary structure prediction.用于RNA二级结构预测的复杂SCFG设计评估。
Theory Biosci. 2011 Dec;130(4):313-36. doi: 10.1007/s12064-011-0139-7. Epub 2011 Dec 2.
7
ViennaRNA Package 2.0.维也纳RNA软件包2.0
Algorithms Mol Biol. 2011 Nov 24;6:26. doi: 10.1186/1748-7188-6-26.
8
Automated RNA tertiary structure prediction from secondary structure and low-resolution restraints.基于二级结构和低分辨率限制条件的自动化RNA三级结构预测
J Comput Chem. 2011 Jul 30;32(10):2232-44. doi: 10.1002/jcc.21806. Epub 2011 Apr 21.
9
Improvements of the hierarchical approach for predicting RNA tertiary structure.预测 RNA 三级结构的分层方法的改进。
J Biomol Struct Dyn. 2011 Apr;28(5):815-26. doi: 10.1080/07391102.2011.10508609.
10
Parametric analysis of RNA branching configurations.RNA 分支结构的参数分析。
Bull Math Biol. 2011 Apr;73(4):754-76. doi: 10.1007/s11538-010-9607-3. Epub 2011 Jan 5.