• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从结构中精确学习RNA能量参数。

Exact learning of RNA energy parameters from structure.

作者信息

Chitsaz Hamidreza, Aminisharifabad Mohammad

机构信息

1Department of Computer Science, Colorado State University, Fort Collins, Colorado.

2Department of Computer Science, Wayne State University, Detroit, Michigan.

出版信息

J Comput Biol. 2015 Jun;22(6):463-73. doi: 10.1089/cmb.2014.0164. Epub 2015 Mar 10.

DOI:10.1089/cmb.2014.0164
PMID:25756654
Abstract

We consider the problem of exact learning of parameters of a linear RNA energy model from secondary structure data. A necessary and sufficient condition for learnability of parameters is derived, which is based on computing the convex hull of union of translated Newton polytopes of input sequences. The set of learned energy parameters is characterized as the convex cone generated by the normal vectors to those facets of the resulting polytope that are incident to the origin. In practice, the sufficient condition may not be satisfied by the entire training data set; hence, computing a maximal subset of training data for which the sufficient condition is satisfied is often desired. We show that the problem is NP-hard in general for an arbitrary dimensional feature space. Using a randomized greedy algorithm, we select a subset of RNA STRAND v2.0 database that satisfies the sufficient condition for separate A-U, C-G, G-U base pair counting model. The set of learned energy parameters includes experimentally measured energies of A-U, C-G, and G-U pairs; hence, our parameter set is in agreement with the Turner parameters.

摘要

我们考虑从二级结构数据中精确学习线性RNA能量模型参数的问题。推导了参数可学习性的充要条件,该条件基于计算输入序列平移牛顿多胞体并集的凸包。学习到的能量参数集被表征为由所得多胞体中与原点相交的那些面的法向量生成的凸锥。在实际中,整个训练数据集可能不满足充分条件;因此,通常希望计算满足充分条件的训练数据的最大子集。我们表明,对于任意维特征空间,该问题一般是NP难的。使用随机贪心算法,我们从RNA STRAND v2.0数据库中选择了一个满足单独A-U、C-G、G-U碱基对计数模型充分条件的子集。学习到的能量参数集包括A-U、C-G和G-U对的实验测量能量;因此,我们的参数集与特纳参数一致。

相似文献

1
Exact learning of RNA energy parameters from structure.从结构中精确学习RNA能量参数。
J Comput Biol. 2015 Jun;22(6):463-73. doi: 10.1089/cmb.2014.0164. Epub 2015 Mar 10.
2
The RNA Newton polytope and learnability of energy parameters.RNA 牛顿多胞形与能量参数的可学习性。
Bioinformatics. 2013 Jul 1;29(13):i300-7. doi: 10.1093/bioinformatics/btt226.
3
Thermodynamic characterization of single mismatches found in naturally occurring RNA.天然存在的RNA中单个错配的热力学特征。
Biochemistry. 2007 Nov 20;46(46):13425-36. doi: 10.1021/bi701311c. Epub 2007 Oct 24.
4
Parametric analysis of RNA branching configurations.RNA 分支结构的参数分析。
Bull Math Biol. 2011 Apr;73(4):754-76. doi: 10.1007/s11538-010-9607-3. Epub 2011 Jan 5.
5
Extracting stacking interaction parameters for RNA from the data set of native structures.从天然结构数据集中提取RNA的堆积相互作用参数。
J Mol Biol. 2005 Mar 18;347(1):53-69. doi: 10.1016/j.jmb.2004.12.012. Epub 2005 Jan 12.
6
Statistical potentials for hairpin and internal loops improve the accuracy of the predicted RNA structure.发夹和内部环的统计势可提高预测 RNA 结构的准确性。
J Mol Biol. 2011 Oct 21;413(2):473-83. doi: 10.1016/j.jmb.2011.08.033. Epub 2011 Aug 23.
7
Moments of the Boltzmann distribution for RNA secondary structures.RNA二级结构的玻尔兹曼分布矩。
Bull Math Biol. 2005 Sep;67(5):1031-47. doi: 10.1016/j.bulm.2004.12.003.
8
Predicting RNA secondary structures with arbitrary pseudoknots by maximizing the number of stacking pairs.通过最大化堆叠对的数量来预测具有任意假结的RNA二级结构。
J Comput Biol. 2003;10(6):981-95. doi: 10.1089/106652703322756186.
9
Computing the partition function and sampling for saturated secondary structures of RNA, with respect to the Turner energy model.关于特纳能量模型,计算RNA饱和二级结构的配分函数并进行采样。
J Comput Biol. 2007 Mar;14(2):190-215. doi: 10.1089/cmb.2006.0012.
10
A sensitivity analysis of RNA folding nearest neighbor parameters identifies a subset of free energy parameters with the greatest impact on RNA secondary structure prediction.RNA折叠最近邻参数的敏感性分析确定了对RNA二级结构预测影响最大的自由能参数子集。
Nucleic Acids Res. 2017 Jun 2;45(10):6168-6176. doi: 10.1093/nar/gkx170.