• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质二级结构的贝叶斯分割

Bayesian segmentation of protein secondary structure.

作者信息

Schmidler S C, Liu J S, Brutlag D L

机构信息

Section on Medical Informatics, Stanford University School of Medicine, CA 94305, USA.

出版信息

J Comput Biol. 2000 Feb-Apr;7(1-2):233-48. doi: 10.1089/10665270050081496.

DOI:10.1089/10665270050081496
PMID:10890399
Abstract

We present a novel method for predicting the secondary structure of a protein from its amino acid sequence. Most existing methods predict each position in turn based on a local window of residues, sliding this window along the length of the sequence. In contrast, we develop a probabilistic model of protein sequence/structure relationships in terms of structural segments, and formulate secondary structure prediction as a general Bayesian inference problem. A distinctive feature of our approach is the ability to develop explicit probabilistic models for alpha-helices, beta-strands, and other classes of secondary structure, incorporating experimentally and empirically observed aspects of protein structure such as helical capping signals, side chain correlations, and segment length distributions. Our model is Markovian in the segments, permitting efficient exact calculation of the posterior probability distribution over all possible segmentations of the sequence using dynamic programming. The optimal segmentation is computed and compared to a predictor based on marginal posterior modes, and the latter is shown to provide significant improvement in predictive accuracy. The marginalization procedure provides exact secondary structure probabilities at each sequence position, which are shown to be reliable estimates of prediction uncertainty. We apply this model to a database of 452 nonhomologous structures, achieving accuracies as high as the best currently available methods. We conclude by discussing an extension of this framework to model nonlocal interactions in protein structures, providing a possible direction for future improvements in secondary structure prediction accuracy.

摘要

我们提出了一种从蛋白质氨基酸序列预测其二级结构的新方法。大多数现有方法基于残基的局部窗口依次预测每个位置,并沿序列长度滑动此窗口。相比之下,我们根据结构片段建立了蛋白质序列/结构关系的概率模型,并将二级结构预测表述为一个通用的贝叶斯推理问题。我们方法的一个显著特点是能够为α螺旋、β链和其他二级结构类别建立明确的概率模型,纳入诸如螺旋封端信号、侧链相关性和片段长度分布等蛋白质结构的实验和经验观察方面。我们的模型在片段上是马尔可夫的,允许使用动态规划对序列的所有可能分割高效精确地计算后验概率分布。计算出最优分割并与基于边际后验模式的预测器进行比较,结果表明后者在预测准确性上有显著提高。边缘化过程在每个序列位置提供精确的二级结构概率,这些概率被证明是预测不确定性的可靠估计。我们将此模型应用于一个包含452个非同源结构的数据库,获得了与目前最佳方法一样高的准确率。最后,我们讨论了将此框架扩展以对蛋白质结构中的非局部相互作用进行建模,为未来提高二级结构预测准确性提供了一个可能的方向。

相似文献

1
Bayesian segmentation of protein secondary structure.蛋白质二级结构的贝叶斯分割
J Comput Biol. 2000 Feb-Apr;7(1-2):233-48. doi: 10.1089/10665270050081496.
2
A Segmental Semi Markov Model for protein secondary structure prediction.一种蛋白质二级结构预测的分段半马尔可夫模型。
Math Biosci. 2009 Oct;221(2):130-5. doi: 10.1016/j.mbs.2009.07.004. Epub 2009 Jul 29.
3
A seqlet-based maximum entropy Markov approach for protein secondary structure prediction.一种基于序列片段的最大熵马尔可夫方法用于蛋白质二级结构预测。
Sci China C Life Sci. 2005 Aug;48(4):394-405. doi: 10.1360/062004-53.
4
Formulation of probabilistic models of protein structure in atomic detail using the reference ratio method.使用参考比率法构建原子细节层面的蛋白质结构概率模型。
Proteins. 2014 Feb;82(2):288-99. doi: 10.1002/prot.24386. Epub 2013 Oct 17.
5
Bayesian models and algorithms for protein β-sheet prediction.贝叶斯模型和算法在蛋白质 β-折叠预测中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):395-409. doi: 10.1109/TCBB.2008.140.
6
Bayesian segmental models with multiple sequence alignment profiles for protein secondary structure and contact map prediction.用于蛋白质二级结构和接触图预测的具有多序列比对图谱的贝叶斯分段模型。
IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):98-113. doi: 10.1109/TCBB.2006.17.
7
Protein secondary structure: entropy, correlations and prediction.蛋白质二级结构:熵、相关性与预测
Bioinformatics. 2004 Jul 10;20(10):1603-11. doi: 10.1093/bioinformatics/bth132. Epub 2004 Feb 26.
8
A Bayesian approach to DNA sequence segmentation.一种用于DNA序列分割的贝叶斯方法。
Biometrics. 2004 Sep;60(3):573-81; discussion 581-8. doi: 10.1111/j.0006-341X.2004.00206.x.
9
Predicting protein secondary structure with probabilistic schemata of evolutionarily derived information.利用进化衍生信息的概率模式预测蛋白质二级结构。
Protein Sci. 1997 Sep;6(9):1963-75. doi: 10.1002/pro.5560060917.
10
Prediction of protein secondary structure content for the twilight zone sequences.预测处于模糊区域序列的蛋白质二级结构含量。
Proteins. 2007 Nov 15;69(3):486-98. doi: 10.1002/prot.21527.

引用本文的文献

1
Efficient enumeration and visualization of helix-coil ensembles.高效枚举和可视化的螺旋-卷曲集。
Biophys J. 2024 Feb 6;123(3):317-333. doi: 10.1016/j.bpj.2023.12.021. Epub 2023 Dec 29.
2
Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models.深度学习在基因组学中的应用:从早期神经网络到现代大型语言模型。
Int J Mol Sci. 2023 Nov 1;24(21):15858. doi: 10.3390/ijms242115858.
3
PSSNet-An Accurate Super-Secondary Structure for Protein Segmentation.PSSNet-一种用于蛋白质分割的精确超二级结构。
Int J Mol Sci. 2022 Nov 26;23(23):14813. doi: 10.3390/ijms232314813.
4
Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction.基于空洞空间金字塔网络的深度集成学习用于蛋白质二级结构预测
Biomolecules. 2022 Jun 2;12(6):774. doi: 10.3390/biom12060774.
5
A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.基于贝叶斯模型的染色质中可及单链 DNA 与免疫球蛋白基因体细胞高频突变关系的计算分析。
PLoS Comput Biol. 2021 Sep 7;17(9):e1009323. doi: 10.1371/journal.pcbi.1009323. eCollection 2021 Sep.
6
DNSS2: Improved ab initio protein secondary structure prediction using advanced deep learning architectures.DNSS2:使用先进深度学习架构改进从头算蛋白质二级结构预测
Proteins. 2021 Feb;89(2):207-217. doi: 10.1002/prot.26007. Epub 2020 Sep 16.
7
Prediction of 8-state protein secondary structures by a novel deep learning architecture.一种新型深度学习架构预测 8 态蛋白质二级结构。
BMC Bioinformatics. 2018 Aug 3;19(1):293. doi: 10.1186/s12859-018-2280-5.
8
CNNH_PSS: protein 8-class secondary structure prediction by convolutional neural network with highway.CNN_H_PSS:基于卷积神经网络和高速公路的 8 类蛋白质二级结构预测。
BMC Bioinformatics. 2018 May 8;19(Suppl 4):60. doi: 10.1186/s12859-018-2067-8.
9
Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.基于深度卷积神经场的蛋白质二级结构预测
Sci Rep. 2016 Jan 11;6:18962. doi: 10.1038/srep18962.
10
Bayesian model of protein primary sequence for secondary structure prediction.用于二级结构预测的蛋白质一级序列的贝叶斯模型。
PLoS One. 2014 Oct 14;9(10):e109832. doi: 10.1371/journal.pone.0109832. eCollection 2014.