• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于隐马尔可夫模型的蛋白质结构字母表:蛋白质局部形状的学习捕获序列特异性。

Hidden Markov model-derived structural alphabet for proteins: the learning of protein local shapes captures sequence specificity.

作者信息

Camproux A C, Tufféry P

机构信息

Equipe de Bioinformatique Génomique et Moléculaire, INSERM U726, Université Paris 7, case 7113, 2 place Jussieu, 75251 Paris, France.

出版信息

Biochim Biophys Acta. 2005 Aug 5;1724(3):394-403. doi: 10.1016/j.bbagen.2005.05.019.

DOI:10.1016/j.bbagen.2005.05.019
PMID:16040198
Abstract

Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence.

摘要

理解和预测蛋白质结构取决于用于表示它们的模型的复杂性和准确性。我们最近建立了一个隐马尔可夫模型,以将蛋白质三维构象最佳地压缩为结构字母表的一维字母序列。这样的模型同时学习描述局部构象的代表性结构字母的形状及其连接逻辑,即字母之间的转移矩阵。在此,我们更进一步,报告一些证据表明这种蛋白质局部结构模型也捕捉到了一些准确的氨基酸特征。所有字母都有特定且不同的氨基酸分布。此外,我们表明氨基酸的词对某些字母可能有显著的倾向。前景指向从蛋白质的氨基酸序列预测描述其结构的字母序列。

相似文献

1
Hidden Markov model-derived structural alphabet for proteins: the learning of protein local shapes captures sequence specificity.基于隐马尔可夫模型的蛋白质结构字母表:蛋白质局部形状的学习捕获序列特异性。
Biochim Biophys Acta. 2005 Aug 5;1724(3):394-403. doi: 10.1016/j.bbagen.2005.05.019.
2
A hidden markov model derived structural alphabet for proteins.一种用于蛋白质的隐马尔可夫模型衍生结构字母表。
J Mol Biol. 2004 Jun 4;339(3):591-605. doi: 10.1016/j.jmb.2004.04.005.
3
HMMSTR: a hidden Markov model for local sequence-structure correlations in proteins.HMMSTR:一种用于蛋白质局部序列-结构相关性的隐马尔可夫模型。
J Mol Biol. 2000 Aug 4;301(1):173-90. doi: 10.1006/jmbi.2000.3837.
4
SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.SAFlex:一种结构字母扩展,用于整合蛋白质结构的灵活性和缺失数据信息。
PLoS One. 2018 Jul 5;13(7):e0198854. doi: 10.1371/journal.pone.0198854. eCollection 2018.
5
Methods for optimizing the structure alphabet sequences of proteins.优化蛋白质结构字母序列的方法。
Comput Biol Med. 2007 Nov;37(11):1610-6. doi: 10.1016/j.compbiomed.2007.03.002. Epub 2007 May 10.
6
Sequence-based protein structure prediction using a reduced state-space hidden Markov model.使用简化状态空间隐马尔可夫模型进行基于序列的蛋白质结构预测。
Comput Biol Med. 2007 Sep;37(9):1211-24. doi: 10.1016/j.compbiomed.2006.10.014. Epub 2006 Dec 11.
7
Enhanced protein fold recognition using a structural alphabet.使用结构字母表增强蛋白质折叠识别
Proteins. 2009 Jul;76(1):129-37. doi: 10.1002/prot.22324.
8
SA-Search: a web tool for protein structure mining based on a Structural Alphabet.SA-Search:一种基于结构字母表的蛋白质结构挖掘网络工具。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W545-8. doi: 10.1093/nar/gkh467.
9
Hidden Markov model approach for identifying the modular framework of the protein backbone.用于识别蛋白质主链模块化框架的隐马尔可夫模型方法。
Protein Eng. 1999 Dec;12(12):1063-73. doi: 10.1093/protein/12.12.1063.
10
Protein design with L- and D-alpha-amino acid structures as the alphabet.以L型和D型α-氨基酸结构为字母表的蛋白质设计。
Acc Chem Res. 2008 Oct;41(10):1301-8. doi: 10.1021/ar700265t. Epub 2008 Jul 22.

引用本文的文献

1
Experimental Validation of MHC Class I and II Peptide-Based Potential Vaccine Candidates for Human Papilloma Virus Using Sprague-Dawly Models.利用 Sprague-Dawly 模型对人乳头瘤病毒的 MHC Ⅰ类和Ⅱ类肽基潜在疫苗候选物进行实验验证。
Molecules. 2023 Feb 10;28(4):1687. doi: 10.3390/molecules28041687.
2
Computational reconstruction of atomistic protein structures from coarse-grained models.基于粗粒度模型的蛋白质原子结构的计算重建
Comput Struct Biotechnol J. 2019 Dec 26;18:162-176. doi: 10.1016/j.csbj.2019.12.007. eCollection 2020.
3
SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information.
SAFlex:一种结构字母扩展,用于整合蛋白质结构的灵活性和缺失数据信息。
PLoS One. 2018 Jul 5;13(7):e0198854. doi: 10.1371/journal.pone.0198854. eCollection 2018.
4
Exploring the potential of a structural alphabet-based tool for mining multiple target conformations and target flexibility insight.探索一种基于结构字母表的工具挖掘多个目标构象及洞察目标灵活性的潜力。
PLoS One. 2017 Aug 17;12(8):e0182972. doi: 10.1371/journal.pone.0182972. eCollection 2017.
5
Insight into SNPs and epitopes of E protein of newly emerged genotype-I isolates of JEV from Midnapur, West Bengal, India.对来自印度西孟加拉邦米德纳布尔新出现的I型日本脑炎病毒分离株E蛋白单核苷酸多态性和表位的深入研究
BMC Immunol. 2017 Mar 6;18(1):13. doi: 10.1186/s12865-017-0197-9.
6
Tertiary alphabet for the observable protein structural universe.可观测蛋白质结构宇宙的三级字母表。
Proc Natl Acad Sci U S A. 2016 Nov 22;113(47):E7438-E7447. doi: 10.1073/pnas.1607178113. Epub 2016 Nov 3.
7
A New Hidden Markov Model for Protein Quality Assessment Using Compatibility Between Protein Sequence and Structure.一种利用蛋白质序列与结构兼容性进行蛋白质质量评估的新型隐马尔可夫模型
Tsinghua Sci Technol. 2015 Mar 25;19(6):559-567. doi: 10.1109/tst.2014.6961026.
8
Deciphering the shape and deformation of secondary structures through local conformation analysis.通过局部构象分析解析二级结构的形状和变形。
BMC Struct Biol. 2011 Feb 1;11:9. doi: 10.1186/1472-6807-11-9.
9
Mining protein loops using a structural alphabet and statistical exceptionality.使用结构字母表和统计异常挖掘蛋白质环。
BMC Bioinformatics. 2010 Feb 4;11:75. doi: 10.1186/1471-2105-11-75.
10
Reconstruction of protein backbones from the BriX collection of canonical protein fragments.从标准蛋白质片段的BriX集合中重建蛋白质骨架。
PLoS Comput Biol. 2008 May 23;4(5):e1000083. doi: 10.1371/journal.pcbi.1000083.