• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用明确表示空间坐标的3D隐马尔可夫模型来建模和比较蛋白质结构。

Using 3D Hidden Markov Models that explicitly represent spatial coordinates to model and compare protein structures.

作者信息

Alexandrov Vadim, Gerstein Mark

机构信息

Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave, New Haven, CT 06511, USA.

出版信息

BMC Bioinformatics. 2004 Jan 9;5:2. doi: 10.1186/1471-2105-5-2.

DOI:10.1186/1471-2105-5-2
PMID:14715091
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC344530/
Abstract

BACKGROUND

Hidden Markov Models (HMMs) have proven very useful in computational biology for such applications as sequence pattern matching, gene-finding, and structure prediction. Thus far, however, they have been confined to representing 1D sequence (or the aspects of structure that could be represented by character strings).

RESULTS

We develop an HMM formalism that explicitly uses 3D coordinates in its match states. The match states are modeled by 3D Gaussian distributions centered on the mean coordinate position of each alpha carbon in a large structural alignment. The transition probabilities depend on the spread of the neighboring match states and on the number of gaps found in the structural alignment. We also develop methods for aligning query structures against 3D HMMs and scoring the result probabilistically. For 1D HMMs these tasks are accomplished by the Viterbi and forward algorithms. However, these will not work in unmodified form for the 3D problem, due to non-local quality of structural alignment, so we develop extensions of these algorithms for the 3D case. Several applications of 3D HMMs for protein structure classification are reported. A good separation of scores for different fold families suggests that the described construct is quite useful for protein structure analysis.

CONCLUSION

We have created a rigorous 3D HMM representation for protein structures and implemented a complete set of routines for building 3D HMMs in C and Perl. The code is freely available from http://www.molmovdb.org/geometry/3dHMM, and at this site we also have a simple prototype server to demonstrate the features of the described approach.

摘要

背景

隐马尔可夫模型(HMM)在计算生物学中已被证明在序列模式匹配、基因查找和结构预测等应用中非常有用。然而,到目前为止,它们仅限于表示一维序列(或可以用字符串表示的结构方面)。

结果

我们开发了一种HMM形式体系,在其匹配状态中明确使用三维坐标。匹配状态由以大型结构比对中每个α碳原子的平均坐标位置为中心的三维高斯分布建模。转移概率取决于相邻匹配状态的分布以及在结构比对中发现的间隙数量。我们还开发了将查询结构与三维HMM进行比对并对结果进行概率评分的方法。对于一维HMM,这些任务由维特比算法和前向算法完成。然而,由于结构比对的非局部性质,这些算法未经修改不能用于三维问题,因此我们针对三维情况开发了这些算法的扩展。报告了三维HMM在蛋白质结构分类中的几个应用。不同折叠家族得分的良好分离表明所描述的构建体对蛋白质结构分析非常有用。

结论

我们为蛋白质结构创建了一种严格的三维HMM表示,并在C和Perl中实现了一套完整的构建三维HMM的例程。代码可从http://www.molmovdb.org/geometry/3dHMM免费获取,在该网站我们还有一个简单的原型服务器来展示所描述方法的功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d9/344530/65ee09a7d4e7/1471-2105-5-2-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d9/344530/c7382ffed741/1471-2105-5-2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d9/344530/dfdc4b957f7a/1471-2105-5-2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d9/344530/65ee09a7d4e7/1471-2105-5-2-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d9/344530/c7382ffed741/1471-2105-5-2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d9/344530/dfdc4b957f7a/1471-2105-5-2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d9/344530/65ee09a7d4e7/1471-2105-5-2-7.jpg

相似文献

1
Using 3D Hidden Markov Models that explicitly represent spatial coordinates to model and compare protein structures.使用明确表示空间坐标的3D隐马尔可夫模型来建模和比较蛋白质结构。
BMC Bioinformatics. 2004 Jan 9;5:2. doi: 10.1186/1471-2105-5-2.
2
Algorithms for incorporating prior topological information in HMMs: application to transmembrane proteins.用于将先验拓扑信息纳入隐马尔可夫模型的算法:在跨膜蛋白中的应用。
BMC Bioinformatics. 2006 Apr 5;7:189. doi: 10.1186/1471-2105-7-189.
3
Hidden Markov models that use predicted local structure for fold recognition: alphabets of backbone geometry.利用预测的局部结构进行折叠识别的隐马尔可夫模型:主链几何结构字母表
Proteins. 2003 Jun 1;51(4):504-14. doi: 10.1002/prot.10369.
4
Applications of generalized pair hidden Markov models to alignment and gene finding problems.广义配对隐马尔可夫模型在序列比对和基因查找问题中的应用。
J Comput Biol. 2002;9(2):389-99. doi: 10.1089/10665270252935520.
5
HMMs in Protein Fold Classification.蛋白质折叠分类中的隐马尔可夫模型
Methods Mol Biol. 2017;1552:13-27. doi: 10.1007/978-1-4939-6753-7_2.
6
Calibrating E-values for hidden Markov models using reverse-sequence null models.使用反向序列空模型校准隐马尔可夫模型的E值。
Bioinformatics. 2005 Nov 15;21(22):4107-15. doi: 10.1093/bioinformatics/bti629. Epub 2005 Aug 25.
7
Accelerated Profile HMM Searches.加速轮廓隐马尔可夫模型搜索。
PLoS Comput Biol. 2011 Oct;7(10):e1002195. doi: 10.1371/journal.pcbi.1002195. Epub 2011 Oct 20.
8
HMM sampling and applications to gene finding and alternative splicing.隐马尔可夫模型采样及其在基因发现和可变剪接中的应用。
Bioinformatics. 2003 Oct;19 Suppl 2:ii36-41. doi: 10.1093/bioinformatics/btg1057.
9
MRFalign: protein homology detection through alignment of Markov random fields.MRFalign:通过马尔可夫随机场比对进行蛋白质同源性检测。
PLoS Comput Biol. 2014 Mar 27;10(3):e1003500. doi: 10.1371/journal.pcbi.1003500. eCollection 2014 Mar.
10
An evolutionary method for learning HMM structure: prediction of protein secondary structure.一种学习隐马尔可夫模型结构的进化方法:蛋白质二级结构预测
BMC Bioinformatics. 2007 Sep 21;8:357. doi: 10.1186/1471-2105-8-357.

引用本文的文献

1
Actin-interacting and flagellar proteins in Leishmania spp.: Bioinformatics predictions to functional assignments in phagosome formation.利什曼原虫属中的肌动蛋白相互作用蛋白和鞭毛蛋白:对吞噬体形成中功能分配的生物信息学预测。
Genet Mol Biol. 2009 Jul;32(3):652-65. doi: 10.1590/S1415-47572009000300033. Epub 2009 Sep 1.
2
A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models.基于归纳逻辑编程和命题模型的家族蛋白质远程同源检测的判别方法。
BMC Bioinformatics. 2011 Mar 23;12:83. doi: 10.1186/1471-2105-12-83.
3
Superimposition of protein structures with dynamically weighted RMSD.

本文引用的文献

1
Towards discovering structural signatures of protein folds based on logical hidden Markov models.基于逻辑隐马尔可夫模型探索蛋白质折叠的结构特征。
Pac Symp Biocomput. 2003:192-203. doi: 10.1142/9789812776303_0019.
2
HMMSTR: a hidden Markov model for local sequence-structure correlations in proteins.HMMSTR:一种用于蛋白质局部序列-结构相关性的隐马尔可夫模型。
J Mol Biol. 2000 Aug 4;301(1):173-90. doi: 10.1006/jmbi.2000.3837.
3
Genie--gene finding in Drosophila melanogaster.精灵——黑腹果蝇中的基因发现
蛋白质结构的叠加与动态加权 RMSD。
J Mol Model. 2010 Feb;16(2):211-22. doi: 10.1007/s00894-009-0538-6. Epub 2009 Jul 1.
4
Improving model construction of profile HMMs for remote homology detection through structural alignment.通过结构比对改进用于远程同源性检测的轮廓隐马尔可夫模型的模型构建。
BMC Bioinformatics. 2007 Nov 9;8:435. doi: 10.1186/1471-2105-8-435.
5
Gaussian-weighted RMSD superposition of proteins: a structural comparison for flexible proteins and predicted protein structures.蛋白质的高斯加权均方根偏差叠加:柔性蛋白质和预测蛋白质结构的结构比较。
Biophys J. 2006 Jun 15;90(12):4558-73. doi: 10.1529/biophysj.105.066654. Epub 2006 Mar 24.
6
Normal modes for predicting protein motions: a comprehensive database assessment and associated Web tool.预测蛋白质运动的正常模式:全面的数据库评估及相关网络工具
Protein Sci. 2005 Mar;14(3):633-43. doi: 10.1110/ps.04882105.
7
Recent applications of Hidden Markov Models in computational biology.隐马尔可夫模型在计算生物学中的最新应用。
Genomics Proteomics Bioinformatics. 2004 May;2(2):84-96. doi: 10.1016/s1672-0229(04)02014-5.
Genome Res. 2000 Apr;10(4):529-38. doi: 10.1101/gr.10.4.529.
4
Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.评估基因组学中的注释转移:通过传统分数和概率分数量化蛋白质序列、结构与功能之间的关系。
J Mol Biol. 2000 Mar 17;297(1):233-49. doi: 10.1006/jmbi.2000.3550.
5
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):263-6. doi: 10.1093/nar/28.1.263.
6
Structural assignments to the Mycoplasma genitalium proteins show extensive gene duplications and domain rearrangements.对生殖支原体蛋白质的结构分析表明存在广泛的基因重复和结构域重排。
Proc Natl Acad Sci U S A. 1998 Dec 8;95(25):14658-63. doi: 10.1073/pnas.95.25.14658.
7
A hidden Markov model for predicting transmembrane helices in protein sequences.一种用于预测蛋白质序列中跨膜螺旋的隐马尔可夫模型。
Proc Int Conf Intell Syst Mol Biol. 1998;6:175-82.
8
A unified statistical framework for sequence comparison and structure comparison.用于序列比较和结构比较的统一统计框架。
Proc Natl Acad Sci U S A. 1998 May 26;95(11):5913-20. doi: 10.1073/pnas.95.11.5913.
9
Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins.针对手动标准(蛋白质的scop分类)对自动结构比对进行全面评估。
Protein Sci. 1998 Feb;7(2):445-56. doi: 10.1002/pro.5560070226.
10
Predicting protein structure using hidden Markov models.使用隐马尔可夫模型预测蛋白质结构。
Proteins. 1997;Suppl 1:134-9. doi: 10.1002/(sici)1097-0134(1997)1+<134::aid-prot18>3.3.co;2-q.