• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于利用结构信息对蛋白质家族进行建模的轮廓条件随机场。

Profile conditional random fields for modeling protein families with structural information.

作者信息

Kinjo Akira R

机构信息

Institute for Protein Research, Osaka University, Suita, Osaka, 565-0871, Japan.

出版信息

Biophysics (Nagoya-shi). 2009 May 30;5:37-44. doi: 10.2142/biophysics.5.37. eCollection 2009.

DOI:10.2142/biophysics.5.37
PMID:27857577
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5036637/
Abstract

A statistical model of protein families, called profile conditional random fields (CRFs), is proposed. This model may be regarded as an integration of the profile hidden Markov model (HMM) and the Finkelstein-Reva (FR) theory of protein folding. While the model structure of the profile CRF is almost identical to the profile HMM, it can incorporate arbitrary correlations in the sequences to be aligned to the model. In addition, like in the FR theory, the profile CRF can incorporate long-range pair-wise interactions between model states via mean-field-like approximations. We give the detailed formulation of the model, self-consistent approximations for treating long-range interactions, and algorithms for computing partition functions and marginal probabilities. We also outline the methods for the global optimization of model parameters as well as a Bayesian framework for parameter learning and selection of optimal alignments.

摘要

提出了一种称为轮廓条件随机场(CRF)的蛋白质家族统计模型。该模型可被视为轮廓隐马尔可夫模型(HMM)与芬克尔斯坦 - 雷瓦(FR)蛋白质折叠理论的整合。虽然轮廓CRF的模型结构与轮廓HMM几乎相同,但它可以纳入要与模型比对的序列中的任意相关性。此外,与FR理论一样,轮廓CRF可以通过类似平均场的近似纳入模型状态之间的长程成对相互作用。我们给出了模型的详细公式、处理长程相互作用的自洽近似以及计算配分函数和边缘概率的算法。我们还概述了模型参数全局优化的方法以及用于参数学习和最优比对选择的贝叶斯框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e0b/5036637/f80bfcd40cc7/5_37_f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e0b/5036637/f80bfcd40cc7/5_37_f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e0b/5036637/f80bfcd40cc7/5_37_f1.jpg

相似文献

1
Profile conditional random fields for modeling protein families with structural information.用于利用结构信息对蛋白质家族进行建模的轮廓条件随机场。
Biophysics (Nagoya-shi). 2009 May 30;5:37-44. doi: 10.2142/biophysics.5.37. eCollection 2009.
2
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.HMM-ModE——通过优化判别阈值并利用负训练序列修改发射概率,使用轮廓隐马尔可夫模型改进分类。
BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104.
3
The infinite-order conditional random field model for sequential data modeling.用于序列数据建模的无穷阶条件随机场模型。
IEEE Trans Pattern Anal Mach Intell. 2013 Jun;35(6):1523-34. doi: 10.1109/TPAMI.2012.208.
4
Protein fold recognition using segmentation conditional random fields (SCRFs).使用分割条件随机场(SCRF)进行蛋白质折叠识别。
J Comput Biol. 2006 Mar;13(2):394-406. doi: 10.1089/cmb.2006.13.394.
5
ProbPFP: a multiple sequence alignment algorithm combining hidden Markov model optimized by particle swarm optimization with partition function.ProbPFP:一种通过粒子群优化算法优化的隐马尔可夫模型与分区函数相结合的多序列比对算法。
BMC Bioinformatics. 2019 Nov 25;20(Suppl 18):573. doi: 10.1186/s12859-019-3132-7.
6
Stochastic motif extraction using hidden Markov model.使用隐马尔可夫模型的随机基序提取
Proc Int Conf Intell Syst Mol Biol. 1994;2:121-9.
7
Conditional random fields as classifiers for three-class motor-imagery brain-computer interfaces.条件随机场作为三分类运动想象脑-机接口的分类器。
J Neural Eng. 2011 Apr;8(2):025013. doi: 10.1088/1741-2560/8/2/025013. Epub 2011 Mar 24.
8
Hidden Markov models in computational biology. Applications to protein modeling.计算生物学中的隐马尔可夫模型。在蛋白质建模中的应用。
J Mol Biol. 1994 Feb 4;235(5):1501-31. doi: 10.1006/jmbi.1994.1104.
9
Discriminative learning for dynamic state prediction.用于动态状态预测的判别式学习
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1847-61. doi: 10.1109/TPAMI.2009.37.
10
Sequence alignments and pair hidden Markov models using evolutionary history.使用进化历史的序列比对和配对隐马尔可夫模型。
J Mol Biol. 2003 Oct 17;333(2):453-60. doi: 10.1016/j.jmb.2003.08.015.

引用本文的文献

1
A unified statistical model of protein multiple sequence alignment integrating direct coupling and insertions.整合直接耦合和插入的蛋白质多序列比对统一统计模型。
Biophys Physicobiol. 2016 Apr 22;13:45-62. doi: 10.2142/biophysico.13.0_45. eCollection 2016.

本文引用的文献

1
Discriminative learning for protein conformation sampling.用于蛋白质构象采样的判别式学习
Proteins. 2008 Oct;73(1):228-40. doi: 10.1002/prot.22057.
2
Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.蛋白质家族特征的本质:来自位置特异性评分矩阵奇异值分析的见解。
PLoS One. 2008 Apr 9;3(4):e1963. doi: 10.1371/journal.pone.0001963.
3
Conrad: gene prediction using conditional random fields.康拉德:使用条件随机场进行基因预测。
Genome Res. 2007 Sep;17(9):1389-98. doi: 10.1101/gr.6558107. Epub 2007 Aug 9.
4
Probabilistic description of protein alignments for sequences and structures.序列和结构的蛋白质比对的概率描述。
Proteins. 2004 Jul 1;56(1):157-66. doi: 10.1002/prot.20067.
5
Identifying sequence-structure pairs undetected by sequence alignments.识别序列比对未检测到的序列-结构对。
Protein Eng. 2000 Jul;13(7):459-75. doi: 10.1093/protein/13.7.459.
6
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
7
Improvement of protein secondary structure prediction using binary word encoding.使用二元词编码改进蛋白质二级结构预测
Proteins. 1997 Jan;27(1):36-46. doi: 10.1002/(sici)1097-0134(199701)27:1<36::aid-prot5>3.0.co;2-l.
8
Search for the most stable folds of protein chains: II. Computation of stable architectures of beta-proteins using a self-consistent molecular field theory.寻找蛋白质链的最稳定折叠结构:II. 使用自洽分子场理论计算β-蛋白质的稳定结构
Protein Eng. 1996 May;9(5):399-411. doi: 10.1093/protein/9.5.399.
9
Search for the most stable folds of protein chains: I. Application of a self-consistent molecular field theory to a problem of protein three-dimensional structure prediction.寻找蛋白质链的最稳定折叠:I. 自洽分子场理论在蛋白质三维结构预测问题中的应用。
Protein Eng. 1996 May;9(5):387-97. doi: 10.1093/protein/9.5.387.
10
A reliable sequence alignment method based on probabilities of residue correspondences.一种基于残基对应概率的可靠序列比对方法。
Protein Eng. 1995 Oct;8(10):999-1009. doi: 10.1093/protein/8.10.999.