• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质隐马尔可夫模型的转移先验:迈向最大区分度的实证研究

transition priors for protein hidden Markov models: an empirical study towards maximum discrimination.

作者信息

Wistrand Markus, Sonnhammer Erik L L

机构信息

Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm, Sweden.

出版信息

J Comput Biol. 2004;11(1):181-93. doi: 10.1089/106652704773416957.

DOI:10.1089/106652704773416957
PMID:15072695
Abstract

Insertions and deletions in a profile hidden Markov model (HMM) are modeled by transition probabilities between insert, delete and match states. These are estimated by combining observed data and prior probabilities. The transition prior probabilities can be defined either ad hoc or by maximum likelihood (ML) estimation. We show that the choice of transition prior greatly affects the HMM's ability to discriminate between true and false hits. HMM discrimination was measured using the HMMER 2.2 package applied to 373 families from Pfam. We measured the discrimination between true members and noise sequences employing various ML transition priors and also systematically scanned the parameter space of ad hoc transition priors. Our results indicate that ML priors produce far from optimal discrimination, and we present an empirically derived prior that considerably decreases the number of misclassifications compared to ML. Most of the difference stems from the probabilities for exiting a delete state. The ML prior, which is unaware of noise sequences, estimates a delete-to-delete probability that is relatively high and does not penalize noise sequences enough for optimal discrimination.

摘要

在隐马尔可夫模型(HMM)中,插入和缺失是通过插入、删除和匹配状态之间的转移概率来建模的。这些概率是通过结合观测数据和先验概率来估计的。转移先验概率既可以临时定义,也可以通过最大似然(ML)估计来定义。我们表明,转移先验的选择极大地影响了HMM区分真实命中和错误命中的能力。使用应用于Pfam中373个家族的HMMER 2.2软件包来测量HMM的区分能力。我们使用各种ML转移先验来测量真实成员与噪声序列之间的区分能力,并系统地扫描临时转移先验的参数空间。我们的结果表明,ML先验产生的区分能力远非最优,并且我们提出了一种根据经验得出的先验,与ML相比,它大大减少了错误分类的数量。大部分差异源于退出删除状态的概率。不考虑噪声序列的ML先验估计出的删除到删除概率相对较高,并且对噪声序列的惩罚不足以实现最优区分。

相似文献

1
transition priors for protein hidden Markov models: an empirical study towards maximum discrimination.蛋白质隐马尔可夫模型的转移先验:迈向最大区分度的实证研究
J Comput Biol. 2004;11(1):181-93. doi: 10.1089/106652704773416957.
2
Improving profile HMM discrimination by adapting transition probabilities.通过调整转移概率来提高轮廓隐马尔可夫模型的辨别能力。
J Mol Biol. 2004 May 7;338(4):847-54. doi: 10.1016/j.jmb.2004.03.023.
3
Using hidden Markov models to align multiple sequences.使用隐马尔可夫模型对多个序列进行比对。
Cold Spring Harb Protoc. 2009 Jul;2009(7):pdb.top41. doi: 10.1101/pdb.top41.
4
Improved profile HMM performance by assessment of critical algorithmic features in SAM and HMMER.通过评估SAM和HMMER中的关键算法特征提高轮廓隐马尔可夫模型性能。
BMC Bioinformatics. 2005 Apr 15;6:99. doi: 10.1186/1471-2105-6-99.
5
Calibrating E-values for hidden Markov models using reverse-sequence null models.使用反向序列空模型校准隐马尔可夫模型的E值。
Bioinformatics. 2005 Nov 15;21(22):4107-15. doi: 10.1093/bioinformatics/bti629. Epub 2005 Aug 25.
6
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.HMM-ModE——通过优化判别阈值并利用负训练序列修改发射概率,使用轮廓隐马尔可夫模型改进分类。
BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104.
7
HMMEditor: a visual editing tool for profile hidden Markov model.HMMEditor:一种用于轮廓隐马尔可夫模型的可视化编辑工具。
BMC Genomics. 2008;9 Suppl 1(Suppl 1):S8. doi: 10.1186/1471-2164-9-S1-S8.
8
Sequence alignments and pair hidden Markov models using evolutionary history.使用进化历史的序列比对和配对隐马尔可夫模型。
J Mol Biol. 2003 Oct 17;333(2):453-60. doi: 10.1016/j.jmb.2003.08.015.
9
Combined prediction of transmembrane topology and signal peptide of beta-barrel proteins: using a hidden Markov model and genetic algorithms.β-桶状蛋白跨膜拓扑结构和信号肽的联合预测:使用隐马尔可夫模型和遗传算法。
Comput Biol Med. 2010 Jul;40(7):621-8. doi: 10.1016/j.compbiomed.2010.04.006. Epub 2010 May 21.
10
Probabilistic models for biological sequences: selection and Maximum Likelihood estimation.生物序列的概率模型:选择与最大似然估计。
Int J Bioinform Res Appl. 2006;2(3):305-24. doi: 10.1504/IJBRA.2006.010607.

引用本文的文献

1
A jumping profile Hidden Markov Model and applications to recombination sites in HIV and HCV genomes.一种跳跃轮廓隐马尔可夫模型及其在HIV和HCV基因组重组位点中的应用。
BMC Bioinformatics. 2006 May 22;7:265. doi: 10.1186/1471-2105-7-265.
2
Improved profile HMM performance by assessment of critical algorithmic features in SAM and HMMER.通过评估SAM和HMMER中的关键算法特征提高轮廓隐马尔可夫模型性能。
BMC Bioinformatics. 2005 Apr 15;6:99. doi: 10.1186/1471-2105-6-99.