使用最佳 k 路径解码 HMM：算法与应用。

Decoding HMMs using the k best paths: algorithms and applications.

机构信息

Cheriton School of Computer Science, University of Waterloo, 200 University Avenue W, Waterloo, Ontario, Canada N2L 3G1.

出版信息

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S28. doi: 10.1186/1471-2105-11-S1-S28.

DOI:10.1186/1471-2105-11-S1-S28

PMID:20122200

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3009499/

Abstract

BACKGROUND

Traditional algorithms for hidden Markov model decoding seek to maximize either the probability of a state path or the number of positions of a sequence assigned to the correct state. These algorithms provide only a single answer and in practice do not produce good results.

RESULTS

We explore an alternative approach, where we efficiently compute the k paths of highest probability to explain a sequence and then either use those paths to explore alternative explanations for a sequence or to combine them into a single explanation. Our procedure uses an online pruning technique to reduce usage of primary memory.

CONCLUSION

Out algorithm uses much less memory than naive approach. For membrane proteins, even simple path combination algorithms give good explanations, and if we look at the paths we are combining, we can give a sense of confidence in the explanation as well. For proteins with two topologies, the k best paths can give insight into both correct explanations of a sequence, a feature lacking from traditional algorithms in this domain.

摘要

背景

传统的隐马尔可夫模型解码算法旨在最大化状态路径的概率或序列中分配给正确状态的位置数量。这些算法只提供一个单一的答案，在实践中并不能产生很好的结果。

结果

我们探索了一种替代方法，其中我们高效地计算了解释序列的 k 条最高概率路径，然后可以使用这些路径来探索序列的替代解释，或者将它们组合成一个单一的解释。我们的程序使用在线修剪技术来减少主内存的使用。

结论

我们的算法比盲目算法使用的内存少得多。对于膜蛋白，即使是简单的路径组合算法也能给出很好的解释，如果我们观察要组合的路径，我们也可以对解释的可信度有一定的了解。对于具有两种拓扑结构的蛋白质，k 条最佳路径可以深入了解序列的两种正确解释，这是该领域传统算法所缺乏的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0a3/3009499/d03cfc7939e5/1471-2105-11-S1-S28-1.jpg

相似文献

Decoding HMMs using the k best paths: algorithms and applications.使用最佳 k 路径解码 HMM：算法与应用。

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S28. doi: 10.1186/1471-2105-11-S1-S28.

New decoding algorithms for Hidden Markov Models using distance measures on labellings.基于标注距离的隐马尔可夫模型新译码算法

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S40. doi: 10.1186/1471-2105-11-S1-S40.

A new decoding algorithm for hidden Markov models improves the prediction of the topology of all-beta membrane proteins.一种用于隐马尔可夫模型的新解码算法改进了全β膜蛋白拓扑结构的预测。

BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S12. doi: 10.1186/1471-2105-6-S4-S12.

Implementing EM and Viterbi algorithms for Hidden Markov Model in linear memory.在线性内存中实现隐马尔可夫模型的期望最大化（EM）算法和维特比（Viterbi）算法。

BMC Bioinformatics. 2008 Apr 30;9:224. doi: 10.1186/1471-2105-9-224.

HMM sampling and applications to gene finding and alternative splicing.隐马尔可夫模型采样及其在基因发现和可变剪接中的应用。

Bioinformatics. 2003 Oct;19 Suppl 2:ii36-41. doi: 10.1093/bioinformatics/btg1057.

A post-decoding re-ranking algorithm for predicting interacting residues in proteins with hidden Markov models incorporating long-distance information.一种用于通过结合长距离信息的隐马尔可夫模型预测蛋白质中相互作用残基的解码后重新排序算法。

Comput Biol Chem. 2016 Dec;65:21-28. doi: 10.1016/j.compbiolchem.2016.09.015. Epub 2016 Sep 29.

Bayesian restoration of a hidden Markov chain with applications to DNA sequencing.应用于DNA测序的隐马尔可夫链的贝叶斯恢复

J Comput Biol. 1999 Summer;6(2):261-77. doi: 10.1089/cmb.1999.6.261.

The Treeterbi and Parallel Treeterbi algorithms: efficient, optimal decoding for ordinary, generalized and pair HMMs.Treeterbi算法和并行Treeterbi算法：用于普通、广义和成对隐马尔可夫模型的高效、最优解码。

Bioinformatics. 2007 Mar 1;23(5):545-54. doi: 10.1093/bioinformatics/btl659. Epub 2007 Jan 18.

Querying pathways in protein interaction networks based on hidden Markov models.基于隐马尔可夫模型查询蛋白质相互作用网络中的通路。

J Comput Biol. 2009 Feb;16(2):145-57. doi: 10.1089/cmb.2008.02TT.

Algebraic Dynamic Programming over general data structures.基于通用数据结构的代数动态规划。

BMC Bioinformatics. 2015;16 Suppl 19(Suppl 19):S2. doi: 10.1186/1471-2105-16-S19-S2. Epub 2015 Dec 16.

引用本文的文献

New decoding algorithms for Hidden Markov Models using distance measures on labellings.基于标注距离的隐马尔可夫模型新译码算法

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S40. doi: 10.1186/1471-2105-11-S1-S40.

本文引用的文献

A tutorial of techniques for improving standard Hidden Markov Model algorithms.提高标准隐马尔可夫模型算法的技术教程。

J Bioinform Comput Biol. 2009 Aug;7(4):737-54. doi: 10.1142/s0219720009004242.

Identification and evolution of dual-topology membrane proteins.双拓扑膜蛋白的鉴定与进化

Nat Struct Mol Biol. 2006 Feb;13(2):112-6. doi: 10.1038/nsmb1057. Epub 2006 Jan 22.

BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S12. doi: 10.1186/1471-2105-6-S4-S12.

An HMM posterior decoder for sequence feature prediction that includes homology information.一种用于序列特征预测的隐马尔可夫模型后验解码器，其包含同源性信息。

Bioinformatics. 2005 Jun;21 Suppl 1:i251-7. doi: 10.1093/bioinformatics/bti1014.

The Universal Protein Resource (UniProt).通用蛋白质资源（UniProt）。

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9. doi: 10.1093/nar/gki070.

Transmembrane proteins in the Protein Data Bank: identification and classification.蛋白质数据库中的跨膜蛋白：鉴定与分类

Bioinformatics. 2004 Nov 22;20(17):2964-72. doi: 10.1093/bioinformatics/bth340. Epub 2004 Jun 4.

A combined transmembrane topology and signal peptide prediction method.一种跨膜拓扑结构与信号肽联合预测方法。

J Mol Biol. 2004 May 14;338(5):1027-36. doi: 10.1016/j.jmb.2004.03.016.

Two methods for improving performance of an HMM and their application for gene finding.两种提高隐马尔可夫模型性能的方法及其在基因识别中的应用。

Proc Int Conf Intell Syst Mol Biol. 1997;5:179-86.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用最佳 k 路径解码 HMM：算法与应用。

Decoding HMMs using the k best paths: algorithms and applications.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献