Suppr超能文献

提高标准隐马尔可夫模型算法的技术教程。

A tutorial of techniques for improving standard Hidden Markov Model algorithms.

作者信息

Golod Daniil, Brown Daniel G

机构信息

David R. Cheriton School of Computer Science, University of Waterloo, 200 University Avenue West, Waterloo, Ontario, N2L 3G1, Canada.

出版信息

J Bioinform Comput Biol. 2009 Aug;7(4):737-54. doi: 10.1142/s0219720009004242.

Abstract

In this tutorial, we discuss two main algorithms for Hidden Markov Models or HMMs: the Viterbi algorithm and the expectation phase of the Baum-Welch algorithm, and we describe ways to improve their naïve implementations. For the Baum-Welch algorithm we first present an implementation of the expectation computations using constant space. We then discuss the classical implementation of this calculation and describe ways to reduce its space usage to logarithmic and O(square root n), with their respective CPU costs. We also note where each respective algorithm can be parallelized. For the Viterbi algorithm, we describe O(square root n) and logarithmic space algorithms which increase CPU use by a factor of two and by a logarithmic factor respectively. We also present two recent heuristics for decreasing space use, which in practice lead to logarithmic space use. Classical version of Viterbi cannot be parallelized by splitting sequence in several subsequences, but we show a parallelization that works if we are willing to pay a significant extra CPU cost. Finally we show a very simple parallelization trick which enables full usage of multiple CPUs/cores under the condition that they share memory.

摘要

在本教程中,我们将讨论隐马尔可夫模型(HMM)的两种主要算法:维特比算法和鲍姆-韦尔奇算法的期望阶段,并描述改进其朴素实现的方法。对于鲍姆-韦尔奇算法,我们首先给出一种使用常量空间进行期望计算的实现。然后我们讨论该计算的经典实现,并描述将其空间使用减少到对数级和O(根号n)的方法,以及它们各自的CPU成本。我们还指出了每种算法可以并行化的地方。对于维特比算法,我们描述了O(根号n)和对数空间算法,它们分别将CPU使用增加了两倍和对数因子。我们还介绍了两种最近用于减少空间使用的启发式方法,在实际应用中可实现对数空间使用。维特比算法的经典版本不能通过将序列拆分为几个子序列来并行化,但我们展示了一种并行化方法,前提是我们愿意支付显著的额外CPU成本。最后,我们展示了一个非常简单的并行化技巧,在多个CPU/核心共享内存的条件下能够充分利用它们。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验