Suppr超能文献

基于标注距离的隐马尔可夫模型新译码算法

New decoding algorithms for Hidden Markov Models using distance measures on labellings.

机构信息

David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada.

出版信息

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S40. doi: 10.1186/1471-2105-11-S1-S40.

Abstract

BACKGROUND

Existing hidden Markov model decoding algorithms do not focus on approximately identifying the sequence feature boundaries.

RESULTS

We give a set of algorithms to compute the conditional probability of all labellings "near" a reference labelling lambda for a sequence y for a variety of definitions of "near". In addition, we give optimization algorithms to find the best labelling for a sequence in the robust sense of having all of its feature boundaries nearly correct. Natural problems in this domain are NP-hard to optimize. For membrane proteins, our algorithms find the approximate topology of such proteins with comparable success to existing programs, while being substantially more accurate in estimating the positions of transmembrane helix boundaries.

CONCLUSION

More robust HMM decoding may allow for better analysis of sequence features, in reasonable runtimes.

摘要

背景

现有的隐马尔可夫模型解码算法并不专注于近似识别序列特征边界。

结果

我们给出了一组算法,可以计算序列 y 的参考标记 lambda 附近的所有标记的条件概率,对于“附近”的各种定义。此外,我们还给出了优化算法,以在具有几乎所有特征边界都正确的鲁棒意义上为序列找到最佳标记。该领域的自然问题在优化方面是 NP 难的。对于膜蛋白,我们的算法在找到这些蛋白质的近似拓扑结构方面取得了与现有程序相当的成功,同时在估计跨膜螺旋边界的位置方面要准确得多。

结论

更稳健的 HMM 解码可以在合理的运行时间内允许更好地分析序列特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a0d/3009513/0e132db19c5c/1471-2105-11-S1-S40-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验