Suppr超能文献

基于轮廓隐马尔可夫模型的 HIV-1 序列分类。

Classification of HIV-1 sequences using profile Hidden Markov Models.

机构信息

School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India.

出版信息

PLoS One. 2012;7(5):e36566. doi: 10.1371/journal.pone.0036566. Epub 2012 May 18.

Abstract

Accurate classification of HIV-1 subtypes is essential for studying the dynamic spatial distribution pattern of HIV-1 subtypes and also for developing effective methods of treatment that can be targeted to attack specific subtypes. We propose a classification method based on profile Hidden Markov Model that can accurately identify an unknown strain. We show that a standard method that relies on the construction of a positive training set only, to capture unique features associated with a particular subtype, can accurately classify sequences belonging to all subtypes except B and D. We point out the drawbacks of the standard method; namely, an arbitrary choice of threshold to distinguish between true positives and true negatives, and the inability to discriminate between closely related subtypes. We then propose an improved classification method based on construction of a positive as well as a negative training set to improve discriminating ability between closely related subtypes like B and D. Finally, we show how the improved method can be used to accurately determine the subtype composition of Common Recombinant Forms of the virus that are made up of two or more subtypes. Our method provides a simple and highly accurate alternative to other classification methods and will be useful in accurately annotating newly sequenced HIV-1 strains.

摘要

准确的 HIV-1 亚型分类对于研究 HIV-1 亚型的动态空间分布模式以及开发针对特定亚型的有效治疗方法至关重要。我们提出了一种基于轮廓隐马尔可夫模型的分类方法,可以准确识别未知株。我们表明,仅依靠构建阳性训练集来捕获与特定亚型相关的独特特征的标准方法可以准确地对除 B 和 D 以外的所有亚型的序列进行分类。我们指出了标准方法的缺点;即,任意选择阈值来区分真正的阳性和真正的阴性,以及无法区分密切相关的亚型。然后,我们提出了一种改进的分类方法,基于构建阳性和阴性训练集,以提高对 B 和 D 等密切相关亚型的区分能力。最后,我们展示了如何使用改进的方法来准确确定由两个或多个亚型组成的常见重组形式病毒的亚型组成。我们的方法为其他分类方法提供了一种简单且高度准确的替代方法,将有助于准确注释新测序的 HIV-1 株。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0746/3356369/5d03006de9f8/pone.0036566.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验