COMET：用于超快速HIV-1亚型鉴定的基于上下文的自适应建模

COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification.

作者信息

Struck Daniel, Lawyer Glenn, Ternes Anne-Marie, Schmit Jean-Claude, Bercoff Danielle Perez

机构信息

Laboratory of Retrovirology, CRP-Santé, 84, Val Fleuri, L-1526, Luxembourg

Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Campus E1 4, 66123 Saarbrücken, Germany.

出版信息

Nucleic Acids Res. 2014 Oct;42(18):e144. doi: 10.1093/nar/gku739. Epub 2014 Aug 12.

DOI:10.1093/nar/gku739

PMID:25120265

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4191385/

Abstract

Viral sequence classification has wide applications in clinical, epidemiological, structural and functional categorization studies. Most existing approaches rely on an initial alignment step followed by classification based on phylogenetic or statistical algorithms. Here we present an ultrafast alignment-free subtyping tool for human immunodeficiency virus type one (HIV-1) adapted from Prediction by Partial Matching compression. This tool, named COMET, was compared to the widely used phylogeny-based REGA and SCUEAL tools using synthetic and clinical HIV data sets (1,090,698 and 10,625 sequences, respectively). COMET's sensitivity and specificity were comparable to or higher than the two other subtyping tools on both data sets for known subtypes. COMET also excelled in detecting and identifying new recombinant forms, a frequent feature of the HIV epidemic. Runtime comparisons showed that COMET was almost as fast as USEARCH. This study demonstrates the advantages of alignment-free classification of viral sequences, which feature high rates of variation, recombination and insertions/deletions. COMET is free to use via an online interface.

摘要

病毒序列分类在临床、流行病学、结构和功能分类研究中有着广泛应用。大多数现有方法依赖于初始比对步骤，然后基于系统发育或统计算法进行分类。在此，我们展示了一种超快速的、无需比对的人类免疫缺陷病毒1型（HIV-1）亚型分析工具，该工具改编自基于部分匹配压缩的预测方法。这个名为COMET的工具，使用合成和临床HIV数据集（分别为1,090,698和10,625个序列），与广泛使用的基于系统发育的REGA和SCUEAL工具进行了比较。对于已知亚型，在两个数据集上，COMET的灵敏度和特异性与其他两种亚型分析工具相当或更高。COMET在检测和识别新的重组形式方面也表现出色，而重组是HIV流行的一个常见特征。运行时比较表明，COMET几乎与USEARCH一样快。这项研究证明了对具有高变异率、重组率和插入/缺失率的病毒序列进行无需比对分类的优势。可通过在线界面免费使用COMET。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb6a/4191385/57fb396f13e5/gku739fig1.jpg

相似文献

COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification.

Nucleic Acids Res. 2014 Oct;42(18):e144. doi: 10.1093/nar/gku739. Epub 2014 Aug 12.

Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: performance evaluation of the new REGA version 3 and seven other tools.

Infect Genet Evol. 2013 Oct;19:337-48. doi: 10.1016/j.meegid.2013.04.032. Epub 2013 May 7.

Comparative Evaluation of Subtyping Tools for Surveillance of Newly Emerging HIV-1 Strains.

J Clin Microbiol. 2017 Sep;55(9):2827-2837. doi: 10.1128/JCM.00656-17. Epub 2017 Jul 12.

An evolutionary model-based algorithm for accurate phylogenetic breakpoint mapping and subtype prediction in HIV-1.

PLoS Comput Biol. 2009 Nov;5(11):e1000581. doi: 10.1371/journal.pcbi.1000581. Epub 2009 Nov 26.

A novel alignment-free method for HIV-1 subtype classification.

Infect Genet Evol. 2020 Jan;77:104080. doi: 10.1016/j.meegid.2019.104080. Epub 2019 Nov 1.

HIV-1 subtypes B and C unique recombinant forms (URFs) and transmitted drug resistance identified in the Western Cape Province, South Africa.

PLoS One. 2014 Mar 7;9(6):e90845. doi: 10.1371/journal.pone.0090845. eCollection 2014.

Reliability of rapid subtyping tools compared to that of phylogenetic analysis for characterization of human immunodeficiency virus type 1 non-B subtypes and recombinant forms.

J Clin Microbiol. 2008 Dec;46(12):3896-9. doi: 10.1128/JCM.00515-08. Epub 2008 Oct 8.

Comparative performance of the REGA subtyping tool version 2 versus version 1.

Infect Genet Evol. 2010 Apr;10(3):380-5. doi: 10.1016/j.meegid.2009.09.020. Epub 2009 Oct 12.

[HIV-1 subtype distribution determined by phylogenetic analysis of pol gene sequences and automated subtyping tools among HIV-1 isolates from the Aegian Region of Turkey].

Mikrobiyol Bul. 2014 Jul;48(3):420-8. doi: 10.5578/mb.7609.

The heterosexual human immunodeficiency virus type 1 epidemic in Thailand is caused by an intersubtype (A/E) recombinant of African origin.

J Virol. 1996 Oct;70(10):7013-29. doi: 10.1128/JVI.70.10.7013-7029.1996.

引用本文的文献

"Big Events" and HIV Transmission Dynamics: Estimating Time Since HIV Infection from Deep Sequencing Data Among Sex Workers and Their Clients in Dnipro, Ukraine.

Viruses. 2025 Aug 21;17(8):1148. doi: 10.3390/v17081148.

The Molecular Epidemiology of HIV-1 in Russia, 1987-2023: Subtypes, Transmission Networks and Phylogenetic Story.

Pathogens. 2025 Jul 26;14(8):738. doi: 10.3390/pathogens14080738.

Phylodynamics analysis of HIV epidemic history in Belarus in 1987-2022.

Front Epidemiol. 2025 Jul 21;5:1601976. doi: 10.3389/fepid.2025.1601976. eCollection 2025.

Variation in HIV-1 Tat and Vpr protein amino acid sequences and its association with vascular health measures in a South African cohort: an exploratory study.

Virol J. 2025 Aug 4;22(1):266. doi: 10.1186/s12985-025-02891-8.

Drug resistance mutations and phylogenetic analysis of HIV-1 subtypes B and F from mothers and children with vertical transmission.

BMC Infect Dis. 2025 Jul 1;25(1):811. doi: 10.1186/s12879-025-11230-9.

Changes of methylation at enhancers appear to be essential for HIV infection progression.

Clin Epigenetics. 2025 Jun 19;17(1):104. doi: 10.1186/s13148-025-01910-3.

Retrospective Observational Study of CSF-Derived HIV-1 Tat and Vpr Amino Acid Sequences in a South African Pediatric Cohort with HIV Subtype C.

Int J Mol Sci. 2025 May 22;26(11):5008. doi: 10.3390/ijms26115008.

HIV-1 drug resistance among people living with HIV receiving dolutegravir-based anti-retroviral regimens in Uganda: a national laboratory-based survey using remnant viral load samples, 2022.

J Antimicrob Chemother. 2025 Aug 1;80(8):2126-2134. doi: 10.1093/jac/dkaf180.

Distinct Molecular Epidemiology, Transmission Patterns, and Resistance Mutations of HIV-1 Subtypes A1, A6, and A7 in Bulgaria.

Microorganisms. 2025 May 12;13(5):1108. doi: 10.3390/microorganisms13051108.

Distribution of CCR5-Delta32, CCR2-64I, and SDF1-3'A host genetic factors in HIV-infected and uninfected individuals in Luanda, Angola.

AIDS Res Ther. 2025 May 24;22(1):54. doi: 10.1186/s12981-025-00751-7.

本文引用的文献

Expanded classification of hepatitis C virus into 7 genotypes and 67 subtypes: updated criteria and genotype assignment web resource.

Hepatology. 2014 Jan;59(1):318-27. doi: 10.1002/hep.26744.

Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: performance evaluation of the new REGA version 3 and seven other tools.

Infect Genet Evol. 2013 Oct;19:337-48. doi: 10.1016/j.meegid.2013.04.032. Epub 2013 May 7.

HIV-1 subtype distribution and its demographic determinants in newly diagnosed patients in Europe suggest highly compartmentalized epidemics.

Retrovirology. 2013 Jan 14;10:7. doi: 10.1186/1742-4690-10-7.

One size does not fit all: on how Markov model order dictates performance of genomic sequence analyses.

Nucleic Acids Res. 2013 Feb 1;41(3):1416-24. doi: 10.1093/nar/gks1285. Epub 2012 Dec 24.

HIV-1 subtype is an independent predictor of reverse transcriptase mutation K65R in HIV-1 patients treated with combination antiretroviral therapy including tenofovir.

Antimicrob Agents Chemother. 2013 Feb;57(2):1053-6. doi: 10.1128/AAC.01668-12. Epub 2012 Nov 26.

Classification of HIV-1 sequences using profile Hidden Markov Models.

PLoS One. 2012;7(5):e36566. doi: 10.1371/journal.pone.0036566. Epub 2012 May 18.

Alignment methods: strategies, challenges, benchmarking, and comparative overview.

Methods Mol Biol. 2012;855:203-35. doi: 10.1007/978-1-61779-582-4_7.

Global trends in molecular epidemiology of HIV-1 during 2000-2007.

AIDS. 2011 Mar 13;25(5):679-89. doi: 10.1097/QAD.0b013e328342ff93.

Search and clustering orders of magnitude faster than BLAST.

Bioinformatics. 2010 Oct 1;26(19):2460-1. doi: 10.1093/bioinformatics/btq461. Epub 2010 Aug 12.

MS4--Multi-Scale Selector of Sequence Signatures: an alignment-free method for classification of biological sequences.

BMC Bioinformatics. 2010 Jul 30;11:406. doi: 10.1186/1471-2105-11-406.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

COMET：用于超快速HIV-1亚型鉴定的基于上下文的自适应建模

COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification.

作者信息

Struck Daniel, Lawyer Glenn, Ternes Anne-Marie, Schmit Jean-Claude, Bercoff Danielle Perez

机构信息

Laboratory of Retrovirology, CRP-Santé, 84, Val Fleuri, L-1526, Luxembourg

Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Campus E1 4, 66123 Saarbrücken, Germany.

出版信息

Nucleic Acids Res. 2014 Oct;42(18):e144. doi: 10.1093/nar/gku739. Epub 2014 Aug 12.

DOI:10.1093/nar/gku739

PMID:25120265

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4191385/

Abstract

摘要

COMET：用于超快速HIV-1亚型鉴定的基于上下文的自适应建模

COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

COMET：用于超快速HIV-1亚型鉴定的基于上下文的自适应建模

COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献