• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过蒙特卡罗聚类重建病毒变体。

Reconstruction of Viral Variants via Monte Carlo Clustering.

机构信息

Department of Computer Science and Georgia State University, Atlanta, Georgia, USA.

Department of Mathematics and Statistics, Georgia State University, Atlanta, Georgia, USA.

出版信息

J Comput Biol. 2023 Sep;30(9):1009-1018. doi: 10.1089/cmb.2023.0154. Epub 2023 Sep 11.

DOI:10.1089/cmb.2023.0154
PMID:37695837
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10518690/
Abstract

Identifying viral variants through clustering is essential for understanding the composition and structure of viral populations within and between hosts, which play a crucial role in disease progression and epidemic spread. This article proposes and validates novel Monte Carlo (MC) methods for clustering aligned viral sequences by minimizing either entropy or Hamming distance from consensuses. We validate these methods on four benchmarks: two SARS-CoV-2 interhost data sets and two HIV intrahost data sets. A parallelized version of our tool is scalable to very large data sets. We show that both entropy and Hamming distance-based MC clusterings discern the meaningful information from sequencing data. The proposed clustering methods consistently converge to similar clusterings across different runs. Finally, we show that MC clustering improves reconstruction of intrahost viral population from sequencing data.

摘要

通过聚类来识别病毒变体对于理解宿主内和宿主间病毒群体的组成和结构至关重要,这些结构在疾病进展和疫情传播中起着关键作用。本文提出并验证了通过最小化共识的熵或汉明距离来对对齐的病毒序列进行聚类的新型蒙特卡罗 (MC) 方法。我们在四个基准上验证了这些方法:两个 SARS-CoV-2 宿主间数据集和两个 HIV 宿主内数据集。我们工具的并行版本可扩展到非常大的数据集。我们表明,基于熵和汉明距离的 MC 聚类都可以从测序数据中辨别出有意义的信息。所提出的聚类方法在不同的运行中始终收敛到相似的聚类。最后,我们表明 MC 聚类可改善从测序数据重建宿主内病毒群体。

相似文献

1
Reconstruction of Viral Variants via Monte Carlo Clustering.通过蒙特卡罗聚类重建病毒变体。
J Comput Biol. 2023 Sep;30(9):1009-1018. doi: 10.1089/cmb.2023.0154. Epub 2023 Sep 11.
2
From Alpha to Zeta: Identifying Variants and Subtypes of SARS-CoV-2 Via Clustering.从 Alpha 到 Zeta:通过聚类识别 SARS-CoV-2 的变体和亚型。
J Comput Biol. 2021 Nov;28(11):1113-1129. doi: 10.1089/cmb.2021.0302. Epub 2021 Oct 25.
3
Convex hulls in hamming space enable efficient search for similarity and clustering of genomic sequences.哈明空间中的凸包可实现对基因组序列的相似性和聚类的高效搜索。
BMC Bioinformatics. 2020 Dec 30;21(Suppl 18):482. doi: 10.1186/s12859-020-03811-z.
4
Identification of a High-Frequency Intrahost SARS-CoV-2 Spike Variant with Enhanced Cytopathic and Fusogenic Effects.鉴定具有增强致细胞病变和融合作用的 SARS-CoV-2 高频宿主内刺突变异株。
mBio. 2021 Jun 29;12(3):e0078821. doi: 10.1128/mBio.00788-21.
5
Improved SARS-CoV-2 sequencing surveillance allows the identification of new variants and signatures in infected patients.改进的 SARS-CoV-2 测序监测可识别感染患者中的新变体和特征。
Genome Med. 2022 Aug 12;14(1):90. doi: 10.1186/s13073-022-01098-8.
6
Reads2Vec: Efficient Embedding of Raw High-Throughput Sequencing Reads Data.Reads2Vec:高效嵌入原始高通量测序读取数据。
J Comput Biol. 2023 Apr;30(4):469-491. doi: 10.1089/cmb.2022.0424. Epub 2023 Feb 2.
7
An entropy-based study on the mutational landscape of SARS-CoV-2 in USA: Comparing different variants and revealing co-mutational behavior of proteins.基于熵的美国 SARS-CoV-2 突变景观研究:比较不同变体并揭示蛋白质的共突变行为。
Gene. 2024 Sep 5;922:148556. doi: 10.1016/j.gene.2024.148556. Epub 2024 May 14.
8
Deep-sequencing of viral genomes from a large and diverse cohort of treatment-naive HIV-infected persons shows associations between intrahost genetic diversity and viral load.对来自大量不同未经治疗的 HIV 感染者的病毒基因组进行深度测序,显示了宿主内遗传多样性与病毒载量之间的关联。
PLoS Comput Biol. 2023 Jan 3;19(1):e1010756. doi: 10.1371/journal.pcbi.1010756. eCollection 2023 Jan.
9
New Surveillance Metrics for Alerting Community-Acquired Outbreaks of Emerging SARS-CoV-2 Variants Using Imported Case Data: Bayesian Markov Chain Monte Carlo Approach.利用输入病例数据进行新型 SARS-CoV-2 变异株社区获得性暴发预警的新监测指标:贝叶斯马尔可夫链蒙特卡罗方法。
JMIR Public Health Surveill. 2022 Nov 25;8(11):e40866. doi: 10.2196/40866.
10
Benchmarking and Assessment of Eight Genome Assemblers on Viral Next-Generation Sequencing Data, Including the SARS-CoV-2.对包括 SARS-CoV-2 在内的病毒下一代测序数据的八种基因组组装器的基准测试和评估。
OMICS. 2022 Jul;26(7):372-381. doi: 10.1089/omi.2022.0042. Epub 2022 Jun 28.

本文引用的文献

1
GISAID's Role in Pandemic Response.全球流感共享数据库(GISAID)在大流行应对中的作用。
China CDC Wkly. 2021 Dec 3;3(49):1049-1051. doi: 10.46234/ccdcw2021.255.
2
From Alpha to Zeta: Identifying Variants and Subtypes of SARS-CoV-2 Via Clustering.从 Alpha 到 Zeta:通过聚类识别 SARS-CoV-2 的变体和亚型。
J Comput Biol. 2021 Nov;28(11):1113-1129. doi: 10.1089/cmb.2021.0302. Epub 2021 Oct 25.
3
Accurate assembly of minority viral haplotypes from next-generation sequencing through efficient noise reduction.通过有效降低噪声,实现下一代测序中少数病毒单倍型的精确组装。
Nucleic Acids Res. 2021 Sep 27;49(17):e102. doi: 10.1093/nar/gkab576.
4
SARS-CoV-2 Molecular Transmission Clusters and Containment Measures in Ten European Regions during the First Pandemic Wave.新冠疫情第一波期间欧洲十个地区的新冠病毒分子传播集群及防控措施
Life (Basel). 2021 Mar 9;11(3):219. doi: 10.3390/life11030219.
5
Using earth mover's distance for viral outbreak investigations.利用推土机距离进行病毒爆发调查。
BMC Genomics. 2020 Dec 16;21(Suppl 5):582. doi: 10.1186/s12864-020-06982-4.
6
QUENTIN: reconstruction of disease transmissions from viral quasispecies genomic data.从病毒准种基因组数据重建疾病传播。
Bioinformatics. 2018 Jan 1;34(1):163-170. doi: 10.1093/bioinformatics/btx402.
7
Inference of genetic relatedness between viral quasispecies from sequencing data.从测序数据推断病毒准种之间的遗传相关性。
BMC Genomics. 2017 Dec 6;18(Suppl 10):918. doi: 10.1186/s12864-017-4274-5.
8
Accurate Genetic Detection of Hepatitis C Virus Transmissions in Outbreak Settings.在疫情环境中对丙型肝炎病毒传播进行准确的基因检测。
J Infect Dis. 2016 Mar 15;213(6):957-65. doi: 10.1093/infdis/jiv542. Epub 2015 Nov 17.
9
Antigenic cooperation among intrahost HCV variants organized into a complex network of cross-immunoreactivity.宿主体内丙型肝炎病毒(HCV)变体之间的抗原协同作用构成了一个交叉免疫反应的复杂网络。
Proc Natl Acad Sci U S A. 2015 May 26;112(21):6653-8. doi: 10.1073/pnas.1422942112. Epub 2015 May 4.
10
Drug resistance of a viral population and its individual intrahost variants during the first 48 hours of therapy.病毒群体及其个体在治疗的头 48 小时内的耐药性。
Clin Pharmacol Ther. 2014 Jun;95(6):627-35. doi: 10.1038/clpt.2014.20. Epub 2014 Jan 31.