• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VILOCA:用于短读长和长读长数据的测序质量感知病毒单倍型重建及突变检测

VILOCA: sequencing quality-aware viral haplotype reconstruction and mutation calling for short-read and long-read data.

作者信息

Fuhrmann Lara, Langer Benjamin, Topolsky Ivan, Beerenwinkel Niko

机构信息

Department of Biosystems Science and Engineering, ETH Zurich, Klingelbergstrasse 48, Basel 4056, Switzerland.

SIB Swiss Institute of Bioinformatics, Quartier Sorge - Bâtiment Amphipôle, Lausanne 1015, Switzerland.

出版信息

NAR Genom Bioinform. 2024 Nov 28;6(4):lqae152. doi: 10.1093/nargab/lqae152. eCollection 2024 Dec.

DOI:10.1093/nargab/lqae152
PMID:39633724
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11616694/
Abstract

RNA viruses exist as large heterogeneous populations within their host. The structure and diversity of virus populations affects disease progression and treatment outcomes. Next-generation sequencing allows detailed viral population analysis, but inferring diversity from error-prone reads is challenging. Here, we present VILOCA (VIral LOcal haplotype reconstruction and mutation CAlling for short and long read data), a method for mutation calling and reconstruction of local haplotypes from short- and long-read viral sequencing data. Local haplotypes refer to genomic regions that have approximately the length of the input reads. VILOCA recovers local haplotypes by using a Dirichlet process mixture model to cluster reads around their unobserved haplotypes and leveraging quality scores of the sequencing reads. We assessed the performance of VILOCA in terms of mutation calling and haplotype reconstruction accuracy on simulated and experimental Illumina, PacBio and Oxford Nanopore data. On simulated and experimental Illumina data, VILOCA performed better or similar to existing methods. On the simulated long-read data, VILOCA is able to recover on average [Formula: see text] of the ground truth mutations with perfect precision compared to only [Formula: see text] recall and [Formula: see text] precision of the second-best method. In summary, VILOCA provides significantly improved accuracy in mutation and haplotype calling, especially for long-read sequencing data, and therefore facilitates the comprehensive characterization of heterogeneous within-host viral populations.

摘要

RNA病毒在其宿主内以高度异质的群体形式存在。病毒群体的结构和多样性会影响疾病进展和治疗结果。新一代测序技术允许进行详细的病毒群体分析,但从容易出错的读数中推断多样性具有挑战性。在这里,我们提出了VILOCA(用于短读长和长读长数据的病毒局部单倍型重建和突变检测),一种从短读长和长读长病毒测序数据中进行突变检测和局部单倍型重建的方法。局部单倍型是指长度与输入读数大致相同的基因组区域。VILOCA通过使用狄利克雷过程混合模型将读数聚类在其未观察到的单倍型周围,并利用测序读数的质量得分来恢复局部单倍型。我们在模拟和实验的Illumina、PacBio和牛津纳米孔数据上评估了VILOCA在突变检测和单倍型重建准确性方面表现。在模拟和实验的Illumina数据上,VILOCA的表现优于或类似于现有方法。在模拟的长读长数据上,与第二好的方法仅70%的召回率和73%的精确率相比,VILOCA能够以完美的精确率平均恢复92%的真实突变。总之,VILOCA在突变和单倍型检测方面提供了显著提高的准确性,特别是对于长读长测序数据,因此有助于对宿主内异质病毒群体进行全面表征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/3da56543b565/lqae152fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/c6f2770a1184/lqae152figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/c18ff5df11be/lqae152fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/c9e9d861116d/lqae152fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/7d93c00dd49f/lqae152fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/0328868d5cc2/lqae152fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/ca77e24b6d4e/lqae152fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/3da56543b565/lqae152fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/c6f2770a1184/lqae152figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/c18ff5df11be/lqae152fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/c9e9d861116d/lqae152fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/7d93c00dd49f/lqae152fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/0328868d5cc2/lqae152fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/ca77e24b6d4e/lqae152fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6163/11616694/3da56543b565/lqae152fig6.jpg

相似文献

1
VILOCA: sequencing quality-aware viral haplotype reconstruction and mutation calling for short-read and long-read data.VILOCA:用于短读长和长读长数据的测序质量感知病毒单倍型重建及突变检测
NAR Genom Bioinform. 2024 Nov 28;6(4):lqae152. doi: 10.1093/nargab/lqae152. eCollection 2024 Dec.
2
Evaluation of haplotype callers for next-generation sequencing of viruses.病毒下一代测序中单体型caller 的评估。
Infect Genet Evol. 2020 Aug;82:104277. doi: 10.1016/j.meegid.2020.104277. Epub 2020 Mar 6.
3
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.利用跨越多个单核苷酸多态性的读取信息,从测序数据中推断单倍型。
Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3.
4
Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads.使用 PEPPER-Margin-DeepVariant 进行单体型感知变异调用可实现纳米孔长读段的高精度。
Nat Methods. 2021 Nov;18(11):1322-1332. doi: 10.1038/s41592-021-01299-w. Epub 2021 Nov 1.
5
Local read haplotagging enables accurate long-read small variant calling.局部读取单倍型标记可实现准确的长读长小变异检测。
bioRxiv. 2023 Sep 12:2023.09.07.556731. doi: 10.1101/2023.09.07.556731.
6
Long-Read Metagenomics Improves the Recovery of Viral Diversity from Complex Natural Marine Samples.长读宏基因组提高了从复杂自然海洋样本中病毒多样性的恢复。
mSystems. 2022 Jun 28;7(3):e0019222. doi: 10.1128/msystems.00192-22. Epub 2022 Jun 13.
7
Local read haplotagging enables accurate long-read small variant calling.局部读取标签化可实现长读长小型变异calling 的准确性。
Nat Commun. 2024 Jul 13;15(1):5907. doi: 10.1038/s41467-024-50079-5.
8
Deep sequencing of a genetically heterogeneous sample: local haplotype reconstruction and read error correction.基因异质样本的深度测序:局部单倍型重建与读段错误校正
J Comput Biol. 2010 Mar;17(3):417-28. doi: 10.1089/cmb.2009.0164.
9
Benchmarking reveals superiority of deep learning variant callers on bacterial nanopore sequence data.基准测试显示深度学习变异调用程序在细菌纳米孔测序数据上的优越性。
Elife. 2024 Oct 10;13:RP98300. doi: 10.7554/eLife.98300.
10
Adaptation of Oxford Nanopore technology for hepatitis C whole genome sequencing and identification of within-host viral variants.牛津纳米孔技术在丙型肝炎全基因组测序及宿主内病毒变异体鉴定中的应用。
BMC Genomics. 2021 Mar 2;22(1):148. doi: 10.1186/s12864-021-07460-1.

本文引用的文献

1
V-pipe 3.0: a sustainable pipeline for within-sample viral genetic diversity estimation.V-pipe 3.0:一种用于样本内病毒遗传多样性估计的可持续性管道。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae065.
2
Parallel evolution and enhanced virulence upon passage of an RNA virus in .一种RNA病毒传代后的平行进化与毒力增强
Virus Evol. 2023 Dec 15;9(2):vead074. doi: 10.1093/ve/vead074. eCollection 2023.
3
Surveillance of SARS-CoV-2, rotavirus, norovirus genogroup II, and human adenovirus in wastewater as an epidemiological tool to anticipate outbreaks of COVID-19 and acute gastroenteritis in a city without a wastewater treatment plant in the Peruvian Highlands.
在秘鲁高地没有污水处理厂的城市中,利用废水监测 SARS-CoV-2、轮状病毒、诺如病毒基因 II 组和人腺病毒,作为预测 COVID-19 和急性胃肠炎爆发的流行病学工具。
Sci Total Environ. 2023 Dec 20;905:167161. doi: 10.1016/j.scitotenv.2023.167161. Epub 2023 Sep 18.
4
Wastewater surveillance of SARS-CoV-2 genomic populations on a country-wide scale through targeted sequencing.通过靶向测序对全国范围内的 SARS-CoV-2 基因组种群进行污水监测。
PLoS One. 2023 Apr 21;18(4):e0284483. doi: 10.1371/journal.pone.0284483. eCollection 2023.
5
Hepatitis B virus haplotype number at baseline is a predictive marker of functional cure during antiviral therapy for patients with genotypes A and D HBeAg-positive chronic hepatitis B.基线时乙型肝炎病毒单倍型数量是基因型 A 和 D HBeAg 阳性慢性乙型肝炎患者抗病毒治疗中功能性治愈的预测标志物。
Aliment Pharmacol Ther. 2023 Mar;57(5):509-523. doi: 10.1111/apt.17299. Epub 2022 Nov 25.
6
HaploDMF: viral haplotype reconstruction from long reads via deep matrix factorization.HaploDMF:通过深度矩阵分解从长读中重建病毒单倍型。
Bioinformatics. 2022 Dec 13;38(24):5360-5367. doi: 10.1093/bioinformatics/btac708.
7
Municipal and neighbourhood level wastewater surveillance and subtyping of an influenza virus outbreak.城市和社区层面污水监测及流感病毒爆发的亚型分析。
Sci Rep. 2022 Sep 22;12(1):15777. doi: 10.1038/s41598-022-20076-z.
8
Early detection and surveillance of SARS-CoV-2 genomic variants in wastewater using COJAC.利用 COJAC 技术对污水中的 SARS-CoV-2 基因组变异进行早期检测和监测。
Nat Microbiol. 2022 Aug;7(8):1151-1160. doi: 10.1038/s41564-022-01185-x. Epub 2022 Jul 18.
9
Strainline: full-length de novo viral haplotype reconstruction from noisy long reads.Strainline:从嘈杂的长读段中全长从头重建病毒单倍型。
Genome Biol. 2022 Jan 20;23(1):29. doi: 10.1186/s13059-021-02587-6.
10
Norovirus detection in wastewater and its correlation with human gastroenteritis: a systematic review and meta-analysis.污水中诺如病毒的检测及其与人类胃肠炎的相关性:系统评价和荟萃分析。
Environ Sci Pollut Res Int. 2022 Apr;29(16):22829-22842. doi: 10.1007/s11356-021-18202-x. Epub 2022 Jan 19.