• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用长读长重建病毒单倍型。

Reconstructing viral haplotypes using long reads.

作者信息

Cai Dehan, Sun Yanni

机构信息

Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China.

出版信息

Bioinformatics. 2022 Apr 12;38(8):2127-2134. doi: 10.1093/bioinformatics/btac089.

DOI:10.1093/bioinformatics/btac089
PMID:35157018
Abstract

MOTIVATION

Most RNA viruses lack strict proofreading during replication. Coupled with a high replication rate, some RNA viruses can form a virus population containing a group of genetically related but different haplotypes. Characterizing the haplotype composition in a virus population is thus important to understand viruses' evolution. Many attempts have been made to reconstruct viral haplotypes using next-generation sequencing (NGS) reads. However, the short length of NGS reads cannot cover distant single-nucleotide variants, making it difficult to reconstruct complete or near-complete haplotypes. Given the fast developments of third-generation sequencing technologies, a new opportunity has arisen for reconstructing full-length haplotypes with long reads.

RESULTS

In this work, we developed a new tool, RVHaplo to reconstruct haplotypes for known viruses from long reads. We tested it rigorously on both simulated and real viral sequencing data and compared it against other popular haplotype reconstruction tools. The results demonstrated that RVHaplo outperforms the state-of-the-art tools for viral haplotype reconstruction from long reads. Especially, RVHaplo can reconstruct the rare (1% abundance) haplotypes that other tools usually missed.

AVAILABILITY AND IMPLEMENTATION

The source code and the documentation of RVHaplo are available at https://github.com/dhcai21/RVHaplo.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

大多数RNA病毒在复制过程中缺乏严格的校对机制。再加上高复制率,一些RNA病毒能够形成一个包含一组遗传相关但不同单倍型的病毒群体。因此,表征病毒群体中的单倍型组成对于理解病毒的进化很重要。人们已经进行了许多尝试,使用下一代测序(NGS)读数来重建病毒单倍型。然而,NGS读数的短长度无法覆盖远距离的单核苷酸变体,使得重建完整或接近完整的单倍型变得困难。鉴于第三代测序技术的快速发展,利用长读数重建全长单倍型出现了新的机遇。

结果

在这项工作中,我们开发了一种新工具RVHaplo,用于从长读数中为已知病毒重建单倍型。我们在模拟和真实病毒测序数据上对其进行了严格测试,并将其与其他流行的单倍型重建工具进行了比较。结果表明,RVHaplo在从长读数进行病毒单倍型重建方面优于现有工具。特别是,RVHaplo能够重建其他工具通常遗漏的罕见(丰度为1%)单倍型。

可用性和实现方式

RVHaplo的源代码和文档可在https://github.com/dhcai21/RVHaplo获取。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Reconstructing viral haplotypes using long reads.使用长读长重建病毒单倍型。
Bioinformatics. 2022 Apr 12;38(8):2127-2134. doi: 10.1093/bioinformatics/btac089.
2
HaploDMF: viral haplotype reconstruction from long reads via deep matrix factorization.HaploDMF:通过深度矩阵分解从长读中重建病毒单倍型。
Bioinformatics. 2022 Dec 13;38(24):5360-5367. doi: 10.1093/bioinformatics/btac708.
3
De novo haplotype reconstruction in viral quasispecies using paired-end read guided path finding.利用配对末端读指导路径寻找技术对病毒准种进行从头单倍型重建。
Bioinformatics. 2018 Sep 1;34(17):2927-2935. doi: 10.1093/bioinformatics/bty202.
4
HAT: haplotype assembly tool using short and error-prone long reads.HAT:使用短读长和易错长读进行单体型组装的工具。
Bioinformatics. 2022 Dec 13;38(24):5352-5359. doi: 10.1093/bioinformatics/btac702.
5
A binning tool to reconstruct viral haplotypes from assembled contigs.一种从组装的连续序列中重建病毒单倍型的分箱工具。
BMC Bioinformatics. 2019 Nov 4;20(1):544. doi: 10.1186/s12859-019-3138-1.
6
Viral quasispecies reconstruction via tensor factorization with successive read removal.基于连续读段去除的张量分解进行病毒准种重建。
Bioinformatics. 2018 Jul 1;34(13):i23-i31. doi: 10.1093/bioinformatics/bty291.
7
Evaluation of haplotype callers for next-generation sequencing of viruses.病毒下一代测序中单体型caller 的评估。
Infect Genet Evol. 2020 Aug;82:104277. doi: 10.1016/j.meegid.2020.104277. Epub 2020 Mar 6.
8
Benchmarking of viral haplotype reconstruction programmes: an overview of the capacities and limitations of currently available programmes.病毒单倍型重建程序的基准测试:当前可用程序的能力与局限性概述
Brief Bioinform. 2014 May;15(3):431-42. doi: 10.1093/bib/bbs081. Epub 2012 Dec 19.
9
DCHap: A Divide-and-Conquer Haplotype Phasing Algorithm for Third-Generation Sequences.DCHap:一种用于第三代测序的分治单倍型相位算法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1277-1284. doi: 10.1109/TCBB.2020.3005673. Epub 2022 Jun 3.
10
AccuVIR: an ACCUrate VIRal genome assembly tool for third-generation sequencing data.AccuVIR:一种用于第三代测序数据的 ACCUrate 病毒基因组组装工具。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac827.

引用本文的文献

1
BonoboFlow: viral genome assembly and haplotype reconstruction from nanopore reads.倭黑猩猩流程:基于纳米孔测序 reads 的病毒基因组组装与单倍型重建
Bioinform Adv. 2025 May 13;5(1):vbaf115. doi: 10.1093/bioadv/vbaf115. eCollection 2025.
2
A method for in-depth analysis of circular DNA virus populations by unambiguously profiling the low abundant virus variants and partial genomic components.一种通过明确分析低丰度病毒变体和部分基因组成分来深入分析环状DNA病毒群体的方法。
Nucleic Acids Res. 2025 Mar 20;53(6). doi: 10.1093/nar/gkaf221.
3
Deciphering mixed infections by plant RNA virus and reconstructing complete genomes simultaneously present within-host.
解析植物RNA病毒的混合感染并同时重建宿主内完整基因组。
PLoS One. 2025 Jan 14;20(1):e0311555. doi: 10.1371/journal.pone.0311555. eCollection 2025.
4
Easing genomic surveillance: A comprehensive performance evaluation of long-read assemblers across multi-strain mixture data of HIV-1 and Other pathogenic viruses for constructing a user-friendly bioinformatic pipeline.简化基因组监测:针对 HIV-1 和其他病原性病毒的多菌株混合数据,对长读长组装器进行全面性能评估,以构建用户友好的生物信息学管道。
F1000Res. 2024 May 31;13:556. doi: 10.12688/f1000research.149577.1. eCollection 2024.
5
Early detection of emerging viral variants through analysis of community structure of coordinated substitution networks.通过分析协调替代网络的社区结构,早期发现新兴病毒变体。
Nat Commun. 2024 Apr 2;15(1):2838. doi: 10.1038/s41467-024-47304-6.
6
HMMPolish: a coding region polishing tool for TGS-sequenced RNA viruses.HMMPolish:一种用于 TGS 测序 RNA 病毒的编码区修饰工具。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad264.
7
HaploDMF: viral haplotype reconstruction from long reads via deep matrix factorization.HaploDMF:通过深度矩阵分解从长读中重建病毒单倍型。
Bioinformatics. 2022 Dec 13;38(24):5360-5367. doi: 10.1093/bioinformatics/btac708.
8
Nanopore sequencing technology and its application in plant virus diagnostics.纳米孔测序技术及其在植物病毒诊断中的应用。
Front Microbiol. 2022 Jul 25;13:939666. doi: 10.3389/fmicb.2022.939666. eCollection 2022.