• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 454 焦磷酸测序读取中推断病毒准种谱。

Inferring viral quasispecies spectra from 454 pyrosequencing reads.

机构信息

Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA.

出版信息

BMC Bioinformatics. 2011;12 Suppl 6(Suppl 6):S1. doi: 10.1186/1471-2105-12-S6-S1. Epub 2011 Jul 28.

DOI:10.1186/1471-2105-12-S6-S1
PMID:21989211
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3194189/
Abstract

BACKGROUND

RNA viruses infecting a host usually exist as a set of closely related sequences, referred to as quasispecies. The genomic diversity of viral quasispecies is a subject of great interest, particularly for chronic infections, since it can lead to resistance to existing therapies. High-throughput sequencing is a promising approach to characterizing viral diversity, but unfortunately standard assembly software was originally designed for single genome assembly and cannot be used to simultaneously assemble and estimate the abundance of multiple closely related quasispecies sequences.

RESULTS

In this paper, we introduce a new Viral Spectrum Assembler (ViSpA) method for quasispecies spectrum reconstruction and compare it with the state-of-the-art ShoRAH tool on both simulated and real 454 pyrosequencing shotgun reads from HCV and HIV quasispecies. Experimental results show that ViSpA outperforms ShoRAH on simulated error-free reads, correctly assembling 10 out of 10 quasispecies and 29 sequences out of 40 quasispecies. While ShoRAH has a significant advantage over ViSpA on reads simulated with sequencing errors due to its advanced error correction algorithm, ViSpA is better at assembling the simulated reads after they have been corrected by ShoRAH. ViSpA also outperforms ShoRAH on real 454 reads. Indeed, 7 most frequent sequences reconstructed by ViSpA from a real HCV dataset are viable (do not contain internal stop codons), and the most frequent sequence was within 1% of the actual open reading frame obtained by cloning and Sanger sequencing. In contrast, only one of the sequences reconstructed by ShoRAH is viable. On a real HIV dataset, ShoRAH correctly inferred only 2 quasispecies sequences with at most 4 mismatches whereas ViSpA correctly reconstructed 5 quasispecies with at most 2 mismatches, and 2 out of 5 sequences were inferred without any mismatches. ViSpA source code is available at http://alla.cs.gsu.edu/~software/VISPA/vispa.html.

CONCLUSIONS

ViSpA enables accurate viral quasispecies spectrum reconstruction from 454 pyrosequencing reads. We are currently exploring extensions applicable to the analysis of high-throughput sequencing data from bacterial metagenomic samples and ecological samples of eukaryote populations.

摘要

背景

感染宿主的 RNA 病毒通常存在一组密切相关的序列,称为准种。病毒准种的基因组多样性是一个非常感兴趣的课题,特别是对于慢性感染,因为它可能导致对现有治疗方法的耐药性。高通量测序是一种很有前途的方法来描述病毒多样性,但不幸的是,标准的组装软件最初是为单基因组组装设计的,不能同时组装和估计多个密切相关的准种序列的丰度。

结果

本文介绍了一种新的病毒频谱组装(ViSpA)方法,用于准种谱重建,并将其与最先进的 ShoRAH 工具在 HCV 和 HIV 准种的模拟和真实 454 焦磷酸测序 shotgun 读取上进行了比较。实验结果表明,在无错误模拟读取中,ViSpA 比 ShoRAH 表现更好,正确组装了 10 个准种中的 10 个,以及 40 个准种中的 29 个序列。虽然 ShoRAH 由于其先进的纠错算法,在具有测序错误的模拟读取上优于 ViSpA,但在经过 ShoRAH 纠正后的模拟读取上,ViSpA 则更好。ViSpA 在真实的 454 读取上也优于 ShoRAH。事实上,从真实的 HCV 数据集重建的 ViSpA 中 7 个最常见的序列是可行的(不包含内部终止密码子),并且最常见的序列与通过克隆和 Sanger 测序获得的实际开放阅读框相差 1%以内。相比之下,只有 ShoRAH 重建的一个序列是可行的。在真实的 HIV 数据集上,ShoRAH 仅正确推断了最多有 4 个错配的 2 个准种序列,而 ViSpA 则正确重建了最多有 2 个错配的 5 个准种序列,其中 2 个序列没有错配。ViSpA 的源代码可在 http://alla.cs.gsu.edu/~software/VISPA/vispa.html 获得。

结论

ViSpA 能够从 454 焦磷酸测序读取中准确重建病毒准种谱。我们目前正在探索适用于细菌宏基因组样本和真核生物种群生态样本的高通量测序数据的分析扩展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/a9a65a1b4cf8/1471-2105-12-S6-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/3b3180e13d6f/1471-2105-12-S6-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/248428a1bc09/1471-2105-12-S6-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/018b921d17fb/1471-2105-12-S6-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/677050c1b4de/1471-2105-12-S6-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/a9a65a1b4cf8/1471-2105-12-S6-S1-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/3b3180e13d6f/1471-2105-12-S6-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/248428a1bc09/1471-2105-12-S6-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/018b921d17fb/1471-2105-12-S6-S1-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/677050c1b4de/1471-2105-12-S6-S1-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c7f/3194189/a9a65a1b4cf8/1471-2105-12-S6-S1-5.jpg

相似文献

1
Inferring viral quasispecies spectra from 454 pyrosequencing reads.从 454 焦磷酸测序读取中推断病毒准种谱。
BMC Bioinformatics. 2011;12 Suppl 6(Suppl 6):S1. doi: 10.1186/1471-2105-12-S6-S1. Epub 2011 Jul 28.
2
Reconstruction of viral population structure from next-generation sequencing data using multicommodity flows.利用多商品流从下一代测序数据中重建病毒种群结构。
BMC Bioinformatics. 2013;14 Suppl 9(Suppl 9):S2. doi: 10.1186/1471-2105-14-S9-S2. Epub 2013 Jun 28.
3
aBayesQR: A Bayesian Method for Reconstruction of Viral Populations Characterized by Low Diversity.aBayesQR:一种用于重建低多样性特征病毒群体的贝叶斯方法。
J Comput Biol. 2018 Jul;25(7):637-648. doi: 10.1089/cmb.2017.0249. Epub 2018 Feb 26.
4
Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing.高通量测序生成的 HIV-1 半基因组序列的简化亚群、亚型和重组分析。
mSphere. 2020 Oct 14;5(5):e00551-20. doi: 10.1128/mSphere.00551-20.
5
Viral quasispecies reconstruction via tensor factorization with successive read removal.基于连续读段去除的张量分解进行病毒准种重建。
Bioinformatics. 2018 Jul 1;34(13):i23-i31. doi: 10.1093/bioinformatics/bty291.
6
Reconstructing viral quasispecies from NGS amplicon reads.从二代测序扩增子读数中重建病毒准种
In Silico Biol. 2011;11(5-6):237-49. doi: 10.3233/ISB-2012-0458.
7
QUENTIN: reconstruction of disease transmissions from viral quasispecies genomic data.从病毒准种基因组数据重建疾病传播。
Bioinformatics. 2018 Jan 1;34(1):163-170. doi: 10.1093/bioinformatics/btx402.
8
QSdpR: Viral quasispecies reconstruction via correlation clustering.QSdpR:基于相关聚类的病毒准种重建。
Genomics. 2018 Nov;110(6):375-381. doi: 10.1016/j.ygeno.2017.12.007. Epub 2017 Dec 19.
9
Low-Bias RNA Sequencing of the HIV-2 Genome from Blood Plasma.从血浆中对 HIV-2 基因组进行低偏差 RNA 测序。
J Virol. 2018 Dec 10;93(1). doi: 10.1128/JVI.00677-18. Print 2019 Jan 1.
10
Inference of genetic relatedness between viral quasispecies from sequencing data.从测序数据推断病毒准种之间的遗传相关性。
BMC Genomics. 2017 Dec 6;18(Suppl 10):918. doi: 10.1186/s12864-017-4274-5.

引用本文的文献

1
Genetic diversity of hepatitis B virus quasispecies in different biological compartments reveals distinct genotypes.乙型肝炎病毒准种在不同生物隔室中的遗传多样性揭示了不同的基因型。
Sci Rep. 2023 Oct 9;13(1):17023. doi: 10.1038/s41598-023-43655-0.
2
Haploflow: strain-resolved de novo assembly of viral genomes.Haploflow:病毒基因组的菌株解析从头组装
Genome Biol. 2021 Jul 19;22(1):212. doi: 10.1186/s13059-021-02426-8.
3
Accurate assembly of minority viral haplotypes from next-generation sequencing through efficient noise reduction.

本文引用的文献

1
Combinatorial analysis and algorithms for quasispecies reconstruction using next-generation sequencing.基于下一代测序的准种重建的组合分析和算法。
BMC Bioinformatics. 2011 Jan 5;12:5. doi: 10.1186/1471-2105-12-5.
2
Characteristics of 454 pyrosequencing data--enabling realistic simulation with flowsim.454 焦磷酸测序数据的特征——使用 flowsim 进行现实模拟。
Bioinformatics. 2010 Sep 15;26(18):i420-5. doi: 10.1093/bioinformatics/btq365.
3
Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies.
通过有效降低噪声,实现下一代测序中少数病毒单倍型的精确组装。
Nucleic Acids Res. 2021 Sep 27;49(17):e102. doi: 10.1093/nar/gkab576.
4
Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins.罗盘箱:利用支架提高宏基因组分箱的连续性和质量
Front Microbiol. 2021 Feb 24;12:638561. doi: 10.3389/fmicb.2021.638561. eCollection 2021.
5
Haploflow: Strain-resolved assembly of viral genomes.Haploflow:病毒基因组的菌株解析组装
bioRxiv. 2021 Jan 26:2021.01.25.428049. doi: 10.1101/2021.01.25.428049.
6
Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing.高通量测序生成的 HIV-1 半基因组序列的简化亚群、亚型和重组分析。
mSphere. 2020 Oct 14;5(5):e00551-20. doi: 10.1128/mSphere.00551-20.
7
Epidemiological data analysis of viral quasispecies in the next-generation sequencing era.下一代测序时代病毒准种的流行病学数据分析。
Brief Bioinform. 2021 Jan 18;22(1):96-108. doi: 10.1093/bib/bbaa101.
8
Evaluation of haplotype callers for next-generation sequencing of viruses.病毒下一代测序中单体型caller 的评估。
Infect Genet Evol. 2020 Aug;82:104277. doi: 10.1016/j.meegid.2020.104277. Epub 2020 Mar 6.
9
Metagenome SNP calling via read-colored de Bruijn graphs.通过读取颜色化的德布鲁因图进行宏基因组单核苷酸多态性(SNP)检测
Bioinformatics. 2021 Apr 1;36(22-23):5275-5281. doi: 10.1093/bioinformatics/btaa081.
10
BHap: a novel approach for bacterial haplotype reconstruction.BHap:一种用于细菌单倍型重建的新方法。
Bioinformatics. 2019 Nov 1;35(22):4624-4631. doi: 10.1093/bioinformatics/btz280.
下一代测序数据纠错和 HIV 准种可靠估计。
Nucleic Acids Res. 2010 Nov;38(21):7400-9. doi: 10.1093/nar/gkq655. Epub 2010 Jul 29.
4
Deep sequencing of a genetically heterogeneous sample: local haplotype reconstruction and read error correction.基因异质样本的深度测序:局部单倍型重建与读段错误校正
J Comput Biol. 2010 Mar;17(3):417-28. doi: 10.1089/cmb.2009.0164.
5
Fast mapping of short sequences with mismatches, insertions and deletions using index structures.使用索引结构对具有错配、插入和缺失的短序列进行快速映射。
PLoS Comput Biol. 2009 Sep;5(9):e1000502. doi: 10.1371/journal.pcbi.1000502. Epub 2009 Sep 11.
6
The quasispecies nature and biological implications of the hepatitis C virus.丙型肝炎病毒的准种性质及其生物学意义。
Infect Genet Evol. 2009 Dec;9(6):1158-67. doi: 10.1016/j.meegid.2009.07.011. Epub 2009 Aug 8.
7
HapCUT: an efficient and accurate algorithm for the haplotype assembly problem.HapCUT:一种用于单倍型组装问题的高效且准确的算法。
Bioinformatics. 2008 Aug 15;24(16):i153-9. doi: 10.1093/bioinformatics/btn298.
8
Viral population estimation using pyrosequencing.使用焦磷酸测序法进行病毒群体估计。
PLoS Comput Biol. 2008 May 9;4(4):e1000074. doi: 10.1371/journal.pcbi.1000074.
9
Quality scores and SNP detection in sequencing-by-synthesis systems.合成测序系统中的质量评分与单核苷酸多态性检测
Genome Res. 2008 May;18(5):763-70. doi: 10.1101/gr.070227.107. Epub 2008 Jan 22.
10
Pyrobayes: an improved base caller for SNP discovery in pyrosequences.Pyrobayes:用于焦磷酸测序中SNP发现的改进型碱基识别器。
Nat Methods. 2008 Feb;5(2):179-81. doi: 10.1038/nmeth.1172. Epub 2008 Jan 13.