• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

病毒准种全长一致性序列的精确组装。

Accurate assembly of full-length consensus for viral quasispecies.

作者信息

Tian Jia, Gao Ziyu, Li Minghao, Bao Ergude, Zhao Jin

机构信息

College of Computer Science and Technology, Qingdao University, Qingdao, China.

School of Software Engineering, Beijing Jiaotong University, Beijing, China.

出版信息

BMC Bioinformatics. 2025 Feb 1;26(1):36. doi: 10.1186/s12859-025-06045-z.

DOI:10.1186/s12859-025-06045-z
PMID:39893441
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11787740/
Abstract

BACKGROUND

Viruses can inhabit their hosts in the form of an ensemble of various mutant strains. Reconstructing a robust consensus representation for these diverse mutant strains is essential for recognizing the genetic variations among strains and delving into aspects like virulence, pathogenesis, and selecting therapies. Virus genomes are typically small, often composed of only a few thousand to several hundred thousand nucleotides. While constructing a high-quality consensus of virus strains might seem feasible, most current assemblers only generated fragmented contigs. It's important to emphasize the significance of assembling a single full-length consensus contig, as it's vital for identifying genetic diversity and estimating strain abundance accurately.

RESULTS

In this paper, we developed FC-Virus, a de novo genome assembly strategy specifically targeting highly diverse viral populations. FC-Virus first identifies the k-mers that are common across most viral strains, and then uses these k-mers as a backbone to build a full-length consensus sequence covering the entire genome. We benchmark FC-Virus against state-of-the-art genome assemblers.

CONCLUSION

Experimental results confirm that FC-Virus can construct a single, accurate full-length consensus, whereas other assemblers only manage to produce fragmented contigs. FC-Virus is freely available at https://github.com/qdu-bioinfo/FC-Virus.git .

摘要

背景

病毒能够以多种突变株集合的形式寄生于宿主。为这些多样的突变株构建一个可靠的一致表示对于识别菌株间的遗传变异以及深入研究毒力、发病机制和选择治疗方法等方面至关重要。病毒基因组通常较小,往往仅由几千到几十万核苷酸组成。虽然构建高质量的病毒株一致序列看似可行,但目前大多数组装工具仅生成片段化的重叠群。强调组装单个全长一致重叠群的重要性很有必要,因为这对于准确识别遗传多样性和估计菌株丰度至关重要。

结果

在本文中,我们开发了FC-Virus,这是一种专门针对高度多样化病毒群体的从头基因组组装策略。FC-Virus首先识别大多数病毒株中常见的k-mer,然后将这些k-mer用作主干来构建覆盖整个基因组的全长一致序列。我们将FC-Virus与最先进的基因组组装工具进行了基准测试。

结论

实验结果证实,FC-Virus能够构建单一、准确的全长一致序列,而其他组装工具只能生成片段化的重叠群。FC-Virus可在https://github.com/qdu-bioinfo/FC-Virus.git上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/e68ceabd3c20/12859_2025_6045_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/a568bdc83d0e/12859_2025_6045_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/37da2e9346aa/12859_2025_6045_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/ea6ef1e49126/12859_2025_6045_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/8f12e232f042/12859_2025_6045_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/ae2e0648d639/12859_2025_6045_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/9e1291ba9f5f/12859_2025_6045_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/af892e9fcfef/12859_2025_6045_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/e68ceabd3c20/12859_2025_6045_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/a568bdc83d0e/12859_2025_6045_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/37da2e9346aa/12859_2025_6045_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/ea6ef1e49126/12859_2025_6045_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/8f12e232f042/12859_2025_6045_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/ae2e0648d639/12859_2025_6045_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/9e1291ba9f5f/12859_2025_6045_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/af892e9fcfef/12859_2025_6045_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b49f/11787740/e68ceabd3c20/12859_2025_6045_Fig7_HTML.jpg

相似文献

1
Accurate assembly of full-length consensus for viral quasispecies.病毒准种全长一致性序列的精确组装。
BMC Bioinformatics. 2025 Feb 1;26(1):36. doi: 10.1186/s12859-025-06045-z.
2
Full-length de novo viral quasispecies assembly through variation graph construction.通过变异图构建进行全长从头病毒准种组装。
Bioinformatics. 2019 Dec 15;35(24):5086-5094. doi: 10.1093/bioinformatics/btz443.
3
De novo assembly of viral quasispecies using overlap graphs.使用重叠图对病毒准种进行从头组装。
Genome Res. 2017 May;27(5):835-848. doi: 10.1101/gr.215038.116. Epub 2017 Apr 10.
4
Viral quasispecies reconstruction via tensor factorization with successive read removal.基于连续读段去除的张量分解进行病毒准种重建。
Bioinformatics. 2018 Jul 1;34(13):i23-i31. doi: 10.1093/bioinformatics/bty291.
5
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
6
De novo assembly of highly diverse viral populations.从头组装高度多样化的病毒群体。
BMC Genomics. 2012 Sep 13;13:475. doi: 10.1186/1471-2164-13-475.
7
A binning tool to reconstruct viral haplotypes from assembled contigs.一种从组装的连续序列中重建病毒单倍型的分箱工具。
BMC Bioinformatics. 2019 Nov 4;20(1):544. doi: 10.1186/s12859-019-3138-1.
8
Arapan-S: a fast and highly accurate whole-genome assembly software for viruses and small genomes.Arapan-S:一款用于病毒和小基因组的快速且高度准确的全基因组组装软件。
BMC Res Notes. 2012 May 16;5:243. doi: 10.1186/1756-0500-5-243.
9
The effect of variant interference on de novo assembly for viral deep sequencing.变异干扰对病毒深度测序从头组装的影响。
BMC Genomics. 2020 Jun 22;21(1):421. doi: 10.1186/s12864-020-06801-w.
10
Evaluating assembly and variant calling software for strain-resolved analysis of large DNA viruses.评估组装和变异调用软件,用于大型 DNA 病毒的菌株解析分析。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa123.

引用本文的文献

1
Call for caution on alleged tick-borne pathogens in blood-fed ticks from animals across nine Asian countries.呼吁对来自九个亚洲国家动物的饱血蜱中所谓的蜱传病原体保持谨慎。
Microbiol Spectr. 2025 Jul;13(7):e0050725. doi: 10.1128/spectrum.00507-25. Epub 2025 May 23.

本文引用的文献

1
Inference of viral quasispecies with a paired de Bruijn graph.基于配对 de Bruijn 图的病毒准种推断。
Bioinformatics. 2021 May 1;37(4):473-481. doi: 10.1093/bioinformatics/btaa782.
2
Full-length de novo viral quasispecies assembly through variation graph construction.通过变异图构建进行全长从头病毒准种组装。
Bioinformatics. 2019 Dec 15;35(24):5086-5094. doi: 10.1093/bioinformatics/btz443.
3
SKESA: strategic k-mer extension for scrupulous assemblies.SKESA:用于严谨组装的策略性 k--mer 扩展。
Genome Biol. 2018 Oct 4;19(1):153. doi: 10.1186/s13059-018-1540-z.
4
De novo haplotype reconstruction in viral quasispecies using paired-end read guided path finding.利用配对末端读指导路径寻找技术对病毒准种进行从头单倍型重建。
Bioinformatics. 2018 Sep 1;34(17):2927-2935. doi: 10.1093/bioinformatics/bty202.
5
Characterization of HIV-1 Near Full-Length Proviral Genome Quasispecies from Patients with Undetectable Viral Load Undergoing First-Line HAART Therapy.对一线抗逆转录病毒治疗后病毒载量不可检测的患者的 HIV-1 全长前病毒准种进行特征分析。
Viruses. 2017 Dec 19;9(12):392. doi: 10.3390/v9120392.
6
De novo assembly of viral quasispecies using overlap graphs.使用重叠图对病毒准种进行从头组装。
Genome Res. 2017 May;27(5):835-848. doi: 10.1101/gr.215038.116. Epub 2017 Apr 10.
7
SimSeq: a nonparametric approach to simulation of RNA-sequence datasets.SimSeq:一种用于RNA序列数据集模拟的非参数方法。
Bioinformatics. 2015 Jul 1;31(13):2131-40. doi: 10.1093/bioinformatics/btv124. Epub 2015 Feb 26.
8
On the middle ground between open source and commercial software - the case of the Newbler program.在开源软件和商业软件之间的中间地带——以Newbler程序为例。
Genome Biol. 2014;15(4):113. doi: 10.1186/gb4173. Epub 2014 Apr 29.
9
SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.SOAPdenovo2:一种经验丰富的、内存效率高的短读长从头组装器。
Gigascience. 2012 Dec 27;1(1):18. doi: 10.1186/2047-217X-1-18.
10
QUAST: quality assessment tool for genome assemblies.QUAST:基因组组装质量评估工具。
Bioinformatics. 2013 Apr 15;29(8):1072-5. doi: 10.1093/bioinformatics/btt086. Epub 2013 Feb 19.