• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高通量测序数据组装的病毒基因组中单核苷酸多态性的检测:序列分析策略的大规模性能测试。

Detection of single nucleotide polymorphisms in virus genomes assembled from high-throughput sequencing data: large-scale performance testing of sequence analysis strategies.

机构信息

Laboratory of Plant Pathology-TERRA-Gembloux Agro-Bio Tech, University of Liège, Gembloux, Belgium.

Citrus Research International, Matieland, South Africa.

出版信息

PeerJ. 2023 Aug 16;11:e15816. doi: 10.7717/peerj.15816. eCollection 2023.

DOI:10.7717/peerj.15816
PMID:
37601254
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10439718/
Abstract

Recent developments in high-throughput sequencing (HTS) technologies and bioinformatics have drastically changed research in virology, especially for virus discovery. Indeed, proper monitoring of the viral population requires information on the different isolates circulating in the studied area. For this purpose, HTS has greatly facilitated the sequencing of new genomes of detected viruses and their comparison. However, bioinformatics analyses allowing reconstruction of genome sequences and detection of single nucleotide polymorphisms (SNPs) can potentially create bias and has not been widely addressed so far. Therefore, more knowledge is required on the limitations of predicting SNPs based on HTS-generated sequence samples. To address this issue, we compared the ability of 14 plant virology laboratories, each employing a different bioinformatics pipeline, to detect 21 variants of pepino mosaic virus (PepMV) in three samples through large-scale performance testing (PT) using three artificially designed datasets. To evaluate the impact of bioinformatics analyses, they were divided into three key steps: reads pre-processing, virus-isolate identification, and variant calling. Each step was evaluated independently through an original, PT design including discussion and validation between participants at each step. Overall, this work underlines key parameters influencing SNPs detection and proposes recommendations for reliable variant calling for plant viruses. The identification of the closest reference, mapping parameters and manual validation of the detection were recognized as the most impactful analysis steps for the success of the SNPs detections. Strategies to improve the prediction of SNPs are also discussed.

摘要

高通量测序(HTS)技术和生物信息学的最新进展极大地改变了病毒学研究,特别是在病毒发现方面。事实上,要对病毒群体进行适当监测,就需要了解在研究区域中循环的不同分离株的信息。为此,HTS 极大地促进了新检测到的病毒基因组的测序及其比较。然而,允许重建基因组序列和检测单核苷酸多态性(SNP)的生物信息学分析可能会产生偏差,到目前为止还没有得到广泛的解决。因此,需要更多地了解基于 HTS 生成的序列样本预测 SNP 的局限性。为了解决这个问题,我们通过使用三个人工设计的数据集进行大规模性能测试(PT),比较了 14 个植物病毒学实验室在三个样本中检测 21 种 Pepino mosaic virus(PepMV)变体的能力,每个实验室都使用不同的生物信息学管道。为了评估生物信息学分析的影响,我们将其分为三个关键步骤:读取预处理、病毒分离鉴定和变体调用。每个步骤都通过一个原始的 PT 设计进行独立评估,包括参与者在每个步骤的讨论和验证。总的来说,这项工作强调了影响 SNP 检测的关键参数,并为植物病毒可靠的变体调用提出了建议。鉴定最接近的参考序列、映射参数和手动验证检测结果被认为是 SNP 检测成功的最具影响力的分析步骤。还讨论了改进 SNP 预测的策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465f/10439718/74567bd02ee1/peerj-11-15816-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465f/10439718/19a4d1c7dcfd/peerj-11-15816-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465f/10439718/74567bd02ee1/peerj-11-15816-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465f/10439718/19a4d1c7dcfd/peerj-11-15816-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/465f/10439718/74567bd02ee1/peerj-11-15816-g002.jpg

相似文献

1
Detection of single nucleotide polymorphisms in virus genomes assembled from high-throughput sequencing data: large-scale performance testing of sequence analysis strategies.高通量测序数据组装的病毒基因组中单核苷酸多态性的检测:序列分析策略的大规模性能测试。
PeerJ. 2023 Aug 16;11:e15816. doi: 10.7717/peerj.15816. eCollection 2023.
2
Virus Detection by High-Throughput Sequencing of Small RNAs: Large-Scale Performance Testing of Sequence Analysis Strategies.高通量测序小 RNA 检测病毒:序列分析策略的大规模性能测试。
Phytopathology. 2019 Mar;109(3):488-497. doi: 10.1094/PHYTO-02-18-0067-R. Epub 2019 Feb 7.
3
A beginners guide to SNP calling from high-throughput DNA-sequencing data.高通量 DNA 测序数据中 SNP 调用的入门指南。
Hum Genet. 2012 Oct;131(10):1541-54. doi: 10.1007/s00439-012-1213-z. Epub 2012 Aug 11.
4
Bioinformatics tools for analysing viral genomic data.用于分析病毒基因组数据的生物信息学工具。
Rev Sci Tech. 2016 Apr;35(1):271-85. doi: 10.20506/rst.35.1.2432.
5
An investigation of causes of false positive single nucleotide polymorphisms using simulated reads from a small eukaryote genome.利用来自小型真核生物基因组的模拟读数对单核苷酸多态性假阳性原因的调查。
BMC Bioinformatics. 2015 Nov 11;16:382. doi: 10.1186/s12859-015-0801-z.
6
Computational Pipeline for the Detection of Plant RNA Viruses Using High-Throughput Sequencing.基于高通量测序的植物 RNA 病毒检测计算流程。
Methods Mol Biol. 2024;2724:1-20. doi: 10.1007/978-1-0716-3485-1_1.
7
High-throughput sequencing (HTS) for the analysis of viral populations.高通量测序(HTS)用于病毒群体分析。
Infect Genet Evol. 2020 Jun;80:104208. doi: 10.1016/j.meegid.2020.104208. Epub 2020 Jan 27.
8
LABRADOR-A Computational Workflow for Virus Detection in High-Throughput Sequencing Data.LABRADOR:一种用于高通量测序数据中病毒检测的计算工作流程。
Viruses. 2021 Dec 18;13(12):2541. doi: 10.3390/v13122541.
9
Molecular characterization of two Pepino mosaic virus variants from imported tomato seed reveals high levels of sequence identity between Chilean and US isolates.对来自进口番茄种子的两种 Pepino 花叶病毒变体的分子特征分析表明,智利和美国分离株之间具有高度的序列同一性。
Virus Genes. 2007 Jan;34(1):1-8. doi: 10.1007/s11262-006-0003-x. Epub 2006 Aug 22.
10
Cont-ID: detection of sample cross-contamination in viral metagenomic data.病毒宏基因组数据中样本交叉污染的检测。
BMC Biol. 2023 Oct 13;21(1):217. doi: 10.1186/s12915-023-01708-w.

引用本文的文献

1
High-Throughput Sequencing Reveals Apple Virome Diversity and Novel Viruses in the Czech Republic.高通量测序揭示了捷克共和国苹果病毒组的多样性及新型病毒
Viruses. 2025 Apr 29;17(5):650. doi: 10.3390/v17050650.
2
High-throughput sequencing revealed the symptomatic common bean (Phaseolus vulgaris L.) virome in Iran.高通量测序揭示了伊朗有症状普通菜豆(Phaseolus vulgaris L.)的病毒组。
Sci Rep. 2025 Jan 10;15(1):1621. doi: 10.1038/s41598-025-85281-y.
3
Genomic Sequence of a Czech Isolate of Erysimum Latent Virus from .来自捷克的糖芥潜隐病毒分离株的基因组序列 。 (原英文文本不完整,翻译可能不太准确,完整准确的翻译需补充完整原文内容)

本文引用的文献

1
Going beyond consensus genome sequences: An innovative SNP-based methodology reconstructs different Ugandan cassava brown streak virus haplotypes at a nationwide scale in Rwanda.超越共有基因组序列:一种基于单核苷酸多态性的创新方法在卢旺达全国范围内重建了不同的乌干达木薯褐色条纹病毒单倍型。
Virus Evol. 2023 Aug 24;9(2):vead053. doi: 10.1093/ve/vead053. eCollection 2023.
2
Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery.系统基准测试最先进的变异调用管道,确定影响编码序列变异发现准确性的主要因素。
BMC Genomics. 2022 Feb 22;23(1):155. doi: 10.1186/s12864-022-08365-3.
3
Plants (Basel). 2024 Sep 11;13(18):2554. doi: 10.3390/plants13182554.
SARS-COV-2 Variants: Differences and Potential of Immune Evasion.
SARS-CoV-2 变异株:免疫逃逸的差异和潜力。
Front Cell Infect Microbiol. 2022 Jan 18;11:781429. doi: 10.3389/fcimb.2021.781429. eCollection 2021.
4
Emerging SARS-CoV-2 Variants: A Review of Its Mutations, Its Implications and Vaccine Efficacy.新型严重急性呼吸综合征冠状病毒2(SARS-CoV-2)变体:其突变、影响及疫苗效力综述
Vaccines (Basel). 2021 Oct 18;9(10):1195. doi: 10.3390/vaccines9101195.
5
Interlaboratory Comparison Study on Ribodepleted Total RNA High-Throughput Sequencing for Plant Virus Diagnostics and Bioinformatic Competence.用于植物病毒诊断和生物信息学能力的核糖体RNA去除的总RNA高通量测序的实验室间比对研究
Pathogens. 2021 Sep 12;10(9):1174. doi: 10.3390/pathogens10091174.
6
Evaluating assembly and variant calling software for strain-resolved analysis of large DNA viruses.评估组装和变异调用软件,用于大型 DNA 病毒的菌株解析分析。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa123.
7
Benchmarking the performance of Pool-seq SNP callers using simulated and real sequencing data.使用模拟和真实测序数据对 Pool-seq SNP 调用程序的性能进行基准测试。
Mol Ecol Resour. 2021 May;21(4):1216-1229. doi: 10.1111/1755-0998.13343. Epub 2021 Mar 5.
8
Best practices for variant calling in clinical sequencing.临床测序中变异调用的最佳实践。
Genome Med. 2020 Oct 26;12(1):91. doi: 10.1186/s13073-020-00791-w.
9
Detection of Plant Viruses and Disease Management: Relevance of Genetic Diversity and Evolution.植物病毒检测与病害管理:遗传多样性与进化的相关性
Front Plant Sci. 2020 Jul 17;11:1092. doi: 10.3389/fpls.2020.01092. eCollection 2020.
10
Viral quasispecies.病毒准种。
PLoS Genet. 2019 Oct 17;15(10):e1008271. doi: 10.1371/journal.pgen.1008271. eCollection 2019 Oct.