• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从头序列组装需要对嵌合序列进行生物信息学检查。

De novo sequence assembly requires bioinformatic checking of chimeric sequences.

机构信息

Division of Pathology, Department of Laboratory Medicine, Karolinska Institutet, Stockholm, Sweden.

出版信息

PLoS One. 2020 Aug 10;15(8):e0237455. doi: 10.1371/journal.pone.0237455. eCollection 2020.

DOI:10.1371/journal.pone.0237455
PMID:32777809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7417191/
Abstract

De novo assembly of sequence reads from next generation sequencing platforms is a common strategy for detecting presence and sequencing of viruses in biospecimens. Amplification artifacts and presence of several related viruses in the same specimen can lead to assembly of erroneous, chimeric sequences. We now report that such chimeras can also occur between viral and non-viral biological sequences incorrectly joined together which may cause erroneous detection of viruses, highlighting the importance of performing a chimera checking step in bioinformatics pipelines. Using Illumina NextSeq and metagenomic sequencing, we analyzed 80 consecutive non-melanoma skin cancers (NMSCs) from 11 immunosuppressed patients together with 11 NMSCs from patients who had only developed 1 NMSC. We aligned high-quality reads against a Human Papillomavirus (HPV) database and found HPV sequences in 9/91 specimens. A previous bioinformatic analysis of the same crude sequencing data from some of these samples had found an additional 3 specimens to be HPV-positive after performing de novo assembly. The reason for the discrepancy was investigated and found to be mostly caused by chimeric sequences containing both viral and non-viral sequences. Non-viral sequences were present in these 3 samples. To avoid erroneous detection of HPV when performing sequencing, we thus developed a novel script to identify HPV chimeric sequences.

摘要

从下一代测序平台的序列读取进行从头组装是检测生物标本中病毒存在和测序的常用策略。扩增伪影和同一标本中存在几种相关病毒会导致错误的嵌合序列的组装。我们现在报告,这种嵌合体也可能发生在错误连接的病毒和非病毒生物序列之间,这可能导致病毒的错误检测,突出了在生物信息学管道中执行嵌合体检查步骤的重要性。使用 Illumina NextSeq 和宏基因组测序,我们分析了来自 11 名免疫抑制患者的 80 例连续非黑色素瘤皮肤癌(NMSC),以及仅发生 1 例 NMSC 的患者的 11 例 NMSC。我们将高质量的读数与 HPV 数据库进行比对,在 9/91 个标本中发现了 HPV 序列。对来自这些样本的部分粗测序数据的先前生物信息学分析发现,在进行从头组装后,另外 3 个样本呈 HPV 阳性。对差异的原因进行了调查,发现主要是由含有病毒和非病毒序列的嵌合序列引起的。这些 3 个样本中存在非病毒序列。为了避免在进行测序时错误检测 HPV,我们因此开发了一种新的脚本来识别 HPV 嵌合序列。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aa6/7417191/693f5e4feca2/pone.0237455.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aa6/7417191/693f5e4feca2/pone.0237455.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3aa6/7417191/693f5e4feca2/pone.0237455.g001.jpg

相似文献

1
De novo sequence assembly requires bioinformatic checking of chimeric sequences.从头序列组装需要对嵌合序列进行生物信息学检查。
PLoS One. 2020 Aug 10;15(8):e0237455. doi: 10.1371/journal.pone.0237455. eCollection 2020.
2
Viruses in cancers among the immunosuppressed.免疫抑制人群中的癌症病毒。
Int J Cancer. 2017 Dec 15;141(12):2498-2504. doi: 10.1002/ijc.31017. Epub 2017 Sep 13.
3
De novo assembly of highly polymorphic metagenomic data using in situ generated reference sequences and a novel BLAST-based assembly pipeline.使用原位生成的参考序列和基于新型BLAST的组装流程对高度多态的宏基因组数据进行从头组装。
BMC Bioinformatics. 2017 Apr 26;18(1):223. doi: 10.1186/s12859-017-1630-z.
4
Sequencing detects human papillomavirus in some apparently HPV-negative invasive cervical cancers.测序在一些明显 HPV 阴性的浸润性宫颈癌中检测出人乳头瘤病毒。
J Gen Virol. 2020 Mar;101(3):265-270. doi: 10.1099/jgv.0.001374.
5
ZWA: Viral genome assembly and characterization hindrances from virus-host chimeric reads; a refining approach.ZWA:病毒-宿主嵌合读取对病毒基因组组装和特征分析的阻碍;一种改进方法。
PLoS Comput Biol. 2021 Aug 9;17(8):e1009304. doi: 10.1371/journal.pcbi.1009304. eCollection 2021 Aug.
6
Comparison of Illumina de novo assembled and Sanger sequenced viral genomes: A case study for RNA viruses recovered from the plant pathogenic fungus Sclerotinia sclerotiorum.Illumina 从头组装和桑格测序的病毒基因组比较:以从植物病原真菌核盘菌中分离出的 RNA 病毒为例的研究
Virus Res. 2016 Jul 2;219:51-57. doi: 10.1016/j.virusres.2015.11.001. Epub 2015 Nov 12.
7
Reliable Detection of Herpes Simplex Virus Sequence Variation by High-Throughput Resequencing.通过高通量重测序可靠检测单纯疱疹病毒序列变异
Viruses. 2017 Aug 16;9(8):226. doi: 10.3390/v9080226.
8
ContigExtender: a new approach to improving de novo sequence assembly for viral metagenomics data.ContigExtender:一种改进病毒宏基因组数据从头测序组装的新方法。
BMC Bioinformatics. 2021 Mar 12;22(1):119. doi: 10.1186/s12859-021-04038-2.
9
Correcting the Estimation of Viral Taxa Distributions in Next-Generation Sequencing Data after Applying Artificial Neural Networks.应用人工神经网络后校正下一代测序数据中病毒分类群分布的估计。
Genes (Basel). 2021 Oct 31;12(11):1755. doi: 10.3390/genes12111755.
10
Can we use it? On the utility of de novo and reference-based assembly of Nanopore data for plant plastome sequencing.我们可以使用它吗?从头组装和基于参考的纳米孔数据在植物质体测序中的应用。
PLoS One. 2020 Mar 24;15(3):e0226234. doi: 10.1371/journal.pone.0226234. eCollection 2020.

引用本文的文献

1
Choosing the most suitable NGS technology to combine with a standardized viral enrichment protocol for obtaining complete avian orthoreovirus genomes from metagenomic samples.选择最合适的二代测序(NGS)技术,与标准化病毒富集方案相结合,以便从宏基因组样本中获得完整的禽正呼肠孤病毒基因组。
Front Bioinform. 2025 Feb 4;5:1498921. doi: 10.3389/fbinf.2025.1498921. eCollection 2025.
2
Navigating the Landscape: A Comprehensive Review of Current Virus Databases.探索全景:当前病毒数据库的全面综述
Viruses. 2023 Aug 29;15(9):1834. doi: 10.3390/v15091834.
3
Single-worm long-read sequencing reveals genome diversity in free-living nematodes.

本文引用的文献

1
Sequencing detects human papillomavirus in some apparently HPV-negative invasive cervical cancers.测序在一些明显 HPV 阴性的浸润性宫颈癌中检测出人乳头瘤病毒。
J Gen Virol. 2020 Mar;101(3):265-270. doi: 10.1099/jgv.0.001374.
2
Author's reply to: Human papillomavirus type 197 is not associated with skin tumors.作者对《197型人乳头瘤病毒与皮肤肿瘤无关》的回复
Int J Cancer. 2019 Dec 1;145(11):3181. doi: 10.1002/ijc.32645. Epub 2019 Sep 10.
3
New approaches for metagenome assembly with short reads.基于短读长的宏基因组组装新方法
单虫长读测序揭示自由生活线虫的基因组多样性。
Nucleic Acids Res. 2023 Aug 25;51(15):8035-8047. doi: 10.1093/nar/gkad647.
4
Chimera: The spoiler in multiple displacement amplification.嵌合体:多重置换扩增中的干扰因素
Comput Struct Biotechnol J. 2023 Feb 23;21:1688-1696. doi: 10.1016/j.csbj.2023.02.034. eCollection 2023.
5
Novel Virus Identification through Metagenomics: A Systematic Review.通过宏基因组学进行新型病毒鉴定:一项系统综述。
Life (Basel). 2022 Dec 7;12(12):2048. doi: 10.3390/life12122048.
6
Target-enriched long-read sequencing (TELSeq) contextualizes antimicrobial resistance genes in metagenomes.靶向富集长读测序(TELSeq)使宏基因组中的抗生素耐药基因背景化。
Microbiome. 2022 Nov 2;10(1):185. doi: 10.1186/s40168-022-01368-y.
7
Ultrafast and Cost-Effective Pathogen Identification and Resistance Gene Detection in a Clinical Setting Using Nanopore Flongle Sequencing.在临床环境中使用纳米孔Flongle测序进行超快速且经济高效的病原体鉴定和耐药基因检测。
Front Microbiol. 2022 Mar 17;13:822402. doi: 10.3389/fmicb.2022.822402. eCollection 2022.
8
A simple guide to de novo transcriptome assembly and annotation.从头转录组组装与注释简明指南。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab563.
9
CStone: A de novo transcriptome assembler for short-read data that identifies non-chimeric contigs based on underlying graph structure.CStone:一种用于短读长数据的从头转录组装配程序,它基于基础图结构识别非嵌合的连续序列。
PLoS Comput Biol. 2021 Nov 23;17(11):e1009631. doi: 10.1371/journal.pcbi.1009631. eCollection 2021 Nov.
10
Human Papillomavirus Detection by Whole-Genome Next-Generation Sequencing: Importance of Validation and Quality Assurance Procedures.人乳头瘤病毒的全基因组下一代测序检测:验证和质量保证程序的重要性。
Viruses. 2021 Jul 8;13(7):1323. doi: 10.3390/v13071323.
Brief Bioinform. 2020 Mar 23;21(2):584-594. doi: 10.1093/bib/bbz020.
4
A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection.一个用于增强新型病毒检测高通量测序生物信息学分析的参考病毒数据库(RVDB)。
mSphere. 2018 Mar 14;3(2). doi: 10.1128/mSphereDirect.00069-18. eCollection 2018 Mar-Apr.
5
Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.通过验证的视角看宏基因组组装:评估和提高宏基因组组装基因组质量的最新进展。
Brief Bioinform. 2019 Jul 19;20(4):1140-1150. doi: 10.1093/bib/bbx098.
6
Viruses in cancers among the immunosuppressed.免疫抑制人群中的癌症病毒。
Int J Cancer. 2017 Dec 15;141(12):2498-2504. doi: 10.1002/ijc.31017. Epub 2017 Sep 13.
7
Viruses in case series of tumors: Consistent presence in different cancers in the same subject.肿瘤病例系列中的病毒:同一受试者不同癌症中持续存在。
PLoS One. 2017 Mar 3;12(3):e0172308. doi: 10.1371/journal.pone.0172308. eCollection 2017.
8
VirusDetect: An automated pipeline for efficient virus discovery using deep sequencing of small RNAs.病毒检测:一种利用小RNA深度测序进行高效病毒发现的自动化流程。
Virology. 2017 Jan;500:130-138. doi: 10.1016/j.virol.2016.10.017. Epub 2016 Nov 5.
9
VIP: an integrated pipeline for metagenomics of virus identification and discovery.VIP:一种用于病毒鉴定和发现的宏基因组学综合流程。
Sci Rep. 2016 Mar 30;6:23774. doi: 10.1038/srep23774.
10
ViromeScan: a new tool for metagenomic viral community profiling.病毒组扫描:一种用于宏基因组病毒群落分析的新工具。
BMC Genomics. 2016 Mar 1;17:165. doi: 10.1186/s12864-016-2446-3.