• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在来自多名患者和多种癌症的数据中识别已知和新型复发性病毒序列。

Identification of Known and Novel Recurrent Viral Sequences in Data from Multiple Patients and Multiple Cancers.

作者信息

Friis-Nielsen Jens, Kjartansdóttir Kristín Rós, Mollerup Sarah, Asplund Maria, Mourier Tobias, Jensen Randi Holm, Hansen Thomas Arn, Rey-Iglesia Alba, Richter Stine Raith, Nielsen Ida Broman, Alquezar-Planas David E, Olsen Pernille V S, Vinner Lasse, Fridholm Helena, Nielsen Lars Peter, Willerslev Eske, Sicheritz-Pontén Thomas, Lund Ole, Hansen Anders Johannes, Izarzugaza Jose M G, Brunak Søren

机构信息

Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark.

Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.

出版信息

Viruses. 2016 Feb 19;8(2):53. doi: 10.3390/v8020053.

DOI:10.3390/v8020053
PMID:26907326
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4776208/
Abstract

Virus discovery from high throughput sequencing data often follows a bottom-up approach where taxonomic annotation takes place prior to association to disease. Albeit effective in some cases, the approach fails to detect novel pathogens and remote variants not present in reference databases. We have developed a species independent pipeline that utilises sequence clustering for the identification of nucleotide sequences that co-occur across multiple sequencing data instances. We applied the workflow to 686 sequencing libraries from 252 cancer samples of different cancer and tissue types, 32 non-template controls, and 24 test samples. Recurrent sequences were statistically associated to biological, methodological or technical features with the aim to identify novel pathogens or plausible contaminants that may associate to a particular kit or method. We provide examples of identified inhabitants of the healthy tissue flora as well as experimental contaminants. Unmapped sequences that co-occur with high statistical significance potentially represent the unknown sequence space where novel pathogens can be identified.

摘要

从高通量测序数据中发现病毒通常采用自下而上的方法,即在与疾病关联之前先进行分类注释。尽管这种方法在某些情况下有效,但它无法检测到参考数据库中不存在的新型病原体和远距离变体。我们开发了一种不依赖物种的流程,该流程利用序列聚类来识别在多个测序数据实例中共同出现的核苷酸序列。我们将该工作流程应用于来自252个不同癌症和组织类型的癌症样本、32个无模板对照和24个测试样本的686个测序文库。将反复出现的序列与生物学、方法学或技术特征进行统计学关联,目的是识别可能与特定试剂盒或方法相关的新型病原体或可能的污染物。我们提供了健康组织菌群中已识别出的微生物以及实验污染物的示例。具有高统计学显著性共同出现的未映射序列可能代表了未知的序列空间,在这个空间中可以识别新型病原体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/1cb7d3de8387/viruses-08-00053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/e341b9f35d47/viruses-08-00053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/e0a34711718f/viruses-08-00053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/5e3d02b56075/viruses-08-00053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/1cb7d3de8387/viruses-08-00053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/e341b9f35d47/viruses-08-00053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/e0a34711718f/viruses-08-00053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/5e3d02b56075/viruses-08-00053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98c5/4776208/1cb7d3de8387/viruses-08-00053-g004.jpg

相似文献

1
Identification of Known and Novel Recurrent Viral Sequences in Data from Multiple Patients and Multiple Cancers.在来自多名患者和多种癌症的数据中识别已知和新型复发性病毒序列。
Viruses. 2016 Feb 19;8(2):53. doi: 10.3390/v8020053.
2
Cataloguing the taxonomic origins of sequences from a heterogeneous sample using phylogenomics: applications in adventitious agent detection.利用系统发育基因组学对异质样本中序列的分类学起源进行编目:在检测外来因子中的应用。
PDA J Pharm Sci Technol. 2014 Nov-Dec;68(6):602-18. doi: 10.5731/pdajpst.2014.01023.
3
Detecting viral sequences in NGS data.检测 NGS 数据中的病毒序列。
Curr Opin Virol. 2019 Dec;39:41-48. doi: 10.1016/j.coviro.2019.07.010. Epub 2019 Aug 26.
4
Sensitive detection of viral transcripts in human tumor transcriptomes.人类肿瘤转录组中病毒转录本的灵敏检测。
PLoS Comput Biol. 2013;9(10):e1003228. doi: 10.1371/journal.pcbi.1003228. Epub 2013 Oct 3.
5
Viral sequences in human cancer.人类癌症中的病毒序列。
Virology. 2018 Jan 1;513:208-216. doi: 10.1016/j.virol.2017.10.017. Epub 2017 Nov 5.
6
VirusDetect: An automated pipeline for efficient virus discovery using deep sequencing of small RNAs.病毒检测:一种利用小RNA深度测序进行高效病毒发现的自动化流程。
Virology. 2017 Jan;500:130-138. doi: 10.1016/j.virol.2016.10.017. Epub 2016 Nov 5.
7
A practical approach to a viral detection pipeline using existing viral and non-viral sequence resources.一种利用现有病毒和非病毒序列资源构建病毒检测流程的实用方法。
PDA J Pharm Sci Technol. 2014 Nov-Dec;68(6):595-601. doi: 10.5731/pdajpst.2014.01022.
8
Advances in the application of high-throughput sequencing in invertebrate virology.高通量测序在无脊椎动物病毒学中的应用进展
J Invertebr Pathol. 2017 Jul;147:145-156. doi: 10.1016/j.jip.2017.02.006. Epub 2017 Feb 27.
9
Adventitious Virus Detection in Cells by High-Throughput Sequencing of Newly Synthesized RNAs: Unambiguous Differentiation of Cell Infection from Carryover of Viral Nucleic Acids.高通量测序新合成 RNA 检测细胞中的外源病毒:明确区分细胞感染与病毒核酸残留。
mSphere. 2019 Jun 5;4(3):e00298-19. doi: 10.1128/mSphere.00298-19.
10
Current challenges to virus discovery by meta-transcriptomics.元转录组学发现病毒面临的挑战。
Curr Opin Virol. 2021 Dec;51:48-55. doi: 10.1016/j.coviro.2021.09.007. Epub 2021 Sep 27.

引用本文的文献

1
Metagenomic Identification of Viral Sequences in Laboratory Reagents.实验室试剂中病毒序列的宏基因组鉴定。
Viruses. 2021 Oct 21;13(11):2122. doi: 10.3390/v13112122.
2
Potential prognostic impact of EBV RNA-seq reads in gastric cancer: a reanalysis of The Cancer Genome Atlas cohort.EBV RNA-seq 读码在胃癌中的潜在预后影响:对癌症基因组图谱队列的重新分析。
FEBS Open Bio. 2020 Mar;10(3):455-467. doi: 10.1002/2211-5463.12803. Epub 2020 Feb 16.
3
DAMIAN: an open source bioinformatics tool for fast, systematic and cohort based analysis of microorganisms in diagnostic samples.

本文引用的文献

1
What's in your next-generation sequence data? An exploration of unmapped DNA and RNA sequence reads from the bovine reference individual.你的下一代测序数据中有什么?对来自牛参考个体的未映射DNA和RNA序列读数的探索。
BMC Genomics. 2015 Dec 29;16:1114. doi: 10.1186/s12864-015-2313-7.
2
New Type of Papillomavirus and Novel Circular Single Stranded DNA Virus Discovered in Urban Rattus norvegicus Using Circular DNA Enrichment and Metagenomics.利用环状DNA富集和宏基因组学技术在城市褐家鼠中发现新型乳头瘤病毒和新型环状单链DNA病毒
PLoS One. 2015 Nov 11;10(11):e0141952. doi: 10.1371/journal.pone.0141952. eCollection 2015.
3
DAMIAN:一个用于快速、系统和基于队列分析诊断样本中微生物的开源生物信息学工具。
Sci Rep. 2019 Nov 14;9(1):16841. doi: 10.1038/s41598-019-52881-4.
4
A virome-wide clonal integration analysis platform for discovering cancer viral etiology.用于发现癌症病毒病因的病毒组全克隆整合分析平台。
Genome Res. 2019 May;29(5):819-830. doi: 10.1101/gr.242529.118. Epub 2019 Mar 14.
5
Tumour virology in the era of high-throughput genomics.高通量基因组学时代的肿瘤病毒学。
Philos Trans R Soc Lond B Biol Sci. 2017 Oct 19;372(1732). doi: 10.1098/rstb.2016.0265.
6
The Human Virome: Implications for Clinical Practice in Transplantation Medicine.人类病毒组:对移植医学临床实践的影响。
J Clin Microbiol. 2017 Oct;55(10):2884-2893. doi: 10.1128/JCM.00489-17. Epub 2017 Jul 19.
7
High diversity of picornaviruses in rats from different continents revealed by deep sequencing.深度测序揭示不同大陆大鼠中微小核糖核酸病毒的高度多样性。
Emerg Microbes Infect. 2016 Aug 17;5(8):e90. doi: 10.1038/emi.2016.90.
Investigation of Human Cancers for Retrovirus by Low-Stringency Target Enrichment and High-Throughput Sequencing.
通过低严谨度靶向富集和高通量测序对人类癌症中的逆转录病毒进行研究。
Sci Rep. 2015 Aug 19;5:13201. doi: 10.1038/srep13201.
4
Target-dependent enrichment of virions determines the reduction of high-throughput sequencing in virus discovery.病毒粒子的靶标依赖性富集决定了病毒发现中高通量测序的减少。
PLoS One. 2015 Apr 8;10(4):e0122636. doi: 10.1371/journal.pone.0122636. eCollection 2015.
5
Traces of ATCV-1 associated with laboratory component contamination.与实验室成分污染相关的ATCV-1痕迹。
Proc Natl Acad Sci U S A. 2015 Mar 3;112(9):E925-6. doi: 10.1073/pnas.1423756112. Epub 2015 Feb 5.
6
CDD: NCBI's conserved domain database.CDD:美国国家生物技术信息中心的保守结构域数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D222-6. doi: 10.1093/nar/gku1221. Epub 2014 Nov 20.
7
A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes.在人类粪便宏基因组的未知序列中发现的一种高度丰富的噬菌体。
Nat Commun. 2014 Jul 24;5:4498. doi: 10.1038/ncomms5498.
8
Comprehensive human virus screening using high-throughput sequencing with a user-friendly representation of bioinformatics analysis: a pilot study.使用高通量测序进行全面的人类病毒筛查并以用户友好的方式呈现生物信息学分析:一项初步研究。
J Clin Microbiol. 2014 Sep;52(9):3351-61. doi: 10.1128/JCM.01389-14. Epub 2014 Jul 9.
9
False-positive results in metagenomic virus discovery: a strong case for follow-up diagnosis.宏基因组病毒发现中的假阳性结果:后续诊断的有力依据。
Transbound Emerg Dis. 2014 Aug;61(4):293-9. doi: 10.1111/tbed.12251. Epub 2014 Jun 10.
10
A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples.一种与云兼容的生物信息学流程,用于从临床样本的下一代测序中进行超快速病原体鉴定。
Genome Res. 2014 Jul;24(7):1180-92. doi: 10.1101/gr.171934.113. Epub 2014 Jun 4.