• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

viGEN:用于检测和定量人类肿瘤中病毒RNA的开源流程

viGEN: An Open Source Pipeline for the Detection and Quantification of Viral RNA in Human Tumors.

作者信息

Bhuvaneshwar Krithika, Song Lei, Madhavan Subha, Gusev Yuriy

机构信息

Innovation Center for Biomedical Informatics, Georgetown University, Washington, DC, United States.

出版信息

Front Microbiol. 2018 Jun 5;9:1172. doi: 10.3389/fmicb.2018.01172. eCollection 2018.

DOI:10.3389/fmicb.2018.01172
PMID:29922260
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5996193/
Abstract

An estimated 17% of cancers worldwide are associated with infectious causes. The extent and biological significance of viral presence/infection in actual tumor samples is generally unknown but could be measured using human transcriptome (RNA-seq) data from tumor samples. We present an open source bioinformatics pipeline viGEN, which allows for not only the detection and quantification of viral RNA, but also variants in the viral transcripts. The pipeline includes 4 major modules: The first module aligns and filter out human RNA sequences; the second module maps and count (remaining un-aligned) reads against reference genomes of all known and sequenced human viruses; the third module quantifies read counts at the individual viral-gene level thus allowing for downstream differential expression analysis of viral genes between case and controls groups. The fourth module calls variants in these viruses. To the best of our knowledge, there are no publicly available pipelines or packages that would provide this type of complete analysis in one open source package. In this paper, we applied the viGEN pipeline to two case studies. We first demonstrate the working of our pipeline on a large public dataset, the TCGA cervical cancer cohort. In the second case study, we performed an in-depth analysis on a small focused study of TCGA liver cancer patients. In the latter cohort, we performed viral-gene quantification, viral-variant extraction and survival analysis. This allowed us to find differentially expressed viral-transcripts and viral-variants between the groups of patients, and connect them to clinical outcome. From our analyses, we show that we were able to successfully detect the human papilloma virus among the TCGA cervical cancer patients. We compared the viGEN pipeline with two metagenomics tools and demonstrate similar sensitivity/specificity. We were also able to quantify viral-transcripts and extract viral-variants using the liver cancer dataset. The results presented corresponded with published literature in terms of rate of detection, and impact of several known variants of HBV genome. This pipeline is generalizable, and can be used to provide novel biological insights into microbial infections in complex diseases and tumorigeneses. Our viral pipeline could be used in conjunction with additional type of immuno-oncology analysis based on RNA-seq data of host RNA for cancer immunology applications. The source code, with example data and tutorial is available at: https://github.com/ICBI/viGEN/.

摘要

据估计,全球17%的癌症与感染因素有关。实际肿瘤样本中病毒存在/感染的程度和生物学意义通常未知,但可以使用肿瘤样本的人类转录组(RNA测序)数据进行测量。我们提出了一个开源生物信息学管道viGEN,它不仅可以检测和量化病毒RNA,还可以检测病毒转录本中的变体。该管道包括4个主要模块:第一个模块比对并过滤掉人类RNA序列;第二个模块将(剩余未比对的) reads映射到所有已知和已测序人类病毒的参考基因组上并进行计数;第三个模块在个体病毒基因水平上量化reads计数,从而允许对病例组和对照组之间的病毒基因进行下游差异表达分析。第四个模块调用这些病毒中的变体。据我们所知,没有公开可用的管道或软件包能在一个开源软件包中提供这种类型的完整分析。在本文中,我们将viGEN管道应用于两个案例研究。我们首先在一个大型公共数据集——TCGA宫颈癌队列上展示了我们管道的工作原理。在第二个案例研究中,我们对TCGA肝癌患者的一个小型重点研究进行了深入分析。在后一个队列中,我们进行了病毒基因定量、病毒变体提取和生存分析。这使我们能够找到患者组之间差异表达的病毒转录本和病毒变体,并将它们与临床结果联系起来。从我们的分析中可以看出,我们能够在TCGA宫颈癌患者中成功检测到人乳头瘤病毒。我们将viGEN管道与两种宏基因组学工具进行了比较,并展示了相似的敏感性/特异性。我们还能够使用肝癌数据集量化病毒转录本并提取病毒变体。呈现的结果在检测率以及乙肝病毒基因组几个已知变体的影响方面与已发表的文献一致。这个管道具有通用性,可用于为复杂疾病和肿瘤发生中的微生物感染提供新的生物学见解。我们的病毒管道可与基于宿主RNA的RNA测序数据进行的其他类型免疫肿瘤学分析结合使用,用于癌症免疫学应用。源代码以及示例数据和教程可在以下网址获取:https://github.com/ICBI/viGEN/ 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b00/5996193/163013c0c817/fmicb-09-01172-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b00/5996193/3be82948c64f/fmicb-09-01172-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b00/5996193/163013c0c817/fmicb-09-01172-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b00/5996193/3be82948c64f/fmicb-09-01172-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b00/5996193/163013c0c817/fmicb-09-01172-g0002.jpg

相似文献

1
viGEN: An Open Source Pipeline for the Detection and Quantification of Viral RNA in Human Tumors.viGEN:用于检测和定量人类肿瘤中病毒RNA的开源流程
Front Microbiol. 2018 Jun 5;9:1172. doi: 10.3389/fmicb.2018.01172. eCollection 2018.
2
Read-Split-Run: an improved bioinformatics pipeline for identification of genome-wide non-canonical spliced regions using RNA-Seq data.读取-分割-运行:一种利用RNA测序数据识别全基因组非经典剪接区域的改进型生物信息学流程。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):503. doi: 10.1186/s12864-016-2896-7.
3
Virus detection in high-throughput sequencing data without a reference genome of the host.在没有宿主参考基因组的高通量测序数据中进行病毒检测。
Infect Genet Evol. 2018 Dec;66:180-187. doi: 10.1016/j.meegid.2018.09.026. Epub 2018 Oct 3.
4
PhytoPipe: a phytosanitary pipeline for plant pathogen detection and diagnosis using RNA-seq data.PhytoPipe:一种使用 RNA-seq 数据进行植物病原体检测和诊断的植物卫生管道。
BMC Bioinformatics. 2023 Dec 13;24(1):470. doi: 10.1186/s12859-023-05589-2.
5
An open RNA-Seq data analysis pipeline tutorial with an example of reprocessing data from a recent Zika virus study.一个开放的RNA测序数据分析流程教程,并以最近寨卡病毒研究中的数据重新处理为例。
F1000Res. 2016 Jul 5;5:1574. doi: 10.12688/f1000research.9110.1. eCollection 2016.
6
Virus expression detection reveals RNA-sequencing contamination in TCGA.病毒表达检测揭示 TCGA 中的 RNA 测序污染。
BMC Genomics. 2020 Jan 28;21(1):79. doi: 10.1186/s12864-020-6483-6.
7
Alternative preprocessing of RNA-Sequencing data in The Cancer Genome Atlas leads to improved analysis results.RNA 测序数据在癌症基因组图谱中的替代预处理方法可改善分析结果。
Bioinformatics. 2015 Nov 15;31(22):3666-72. doi: 10.1093/bioinformatics/btv377. Epub 2015 Jul 24.
8
The Impact of RNA-seq Alignment Pipeline on Detection of Differentially Expressed Genes.RNA测序比对流程对差异表达基因检测的影响
IEEE Glob Conf Signal Inf Process. 2014 Dec;2012:1376-1379. doi: 10.1109/GlobalSIP.2014.7032351. Epub 2015 Feb 9.
9
A comprehensive next generation sequencing-based virome assessment in brain tissue suggests no major virus - tumor association.对脑组织进行全面的下一代测序病毒组评估表明,没有主要的病毒-肿瘤关联。
Acta Neuropathol Commun. 2016 Jul 11;4(1):71. doi: 10.1186/s40478-016-0338-z.
10
SeqAcademy: an educational pipeline for RNA-Seq and ChIP-Seq analysis.SeqAcademy:一个用于RNA测序和染色质免疫沉淀测序分析的教育流程。
F1000Res. 2018 May 22;7. doi: 10.12688/f1000research.14880.4. eCollection 2018.

引用本文的文献

1
Viral Transcript and Tumor Immune Microenvironment-Based Transcriptomic Profiling of HPV-Associated Head and Neck Squamous Cell Carcinoma Identifies Subtypes Associated with Prognosis.基于病毒转录本和肿瘤免疫微环境的人乳头瘤病毒相关头颈部鳞状细胞癌转录组分析确定与预后相关的亚型
Viruses. 2024 Dec 24;17(1):4. doi: 10.3390/v17010004.
2
Integrative genomic analysis of the lung tissue microenvironment in SARS-CoV-2 and NL63 patients.新冠病毒和NL63病毒感染患者肺组织微环境的综合基因组分析
Heliyon. 2024 Jun 10;10(12):e32772. doi: 10.1016/j.heliyon.2024.e32772. eCollection 2024 Jun 30.
3
INHBA is Enriched in HPV-negative Oropharyngeal Squamous Cell Carcinoma and Promotes Cancer Progression.

本文引用的文献

1
Comprehensive and Integrative Genomic Characterization of Hepatocellular Carcinoma.肝细胞癌的综合与整合基因组特征分析
Cell. 2017 Jun 15;169(7):1327-1341.e23. doi: 10.1016/j.cell.2017.05.046.
2
Centrifuge: rapid and sensitive classification of metagenomic sequences.离心机:宏基因组序列的快速灵敏分类
Genome Res. 2016 Dec;26(12):1721-1729. doi: 10.1101/gr.210641.116. Epub 2016 Oct 17.
3
Using Small RNA Deep Sequencing Data to Detect Human Viruses.利用小RNA深度测序数据检测人类病毒。
INHBA 在 HPV 阴性口咽鳞状细胞癌中富集,并促进癌症进展。
Cancer Res Commun. 2024 Feb 28;4(2):571-587. doi: 10.1158/2767-9764.CRC-23-0258.
4
IL-8 correlates with nonresponse to neoadjuvant nivolumab in HPV positive HNSCC via a potential extracellular vesicle miR-146a mediated mechanism.IL-8 通过一种潜在的细胞外囊泡 miR-146a 介导的机制与 HPV 阳性头颈部鳞状细胞癌对新辅助纳武利尤单抗的无反应相关。
Mol Carcinog. 2023 Sep;62(9):1428-1443. doi: 10.1002/mc.23587. Epub 2023 Jul 4.
5
Enhanced Viral Metagenomics with Lazypipe 2.利用 Lazypipe 2 实现增强型病毒宏基因组学
Viruses. 2023 Feb 4;15(2):431. doi: 10.3390/v15020431.
6
The HLA ligandome of oropharyngeal squamous cell carcinomas reveals shared tumour-exclusive peptides for semi-personalised vaccination.口咽鳞状细胞癌的 HLA 配体组揭示了用于半个性化疫苗接种的共享肿瘤特异性肽。
Br J Cancer. 2023 May;128(9):1777-1787. doi: 10.1038/s41416-023-02197-y. Epub 2023 Feb 23.
7
Viral Causality of Human Cancer and Potential Roles of Human Endogenous Retroviruses in the Multi-Omics Era: An Evolutionary Epidemiology Review.人类癌症的病毒因果关系及人类内源性逆转录病毒在多组学时代的潜在作用:一项进化流行病学综述
Front Oncol. 2021 Oct 29;11:687631. doi: 10.3389/fonc.2021.687631. eCollection 2021.
8
Kmerator Suite: design of specific -mer signatures and automatic metadata discovery in large RNA-seq datasets.Kmerator套件:在大型RNA测序数据集中设计特定的k-mer特征并自动发现元数据
NAR Genom Bioinform. 2021 Jun 23;3(3):lqab058. doi: 10.1093/nargab/lqab058. eCollection 2021 Sep.
9
Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper.使用 FlexTyper 对索引短读取进行灵活序列查询的实用性展示。
PLoS Comput Biol. 2021 Mar 22;17(3):e1008815. doi: 10.1371/journal.pcbi.1008815. eCollection 2021 Mar.
10
Correlation of HPV16 Gene Status and Gene Expression With Antibody Seropositivity and TIL Status in OPSCC.口咽鳞状细胞癌中HPV16基因状态及基因表达与抗体血清阳性及肿瘤浸润淋巴细胞状态的相关性
Front Oncol. 2021 Jan 26;10:591063. doi: 10.3389/fonc.2020.591063. eCollection 2020.
Biomed Res Int. 2016;2016:2596782. doi: 10.1155/2016/2596782. Epub 2016 Mar 15.
4
Translating RNA sequencing into clinical diagnostics: opportunities and challenges.将RNA测序转化为临床诊断:机遇与挑战。
Nat Rev Genet. 2016 May;17(5):257-71. doi: 10.1038/nrg.2016.10. Epub 2016 Mar 21.
5
GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data.GenSeed-HMM:一种使用隐马尔可夫模型轮廓作为种子进行渐进式组装的工具及其在从宏基因组数据中发现阿尔帕病毒科病毒中的应用
Front Microbiol. 2016 Mar 4;7:269. doi: 10.3389/fmicb.2016.00269. eCollection 2016.
6
A case study for cloud based high throughput analysis of NGS data using the globus genomics system.一个使用Globus基因组学系统对NGS数据进行基于云的高通量分析的案例研究。
Comput Struct Biotechnol J. 2014 Nov 7;13:64-74. doi: 10.1016/j.csbj.2014.11.001. eCollection 2015.
7
Infectious Entry Pathway Mediated by the Human Endogenous Retrovirus K Envelope Protein.人类内源性逆转录病毒K包膜蛋白介导的感染性进入途径。
J Virol. 2016 Jan 20;90(7):3640-9. doi: 10.1128/JVI.03136-15.
8
Cancer-fighting viruses win approval.抗癌病毒获批。
Nature. 2015 Oct 29;526(7575):622-3. doi: 10.1038/526622a.
9
Next-generation sequencing in clinical virology: Discovery of new viruses.临床病毒学中的下一代测序:新病毒的发现
World J Virol. 2015 Aug 12;4(3):265-76. doi: 10.5501/wjv.v4.i3.265.
10
Comprehensive DNA methylation analysis of hepatitis B virus genome in infected liver tissues.感染肝组织中乙型肝炎病毒基因组的综合DNA甲基化分析
Sci Rep. 2015 May 22;5:10478. doi: 10.1038/srep10478.