• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepVISP:用于病毒位点整合预测和基序发现的深度学习。

DeepVISP: Deep Learning for Virus Site Integration Prediction and Motif Discovery.

机构信息

Center for Precision Health School of Biomedical Informatics The University of Texas Health Science Center at Houston (UTHealth) Houston TX 77030 USA.

MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences Houston TX 77030 USA.

出版信息

Adv Sci (Weinh). 2021 Mar 8;8(9):2004958. doi: 10.1002/advs.202004958. eCollection 2021 May.

DOI:10.1002/advs.202004958
PMID:33977077
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8097320/
Abstract

Approximately 15% of human cancers are estimated to be attributed to viruses. Virus sequences can be integrated into the host genome, leading to genomic instability and carcinogenesis. Here, a new deep convolutional neural network (CNN) model is developed with attention architecture, namely DeepVISP, for accurately predicting oncogenic virus integration sites (VISs) in the human genome. Using the curated benchmark integration data of three viruses, hepatitis B virus (HBV), human herpesvirus (HPV), and Epstein-Barr virus (EBV), DeepVISP achieves high accuracy and robust performance for all three viruses through automatically learning informative features and essential genomic positions only from the DNA sequences. In comparison, DeepVISP outperforms conventional machine learning methods by 8.43-34.33% measured by area under curve (AUC) value enhancement in three viruses. Moreover, DeepVISP can decode -regulatory factors that are potentially involved in virus integration and tumorigenesis, such as HOXB7, IKZF1, and LHX6. These findings are supported by multiple lines of evidence in literature. The clustering analysis of the informative motifs reveales that the representative k-mers in clusters could help guide virus recognition of the host genes. A user-friendly web server is developed for predicting putative oncogenic VISs in the human genome using DeepVISP.

摘要

据估计,大约 15%的人类癌症归因于病毒。病毒序列可以整合到宿主基因组中,导致基因组不稳定和致癌作用。在这里,开发了一种具有注意力架构的新深度卷积神经网络 (CNN) 模型,即 DeepVISP,用于准确预测人类基因组中的致癌病毒整合位点 (VIS)。使用三种病毒(乙型肝炎病毒 (HBV)、人类疱疹病毒 (HPV) 和爱泼斯坦-巴尔病毒 (EBV))的精心整理的基准整合数据,DeepVISP 通过仅从 DNA 序列自动学习信息丰富的特征和必要的基因组位置,为所有三种病毒实现了高精度和稳健的性能。相比之下,DeepVISP 通过 AUC 值增强测量在三种病毒中分别提高了 8.43%至 34.33%,优于传统机器学习方法。此外,DeepVISP 可以解码可能参与病毒整合和肿瘤发生的调节因子,如 HOXB7、IKZF1 和 LHX6。这些发现得到了文献中多条证据的支持。信息基序的聚类分析表明,簇中的代表性 k-mer 有助于指导宿主基因的病毒识别。开发了一个用户友好的网络服务器,用于使用 DeepVISP 预测人类基因组中的潜在致癌 VIS。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/bacb9d4682ec/ADVS-8-2004958-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/e307dfb84e47/ADVS-8-2004958-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/ebd5a66f178d/ADVS-8-2004958-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/b3f7ae8d8841/ADVS-8-2004958-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/15173dea7d12/ADVS-8-2004958-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/66c45c8218f4/ADVS-8-2004958-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/6d0dcf43b407/ADVS-8-2004958-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/bacb9d4682ec/ADVS-8-2004958-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/e307dfb84e47/ADVS-8-2004958-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/ebd5a66f178d/ADVS-8-2004958-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/b3f7ae8d8841/ADVS-8-2004958-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/15173dea7d12/ADVS-8-2004958-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/66c45c8218f4/ADVS-8-2004958-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/6d0dcf43b407/ADVS-8-2004958-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f16/8097320/bacb9d4682ec/ADVS-8-2004958-g004.jpg

相似文献

1
DeepVISP: Deep Learning for Virus Site Integration Prediction and Motif Discovery.DeepVISP:用于病毒位点整合预测和基序发现的深度学习。
Adv Sci (Weinh). 2021 Mar 8;8(9):2004958. doi: 10.1002/advs.202004958. eCollection 2021 May.
2
A virome-wide clonal integration analysis platform for discovering cancer viral etiology.用于发现癌症病毒病因的病毒组全克隆整合分析平台。
Genome Res. 2019 May;29(5):819-830. doi: 10.1101/gr.242529.118. Epub 2019 Mar 14.
3
The landscape of viral associations in human cancers.人类癌症中病毒相关性的全景。
Nat Genet. 2020 Mar;52(3):320-330. doi: 10.1038/s41588-019-0558-9. Epub 2020 Feb 5.
4
VIS Atlas: A Database of Virus Integration Sites in Human Genome from NGS Data to Explore Integration Patterns.VIS 图谱:基于 NGS 数据的人类基因组病毒整合位点数据库,用于探索整合模式。
Genomics Proteomics Bioinformatics. 2023 Apr;21(2):300-310. doi: 10.1016/j.gpb.2023.02.005. Epub 2023 Feb 16.
5
Human tumor-associated viruses and new insights into the molecular mechanisms of cancer.人类肿瘤相关病毒与癌症分子机制的新见解。
Oncogene. 2008 Dec;27 Suppl 2:S31-42. doi: 10.1038/onc.2009.351.
6
Genome-wide profiling of Epstein-Barr virus integration by targeted sequencing in Epstein-Barr virus associated malignancies.通过靶向测序对 EBV 相关恶性肿瘤中 EBV 整合的全基因组分析。
Theranostics. 2019 Jan 30;9(4):1115-1124. doi: 10.7150/thno.29622. eCollection 2019.
7
VISDB: a manually curated database of viral integration sites in the human genome.VISDB:一个人类基因组中病毒整合位点的人工 curated 数据库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D633-D641. doi: 10.1093/nar/gkz867.
8
DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites.DeepHBV:一种用于预测乙型肝炎病毒 (HBV) 整合位点的深度学习模型。
BMC Ecol Evol. 2021 Jul 7;21(1):138. doi: 10.1186/s12862-021-01869-8.
9
Non-Random Pattern of Integration for Epstein-Barr Virus with Preference for Gene-Poor Genomic Chromosomal Regions into the Genome of Burkitt Lymphoma Cell Lines.EB 病毒整合呈非随机模式,优先选择基因贫乏的基因组染色体区域整合到伯基特淋巴瘤细胞系的基因组中。
Viruses. 2022 Jan 4;14(1):86. doi: 10.3390/v14010086.
10
HPV, HBV, and HIV-1 Viral Integration Site Mapping: A Streamlined Workflow from NGS to Genomic Insights of Carcinogenesis.HPV、HBV 和 HIV-1 病毒整合位点作图:从 NGS 到致癌基因组见解的简化工作流程。
Viruses. 2024 Jun 18;16(6):975. doi: 10.3390/v16060975.

引用本文的文献

1
Identification and catalog of viral transcriptional regulators in human diseases.人类疾病中病毒转录调节因子的鉴定与编目。
iScience. 2025 Feb 21;28(3):112081. doi: 10.1016/j.isci.2025.112081. eCollection 2025 Mar 21.
2
Deep learning for detecting and elucidating human T-cell leukemia virus type 1 integration in the human genome.用于检测和阐明人类基因组中1型人类T细胞白血病病毒整合情况的深度学习。
Patterns (N Y). 2023 Feb 10;4(2):100674. doi: 10.1016/j.patter.2022.100674.
3
NetBCE: An Interpretable Deep Neural Network for Accurate Prediction of Linear B-cell Epitopes.

本文引用的文献

1
Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning.Deep4mC:通过深度学习对 DNA N4-甲基胞嘧啶位点进行系统评估和计算预测。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa099.
2
Decoding whole-genome mutational signatures in 37 human pan-cancers by denoising sparse autoencoder neural network.利用去噪稀疏自动编码器神经网络对 37 种人类泛癌进行全基因组突变特征解码。
Oncogene. 2020 Jul;39(27):5031-5041. doi: 10.1038/s41388-020-1343-z. Epub 2020 Jun 11.
3
6mA-Finder: a novel online tool for predicting DNA N6-methyladenine sites in genomes.
NetBCE:一种用于线性 B 细胞表位准确预测的可解释深度神经网络。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):1002-1012. doi: 10.1016/j.gpb.2022.11.009. Epub 2022 Dec 13.
4
Designing optimal convolutional neural network architecture using differential evolution algorithm.使用差分进化算法设计最优卷积神经网络架构。
Patterns (N Y). 2022 Aug 24;3(9):100567. doi: 10.1016/j.patter.2022.100567. eCollection 2022 Sep 9.
6mA-Finder:一种用于预测基因组中 DNA N6-甲基腺嘌呤位点的新型在线工具。
Bioinformatics. 2020 May 1;36(10):3257-3259. doi: 10.1093/bioinformatics/btaa113.
4
JASPAR 2020: update of the open-access database of transcription factor binding profiles.JASPAR 2020:转录因子结合谱开放获取数据库的更新。
Nucleic Acids Res. 2020 Jan 8;48(D1):D87-D92. doi: 10.1093/nar/gkz1001.
5
VISDB: a manually curated database of viral integration sites in the human genome.VISDB:一个人类基因组中病毒整合位点的人工 curated 数据库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D633-D641. doi: 10.1093/nar/gkz867.
6
MicroRNA-374a, -4680, and -133b suppress cell proliferation through the regulation of genes associated with human cleft palate in cultured human palate cells.miRNA-374a、-4680 和 -133b 通过调节与人腭裂相关的基因抑制培养的人腭细胞的增殖。
BMC Med Genomics. 2019 Jul 1;12(1):93. doi: 10.1186/s12920-019-0546-z.
7
Epigenetic inactivation of LHX6 mediated microcystin-LR induced hepatocarcinogenesis via the Wnt/β-catenin and P53 signaling pathways.组蛋白修饰导致的 LHX6 失活通过 Wnt/β-catenin 和 P53 信号通路介导微囊藻毒素-LR 诱导的肝癌发生。
Environ Pollut. 2019 Sep;252(Pt A):216-226. doi: 10.1016/j.envpol.2019.05.049. Epub 2019 May 15.
8
Genome-wide profiling of Epstein-Barr virus integration by targeted sequencing in Epstein-Barr virus associated malignancies.通过靶向测序对 EBV 相关恶性肿瘤中 EBV 整合的全基因组分析。
Theranostics. 2019 Jan 30;9(4):1115-1124. doi: 10.7150/thno.29622. eCollection 2019.
9
Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites.深度学习分类器与随机森林方法相结合,用于预测丙二酰化位点。
Genomics Proteomics Bioinformatics. 2018 Dec;16(6):451-459. doi: 10.1016/j.gpb.2018.08.004. Epub 2019 Jan 11.
10
DeepHINT: understanding HIV-1 integration via deep learning with attention.DeepHINT:通过深度学习和注意力理解 HIV-1 整合。
Bioinformatics. 2019 May 15;35(10):1660-1667. doi: 10.1093/bioinformatics/bty842.