Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT 05405, USA.
Institute for the Advanced Study of Human Biology, Kyoto University, Kyoto 606-8501, Japan.
Viruses. 2021 Feb 4;13(2):245. doi: 10.3390/v13020245.
Hepatocellular carcinoma (HCC) is a leading cause of cancer-related mortality. Almost half of HCC cases are associated with hepatitis B virus (HBV) infections, which often lead to HBV sequence integrations in the human genome. Accurate identification of HBV integration sites at a single nucleotide resolution is critical for developing a better understanding of the cancer genome landscape and of the disease itself. Here, we performed further analyses and characterization of HBV integrations identified by our recently reported VIcaller platform in recurrent or known HCC genes (such as , , and ) as well as non-recurrent cancer-related genes (such as , , and ). Our pathway enrichment analysis revealed multiple pathways involving the alcohol dehydrogenase 4 gene, such as the metabolism pathways of retinol, tyrosine, and fatty acid. Further analysis of the HBV integration sites revealed distinct patterns involving the integration upper breakpoints, integrated genome lengths, and integration allele fractions between tumor and normal tissues. Our analysis also implies that the VIcaller method has diagnostic potential through discovering novel clonal integrations in cancer-related genes. In conclusion, although VIcaller is a hypothesis free virome-wide approach, it can still be applied to accurately identify genome-wide integration events of a specific candidate virus and their integration allele fractions.
肝细胞癌(HCC)是癌症相关死亡的主要原因。几乎一半的 HCC 病例与乙型肝炎病毒(HBV)感染有关,HBV 感染常导致 HBV 序列整合到人类基因组中。准确识别单核苷酸分辨率的 HBV 整合位点对于深入了解癌症基因组景观和疾病本身至关重要。在这里,我们对我们最近报道的 VIcaller 平台在复发或已知 HCC 基因(如 、 和 )以及非复发癌症相关基因(如 、 和 )中识别的 HBV 整合进行了进一步的分析和表征。我们的通路富集分析显示,涉及醇脱氢酶 4 基因的多个通路,如视黄醇、酪氨酸和脂肪酸的代谢途径。对 HBV 整合位点的进一步分析揭示了肿瘤组织和正常组织之间整合上断点、整合基因组长度和整合等位基因分数之间的不同模式。我们的分析还表明,通过在癌症相关基因中发现新的克隆整合,VIcaller 方法具有诊断潜力。总之,尽管 VIcaller 是一种无假设的病毒组全基因组方法,但它仍然可以应用于准确识别特定候选病毒的全基因组整合事件及其整合等位基因分数。