Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, Hubei, China.
Cancer Biology Research Center (Key Laboratory of the Ministry of Education), Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, Hubei, China.
BMC Genomics. 2024 Feb 20;25(1):198. doi: 10.1186/s12864-024-10101-y.
Cervical cancer (CC) causes more than 311,000 deaths annually worldwide. The integration of human papillomavirus (HPV) is a crucial genetic event that contributes to cervical carcinogenesis. Despite HPV DNA integration is known to disrupt the genomic architecture of both the host and viral genomes in CC, the complexity of this process remains largely unexplored.
In this study, we conducted whole-genome sequencing (WGS) at 55-65X coverage utilizing the PacBio long-read sequencing platform in SiHa and HeLa cells, followed by comprehensive analyses of the sequence data to elucidate the complexity of HPV integration. Firstly, our results demonstrated that PacBio long-read sequencing effectively identifies HPV integration breakpoints with comparable accuracy to targeted-capture Next-generation sequencing (NGS) methods. Secondly, we constructed detailed models of complex integrated genome structures that included both the HPV genome and nearby regions of the human genome by utilizing PacBio long-read WGS. Thirdly, our sequencing results revealed the occurrence of a wide variety of genome-wide structural variations (SVs) in SiHa and HeLa cells. Additionally, our analysis further revealed a potential correlation between changes in gene expression levels and SVs on chromosome 13 in the genome of SiHa cells.
Using PacBio long-read sequencing, we have successfully constructed complex models illustrating HPV integrated genome structures in SiHa and HeLa cells. This accomplishment serves as a compelling demonstration of the valuable capabilities of long-read sequencing in detecting and characterizing HPV genomic integration structures within human cells. Furthermore, these findings offer critical insights into the complex process of HPV16 and HPV18 integration and their potential contribution to the development of cervical cancer.
宫颈癌(CC)每年在全球导致超过 311,000 人死亡。人乳头瘤病毒(HPV)的整合是导致宫颈癌发生的关键遗传事件。尽管已经知道 HPV DNA 的整合会破坏 CC 中宿主和病毒基因组的基因组结构,但这个过程的复杂性在很大程度上仍未得到探索。
在这项研究中,我们利用 PacBio 长读测序平台在 SiHa 和 HeLa 细胞中进行了 55-65X 覆盖度的全基因组测序(WGS),随后对序列数据进行了全面分析,以阐明 HPV 整合的复杂性。首先,我们的结果表明 PacBio 长读测序可以有效地识别 HPV 整合断点,其准确性可与靶向捕获下一代测序(NGS)方法相媲美。其次,我们通过利用 PacBio 长读 WGS 构建了包括 HPV 基因组和人类基因组附近区域在内的复杂整合基因组结构的详细模型。第三,我们的测序结果揭示了 SiHa 和 HeLa 细胞中广泛存在的全基因组结构变异(SVs)。此外,我们的分析进一步揭示了 SiHa 细胞基因组中 13 号染色体上基因表达水平变化与 SVs 之间可能存在的相关性。
使用 PacBio 长读测序,我们成功构建了复杂的模型,说明了 SiHa 和 HeLa 细胞中 HPV 整合基因组结构。这一成就有力地证明了长读测序在检测和表征人类细胞中 HPV 基因组整合结构方面的有价值的能力。此外,这些发现为 HPV16 和 HPV18 整合的复杂过程及其对宫颈癌发展的潜在贡献提供了重要的见解。