NNF Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, DK-2800, Denmark.
Department of Biology, University of Copenhagen, Copenhagen, DK-2200, Denmark.
Nat Commun. 2023 Mar 13;14(1):1358. doi: 10.1038/s41467-023-36689-5.
Cancer genomes are highly complex and heterogeneous. The standard short-read sequencing and analytical methods are unable to provide the complete and precise base-level structural variant landscape of cancer genomes. In this work, we apply high-resolution long accurate HiFi and long-range Hi-C sequencing to the melanoma COLO829 cancer line. Also, we develop an efficient graph-based approach that processes these data types for chromosome-scale haplotype-resolved reconstruction to characterise the cancer precise structural variant landscape. Our method produces high-quality phased scaffolds on the chromosome level on three healthy samples and the COLO829 cancer line in less than half a day even in the absence of trio information, outperforming existing state-of-the-art methods. In the COLO829 cancer cell line, here we show that our method identifies and characterises precise somatic structural variant calls in important repeat elements that were missed in short-read-based call sets. Our method also finds the precise chromosome-level structural variant (germline and somatic) landscape with 19,956 insertions, 14,846 deletions, 421 duplications, 52 inversions and 498 translocations at the base resolution. Our simple pstools approach should facilitate better personalised diagnosis and disease management, including predicting therapeutic responses.
癌症基因组高度复杂且异质。标准的短读测序和分析方法无法提供癌症基因组完整且精确的碱基水平结构变异景观。在这项工作中,我们应用高分辨率长读长 HiFi 和长程 Hi-C 测序技术对黑色素瘤 COLO829 癌细胞系进行研究。此外,我们开发了一种有效的基于图的方法,用于处理这些数据类型,以进行染色体级别的单倍型解析重建,从而描绘癌症精确的结构变异景观。我们的方法在不到半天的时间内(即使没有 trio 信息),在三个健康样本和 COLO829 癌细胞系上生成了高质量的染色体水平相位支架,优于现有的最先进方法。在 COLO829 癌细胞系中,我们展示了我们的方法可以识别和描述重要重复元件中的精确体细胞结构变异,而这些变异在基于短读的调用集中被忽略了。我们的方法还发现了精确的染色体水平结构变异(种系和体细胞)景观,在碱基分辨率下有 19956 个插入、14846 个缺失、421 个重复、52 个倒位和 498 个易位。我们的简单 pstools 方法应该有助于更好的个性化诊断和疾病管理,包括预测治疗反应。