Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
National Institute for Health Research Oxford Biomedical Research Centre, Oxford, UK.
Sci Rep. 2021 Mar 19;11(1):6408. doi: 10.1038/s41598-021-85354-8.
Recent advances in throughput and accuracy mean that the Oxford Nanopore Technologies PromethION platform is a now a viable solution for genome sequencing. Much of the validation of bioinformatic tools for this long-read data has focussed on calling germline variants (including structural variants). Somatic variants are outnumbered many-fold by germline variants and their detection is further complicated by the effects of tumour purity/subclonality. Here, we evaluate the extent to which Nanopore sequencing enables detection and analysis of somatic variation. We do this through sequencing tumour and germline genomes for a patient with diffuse B-cell lymphoma and comparing results with 150 bp short-read sequencing of the same samples. Calling germline single nucleotide variants (SNVs) from specific chromosomes of the long-read data achieved good specificity and sensitivity. However, results of somatic SNV calling highlight the need for the development of specialised joint calling algorithms. We find the comparative genome-wide performance of different tools varies significantly between structural variant types, and suggest long reads are especially advantageous for calling large somatic deletions and duplications. Finally, we highlight the utility of long reads for phasing clinically relevant variants, confirming that a somatic 1.6 Mb deletion and a p.(Arg249Met) mutation involving TP53 are oriented in trans.
近年来,通量和准确性方面的进展意味着牛津纳米孔技术 PromethION 平台现在是基因组测序的可行解决方案。针对这种长读长数据的生物信息学工具的验证在很大程度上集中在调用种系变体(包括结构变体)上。种系变体的数量是体细胞变体的数倍,并且由于肿瘤纯度/亚克隆性的影响,它们的检测更加复杂。在这里,我们评估了纳米孔测序在多大程度上能够检测和分析体细胞变异。我们通过对患有弥漫性 B 细胞淋巴瘤的患者的肿瘤和种系基因组进行测序,并将结果与相同样本的 150bp 短读测序进行比较来做到这一点。从长读数据的特定染色体调用种系单核苷酸变体 (SNV) 可实现良好的特异性和敏感性。然而,体细胞 SNV 调用的结果突出表明需要开发专门的联合调用算法。我们发现不同工具的全基因组比较性能在结构变体类型之间存在显着差异,并建议长读长特别有利于调用大型体细胞缺失和重复。最后,我们强调了长读长用于相位临床相关变体的实用性,证实了体细胞 1.6Mb 缺失和涉及 TP53 的 p.(Arg249Met) 突变是反式取向的。