Hammond Nathan, Liao Linda, Tong Pun Wai, Ng Zena, Nguyen Thuy-Mi P, Ho Chandler, Yang Yao, Scott Stuart A
Clinical Genomics Laboratory, Stanford Medicine, Palo Alto, California 94304, USA.
Department of Pathology, Stanford University, Stanford, California 94305, USA.
Genome Res. 2025 Jun 2;35(6):1391-1399. doi: 10.1101/gr.278836.123.
Long-read sequencing has the capacity to interrogate difficult genomic regions and phase variants; however, short-read sequencing is more commonly implemented for clinical testing. Given the advances in long-read HiFi sequencing chemistry and variant calling, we analytically validated this technology for small variant detection (single nucleotide variants, insertions/deletions; SNVs/indels; <50 bp). HiFi genome sequencing was performed on DNA from reference materials and clinical specimen types, and accuracy results were compared to short-read genome sequencing data. HiFi genome sequencing recall and precision across Genome in a Bottle (GIAB)-defined non-difficult and difficult genomic regions (high confidence) for SNVs are >99.9% and >99.7%, respectively, and for indels are >99.8% and >99.1%, respectively. Moreover, HiFi genome sequencing outperforms short-read genome sequencing on overall SNV/indel F1-score accuracy at all paired sequencing depths, which are further stratified across 100 total GIAB-defined genomic regions for a comprehensive evaluation of performance. Of note, HiFi genome sequencing F1-scores for SNVs and indels surpass 99% at ∼15× and ∼25×, respectively. In addition, high confidence small variant concordance across all HiFi genome sequencing reproducibility assessments (two specimens, three independent sequencing data sets) are >99.8% for SNVs and >98.6% for indels, and average high confidence small variant concordance between paired blood, saliva, and swab specimens are all >99.8%. Taken together, these data underscore that long-read HiFi genome sequencing detection of SNVs and indels is very accurate and robust, which supports the implementation of this technology for clinical diagnostic testing.
长读长测序能够分析复杂的基因组区域并对变异进行定相分析;然而,短读长测序在临床检测中应用更为普遍。鉴于长读长HiFi测序技术和变异检测的进展,我们对该技术进行了分析验证,以用于小变异检测(单核苷酸变异、插入/缺失;SNV/Indel;<50 bp)。对参考物质和临床样本类型的DNA进行了HiFi基因组测序,并将准确性结果与短读长基因组测序数据进行了比较。在基因组中瓶(GIAB)定义的非复杂和复杂基因组区域(高置信度)中,HiFi基因组测序对SNV的召回率和精确率分别>99.9%和>99.7%,对Indel的召回率和精确率分别>99.8%和>99.1%。此外,在所有配对测序深度下,HiFi基因组测序在总体SNV/Indel F1分数准确性方面优于短读长基因组测序,这些深度在总共100个GIAB定义的基因组区域中进一步分层,以全面评估性能。值得注意的是,HiFi基因组测序对SNV和Indel的F1分数分别在约15×和约25×时超过99%。此外,在所有HiFi基因组测序重复性评估(两个样本,三个独立测序数据集)中,高置信度小变异一致性对于SNV>99.8%,对于Indel>98.6%,配对血液、唾液和拭子样本之间的平均高置信度小变异一致性均>99.8%。综上所述,这些数据强调长读长HiFi基因组测序对SNV和Indel的检测非常准确且稳健,这支持了该技术在临床诊断检测中的应用。