Leeds Institute of Medical Research, School of Medicine, University of Leeds, Leeds, UK.
North East and Yorkshire Genomic Laboratory Hub, Central Lab, St James's University Hospital, Leeds, UK.
Mol Genet Genomic Med. 2023 Jun;11(6):e2164. doi: 10.1002/mgg3.2164. Epub 2023 Mar 19.
The widespread adoption of exome sequencing has greatly increased the rate of genetic diagnosis for inherited conditions. However, the detection and validation of large deletions remains challenging. While numerous bioinformatics approaches have been developed to detect deletions from whole - exome sequencing and targeted panels, further work is typically required to define the physical breakpoints or integration sites. Accurate characterisation requires either expensive follow - up whole - genome sequencing or the time - consuming, laborious process of PCR walking, both of which are challenging when dealing with the repeat sequences which frequently intersect deletion breakpoints. The aim of this study was to develop a cost-effective, long-range sequencing method to characterise deletions.
Genomic DNA was amplified with primers spanning the deletion using long-range PCR and the products purified. Sequencing was performed on MinION flongle flowcells. The resulting fast5 files were basecalled using Guppy, trimmed using Porechop and aligned using Minimap2. Filtering was performed using NanoFilt. Nanopore sequencing results were verified by Sanger sequencing.
Four cases with deletions detected following comparative read-depth analysis of targeted short-read sequencing were analysed. Nanopore sequencing defined breakpoints at the molecular level in all cases including homozygous breakpoints in EYS, CNGA1 and CNGB1 and a heterozygous deletion in PRPF31. All breakpoints were verified by Sanger sequencing.
In this study, a quick, accurate and cost - effective method is described to characterise deletions identified from exome, and similar data, using nanopore sequencing.
外显子组测序的广泛采用极大地提高了遗传性疾病的基因诊断率。然而,大的缺失的检测和验证仍然具有挑战性。虽然已经开发了许多生物信息学方法来检测全外显子组测序和靶向面板中的缺失,但通常需要进一步工作来定义物理断点或整合位点。准确的特征描述需要昂贵的后续全基因组测序,或者需要费时费力的 PCR 漫步过程,当涉及经常与缺失断点相交的重复序列时,这两种方法都具有挑战性。本研究的目的是开发一种具有成本效益的长距离测序方法来对缺失进行特征描述。
使用跨越缺失的长距离 PCR 扩增基因组 DNA,并纯化产物。在 MinION flongle 流动池上进行测序。使用 Guppy 对生成的 fast5 文件进行碱基调用,使用 Porechop 进行修剪,并使用 Minimap2 进行对齐。使用 NanoFilt 进行过滤。通过 Sanger 测序验证纳米孔测序结果。
对靶向短读测序的比较读数深度分析检测到的四个缺失病例进行了分析。纳米孔测序在所有病例中均在分子水平上定义了断点,包括 EYS、CNGA1 和 CNGB1 的纯合性断点和 PRPF31 的杂合性缺失。所有断点均通过 Sanger 测序验证。
在这项研究中,描述了一种快速、准确且具有成本效益的方法,用于使用纳米孔测序对从外显子组和类似数据中识别的缺失进行特征描述。