Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA.
Nat Biotechnol. 2021 Apr;39(4):431-441. doi: 10.1038/s41587-020-0731-9. Epub 2020 Nov 30.
Conventional targeted sequencing methods eliminate many of the benefits of nanopore sequencing, such as the ability to accurately detect structural variants or epigenetic modifications. The ReadUntil method allows nanopore devices to selectively eject reads from pores in real time, which could enable purely computational targeted sequencing. However, this requires rapid identification of on-target reads while most mapping methods require computationally intensive basecalling. We present UNCALLED ( https://github.com/skovaka/UNCALLED ), an open source mapper that rapidly matches streaming of nanopore current signals to a reference sequence. UNCALLED probabilistically considers k-mers that could be represented by the signal and then prunes the candidates based on the reference encoded within a Ferragina-Manzini index. We used UNCALLED to deplete sequencing of known bacterial genomes within a metagenomics community, enriching the remaining species 4.46-fold. UNCALLED also enriched 148 human genes associated with hereditary cancers to 29.6× coverage using one MinION flowcell, enabling accurate detection of single-nucleotide polymorphisms, insertions and deletions, structural variants and methylation in these genes.
传统的靶向测序方法消除了纳米孔测序的许多优势,例如准确检测结构变体或表观遗传修饰的能力。ReadUntil 方法允许纳米孔设备实时选择性地从孔中排出读取内容,这可以实现纯计算靶向测序。然而,这需要在大多数映射方法需要计算密集型碱基调用的情况下,快速识别目标读取内容。我们提出了 UNCALLED(https://github.com/skovaka/UNCALLED),这是一种开源映射器,可以快速将纳米孔电流信号的流与参考序列匹配。UNCALLED 概率地考虑可能由信号表示的 k-mer,然后根据 Ferragina-Manzini 索引中编码的参考信息修剪候选内容。我们使用 UNCALLED 在宏基因组群落中耗尽了已知细菌基因组的测序,使剩余的物种富集了 4.46 倍。UNCALLED 还使用一个 MinION 流池将与遗传性癌症相关的 148 个人类基因富集到 29.6×的覆盖度,从而能够准确检测这些基因中的单核苷酸多态性、插入和缺失、结构变体和甲基化。
Bioinformatics. 2021-7-12
Bioinformatics. 2024-5-2
Bioinformatics. 2023-6-30
Elife. 2021-1-19
Nat Commun. 2016-4-15
Genome Biol. 2021-2-22
Brief Bioinform. 2024-7-25
Bioinformatics. 2025-3-29
Brief Bioinform. 2025-3-4
Genome Res. 2025-4-14
Cell Insight. 2025-1-18
Nature. 2020-7-14
Nucleic Acids Res. 2020-1-8
Genes (Basel). 2019-8-29
Nat Biotechnol. 2019-4-1
Nat Biotechnol. 2019-3-11
Bioinformatics. 2019-7-1