• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

奇龙:利用深度学习将纳米孔原始信号直接转换为核苷酸序列。

Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning.

机构信息

Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia.

Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia.

出版信息

Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy037.

DOI:10.1093/gigascience/giy037
PMID:29648610
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5946831/
Abstract

Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step. Trained with only a small set of 4,000 reads, we show that our model provides state-of-the-art basecalling accuracy, even on previously unseen species. Chiron achieves basecalling speeds of more than 2,000 bases per second using desktop computer graphics processing units.

摘要

通过将 DNA 片段转移穿过纳米孔阵列进行测序是一项快速成熟的技术,它提供了比其他方法更快、更便宜的测序。然而,从嘈杂和复杂的电信号中准确破译 DNA 序列具有挑战性。在这里,我们报告了 Chiron,这是第一个实现端到端碱基调用的深度学习模型,它直接将原始信号转换为 DNA 序列,而无需易错的分段步骤。仅使用一小部分 4000 个读数进行训练,我们表明我们的模型即使在以前未见的物种上也能提供最先进的碱基调用准确性。Chiron 使用台式计算机图形处理单元实现了超过 2000 个碱基/秒的碱基调用速度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59d5/5946831/59aea2064113/giy037fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59d5/5946831/fb69d43196fb/giy037fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59d5/5946831/16063d9dfc41/giy037fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59d5/5946831/59aea2064113/giy037fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59d5/5946831/fb69d43196fb/giy037fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59d5/5946831/16063d9dfc41/giy037fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59d5/5946831/59aea2064113/giy037fig3.jpg

相似文献

1
Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning.奇龙:利用深度学习将纳米孔原始信号直接转换为核苷酸序列。
Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy037.
2
Basecalling Using Joint Raw and Event Nanopore Data Sequence-to-Sequence Processing.使用联合原始和事件纳米孔数据序列到序列处理进行碱基调用。
Sensors (Basel). 2022 Mar 15;22(6):2275. doi: 10.3390/s22062275.
3
Performance of neural network basecalling tools for Oxford Nanopore sequencing.基于神经网络的牛津纳米孔测序碱基调用工具的性能。
Genome Biol. 2019 Jun 24;20(1):129. doi: 10.1186/s13059-019-1727-y.
4
From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy.从扭曲到碱基对:提高纳米孔测序读取准确性的计算方法。
Genome Biol. 2018 Jul 13;19(1):90. doi: 10.1186/s13059-018-1462-9.
5
Estimated Nucleotide Reconstruction Quality Symbols of Basecalling Tools for Oxford Nanopore Sequencing.用于 Oxford Nanopore 测序的碱基调用工具的核苷酸重建质量符号估计。
Sensors (Basel). 2023 Jul 29;23(15):6787. doi: 10.3390/s23156787.
6
RUBICON: a framework for designing efficient deep learning-based genomic basecallers.RUBICON:一种用于设计高效深度学习基因组碱基调用器的框架。
Genome Biol. 2024 Feb 16;25(1):49. doi: 10.1186/s13059-024-03181-2.
7
Nanopore basecalling from a perspective of instance segmentation.基于实例分割的纳米孔碱基调用。
BMC Bioinformatics. 2020 Apr 23;21(Suppl 3):136. doi: 10.1186/s12859-020-3459-0.
8
Lokatt: a hybrid DNA nanopore basecaller with an explicit duration hidden Markov model and a residual LSTM network.Lokatt:一种具有显式持续时间隐马尔可夫模型和剩余长短期记忆网络的混合 DNA 纳米孔碱基调用器。
BMC Bioinformatics. 2023 Dec 7;24(1):461. doi: 10.1186/s12859-023-05580-x.
9
NanoReviser: An Error-Correction Tool for Nanopore Sequencing Based on a Deep Learning Algorithm.NanoReviser:一种基于深度学习算法的纳米孔测序纠错工具。
Front Genet. 2020 Aug 12;11:900. doi: 10.3389/fgene.2020.00900. eCollection 2020.
10
Beyond sequencing: machine learning algorithms extract biology hidden in Nanopore signal data.超越测序:机器学习算法从纳米孔信号数据中提取隐藏的生物学信息。
Trends Genet. 2022 Mar;38(3):246-257. doi: 10.1016/j.tig.2021.09.001. Epub 2021 Oct 25.

引用本文的文献

1
Using synthetic RNA to benchmark poly(A) length inference from direct RNA sequencing.使用合成RNA对直接RNA测序中的聚腺苷酸(poly(A))长度推断进行基准测试。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf098.
2
Get ready for short tandem repeats analysis using long reads-the challenges and the state of the art.为使用长读长进行短串联重复序列分析做好准备——挑战与当前技术水平
Front Genet. 2025 Jul 2;16:1610026. doi: 10.3389/fgene.2025.1610026. eCollection 2025.
3
ReQuant: improved base modification calling by k-mer value imputation.

本文引用的文献

1
Performance of neural network basecalling tools for Oxford Nanopore sequencing.基于神经网络的牛津纳米孔测序碱基调用工具的性能。
Genome Biol. 2019 Jun 24;20(1):129. doi: 10.1186/s13059-019-1727-y.
2
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
3
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
ReQuant:通过k-mer值插补改进碱基修饰识别
Nucleic Acids Res. 2025 May 10;53(9). doi: 10.1093/nar/gkaf323.
4
De novo non-canonical nanopore basecalling enables private communication using heavily-modified DNA data at single-molecule level.从头开始的非规范纳米孔碱基识别能够在单分子水平上使用经过大量修饰的DNA数据进行私密通信。
Nat Commun. 2025 May 2;16(1):4099. doi: 10.1038/s41467-025-59357-2.
5
Direct profiling of non-adenosines in poly(A) tails of endogenous and therapeutic mRNAs with Ninetails.使用九尾狐对内源和治疗性mRNA的聚腺苷酸尾中的非腺苷进行直接分析。
Nat Commun. 2025 Mar 18;16(1):2664. doi: 10.1038/s41467-025-57787-6.
6
SqueezeCall: nanopore basecalling using a Squeezeformer network.SqueezeCall:使用Squeezeformer网络进行纳米孔碱基识别
GigaByte. 2025 Feb 14;2025:gigabyte148. doi: 10.46471/gigabyte.148. eCollection 2025.
7
Nanopore sequencing of protozoa: Decoding biological information on a string of biochemical molecules into human-readable signals.原生动物的纳米孔测序:将一串生化分子上的生物信息解码为人类可读信号。
Comput Struct Biotechnol J. 2025 Jan 6;27:440-450. doi: 10.1016/j.csbj.2025.01.002. eCollection 2025.
8
End-to-end simulation of nanopore sequencing signals with feed-forward transformers.使用前馈变压器对纳米孔测序信号进行端到端模拟。
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae744.
9
GCRTcall: a transformer based basecaller for nanopore RNA sequencing enhanced by gated convolution and relative position embedding via joint loss training.GCRTcall:一种基于Transformer的纳米孔RNA测序碱基识别器,通过门控卷积和相对位置嵌入联合损失训练进行增强。
Front Genet. 2024 Nov 22;15:1443532. doi: 10.3389/fgene.2024.1443532. eCollection 2024.
10
Single-Molecule Bioelectronic Sensors with AI-Aided Data Analysis: Convergence and Challenges.具有人工智能辅助数据分析的单分子生物电子传感器:融合与挑战
Precis Chem. 2024 Sep 16;2(10):518-538. doi: 10.1021/prechem.4c00048. eCollection 2024 Oct 28.
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
4
Nanopore DNA Sequencing and Genome Assembly on the International Space Station.国际空间站上的纳米孔 DNA 测序和基因组组装。
Sci Rep. 2017 Dec 21;7(1):18022. doi: 10.1038/s41598-017-18364-0.
5
Nanopore sequencing in microgravity.微重力环境下的纳米孔测序
NPJ Microgravity. 2016 Oct 20;2:16035. doi: 10.1038/npjmgrav.2016.35. eCollection 2016.
6
DeepNano: Deep recurrent neural networks for base calling in MinION nanopore reads.DeepNano:用于MinION纳米孔测序读数碱基识别的深度循环神经网络
PLoS One. 2017 Jun 5;12(6):e0178751. doi: 10.1371/journal.pone.0178751. eCollection 2017.
7
Scaffolding and completing genome assemblies in real-time with nanopore sequencing.实时使用纳米孔测序进行基因组组装和补全。
Nat Commun. 2017 Feb 20;8:14515. doi: 10.1038/ncomms14515.
8
Fast and accurate de novo genome assembly from long uncorrected reads.从长的未校正读段中进行快速且准确的从头基因组组装。
Genome Res. 2017 May;27(5):737-746. doi: 10.1101/gr.214270.116. Epub 2017 Jan 18.
9
Mobile real-time surveillance of Zika virus in Brazil.巴西寨卡病毒的移动实时监测
Genome Med. 2016 Sep 29;8(1):97. doi: 10.1186/s13073-016-0356-2.
10
Nanocall: an open source basecaller for Oxford Nanopore sequencing data.Nanocall:一款用于牛津纳米孔测序数据的开源碱基识别器。
Bioinformatics. 2017 Jan 1;33(1):49-55. doi: 10.1093/bioinformatics/btw569. Epub 2016 Sep 10.