• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GCRTcall:一种基于Transformer的纳米孔RNA测序碱基识别器,通过门控卷积和相对位置嵌入联合损失训练进行增强。

GCRTcall: a transformer based basecaller for nanopore RNA sequencing enhanced by gated convolution and relative position embedding via joint loss training.

作者信息

Li Qingwen, Sun Chen, Wang Daqian, Lou Jizhong

机构信息

Key Laboratory of Epigenetic Regulation and Intervention, Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China.

College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.

出版信息

Front Genet. 2024 Nov 22;15:1443532. doi: 10.3389/fgene.2024.1443532. eCollection 2024.

DOI:10.3389/fgene.2024.1443532
PMID:39649096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11621211/
Abstract

Nanopore sequencing, renowned for its ability to sequence DNA and RNA directly with read lengths extending to several hundred kilobases or even megabases, holds significant promise in fields like transcriptomics and other omics studies. Despite its potential, the technology's limited accuracy in base identification has restricted its widespread application. Although many algorithms have been developed to improve DNA decoding, advancements in RNA sequencing remain limited. Addressing this challenge, we introduce GCRTcall, a novel approach integrating Transformer architecture with gated convolutional networks and relative positional encoding for RNA sequencing signal decoding. Our evaluation demonstrates that GCRTcall achieves state-of-the-art performance in RNA basecalling.

摘要

纳米孔测序以其能够直接对DNA和RNA进行测序,读长可延伸至数百千碱基甚至兆碱基而闻名,在转录组学和其他组学研究等领域具有巨大潜力。尽管有其潜力,但该技术在碱基识别方面的准确性有限,限制了其广泛应用。虽然已经开发了许多算法来改进DNA解码,但RNA测序的进展仍然有限。为应对这一挑战,我们引入了GCRTcall,这是一种将Transformer架构与门控卷积网络和相对位置编码相结合的新颖方法,用于RNA测序信号解码。我们的评估表明,GCRTcall在RNA碱基识别中实现了最先进的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/4a02224d604c/fgene-15-1443532-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/81ed6f28ea05/fgene-15-1443532-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/c86fdbb3a689/fgene-15-1443532-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/0cd3020268a1/fgene-15-1443532-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/4a02224d604c/fgene-15-1443532-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/81ed6f28ea05/fgene-15-1443532-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/c86fdbb3a689/fgene-15-1443532-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/0cd3020268a1/fgene-15-1443532-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f481/11621211/4a02224d604c/fgene-15-1443532-g004.jpg

相似文献

1
GCRTcall: a transformer based basecaller for nanopore RNA sequencing enhanced by gated convolution and relative position embedding via joint loss training.GCRTcall:一种基于Transformer的纳米孔RNA测序碱基识别器,通过门控卷积和相对位置嵌入联合损失训练进行增强。
Front Genet. 2024 Nov 22;15:1443532. doi: 10.3389/fgene.2024.1443532. eCollection 2024.
2
BaseNet: A transformer-based toolkit for nanopore sequencing signal decoding.BaseNet:一种基于Transformer的纳米孔测序信号解码工具包。
Comput Struct Biotechnol J. 2024 Sep 25;23:3430-3444. doi: 10.1016/j.csbj.2024.09.016. eCollection 2024 Dec.
3
Species-specific basecallers improve actual accuracy of nanopore sequencing in plants.物种特异性碱基识别器提高了植物纳米孔测序的实际准确性。
Plant Methods. 2022 Dec 14;18(1):137. doi: 10.1186/s13007-022-00971-2.
4
Lokatt: a hybrid DNA nanopore basecaller with an explicit duration hidden Markov model and a residual LSTM network.Lokatt:一种具有显式持续时间隐马尔可夫模型和剩余长短期记忆网络的混合 DNA 纳米孔碱基调用器。
BMC Bioinformatics. 2023 Dec 7;24(1):461. doi: 10.1186/s12859-023-05580-x.
5
RODAN: a fully convolutional architecture for basecalling nanopore RNA sequencing data.RODAN:一种用于纳米孔 RNA 测序数据碱基调用的全卷积架构。
BMC Bioinformatics. 2022 Apr 20;23(1):142. doi: 10.1186/s12859-022-04686-y.
6
Causalcall: Nanopore Basecalling Using a Temporal Convolutional Network.因果呼叫:使用时间卷积网络的纳米孔碱基识别
Front Genet. 2020 Jan 20;10:1332. doi: 10.3389/fgene.2019.01332. eCollection 2019.
7
SqueezeCall: nanopore basecalling using a Squeezeformer network.SqueezeCall:使用Squeezeformer网络进行纳米孔碱基识别
GigaByte. 2025 Feb 14;2025:gigabyte148. doi: 10.46471/gigabyte.148. eCollection 2025.
8
Nanopore basecalling from a perspective of instance segmentation.基于实例分割的纳米孔碱基调用。
BMC Bioinformatics. 2020 Apr 23;21(Suppl 3):136. doi: 10.1186/s12859-020-3459-0.
9
SACall: A Neural Network Basecaller for Oxford Nanopore Sequencing Data Based on Self-Attention Mechanism.SACall:基于自注意力机制的牛津纳米孔测序数据的神经网络碱基调用程序。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):614-623. doi: 10.1109/TCBB.2020.3039244. Epub 2022 Feb 3.
10
Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling.深度学习模型在纳米孔测序碱基调用中的综合基准测试和体系结构分析。
Genome Biol. 2023 Apr 11;24(1):71. doi: 10.1186/s13059-023-02903-2.

引用本文的文献

1
De novo non-canonical nanopore basecalling enables private communication using heavily-modified DNA data at single-molecule level.从头开始的非规范纳米孔碱基识别能够在单分子水平上使用经过大量修饰的DNA数据进行私密通信。
Nat Commun. 2025 May 2;16(1):4099. doi: 10.1038/s41467-025-59357-2.

本文引用的文献

1
BaseNet: A transformer-based toolkit for nanopore sequencing signal decoding.BaseNet:一种基于Transformer的纳米孔测序信号解码工具包。
Comput Struct Biotechnol J. 2024 Sep 25;23:3430-3444. doi: 10.1016/j.csbj.2024.09.016. eCollection 2024 Dec.
2
A BERT-based model for the prediction of lncRNA subcellular localization in Homo sapiens.基于 BERT 的人类长非编码 RNA 亚细胞定位预测模型。
Int J Biol Macromol. 2024 Apr;265(Pt 1):130659. doi: 10.1016/j.ijbiomac.2024.130659. Epub 2024 Mar 10.
3
NanoCon: contrastive learning-based deep hybrid network for nanopore methylation detection.
NanoCon:基于对比学习的深度混合网络用于纳米孔甲基化检测。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae046.
4
Identifying disease-related microbes based on multi-scale variational graph autoencoder embedding Wasserstein distance.基于多尺度变分图自动编码器嵌入 Wasserstein 距离的疾病相关微生物识别。
BMC Biol. 2023 Dec 20;21(1):294. doi: 10.1186/s12915-023-01796-8.
5
Chemical structure-aware molecular image representation learning.化学结构感知的分子图像表示学习。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad404.
6
BioSeq-Diabolo: Biological sequence similarity analysis using Diabolo.BioSeq-Diabolo:使用 Diabolo 进行生物序列相似性分析。
PLoS Comput Biol. 2023 Jun 20;19(6):e1011214. doi: 10.1371/journal.pcbi.1011214. eCollection 2023 Jun.
7
Advances in nanopore direct RNA sequencing.纳米孔直接 RNA 测序技术的进展。
Nat Methods. 2022 Oct;19(10):1160-1164. doi: 10.1038/s41592-022-01633-w.
8
Tracking SARS-COV-2 variants using Nanopore sequencing in Ukraine in 2021.2021 年在乌克兰使用 Nanopore 测序追踪 SARS-COV-2 变种。
Sci Rep. 2022 Sep 21;12(1):15749. doi: 10.1038/s41598-022-19414-y.
9
RODAN: a fully convolutional architecture for basecalling nanopore RNA sequencing data.RODAN:一种用于纳米孔 RNA 测序数据碱基调用的全卷积架构。
BMC Bioinformatics. 2022 Apr 20;23(1):142. doi: 10.1186/s12859-022-04686-y.
10
GMNN2CD: identification of circRNA-disease associations based on variational inference and graph Markov neural networks.GMNN2CD:基于变分推理和图马尔可夫神经网络的环状RNA与疾病关联识别
Bioinformatics. 2022 Apr 12;38(8):2246-2253. doi: 10.1093/bioinformatics/btac079.