• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用DNA-BERT变压器深度学习模型预测DNA引发链介导的链置换速率常数。

Predicting DNA toehold-mediated strand displacement rate constants using a DNA-BERT transformer deep learning model.

作者信息

Akay Ali, Reddy Hemaprakash Nanja, Galloway Roma, Kozyra Jerzy, Jackson Alexander W

机构信息

Nanovery Limited, United Kingdom.

Universita Degli Studi di Trento, Italy.

出版信息

Heliyon. 2024 Mar 21;10(7):e28443. doi: 10.1016/j.heliyon.2024.e28443. eCollection 2024 Apr 15.

DOI:10.1016/j.heliyon.2024.e28443
PMID:38560216
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10981123/
Abstract

Dynamic DNA nanotechnology is driving exciting developments in molecular computing, cargo delivery, sensing and detection. Combining this innovative area of research with the progress made in machine learning will aid in the design of sophisticated DNA machinery. Herein, we present a novel framework based on a transformer architecture and a deep learning model which can predict the rate constant of toehold-mediated strand displacement, the underlying process in dynamic DNA nanotechnology. Initially, a dataset of 4450 DNA sequences and corresponding rate constants were generated using KinDA. Subsequently, a 1D convolution neural network was trained using specific local features and DNA-BERT sequence embedding to produce predicted rate constants. As a result, the newly trained deep learning model predicted toehold-mediated strand displacement rate constants with a root mean square error of 0.76, during testing. These findings demonstrate that DNA-BERT can improve prediction accuracy, negating the need for extensive computational simulations or experimentation. Finally, the impact of various local features during model training is discussed, and a detailed comparison between the One-hot encoder and DNA-BERT sequences representation methods is presented.

摘要

动态DNA纳米技术正在推动分子计算、货物递送、传感与检测等领域令人兴奋的发展。将这一创新研究领域与机器学习所取得的进展相结合,将有助于设计复杂的DNA机器。在此,我们提出了一种基于Transformer架构和深度学习模型的新颖框架,该框架能够预测动态DNA纳米技术中的基础过程——引发链介导的链置换的速率常数。首先,使用KinDA生成了一个包含4450个DNA序列及相应速率常数的数据集。随后,利用特定的局部特征和DNA-BERT序列嵌入训练了一个一维卷积神经网络,以生成预测的速率常数。结果,新训练的深度学习模型在测试期间预测引发链介导的链置换速率常数时,均方根误差为0.76。这些发现表明,DNA-BERT可以提高预测准确性,无需进行大量的计算模拟或实验。最后,讨论了模型训练期间各种局部特征的影响,并对独热编码器和DNA-BERT序列表示方法进行了详细比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/876767e892c5/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/60330fa00a50/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/4cf2f0ab30e2/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/01a87c64e7f6/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/d1a7143dc9bb/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/cfbfa54def98/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/742b96872c2e/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/950139b9eea4/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/06d5d75c7239/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/9ad2643989c4/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/876767e892c5/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/60330fa00a50/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/4cf2f0ab30e2/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/01a87c64e7f6/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/d1a7143dc9bb/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/cfbfa54def98/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/742b96872c2e/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/950139b9eea4/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/06d5d75c7239/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/9ad2643989c4/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/170c/10981123/876767e892c5/gr9.jpg

相似文献

1
Predicting DNA toehold-mediated strand displacement rate constants using a DNA-BERT transformer deep learning model.使用DNA-BERT变压器深度学习模型预测DNA引发链介导的链置换速率常数。
Heliyon. 2024 Mar 21;10(7):e28443. doi: 10.1016/j.heliyon.2024.e28443. eCollection 2024 Apr 15.
2
A transformer architecture based on BERT and 2D convolutional neural network to identify DNA enhancers from sequence information.基于 BERT 和二维卷积神经网络的变压器架构,用于从序列信息中识别 DNA 增强子。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab005.
3
Nucleic acid strand displacement - from DNA nanotechnology to translational regulation.核酸链置换——从 DNA 纳米技术到翻译调控。
RNA Biol. 2023 Jan;20(1):154-163. doi: 10.1080/15476286.2023.2204565.
4
BERT-Promoter: An improved sequence-based predictor of DNA promoter using BERT pre-trained model and SHAP feature selection.BERT-启动子:一种使用BERT预训练模型和SHAP特征选择的基于序列的DNA启动子改进预测器。
Comput Biol Chem. 2022 Aug;99:107732. doi: 10.1016/j.compbiolchem.2022.107732. Epub 2022 Jul 14.
5
A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.深度学习模型在不同类别不平衡程度的非结构化医疗记录文本分类中的对比研究。
BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.
6
Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).使用基于转换器的双向编码器表示 (BERT) 和领域内预训练 (IDPT) 对耳鸣患者的可操作放射学报告进行自动文本分类。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.
7
Toehold clipping: A mechanism for remote control of DNA strand displacement.适体剪接:远程控制 DNA 链置换的一种机制。
Nucleic Acids Res. 2023 May 8;51(8):4055-4063. doi: 10.1093/nar/gkac1152.
8
IUP-BERT: Identification of Umami Peptides Based on BERT Features.IUP-BERT:基于BERT特征的鲜味肽识别
Foods. 2022 Nov 21;11(22):3742. doi: 10.3390/foods11223742.
9
Regulation of DNA Strand Displacement Using an Allosteric DNA Toehold.使用变构DNA引发链对DNA链置换的调控
J Am Chem Soc. 2016 Oct 26;138(42):14076-14082. doi: 10.1021/jacs.6b08794. Epub 2016 Oct 13.
10
MRM-BERT: a novel deep neural network predictor of multiple RNA modifications by fusing BERT representation and sequence features.MRM-BERT:一种新颖的深度学习神经网络,通过融合 BERT 表示和序列特征,预测多种 RNA 修饰。
RNA Biol. 2024 Jan;21(1):1-10. doi: 10.1080/15476286.2024.2315384. Epub 2024 Feb 15.

引用本文的文献

1
A model-based design strategy to engineer miRNA-regulated detection systems.一种基于模型的设计策略,用于构建受微小RNA调控的检测系统。
Front Syst Biol. 2025 Aug 14;5:1601854. doi: 10.3389/fsysb.2025.1601854. eCollection 2025.
2
Unraveling the future of genomics: CRISPR, single-cell omics, and the applications in cancer and immunology.揭开基因组学的未来:CRISPR、单细胞组学及其在癌症和免疫学中的应用。
Front Genome Ed. 2025 Apr 11;7:1565387. doi: 10.3389/fgeed.2025.1565387. eCollection 2025.

本文引用的文献

1
Toehold-Mediated Strand Displacement in Random Sequence Pools.在随机序列池中进行的锤头介导链置换反应。
J Am Chem Soc. 2023 Jan 11;145(1):634-644. doi: 10.1021/jacs.2c11208. Epub 2022 Dec 26.
2
BERT-Promoter: An improved sequence-based predictor of DNA promoter using BERT pre-trained model and SHAP feature selection.BERT-启动子:一种使用BERT预训练模型和SHAP特征选择的基于序列的DNA启动子改进预测器。
Comput Biol Chem. 2022 Aug;99:107732. doi: 10.1016/j.compbiolchem.2022.107732. Epub 2022 Jul 14.
3
DNA Logic Circuits for Cancer Theranostics.
用于癌症治疗与诊断的 DNA 逻辑电路。
Small. 2022 May;18(20):e2108008. doi: 10.1002/smll.202108008. Epub 2022 Mar 7.
4
Deep transformers and convolutional neural network in identifying DNA N6-methyladenine sites in cross-species genomes.深度转换器和卷积神经网络在跨物种基因组中识别 DNA N6-甲基腺嘌呤位点。
Methods. 2022 Aug;204:199-206. doi: 10.1016/j.ymeth.2021.12.004. Epub 2021 Dec 13.
5
A deep learning model for predicting next-generation sequencing depth from DNA sequence.基于 DNA 序列预测下一代测序深度的深度学习模型。
Nat Commun. 2021 Jul 19;12(1):4387. doi: 10.1038/s41467-021-24497-8.
6
DNA Nanotechnology-Based Biosensors and Therapeutics.基于 DNA 纳米技术的生物传感器和治疗方法。
Adv Healthc Mater. 2021 Aug;10(15):e2002205. doi: 10.1002/adhm.202002205. Epub 2021 Jun 3.
7
A transformer architecture based on BERT and 2D convolutional neural network to identify DNA enhancers from sequence information.基于 BERT 和二维卷积神经网络的变压器架构,用于从序列信息中识别 DNA 增强子。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab005.
8
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome.DNABERT:用于基因组中DNA语言的基于变换器的预训练双向编码器表征模型。
Bioinformatics. 2021 Aug 9;37(15):2112-2120. doi: 10.1093/bioinformatics/btab083.
9
DNA Nanomachines for Identifying Cancer Biomarkers in Body Fluids and Cells.用于在体液和细胞中识别癌症生物标志物的 DNA 纳米机器。
Anal Chem. 2021 Feb 2;93(4):1855-1865. doi: 10.1021/acs.analchem.0c03518. Epub 2020 Dec 16.
10
Deep learning in next-generation sequencing.深度学习在下一代测序中的应用。
Drug Discov Today. 2021 Jan;26(1):173-180. doi: 10.1016/j.drudis.2020.10.002. Epub 2020 Oct 12.