• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

核酸 G-四链体预测概述:从基于规则的方法到深度神经网络。

An overview on nucleic-acid G-quadruplex prediction: from rule-based methods to deep neural networks.

机构信息

Department of Computer Science, Bar-Ilan University, Ramat Gan, 5290002, Israel.

The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, 5290002, Israel.

出版信息

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad252.

DOI:10.1093/bib/bbad252
PMID:37438149
Abstract

Nucleic-acid G-quadruplexes (G4s) play vital roles in many cellular processes. Due to their importance, researchers have developed experimental assays to measure nucleic-acid G4s in high throughput. The generated high-throughput datasets gave rise to unique opportunities to develop machine-learning-based methods, and in particular deep neural networks, to predict G4s in any given nucleic-acid sequence and any species. In this paper, we review the success stories of deep-neural-network applications for G4 prediction. We first cover the experimental technologies that generated the most comprehensive nucleic-acid G4 high-throughput datasets in recent years. We then review classic rule-based methods for G4 prediction. We proceed by reviewing the major machine-learning and deep-neural-network applications to nucleic-acid G4 datasets and report a novel comparison between them. Next, we present the interpretability techniques used on the trained neural networks to learn key molecular principles underlying nucleic-acid G4 folding. As a new result, we calculate the overlap between measured DNA and RNA G4s and compare the performance of DNA- and RNA-G4 predictors on RNA- and DNA-G4 datasets, respectively, to demonstrate the potential of transfer learning from DNA G4s to RNA G4s. Last, we conclude with open questions in the field of nucleic-acid G4 prediction and computational modeling.

摘要

核酸 G-四链体(G4s)在许多细胞过程中发挥着重要作用。由于其重要性,研究人员已经开发了实验方法来高通量测量核酸 G4s。这些高通量数据集的产生为开发基于机器学习的方法,特别是深度神经网络,提供了独特的机会,以便在任何给定的核酸序列和任何物种中预测 G4s。在本文中,我们回顾了深度神经网络在 G4 预测中的成功案例。我们首先介绍了近年来生成最全面的核酸 G4 高通量数据集的实验技术。然后,我们回顾了经典的基于规则的 G4 预测方法。接下来,我们回顾了主要的机器学习和深度神经网络在核酸 G4 数据集上的应用,并报告了它们之间的新比较。接下来,我们介绍了用于训练神经网络的可解释性技术,以学习核酸 G4 折叠的关键分子原理。作为一个新的结果,我们计算了测量的 DNA 和 RNA G4s 之间的重叠,并比较了 DNA 和 RNA-G4 预测器在 RNA 和 DNA-G4 数据集上的性能,以证明从 DNA G4 到 RNA G4 的迁移学习的潜力。最后,我们总结了核酸 G4 预测和计算建模领域的开放性问题。

相似文献

1
An overview on nucleic-acid G-quadruplex prediction: from rule-based methods to deep neural networks.核酸 G-四链体预测概述:从基于规则的方法到深度神经网络。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad252.
2
G4detector: Convolutional Neural Network to Predict DNA G-Quadruplexes.G4detector:用于预测 DNA G-四链体的卷积神经网络。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):1946-1955. doi: 10.1109/TCBB.2021.3073595. Epub 2022 Aug 8.
3
Structurally diverse G-quadruplexes as the noncanonical nucleic acid drug target for live cell imaging and antibacterial study.结构多样的G-四链体作为活细胞成像和抗菌研究的非经典核酸药物靶点。
Chem Commun (Camb). 2023 Feb 2;59(11):1415-1433. doi: 10.1039/d2cc05945b.
4
DeepG4: A deep learning approach to predict cell-type specific active G-quadruplex regions.DeepG4:一种深度学习方法,用于预测细胞类型特异性的活性 G-四链体区域。
PLoS Comput Biol. 2021 Aug 12;17(8):e1009308. doi: 10.1371/journal.pcbi.1009308. eCollection 2021 Aug.
5
G4mismatch: Deep neural networks to predict G-quadruplex propensity based on G4-seq data.G4 错配:基于 G4-seq 数据预测 G-四链体倾向的深度神经网络。
PLoS Comput Biol. 2023 Mar 10;19(3):e1010948. doi: 10.1371/journal.pcbi.1010948. eCollection 2023 Mar.
6
The G-quadruplex (G4) resolvase DHX36 efficiently and specifically disrupts DNA G4s via a translocation-based helicase mechanism.G-四链体 (G4) 结构解旋酶 DHX36 通过基于转位的解旋酶机制高效且特异性地破坏 DNA G4。
J Biol Chem. 2018 Feb 9;293(6):1924-1932. doi: 10.1074/jbc.M117.815076. Epub 2017 Dec 21.
7
Machine learning-based prediction of DNA G-quadruplex folding topology with G4ShapePredictor.基于机器学习的 DNA G-四链体折叠拓扑结构预测软件 G4ShapePredictor。
Sci Rep. 2024 Oct 16;14(1):24238. doi: 10.1038/s41598-024-74826-2.
8
PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks.PENGUINN:使用可解释神经网络对核G-四链体进行精确探索。
Front Genet. 2020 Oct 27;11:568546. doi: 10.3389/fgene.2020.568546. eCollection 2020.
9
G-quadruplex: Flexible conformational changes by cations, pH, crowding and its applications to biosensing.G-四链体:阳离子、pH 值、拥挤环境的灵活构象变化及其在生物传感中的应用。
Biosens Bioelectron. 2021 Apr 15;178:113030. doi: 10.1016/j.bios.2021.113030. Epub 2021 Jan 23.
10
Template-Assembled Synthetic G-Quartets (TASQs): multiTASQing Molecular Tools for Investigating DNA and RNA G-Quadruplex Biology.模板组装合成G-四链体(TASQs):用于研究DNA和RNA G-四链体生物学的多重TASQing分子工具。
Acc Chem Res. 2023 Feb 7;56(3):350-362. doi: 10.1021/acs.accounts.2c00757. Epub 2023 Jan 20.

引用本文的文献

1
Review on Advancement of AI in Cell Engineering and Molecular Biology.人工智能在细胞工程和分子生物学中的进展综述
Methods Mol Biol. 2025;2952:169-192. doi: 10.1007/978-1-0716-4690-8_10.
2
Machine learning-based prediction of DNA G-quadruplex folding topology with G4ShapePredictor.基于机器学习的 DNA G-四链体折叠拓扑结构预测软件 G4ShapePredictor。
Sci Rep. 2024 Oct 16;14(1):24238. doi: 10.1038/s41598-024-74826-2.
3
Special Issue "Bioinformatics of Unusual DNA and RNA Structures".特刊:非寻常 DNA 和 RNA 结构的生物信息学
Int J Mol Sci. 2024 May 10;25(10):5226. doi: 10.3390/ijms25105226.
4
Prediction of DNA i-motifs via machine learning.通过机器学习预测 DNA i- 发夹结构。
Nucleic Acids Res. 2024 Mar 21;52(5):2188-2197. doi: 10.1093/nar/gkae092.
5
'Artificial intelligence and machine learning in RNA biology'.RNA生物学中的人工智能与机器学习
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad415.
6
EndoQuad: a comprehensive genome-wide experimentally validated endogenous G-quadruplex database.EndoQuad:一个全面的全基因组实验验证的内源性 G-四链体数据库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D72-D80. doi: 10.1093/nar/gkad966.