• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Knotify:一个使用句法模式识别进行RNA假结预测的高效并行平台。

Knotify: An Efficient Parallel Platform for RNA Pseudoknot Prediction Using Syntactic Pattern Recognition.

作者信息

Andrikos Christos, Makris Evangelos, Kolaitis Angelos, Rassias Georgios, Pavlatos Christos, Tsanakas Panayiotis

机构信息

School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou St., 15780 Athens, Greece.

Hellenic Air Force Academy, Dekelia Air Base, Acharnes, 13671 Athens, Greece.

出版信息

Methods Protoc. 2022 Feb 2;5(1):14. doi: 10.3390/mps5010014.

DOI:10.3390/mps5010014
PMID:35200530
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8876629/
Abstract

Obtaining valuable clues for noncoding RNA (ribonucleic acid) subsequences remains a significant challenge, acknowledging that most of the human genome transcribes into noncoding RNA parts related to unknown biological operations. Capturing these clues relies on accurate "base pairing" prediction, also known as "RNA secondary structure prediction". As COVID-19 is considered a severe global threat, the single-stranded SARS-CoV-2 virus reveals the importance of establishing an efficient RNA analysis toolkit. This work aimed to contribute to that by introducing a novel system committed to predicting RNA secondary structure patterns (i.e., RNA's pseudoknots) that leverage syntactic pattern-recognition strategies. Having focused on the pseudoknot predictions, we formalized the secondary structure prediction of the RNA to be primarily a parsing and, secondly, an optimization problem. The proposed methodology addresses the problem of predicting pseudoknots of the first order (H-type). We introduce a context-free grammar (CFG) that affords enough expression power to recognize potential pseudoknot pattern. In addition, an alternative methodology of detecting possible pseudoknots is also implemented as well, using a brute-force algorithm. Any input sequence may highlight multiple potential folding patterns requiring a strict methodology to determine the single biologically realistic one. We conscripted a novel heuristic over the widely accepted notion of free-energy minimization to tackle such ambiguity in a performant way by utilizing each pattern's context to unveil the most prominent pseudoknot pattern. The overall process features polynomial-time complexity, while its parallel implementation enhances the end performance, as proportional to the deployed hardware. The proposed methodology does succeed in predicting the core stems of any RNA pseudoknot of the test dataset by performing a 76.4% recall ratio. The methodology achieved a F1-score equal to 0.774 and MCC equal 0.543 in discovering all the stems of an RNA sequence, outperforming the particular task. Measurements were taken using a dataset of 262 RNA sequences establishing a performance speed of 1.31, 3.45, and 7.76 compared to three well-known platforms. The implementation source code is publicly available under knotify github repo.

摘要

由于认识到人类基因组的大部分转录为与未知生物学操作相关的非编码RNA部分,因此获取非编码RNA(核糖核酸)子序列的有价值线索仍然是一项重大挑战。捕捉这些线索依赖于准确的“碱基配对”预测,也称为“RNA二级结构预测”。由于新冠疫情被视为严重的全球威胁,单链的新冠病毒揭示了建立高效RNA分析工具包的重要性。这项工作旨在通过引入一种致力于预测RNA二级结构模式(即RNA假结)的新型系统来为此做出贡献,该系统利用句法模式识别策略。专注于假结预测后,我们将RNA的二级结构预测形式化为主要是一个解析问题,其次是一个优化问题。所提出的方法解决了预测一阶(H型)假结的问题。我们引入了一种上下文无关文法(CFG),它具有足够的表达能力来识别潜在的假结模式。此外,还使用暴力算法实现了另一种检测可能假结的方法。任何输入序列可能会突出显示多种潜在的折叠模式,这需要严格的方法来确定唯一符合生物学实际的模式。我们基于广泛接受的自由能最小化概念引入了一种新颖的启发式方法,通过利用每种模式的上下文以高效的方式解决这种模糊性,从而揭示最突出的假结模式。整个过程具有多项式时间复杂度,而其并行实现提高了最终性能,与所部署的硬件成正比。所提出的方法通过实现76.4%的召回率,成功地预测了测试数据集的任何RNA假结的核心茎。该方法在发现RNA序列的所有茎时,F1分数等于0.774,MCC等于0.543,优于特定任务。使用包含262个RNA序列的数据集进行测量,与三个知名平台相比,性能速度分别为1.31、3.45和7.76。实现源代码在kotify github仓库中公开可用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/a91423da7760/mps-05-00014-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/1a28778a9c75/mps-05-00014-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/baad063416aa/mps-05-00014-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/a24112835a87/mps-05-00014-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/d727090caa12/mps-05-00014-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/cf7fcdb8a85d/mps-05-00014-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/d691bf2519a8/mps-05-00014-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/20e637412cb9/mps-05-00014-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/e9894da19b2d/mps-05-00014-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/ff0fbc23d925/mps-05-00014-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/57469baf4636/mps-05-00014-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/41d645080dd4/mps-05-00014-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/a91423da7760/mps-05-00014-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/1a28778a9c75/mps-05-00014-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/baad063416aa/mps-05-00014-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/a24112835a87/mps-05-00014-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/d727090caa12/mps-05-00014-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/cf7fcdb8a85d/mps-05-00014-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/d691bf2519a8/mps-05-00014-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/20e637412cb9/mps-05-00014-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/e9894da19b2d/mps-05-00014-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/ff0fbc23d925/mps-05-00014-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/57469baf4636/mps-05-00014-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/41d645080dd4/mps-05-00014-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93a9/8876629/a91423da7760/mps-05-00014-g012.jpg

相似文献

1
Knotify: An Efficient Parallel Platform for RNA Pseudoknot Prediction Using Syntactic Pattern Recognition.Knotify:一个使用句法模式识别进行RNA假结预测的高效并行平台。
Methods Protoc. 2022 Feb 2;5(1):14. doi: 10.3390/mps5010014.
2
Knotify_V2.0: Deciphering RNA Secondary Structures with H-Type Pseudoknots and Hairpin Loops.Knotify_V2.0:解析具有 H 型假结和发夹环的 RNA 二级结构。
Genes (Basel). 2024 May 23;15(6):670. doi: 10.3390/genes15060670.
3
Knotify+: Toward the Prediction of RNA H-Type Pseudoknots, Including Bulges and Internal Loops.Knotify+:预测 RNA H 型假结,包括突环和内部环。
Biomolecules. 2023 Feb 6;13(2):308. doi: 10.3390/biom13020308.
4
Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的成对随机树邻接文法
Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.
5
Thermodynamic heuristics with case-based reasoning: combined insights for RNA pseudoknot secondary structure.基于案例推理的热力学启发式方法:RNA 假结二级结构的综合见解。
J Biomol Struct Dyn. 2011 Aug;29(1):1-26. doi: 10.1080/07391102.2011.10507373.
6
Heuristic RNA pseudoknot prediction including intramolecular kissing hairpins.启发式 RNA 假结预测,包括分子内的亲吻发夹。
RNA. 2011 Jan;17(1):27-38. doi: 10.1261/rna.2394511. Epub 2010 Nov 22.
7
KnotSeeker: heuristic pseudoknot detection in long RNA sequences.KnotSeeker:长RNA序列中的启发式假结检测
RNA. 2008 Apr;14(4):630-40. doi: 10.1261/rna.968808. Epub 2008 Feb 26.
8
DotKnot: pseudoknot prediction using the probability dot plot under a refined energy model.DotKnot:使用改进能量模型下的概率点图进行伪结预测。
Nucleic Acids Res. 2010 Apr;38(7):e103. doi: 10.1093/nar/gkq021. Epub 2010 Jan 31.
9
Shapify: Paths to SARS-CoV-2 frameshifting pseudoknot.Shapify:SARS-CoV-2 框架移位假结的途径。
PLoS Comput Biol. 2023 Feb 28;19(2):e1010922. doi: 10.1371/journal.pcbi.1010922. eCollection 2023 Feb.
10
antaRNA--Multi-objective inverse folding of pseudoknot RNA using ant-colony optimization.antaRNA——使用蚁群优化算法对假结RNA进行多目标反向折叠
BMC Bioinformatics. 2015 Nov 18;16:389. doi: 10.1186/s12859-015-0815-6.

引用本文的文献

1
Navigating the Multiverse: a Hitchhiker's guide to selecting harmonization methods for multimodal biomedical data.探索多元宇宙:多模态生物医学数据协调方法选择指南
Biol Methods Protoc. 2025 Apr 17;10(1):bpaf028. doi: 10.1093/biomethods/bpaf028. eCollection 2025.
2
Knotify_V2.0: Deciphering RNA Secondary Structures with H-Type Pseudoknots and Hairpin Loops.Knotify_V2.0:解析具有 H 型假结和发夹环的 RNA 二级结构。
Genes (Basel). 2024 May 23;15(6):670. doi: 10.3390/genes15060670.
3
Knotify+: Toward the Prediction of RNA H-Type Pseudoknots, Including Bulges and Internal Loops.

本文引用的文献

1
ATTfold: RNA Secondary Structure Prediction With Pseudoknots Based on Attention Mechanism.ATTfold:基于注意力机制的带假结RNA二级结构预测
Front Genet. 2020 Dec 15;11:612086. doi: 10.3389/fgene.2020.612086. eCollection 2020.
2
RNAthor - fast, accurate normalization, visualization and statistical analysis of RNA probing data resolved by capillary electrophoresis.RNAthor- 一种快速、准确的毛细管电泳解析 RNA 探测数据的标准化、可视化和统计分析方法。
PLoS One. 2020 Oct 1;15(10):e0239287. doi: 10.1371/journal.pone.0239287. eCollection 2020.
3
RNAProbe: a web server for normalization and analysis of RNA structure probing data.
Knotify+:预测 RNA H 型假结,包括突环和内部环。
Biomolecules. 2023 Feb 6;13(2):308. doi: 10.3390/biom13020308.
4
Computational tools to study RNA-protein complexes.用于研究RNA-蛋白质复合物的计算工具。
Front Mol Biosci. 2022 Oct 7;9:954926. doi: 10.3389/fmolb.2022.954926. eCollection 2022.
RNAProbe:一个用于 RNA 结构探测数据标准化和分析的网络服务器。
Nucleic Acids Res. 2020 Jul 2;48(W1):W292-W299. doi: 10.1093/nar/gkaa396.
4
RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning.使用二维深度神经网络集成和迁移学习进行 RNA 二级结构预测。
Nat Commun. 2019 Nov 27;10(1):5407. doi: 10.1038/s41467-019-13395-9.
5
DMfold: A Novel Method to Predict RNA Secondary Structure With Pseudoknots Based on Deep Learning and Improved Base Pair Maximization Principle.DMfold:一种基于深度学习和改进碱基对最大化原则预测含假结RNA二级结构的新方法。
Front Genet. 2019 Mar 4;10:143. doi: 10.3389/fgene.2019.00143. eCollection 2019.
6
An Enhanced Device-Transparent Real-Time Teleconsultation Environment for Radiologists.增强型设备透明实时远程放射诊断咨询环境。
IEEE J Biomed Health Inform. 2019 Jan;23(1):374-386. doi: 10.1109/JBHI.2018.2824312. Epub 2018 Apr 9.
7
Knotty: efficient and accurate prediction of complex RNA pseudoknot structures.Knotty:高效准确地预测复杂 RNA 假结结构。
Bioinformatics. 2018 Nov 15;34(22):3849-3856. doi: 10.1093/bioinformatics/bty420.
8
New algorithms to represent complex pseudoknotted RNA structures in dot-bracket notation.用于在点棒记法中表示复杂假结 RNA 结构的新算法。
Bioinformatics. 2018 Apr 15;34(8):1304-1312. doi: 10.1093/bioinformatics/btx783.
9
Associating transcription factors and conserved RNA structures with gene regulation in the human brain.关联转录因子和保守的 RNA 结构与人类大脑中的基因调控。
Sci Rep. 2017 Jul 18;7(1):5776. doi: 10.1038/s41598-017-06200-4.
10
Measuring RNA structure transcriptome-wide with icSHAPE.利用icSHAPE在全转录组范围内测量RNA结构。
Methods. 2017 May 1;120:85-90. doi: 10.1016/j.ymeth.2017.02.010. Epub 2017 Mar 20.