• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过启用几何注意力的生物语言模型配对进行单序列蛋白质-RNA复合物结构预测。

Single-sequence protein-RNA complex structure prediction by geometric attention-enabled pairing of biological language models.

作者信息

Roche Rahmatullah, Tarafder Sumit, Bhattacharya Debswapna

机构信息

Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, United States of America.

出版信息

bioRxiv. 2024 Jul 28:2024.07.27.605468. doi: 10.1101/2024.07.27.605468.

DOI:10.1101/2024.07.27.605468
PMID:39091736
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11291176/
Abstract

Ground-breaking progress has been made in structure prediction of biomolecular assemblies, including the recent breakthrough of AlphaFold 3. However, it remains challenging for AlphaFold 3 and other state-of-the-art deep learning-based methods to accurately predict protein-RNA complex structures, in part due to the limited availability of evolutionary and structural information related to protein-RNA interactions that are used as inputs to the existing approaches. Here, we introduce ProRNA3D-single, a new deep-learning framework for protein-RNA complex structure prediction with only single-sequence input. Using a novel geometric attention-enabled pairing of biological language models of protein and RNA, a previously unexplored avenue, ProRNA3D-single enables the prediction of interatomic protein-RNA interaction maps, which are then transformed into multi-scale geometric restraints for modeling 3D structures of protein-RNA complexes via geometry optimization. Benchmark tests show that ProRNA3D-single convincingly outperforms current state-of-the-art methods including AlphaFold 3, particularly when evolutionary information is limited; and exhibits remarkable robustness and performance resilience by attaining better accuracy with only single-sequence input than what most methods can achieve even with explicit evolutionary information. Freely available at https://github.com/Bhattacharya-Lab/ProRNA3D-single, ProRNA3D-single should be broadly useful for modeling 3D structures of protein-RNA complexes at scale, regardless of the availability of evolutionary information.

摘要

在生物分子组装体的结构预测方面已经取得了突破性进展,包括最近AlphaFold 3的突破。然而,对于AlphaFold 3和其他基于深度学习的先进方法来说,准确预测蛋白质-RNA复合物结构仍然具有挑战性,部分原因是作为现有方法输入的与蛋白质-RNA相互作用相关的进化和结构信息有限。在这里,我们介绍了ProRNA3D-single,这是一种仅使用单序列输入进行蛋白质-RNA复合物结构预测的新型深度学习框架。通过使用一种新颖的、启用几何注意力的蛋白质和RNA生物语言模型配对(这是一条以前未探索的途径),ProRNA3D-single能够预测原子间蛋白质-RNA相互作用图谱,然后将其转化为多尺度几何约束,通过几何优化对蛋白质-RNA复合物的三维结构进行建模。基准测试表明,ProRNA3D-single令人信服地优于包括AlphaFold 3在内的当前先进方法,特别是在进化信息有限时;并且通过仅使用单序列输入就能获得比大多数方法即使使用明确进化信息时还要好的准确性,展现出显著的稳健性和性能弹性。ProRNA3D-single可在https://github.com/Bhattacharya-Lab/ProRNA3D-single上免费获取,无论进化信息是否可用,它对于大规模建模蛋白质-RNA复合物的三维结构都应该具有广泛的用途。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/5c3105f02f04/nihpp-2024.07.27.605468v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/e9b52e7c0d50/nihpp-2024.07.27.605468v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/cd1c34cd8040/nihpp-2024.07.27.605468v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/e44b0c6bc577/nihpp-2024.07.27.605468v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/27385b980523/nihpp-2024.07.27.605468v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/edb3895052de/nihpp-2024.07.27.605468v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/8fe09f8be042/nihpp-2024.07.27.605468v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/5c3105f02f04/nihpp-2024.07.27.605468v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/e9b52e7c0d50/nihpp-2024.07.27.605468v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/cd1c34cd8040/nihpp-2024.07.27.605468v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/e44b0c6bc577/nihpp-2024.07.27.605468v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/27385b980523/nihpp-2024.07.27.605468v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/edb3895052de/nihpp-2024.07.27.605468v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/8fe09f8be042/nihpp-2024.07.27.605468v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a750/11291176/5c3105f02f04/nihpp-2024.07.27.605468v1-f0007.jpg

相似文献

1
Single-sequence protein-RNA complex structure prediction by geometric attention-enabled pairing of biological language models.通过启用几何注意力的生物语言模型配对进行单序列蛋白质-RNA复合物结构预测。
bioRxiv. 2024 Jul 28:2024.07.27.605468. doi: 10.1101/2024.07.27.605468.
2
PANDA-3D: protein function prediction based on AlphaFold models.PANDA-3D:基于AlphaFold模型的蛋白质功能预测
NAR Genom Bioinform. 2024 Aug 6;6(3):lqae094. doi: 10.1093/nargab/lqae094. eCollection 2024 Sep.
3
EquiPNAS: improved protein-nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks.EquiPNAS:使用蛋白质语言模型信息的等变深度图神经网络改进蛋白质-核酸结合位点预测
bioRxiv. 2023 Sep 16:2023.09.14.557719. doi: 10.1101/2023.09.14.557719.
4
EquiPNAS: improved protein-nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks.EquiPNAS:利用基于蛋白质语言模型的等变深度图神经网络提高蛋白质-核酸结合位点预测。
Nucleic Acids Res. 2024 Mar 21;52(5):e27. doi: 10.1093/nar/gkae039.
5
E(3) equivariant graph neural networks for robust and accurate protein-protein interaction site prediction.E(3)等变图神经网络用于稳健和准确的蛋白质-蛋白质相互作用位点预测。
PLoS Comput Biol. 2023 Aug 31;19(8):e1011435. doi: 10.1371/journal.pcbi.1011435. eCollection 2023 Aug.
6
PIQLE: protein-protein interface quality estimation by deep graph learning of multimeric interaction geometries.PIQLE:通过多聚体相互作用几何结构的深度图学习进行蛋白质-蛋白质界面质量评估
Bioinform Adv. 2023 Jun 2;3(1):vbad070. doi: 10.1093/bioadv/vbad070. eCollection 2023.
7
Reliable protein-protein docking with AlphaFold, Rosetta, and replica-exchange.使用AlphaFold、Rosetta和副本交换进行可靠的蛋白质-蛋白质对接。
bioRxiv. 2025 Jan 2:2023.07.28.551063. doi: 10.1101/2023.07.28.551063.
8
Accurately identifying nucleic-acid-binding sites through geometric graph learning on language model predicted structures.通过对语言模型预测结构进行几何图学习来准确识别核酸结合位点。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad360.
9
GATSol, an enhanced predictor of protein solubility through the synergy of 3D structure graph and large language modeling.GATSol,一种通过 3D 结构图和大型语言模型协同作用增强蛋白质可溶性预测的方法。
BMC Bioinformatics. 2024 Jun 1;25(1):204. doi: 10.1186/s12859-024-05820-8.
10
Improved RNA secondary structure and tertiary base-pairing prediction using evolutionary profile, mutational coupling and two-dimensional transfer learning.利用进化谱、突变耦合和二维迁移学习改进RNA二级结构和三级碱基配对预测。
Bioinformatics. 2021 Sep 9;37(17):2589-2600. doi: 10.1093/bioinformatics/btab165.

本文引用的文献

1
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
2
Generalized biomolecular modeling and design with RoseTTAFold All-Atom.基于 RoseTTAFold All-Atom 的广义生物分子建模与设计。
Science. 2024 Apr 19;384(6693):eadl2528. doi: 10.1126/science.adl2528.
3
EquiPNAS: improved protein-nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks.
EquiPNAS:利用基于蛋白质语言模型的等变深度图神经网络提高蛋白质-核酸结合位点预测。
Nucleic Acids Res. 2024 Mar 21;52(5):e27. doi: 10.1093/nar/gkae039.
4
Accurate prediction of protein-nucleic acid complexes using RoseTTAFoldNA.使用 RoseTTAFoldNA 准确预测蛋白质-核酸复合物。
Nat Methods. 2024 Jan;21(1):117-121. doi: 10.1038/s41592-023-02086-5. Epub 2023 Nov 23.
5
trRosettaRNA: automated prediction of RNA 3D structure with transformer network.trRosettaRNA:基于 Transformer 网络的 RNA 三维结构自动预测。
Nat Commun. 2023 Nov 9;14(1):7266. doi: 10.1038/s41467-023-42528-4.
6
E(3) equivariant graph neural networks for robust and accurate protein-protein interaction site prediction.E(3)等变图神经网络用于稳健和准确的蛋白质-蛋白质相互作用位点预测。
PLoS Comput Biol. 2023 Aug 31;19(8):e1011435. doi: 10.1371/journal.pcbi.1011435. eCollection 2023 Aug.
7
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
8
Evaluating native-like structures of RNA-protein complexes through the deep learning method.通过深度学习方法评估 RNA-蛋白质复合物的天然结构。
Nat Commun. 2023 Feb 24;14(1):1060. doi: 10.1038/s41467-023-36720-9.
9
Large language models generate functional protein sequences across diverse families.大型语言模型可生成不同家族的功能性蛋白质序列。
Nat Biotechnol. 2023 Aug;41(8):1099-1106. doi: 10.1038/s41587-022-01618-2. Epub 2023 Jan 26.
10
Crystal structure of a pre-chemistry viral RNA-dependent RNA polymerase suggests participation of two basic residues in catalysis.一种前化学病毒 RNA 依赖性 RNA 聚合酶的晶体结构表明两个碱性残基参与了催化作用。
Nucleic Acids Res. 2022 Nov 28;50(21):12389-12399. doi: 10.1093/nar/gkac1133.