• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度学习和大型蛋白质语言模型预测外周膜蛋白的蛋白质-膜界面。

Using deep learning and large protein language models to predict protein-membrane interfaces of peripheral membrane proteins.

作者信息

Paranou Dimitra, Chatzigoulas Alexios, Cournia Zoe

机构信息

Biomedical Research Foundation, Academy of Athens, Athens 11527, Greece.

Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens 15784, Greece.

出版信息

Bioinform Adv. 2024 May 28;4(1):vbae078. doi: 10.1093/bioadv/vbae078. eCollection 2024.

DOI:10.1093/bioadv/vbae078
PMID:39559823
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11572487/
Abstract

MOTIVATION

Characterizing interactions at the protein-membrane interface is crucial as abnormal peripheral protein-membrane attachment is involved in the onset of many diseases. However, a limiting factor in studying and understanding protein-membrane interactions is that the membrane-binding domains of peripheral membrane proteins (PMPs) are typically unknown. By applying artificial intelligence techniques in the context of natural language processing (NLP), the accuracy and prediction time for protein-membrane interface analysis can be significantly improved compared to existing methods. Here, we assess whether NLP and protein language models (pLMs) can be used to predict membrane-interacting amino acids for PMPs.

RESULTS

We utilize available experimental data and generate protein embeddings from two pLMs (ProtTrans and ESM) to train classifier models. Overall, the results demonstrate the first proof of concept study and the promising potential of using deep learning and pLMs to predict protein-membrane interfaces for PMPs faster, with similar accuracy, and without the need for 3D structural data compared to existing tools.

AVAILABILITY AND IMPLEMENTATION

The code is available at https://github.com/zoecournia/pLM-PMI. All data are available in the Supplementary material.

摘要

动机

表征蛋白质 - 膜界面的相互作用至关重要,因为外周蛋白与膜的异常附着与许多疾病的发生有关。然而,研究和理解蛋白质 - 膜相互作用的一个限制因素是外周膜蛋白(PMPs)的膜结合结构域通常未知。通过在自然语言处理(NLP)的背景下应用人工智能技术,与现有方法相比,蛋白质 - 膜界面分析的准确性和预测时间可以得到显著提高。在这里,我们评估NLP和蛋白质语言模型(pLMs)是否可用于预测PMPs与膜相互作用的氨基酸。

结果

我们利用现有的实验数据,并从两个pLMs(ProtTrans和ESM)生成蛋白质嵌入来训练分类器模型。总体而言,结果证明了第一个概念验证研究,以及使用深度学习和pLMs更快地预测PMPs的蛋白质 - 膜界面的潜力,与现有工具相比,具有相似的准确性,且无需3D结构数据。

可用性和实现

代码可在https://github.com/zoecournia/pLM-PMI获取。所有数据都在补充材料中提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/d8d402fdac36/vbae078f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/e1b21c929fad/vbae078f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/414c759a2d46/vbae078f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/f4ee4208a3f3/vbae078f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/a18344a6a33c/vbae078f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/d8d402fdac36/vbae078f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/e1b21c929fad/vbae078f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/414c759a2d46/vbae078f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/f4ee4208a3f3/vbae078f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/a18344a6a33c/vbae078f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69bf/11572487/d8d402fdac36/vbae078f5.jpg

相似文献

1
Using deep learning and large protein language models to predict protein-membrane interfaces of peripheral membrane proteins.利用深度学习和大型蛋白质语言模型预测外周膜蛋白的蛋白质-膜界面。
Bioinform Adv. 2024 May 28;4(1):vbae078. doi: 10.1093/bioadv/vbae078. eCollection 2024.
2
Predicting protein-membrane interfaces of peripheral membrane proteins using ensemble machine learning.使用集成机器学习预测外周膜蛋白的蛋白-膜界面。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab518.
3
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.ProtTrans:通过自监督学习理解生命语言。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.
4
Does protein pretrained language model facilitate the prediction of protein-ligand interaction?蛋白质预训练语言模型是否有助于预测蛋白质-配体相互作用?
Methods. 2023 Nov;219:8-15. doi: 10.1016/j.ymeth.2023.08.016. Epub 2023 Sep 9.
5
Embeddings from protein language models predict conservation and variant effects.基于蛋白质语言模型的嵌入模型可预测保守性和变异效应。
Hum Genet. 2022 Oct;141(10):1629-1647. doi: 10.1007/s00439-021-02411-y. Epub 2021 Dec 30.
6
TemStaPro: protein thermostability prediction using sequence representations from protein language models.TemStaPro:使用蛋白质语言模型的序列表示进行蛋白质热稳定性预测。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae157.
7
S-PLM: Structure-aware Protein Language Model via Contrastive Learning between Sequence and Structure.S-PLM:通过序列与结构之间的对比学习实现的结构感知蛋白质语言模型。
bioRxiv. 2024 May 13:2023.08.06.552203. doi: 10.1101/2023.08.06.552203.
8
EquiPNAS: improved protein-nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks.EquiPNAS:使用蛋白质语言模型信息的等变深度图神经网络改进蛋白质-核酸结合位点预测
bioRxiv. 2023 Sep 16:2023.09.14.557719. doi: 10.1101/2023.09.14.557719.
9
EquiPNAS: improved protein-nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks.EquiPNAS:利用基于蛋白质语言模型的等变深度图神经网络提高蛋白质-核酸结合位点预测。
Nucleic Acids Res. 2024 Mar 21;52(5):e27. doi: 10.1093/nar/gkae039.
10
Learned Embeddings from Deep Learning to Visualize and Predict Protein Sets.深度学习提取的学习特征可用于可视化和预测蛋白质组。
Curr Protoc. 2021 May;1(5):e113. doi: 10.1002/cpz1.113.

引用本文的文献

1
Large Context, Deeper Insights: Harnessing Large Language Models for Advancing Protein-Protein Interaction Analysis.大背景,更深刻的见解:利用大语言模型推动蛋白质-蛋白质相互作用分析
Methods Mol Biol. 2025;2941:243-267. doi: 10.1007/978-1-0716-4623-6_15.

本文引用的文献

1
PMIpred: a physics-informed web server for quantitative protein-membrane interaction prediction.PMIpred:一个定量蛋白质-膜相互作用预测的物理信息学网络服务器。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae069.
2
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
3
Dissecting peripheral protein-membrane interfaces.解析外周蛋白-膜界面。
PLoS Comput Biol. 2022 Dec 14;18(12):e1010346. doi: 10.1371/journal.pcbi.1010346. eCollection 2022 Dec.
4
DREAMM: a web-based server for drugging protein-membrane interfaces as a novel workflow for targeted drug design.DREAMM:一个基于网络的药物设计服务器,用于靶向药物设计的新工作流程,以药物蛋白质-膜界面。
Bioinformatics. 2022 Dec 13;38(24):5449-5451. doi: 10.1093/bioinformatics/btac680.
5
Predicting protein-membrane interfaces of peripheral membrane proteins using ensemble machine learning.使用集成机器学习预测外周膜蛋白的蛋白-膜界面。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab518.
6
Precision Targeting of Mutant PI3Kα in Cancer by Selective Degradation.通过选择性降解精准靶向癌症中的突变型 PI3Kα。
Cancer Discov. 2022 Jan;12(1):20-22. doi: 10.1158/2159-8290.CD-21-1411.
7
AMMU: A survey of transformer-based biomedical pretrained language models.基于变压器的生物医学预训练语言模型综述。
J Biomed Inform. 2022 Feb;126:103982. doi: 10.1016/j.jbi.2021.103982. Epub 2021 Dec 31.
8
Spatial arrangement of proteins in planar and curved membranes by PPM 3.0.PPM 3.0 对平面和弯曲膜中蛋白质的空间排布。
Protein Sci. 2022 Jan;31(1):209-220. doi: 10.1002/pro.4219. Epub 2021 Nov 8.
9
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
10
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.ProtTrans:通过自监督学习理解生命语言。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.