• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TAFPred:基于蛋白质序列的扭转角波动预测

TAFPred: Torsion Angle Fluctuations Prediction from Protein Sequences.

作者信息

Kabir Md Wasi Ul, Alawad Duaa Mohammad, Mishra Avdesh, Hoque Md Tamjidul

机构信息

Computer Science Department, University of New Orleans, New Orleans, LA 70148, USA.

Department of Electrical Engineering and Computer Science, Texas A&M University-Kingsville, Kingsville, TX 78363, USA.

出版信息

Biology (Basel). 2023 Jul 19;12(7):1020. doi: 10.3390/biology12071020.

DOI:10.3390/biology12071020
PMID:37508449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10376102/
Abstract

Protein molecules show varying degrees of flexibility throughout their three-dimensional structures. The flexibility is determined by the fluctuations in torsion angles, specifically phi (φ) and psi (ψ), which define the protein backbone. These angle fluctuations are derived from variations in backbone torsion angles observed in different models. By analyzing the fluctuations in Cartesian coordinate space, we can understand the structural flexibility of proteins. Predicting torsion angle fluctuations is valuable for determining protein function and structure when these angles act as constraints. In this study, a machine learning method called TAFPred is developed to predict torsion angle fluctuations using protein sequences directly. The method incorporates various features, such as disorder probability, position-specific scoring matrix profiles, secondary structure probabilities, and more. TAFPred, employing an optimized Light Gradient Boosting Machine Regressor (LightGBM), achieved high accuracy with correlation coefficients of 0.746 and 0.737 and mean absolute errors of 0.114 and 0.123 for the φ and ψ angles, respectively. Compared to the state-of-the-art method, TAFPred demonstrated significant improvements of 10.08% in MAE and 24.83% in PCC for the phi angle and 9.93% in MAE, and 22.37% in PCC for the psi angle.

摘要

蛋白质分子在其三维结构中表现出不同程度的灵活性。这种灵活性由扭转角的波动决定,特别是定义蛋白质主链的φ(φ)和ψ(ψ)角。这些角度波动源自不同模型中观察到的主链扭转角变化。通过分析笛卡尔坐标空间中的波动,我们可以了解蛋白质的结构灵活性。当这些角度作为约束条件时,预测扭转角波动对于确定蛋白质的功能和结构很有价值。在本研究中,开发了一种名为TAFPred的机器学习方法,直接使用蛋白质序列预测扭转角波动。该方法纳入了各种特征,如无序概率、位置特异性评分矩阵概况、二级结构概率等。TAFPred采用优化的轻梯度提升机回归器(LightGBM),对于φ角和ψ角,相关系数分别为0.746和0.737,平均绝对误差分别为0.114和0.123,实现了高精度。与现有最先进方法相比,TAFPred在φ角的平均绝对误差(MAE)上有10.08%的显著提高,在皮尔逊相关系数(PCC)上有24.83%的显著提高;在ψ角的MAE上有9.93%的显著提高,在PCC上有22.37%的显著提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/9616b9c20770/biology-12-01020-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/322252bfb75c/biology-12-01020-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/8b6c8bd159ed/biology-12-01020-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/78fcb27e9f19/biology-12-01020-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/ac0ad92c2597/biology-12-01020-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/9616b9c20770/biology-12-01020-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/322252bfb75c/biology-12-01020-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/8b6c8bd159ed/biology-12-01020-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/78fcb27e9f19/biology-12-01020-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/ac0ad92c2597/biology-12-01020-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f8c/10376102/9616b9c20770/biology-12-01020-g008.jpg

相似文献

1
TAFPred: Torsion Angle Fluctuations Prediction from Protein Sequences.TAFPred:基于蛋白质序列的扭转角波动预测
Biology (Basel). 2023 Jul 19;12(7):1020. doi: 10.3390/biology12071020.
2
Fluctuations of backbone torsion angles obtained from NMR-determined structures and their prediction.从 NMR 确定的结构中获得的骨架扭转角的波动及其预测。
Proteins. 2010 Dec;78(16):3353-62. doi: 10.1002/prot.22842.
3
Deep learning methods for protein torsion angle prediction.用于蛋白质扭转角预测的深度学习方法。
BMC Bioinformatics. 2017 Sep 18;18(1):417. doi: 10.1186/s12859-017-1834-2.
4
TANGLE: two-level support vector regression approach for protein backbone torsion angle prediction from primary sequences.TANGLE:一种两级支持向量回归方法,用于从蛋白质一级序列预测蛋白质主链扭转角。
PLoS One. 2012;7(2):e30361. doi: 10.1371/journal.pone.0030361. Epub 2012 Feb 2.
5
ANGLOR: a composite machine-learning algorithm for protein backbone torsion angle prediction.ANGLOR:一种用于蛋白质主链扭转角预测的复合机器学习算法。
PLoS One. 2008;3(10):e3400. doi: 10.1371/journal.pone.0003400. Epub 2008 Oct 15.
6
Real-value prediction of backbone torsion angles.主链扭转角的实值预测。
Proteins. 2008 Jul;72(1):427-33. doi: 10.1002/prot.21940.
7
Grid-based prediction of torsion angle probabilities of protein backbone and its application to discrimination of protein intrinsic disorder regions and selection of model structures.基于网格的蛋白质骨架扭转角概率预测及其在蛋白质固有无序区域的区分和模型结构选择中的应用。
BMC Bioinformatics. 2018 Feb 1;19(1):29. doi: 10.1186/s12859-018-2031-7.
8
Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning.通过迭代深度学习改进蛋白质二级结构、局部主链角度和溶剂可及表面积的预测。
Sci Rep. 2015 Jun 22;5:11476. doi: 10.1038/srep11476.
9
Prediction of Protein Backbone Torsion Angles Using Deep Residual Inception Neural Networks.使用深度残差 inception 神经网络预测蛋白质主链扭转角
IEEE/ACM Trans Comput Biol Bioinform. 2018 Mar 12. doi: 10.1109/TCBB.2018.2814586.
10
SAINT-Angle: self-attention augmented inception-inside-inception network and transfer learning improve protein backbone torsion angle prediction.圣角:自注意力增强的 inception-inside-inception 网络与迁移学习提升蛋白质主链扭转角预测
Bioinform Adv. 2023 Apr 5;3(1):vbad042. doi: 10.1093/bioadv/vbad042. eCollection 2023.

引用本文的文献

1
DRBpred: A sequence-based machine learning method to effectively predict DNA- and RNA-binding residues.DRBpred:一种基于序列的机器学习方法,可有效预测 DNA 和 RNA 结合残基。
Comput Biol Med. 2024 Mar;170:108081. doi: 10.1016/j.compbiomed.2024.108081. Epub 2024 Jan 29.

本文引用的文献

1
NMR Provides Unique Insight into the Functional Dynamics and Interactions of Intrinsically Disordered Proteins.NMR 提供了对无规卷曲蛋白质的功能动态和相互作用的独特见解。
Chem Rev. 2022 May 25;122(10):9331-9356. doi: 10.1021/acs.chemrev.1c01023. Epub 2022 Apr 21.
2
Structure-Based Drug Design Using Deep Learning.基于结构的深度学习药物设计。
J Chem Inf Model. 2022 Nov 14;62(21):5100-5109. doi: 10.1021/acs.jcim.1c01319. Epub 2021 Nov 18.
3
AlphaFold heralds a data-driven revolution in biology and medicine.AlphaFold预示着生物学和医学领域一场由数据驱动的革命。
Nat Med. 2021 Oct;27(10):1666-1669. doi: 10.1038/s41591-021-01533-0.
4
flDPnn: Accurate intrinsic disorder prediction with putative propensities of disorder functions.flDPnn:利用无序功能的假定倾向进行准确的固有无序预测。
Nat Commun. 2021 Jul 21;12(1):4438. doi: 10.1038/s41467-021-24773-7.
5
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
6
A Unified Description of Intrinsically Disordered Protein Dynamics under Physiological Conditions Using NMR Spectroscopy.使用 NMR 光谱技术对生理条件下的无规卷曲蛋白动力学进行统一描述。
J Am Chem Soc. 2019 Nov 6;141(44):17817-17829. doi: 10.1021/jacs.9b09002. Epub 2019 Oct 22.
7
Advances in protein structure prediction and design.蛋白质结构预测和设计的进展。
Nat Rev Mol Cell Biol. 2019 Nov;20(11):681-697. doi: 10.1038/s41580-019-0163-x. Epub 2019 Aug 15.
8
DeepGOPlus: improved protein function prediction from sequence.DeepGOPlus:从序列中改进蛋白质功能预测。
Bioinformatics. 2020 Jan 15;36(2):422-429. doi: 10.1093/bioinformatics/btz595.
9
Troubleshooting Guide to Expressing Intrinsically Disordered Proteins for Use in NMR Experiments.用于核磁共振实验的内在无序蛋白质表达故障排除指南。
Front Mol Biosci. 2019 Jan 18;5:118. doi: 10.3389/fmolb.2018.00118. eCollection 2018.
10
PBRpredict-Suite: a suite of models to predict peptide-recognition domain residues from protein sequence.PBRpredict-Suite:一套用于从蛋白质序列预测肽识别结构域残基的模型。
Bioinformatics. 2018 Oct 1;34(19):3289-3299. doi: 10.1093/bioinformatics/bty352.