• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统比较和综合评价 80 种氨基酸描述符在肽定量构效关系建模中的应用。

Systematic Comparison and Comprehensive Evaluation of 80 Amino Acid Descriptors in Peptide QSAR Modeling.

机构信息

Center for Informational Biology, University of Electronic Science and Technology of China (UESTC) at Qingshuihe Campus, Chengdu 611731, China.

School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC) at Shahe Campus, Chengdu 610054, China.

出版信息

J Chem Inf Model. 2021 Apr 26;61(4):1718-1731. doi: 10.1021/acs.jcim.0c01370. Epub 2021 Mar 12.

DOI:10.1021/acs.jcim.0c01370
PMID:33710894
Abstract

The peptide quantitative structure-activity relationship (QSAR), also known as the quantitative sequence-activity model (QSAM), has attracted much attention in the bio- and chemoinformatics communities and is a well developed computational peptidology strategy to statistically correlate the sequence/structure and activity/property relationships of functional peptides. Amino acid descriptors (AADs) are one of the most widely used methods to characterize peptide structures by decomposing the peptide into its residue building blocks and sequentially parametrizing each building block with a vector of amino acid principal properties. Considering that various AADs have been proposed over the past decades and new AADs are still emerging today, we herein query the following: is it necessary to develop so many AADs and do we need to continuously develop more new AADs? In this study, we exhaustively collect 80 published AADs and comprehensively evaluate their modeling performance (including fitting ability, internal stability, and predictive power) on 8 QSAR-oriented peptide sample sets (QPSs) by employing 2 sophisticated machine learning methods (MLMs), totally building and systematically comparing 1280 (80 AADs × 8 QPSs × 2 MLMs) peptide QSAR models. The following is revealed: (i) None of the AADs can work best on all or most peptide sets; an AAD usually performs well for some peptides but badly for others. (ii) Modeling performance is primarily determined by the peptide samples and then the MLMs used, while AADs have only a moderate influence on the performance. (iii) There is no essential difference between the modeling performances of different AAD types (physiochemical, topological, 3D-structural, etc.). (iv) Two random descriptors, which are separately generated randomly in standard normal distribution (0, 1) and uniform distribution (-1, +1), do not perform significantly worse than these carefully developed AADs. (v) A secondary descriptor, which carries major information involved in the 80 (primary) AADs, does not perform significantly better than these AADs. Overall, we conclude that since there are various AADs available to date and they already cover numerous amino acid properties, further development of new AADs is not an essential choice to improve peptide QSAR modeling; the traditional AAD methodology is believed to have almost reached the theoretical limit nowadays. In addition, the AADs are more likely to be a vector symbol but not informative data; they are utilized to mark and distinguish the 20 amino acids but do not really bring much original property information to these amino acids.

摘要

肽的定量构效关系(QSAR),也称为定量序列活性模型(QSAM),在生物信息学和化学生物信息学领域引起了广泛关注,是一种成熟的计算肽化学策略,用于统计相关功能肽的序列/结构和活性/性质关系。氨基酸描述符(AAD)是通过将肽分解为其残基构建块并顺序地用氨基酸主要性质的向量参数化来描述肽结构的最广泛使用的方法之一。考虑到过去几十年已经提出了各种 AAD,并且今天仍在不断涌现新的 AAD,我们在此询问:是否有必要开发这么多 AAD,我们是否需要不断开发更多新的 AAD?在这项研究中,我们详尽地收集了 80 种已发表的 AAD,并通过使用 2 种先进的机器学习方法(MLM),在 8 个面向 QSAR 的肽样本集(QPS)上全面评估了它们的建模性能(包括拟合能力、内部稳定性和预测能力),总共构建和系统地比较了 1280 个(80 个 AAD×8 个 QPS×2 个 MLM)肽 QSAR 模型。以下是揭示的结果:(i)没有一种 AAD 可以在所有或大多数肽集上表现最佳;一种 AAD 通常对某些肽表现良好,但对其他肽表现不佳。(ii)建模性能主要由肽样本决定,然后由使用的 MLM 决定,而 AAD 对性能只有适度的影响。(iii)不同 AAD 类型(生理化学、拓扑、3D 结构等)的建模性能没有本质区别。(iv)两个随机描述符,分别在标准正态分布(0,1)和均匀分布(-1,+1)中随机生成,其性能并不明显差于这些精心开发的 AAD。(v)一个二次描述符,包含 80 个(主要)AAD 中涉及的主要信息,其性能并不明显优于这些 AAD。总体而言,我们得出结论,由于目前已经有各种 AAD 可供使用,并且它们已经涵盖了许多氨基酸性质,因此开发新的 AAD 并不是改进肽 QSAR 建模的必要选择;传统的 AAD 方法学如今已接近理论极限。此外,AAD 更有可能是一个向量符号而不是信息数据;它们用于标记和区分 20 种氨基酸,但并没有真正为这些氨基酸带来多少原始属性信息。

相似文献

1
Systematic Comparison and Comprehensive Evaluation of 80 Amino Acid Descriptors in Peptide QSAR Modeling.系统比较和综合评价 80 种氨基酸描述符在肽定量构效关系建模中的应用。
J Chem Inf Model. 2021 Apr 26;61(4):1718-1731. doi: 10.1021/acs.jcim.0c01370. Epub 2021 Mar 12.
2
Systematic Modeling, Prediction, and Comparison of Domain-Peptide Affinities: Does it Work Effectively With the Peptide QSAR Methodology?结构域-肽亲和力的系统建模、预测及比较:它在肽定量构效关系方法中是否有效?
Front Genet. 2022 Jan 14;12:800857. doi: 10.3389/fgene.2021.800857. eCollection 2021.
3
PepQSAR: a comprehensive data source and information platform for peptide quantitative structure-activity relationships.PepQSAR:一个用于肽定量构效关系的综合数据源和信息平台。
Amino Acids. 2023 Feb;55(2):235-242. doi: 10.1007/s00726-022-03219-4. Epub 2022 Dec 6.
4
Structural parameterization and functional prediction of antigenic polypeptome sequences with biological activity through quantitative sequence-activity models (QSAM) by molecular electronegativity edge-distance vector (VMED).通过分子电负性边缘距离向量(VMED)的定量序列-活性模型(QSAM)对抗具有生物活性的抗原多肽组序列进行结构参数化和功能预测。
Sci China C Life Sci. 2007 Oct;50(5):706-16. doi: 10.1007/s11427-007-0080-7.
5
Combinatorial QSAR of ambergris fragrance compounds.龙涎香香料化合物的组合定量构效关系
J Chem Inf Comput Sci. 2004 Mar-Apr;44(2):582-95. doi: 10.1021/ci034203t.
6
Exploring the activity space of peptides binding to diverse SH3 domains using principal property descriptors derived from amino acid rotamers.利用源自氨基酸构象的主要性质描述符探索与不同 SH3 结构域结合的肽的活性空间。
Biopolymers. 2011;96(3):288-301. doi: 10.1002/bip.21531.
7
QSSR Modeling of Bacillus Subtilis Lipase A Peptide Collision Cross-Sections in Ion Mobility Spectrometry: Local Descriptor Versus Global Descriptor.QSSR 建模在离子淌度光谱法中枯草芽孢杆菌脂肪酶 A 肽碰撞截面:局部描述符与全局描述符。
Protein J. 2021 Feb;40(1):54-62. doi: 10.1007/s10930-020-09960-7. Epub 2021 Jan 16.
8
Quantitative structure-activity relationship modeling reveals the minimal sequence requirement and amino acid preference of sirtuin-1's deacetylation substrates in diabetes mellitus.定量构效关系建模揭示了糖尿病中 SIRT1 的去乙酰化底物的最小序列要求和氨基酸偏好。
J Bioinform Comput Biol. 2022 Jun;20(3):2250008. doi: 10.1142/S0219720022500081. Epub 2022 Apr 21.
9
Prediction of antioxidant peptides using a quantitative structure-activity relationship predictor (AnOxPP) based on bidirectional long short-term memory neural network and interpretable amino acid descriptors.基于双向长短期记忆神经网络和可解释氨基酸描述符的抗氧化肽定量构效关系预测器 (AnOxPP) 的抗氧化肽预测。
Comput Biol Med. 2023 Mar;154:106591. doi: 10.1016/j.compbiomed.2023.106591. Epub 2023 Jan 24.
10
QSAR modeling of the antimicrobial activity of peptides as a mathematical function of a sequence of amino acids.将肽的抗菌活性作为氨基酸序列的数学函数进行定量构效关系建模。
Comput Biol Chem. 2015 Dec;59 Pt A:126-30. doi: 10.1016/j.compbiolchem.2015.09.009. Epub 2015 Sep 21.

引用本文的文献

1
Rational design of potent phosphopeptide binders to endocrine Snk PBD domain by integrating machine learning optimization, molecular dynamics simulation, binding energetics rescoring, and in vitro affinity assay.通过整合机器学习优化、分子动力学模拟、结合能重新评分和体外亲和力测定,对内分泌Snk PBD结构域的强效磷酸肽结合剂进行合理设计。
Eur Biophys J. 2025 Feb;54(1-2):33-43. doi: 10.1007/s00249-024-01729-5. Epub 2024 Nov 29.
2
Quantitative physics-physiology relationship modeling of human emotional response to Shu music.人类对舒曼音乐情绪反应的定量物理-生理关系建模
Front Psychol. 2024 Oct 8;15:1351058. doi: 10.3389/fpsyg.2024.1351058. eCollection 2024.
3
Computational design and experimental confirmation of a disulfide-stapled YAP helix-trap derived from TEAD4 helical hairpin to selectively capture YAP α1-helix with potent antitumor activity.
基于 TEAD4 螺旋发夹设计的 YAP 螺旋夹的计算设计和实验验证,可选择性捕获具有强抗肿瘤活性的 YAPα1-螺旋。
J Comput Aided Mol Des. 2024 Aug 23;38(1):31. doi: 10.1007/s10822-024-00572-2.
4
Navigating the Expansive Landscapes of Soft Materials: A User Guide for High-Throughput Workflows.探索软材料的广阔领域:高通量工作流程用户指南
ACS Polym Au. 2023 Dec 5;3(6):406-427. doi: 10.1021/acspolymersau.3c00025. eCollection 2023 Dec 13.
5
Coupled folding-upon-binding of human tumor suppressor MIG6 to lung cancer EGFR kinase domain and molecular trimming/stapling of MIG6-derived β-hairpins to target the coupling event.人类肿瘤抑制因子MIG6与肺癌表皮生长因子受体(EGFR)激酶结构域结合时的偶联折叠,以及对MIG6衍生的β-发夹进行分子修剪/钉合以靶向偶联事件。
Eur Biophys J. 2023 Feb;52(1-2):17-25. doi: 10.1007/s00249-022-01624-x. Epub 2022 Dec 22.
6
PepQSAR: a comprehensive data source and information platform for peptide quantitative structure-activity relationships.PepQSAR:一个用于肽定量构效关系的综合数据源和信息平台。
Amino Acids. 2023 Feb;55(2):235-242. doi: 10.1007/s00726-022-03219-4. Epub 2022 Dec 6.
7
Structural Mapping of BMP Conformational Epitopes and Bioengineering Design of Osteogenic Peptides to Specifically Target the Epitope-Binding Sites.骨形态发生蛋白(BMP)构象表位的结构映射及成骨肽的生物工程设计,以特异性靶向表位结合位点
Cell Mol Bioeng. 2022 Apr 28;15(4):341-352. doi: 10.1007/s12195-022-00725-z. eCollection 2022 Aug.
8
Rational design of stapled helical peptides as antidiabetic PPARγ antagonists to target coactivator site by decreasing unfavorable entropy penalty instead of increasing favorable enthalpy contribution.通过降低不利的熵罚而不是增加有利的焓贡献来设计订书钉螺旋肽作为抗糖尿病 PPARγ 拮抗剂,以靶向共激活剂位点。
Eur Biophys J. 2022 Dec;51(7-8):535-543. doi: 10.1007/s00249-022-01616-x. Epub 2022 Sep 4.
9
Comprehensive Evaluation and Comparison of Machine Learning Methods in QSAR Modeling of Antioxidant Tripeptides.抗氧化三肽定量构效关系建模中机器学习方法的综合评价与比较
ACS Omega. 2022 Jul 15;7(29):25760-25771. doi: 10.1021/acsomega.2c03062. eCollection 2022 Jul 26.
10
Application of Machine Learning in Developing Quantitative Structure-Property Relationship for Electronic Properties of Polyaromatic Compounds.机器学习在开发多环芳烃化合物电子性质的定量结构-性质关系中的应用。
ACS Omega. 2022 Jun 17;7(26):22879-22888. doi: 10.1021/acsomega.2c02650. eCollection 2022 Jul 5.