• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过肽-蛋白对接和机器学习加速小分子肽配体的筛选。

Accelerating the Screening of Small Peptide Ligands by Combining Peptide-Protein Docking and Machine Learning.

机构信息

Department of Biochemistry and Molecular Biology, Miller School of Medicine, University of Miami, Miami, FL 33136, USA.

Department of Bioscience and Technology for Food, Agriculture and Environment, University of Teramo, 64100 Teramo, Italy.

出版信息

Int J Mol Sci. 2023 Jul 29;24(15):12144. doi: 10.3390/ijms241512144.

DOI:10.3390/ijms241512144
PMID:37569520
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10419121/
Abstract

This research introduces a novel pipeline that couples machine learning (ML), and molecular docking for accelerating the process of small peptide ligand screening through the prediction of peptide-protein docking. Eight ML algorithms were analyzed for their potential. Notably, Light Gradient Boosting Machine (LightGBM), despite having comparable F1-score and accuracy to its counterparts, showcased superior computational efficiency. LightGBM was used to classify peptide-protein docking performance of the entire tetrapeptide library of 160,000 peptide ligands against four viral envelope proteins. The library was classified into two groups, 'better performers' and 'worse performers'. By training the LightGBM algorithm on just 1% of the tetrapeptide library, we successfully classified the remaining 99%with an accuracy range of 0.81-0.85 and an F1-score between 0.58-0.67. Three different molecular docking software were used to prove that the process is not software dependent. With an adjustable probability threshold (from 0.5 to 0.95), the process could be accelerated by a factor of at least 10-fold and still get 90-95% concurrence with the method without ML. This study validates the efficiency of machine learning coupled to molecular docking in rapidly identifying top peptides without relying on high-performance computing power, making it an effective tool for screening potential bioactive compounds.

摘要

这项研究介绍了一种新的流水线,该流水线将机器学习(ML)和分子对接相结合,通过预测肽-蛋白对接来加速小肽配体筛选的过程。分析了八种 ML 算法的潜力。值得注意的是,尽管 Light Gradient Boosting Machine(LightGBM)的 F1 得分和准确性与其同类相当,但它具有更高的计算效率。LightGBM 用于对来自四个病毒包膜蛋白的 16 万个肽配体的整个四肽库的肽-蛋白对接性能进行分类。该文库被分为两组,“表现更好”和“表现更差”。通过仅在四肽文库的 1%上训练 LightGBM 算法,我们成功地以 0.81-0.85 的准确率和 0.58-0.67 的 F1 分数对其余 99%进行了分类。使用三种不同的分子对接软件证明了该过程不依赖于软件。通过调整概率阈值(从 0.5 到 0.95),该过程可以加速 10 倍以上,并且仍然可以与不使用 ML 的方法达到 90-95%的一致性。这项研究验证了机器学习与分子对接相结合在快速识别顶级肽方面的效率,而无需依赖高性能计算能力,使其成为筛选潜在生物活性化合物的有效工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/7720a95df39b/ijms-24-12144-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/260560b75233/ijms-24-12144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/01139752ed4b/ijms-24-12144-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/bef869e35f14/ijms-24-12144-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/15e67ed88ab3/ijms-24-12144-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/7720a95df39b/ijms-24-12144-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/260560b75233/ijms-24-12144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/01139752ed4b/ijms-24-12144-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/bef869e35f14/ijms-24-12144-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/15e67ed88ab3/ijms-24-12144-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ff8/10419121/7720a95df39b/ijms-24-12144-g005.jpg

相似文献

1
Accelerating the Screening of Small Peptide Ligands by Combining Peptide-Protein Docking and Machine Learning.通过肽-蛋白对接和机器学习加速小分子肽配体的筛选。
Int J Mol Sci. 2023 Jul 29;24(15):12144. doi: 10.3390/ijms241512144.
2
Geometry Optimization Algorithms in Conjunction with the Machine Learning Potential ANI-2x Facilitate the Structure-Based Virtual Screening and Binding Mode Prediction.几何优化算法与机器学习势能ANI-2x 相结合,有助于基于结构的虚拟筛选和结合模式预测。
Biomolecules. 2024 May 31;14(6):648. doi: 10.3390/biom14060648.
3
DENVIS: Scalable and High-Throughput Virtual Screening Using Graph Neural Networks with Atomic and Surface Protein Pocket Features.DENVIS:使用具有原子和表面蛋白口袋特征的图神经网络进行可扩展的高通量虚拟筛选。
J Chem Inf Model. 2022 Oct 10;62(19):4642-4659. doi: 10.1021/acs.jcim.2c01057. Epub 2022 Sep 26.
4
Machine learning in computational docking.计算对接中的机器学习。
Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.
5
Boosted neural networks scoring functions for accurate ligand docking and ranking.用于精确配体对接和排序的增强神经网络评分函数。
J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.
6
LCP: Simple Representation of Docking Poses for Machine Learning: A Case Study on Xanthine Oxidase Inhibitors.LCP:用于机器学习的对接构象的简单表示:黄嘌呤氧化酶抑制剂的案例研究。
Mol Inform. 2022 May;41(5):e2100245. doi: 10.1002/minf.202100245. Epub 2021 Nov 29.
7
Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.用于预测配体结合构象和亲和力以及进行筛选富集的任务特定评分函数。
J Chem Inf Model. 2018 Jan 22;58(1):119-133. doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.
8
Use of tree-based machine learning methods to screen affinitive peptides based on docking data.基于对接数据,使用基于树的机器学习方法筛选亲和肽。
Mol Inform. 2023 Dec;42(12):e202300143. doi: 10.1002/minf.202300143. Epub 2023 Nov 9.
9
Integrating sampling techniques and inverse virtual screening: toward the discovery of artificial peptide-based receptors for ligands.整合采样技术与反向虚拟筛选:探索基于人工肽的配体受体
Mol Divers. 2016 May;20(2):421-38. doi: 10.1007/s11030-015-9648-5. Epub 2015 Nov 9.
10
SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation.SCORCH:利用机器学习分类器、数据增强和不确定性估计改进基于结构的虚拟筛选。
J Adv Res. 2023 Apr;46:135-147. doi: 10.1016/j.jare.2022.07.001. Epub 2022 Jul 25.

引用本文的文献

1
AAindexNC: Estimating the Physicochemical Properties of Non-Canonical Amino Acids, Including Those Derived from the PDB and PDBeChem Databank.AAindexNC:估算非标准氨基酸的物理化学性质,包括那些源自蛋白质数据库(PDB)和蛋白质数据银行化学数据库(PDBeChem)的非标准氨基酸。
Int J Mol Sci. 2024 Nov 22;25(23):12555. doi: 10.3390/ijms252312555.

本文引用的文献

1
2022 FDA TIDES (Peptides and Oligonucleotides) Harvest.2022年美国食品药品监督管理局潮汐项目(肽与寡核苷酸)成果
Pharmaceuticals (Basel). 2023 Feb 22;16(3):336. doi: 10.3390/ph16030336.
2
Machine learning on protein-protein interaction prediction: models, challenges and trends.蛋白质-蛋白质相互作用预测中的机器学习:模型、挑战与趋势。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad076.
3
Machine Learning Advances in Predicting Peptide/Protein-Protein Interactions Based on Sequence Information for Lead Peptides Discovery.
基于序列信息的肽/蛋白质-蛋白质相互作用预测的机器学习进展,用于先导肽发现。
Adv Biol (Weinh). 2023 Jun;7(6):e2200232. doi: 10.1002/adbi.202200232. Epub 2023 Feb 12.
4
Prediction of Protein-Protein Interaction Sites by Multifeature Fusion and RF with mRMR and IFS.基于 mRMR 和 IFS 的多特征融合和 RF 预测蛋白质-蛋白质相互作用位点。
Dis Markers. 2022 Oct 4;2022:5892627. doi: 10.1155/2022/5892627. eCollection 2022.
5
In silico SELEX screening and statistical analysis of newly designed 5mer peptide-aptamers as Bcl-xl inhibitors using the Taguchi method.基于 Taguchi 法的新型 5 肽适体筛选及其作为 Bcl-xl 抑制剂的统计学分析
Comput Biol Med. 2022 Jul;146:105632. doi: 10.1016/j.compbiomed.2022.105632. Epub 2022 May 18.
6
An overview on display systems (phage, bacterial, and yeast display) for production of anticancer antibodies; advantages and disadvantages.展示系统(噬菌体、细菌和酵母展示)在生产抗癌抗体方面的概述;优缺点。
Int J Biol Macromol. 2022 May 31;208:421-442. doi: 10.1016/j.ijbiomac.2022.03.113. Epub 2022 Mar 24.
7
Therapeutic peptides: current applications and future directions.治疗性肽:当前的应用及未来方向。
Signal Transduct Target Ther. 2022 Feb 14;7(1):48. doi: 10.1038/s41392-022-00904-4.
8
Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking.基于深度对接的人工智能辅助超大规模化学库虚拟筛选。
Nat Protoc. 2022 Mar;17(3):672-697. doi: 10.1038/s41596-021-00659-2. Epub 2022 Feb 4.
9
A new class of sensing elements for sensors: Clamp peptides for Zika virus.用于传感器的新型传感元件:寨卡病毒夹肽。
Biosens Bioelectron. 2021 Nov 1;191:113471. doi: 10.1016/j.bios.2021.113471. Epub 2021 Jul 1.
10
Trends in peptide drug discovery.肽类药物研发趋势。
Nat Rev Drug Discov. 2021 Apr;20(4):309-325. doi: 10.1038/s41573-020-00135-8. Epub 2021 Feb 3.