• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习对凝血酶抑制剂抑制常数进行计算机模拟预测

In silico Prediction of Inhibitory Constant of Thrombin Inhibitors Using Machine Learning.

作者信息

Zhao Junnan, Zhu Lu, Zhou Weineng, Yin Lingfeng, Wang Yuchen, Fan Yuanrong, Chen Yadong, Liu Haichun

机构信息

Laboratory of Molecular Design and Drug Discovery, School of Science, China; Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198 Jiangsu, China.

出版信息

Comb Chem High Throughput Screen. 2018;21(9):662-669. doi: 10.2174/1386207322666181220130232.

DOI:10.2174/1386207322666181220130232
PMID:30569853
Abstract

BACKGROUND

Thrombin is the central protease of the vertebrate blood coagulation cascade, which is closely related to cardiovascular diseases. The inhibitory constant Ki is the most significant property of thrombin inhibitors.

METHOD

This study was carried out to predict Ki values of thrombin inhibitors based on a large data set by using machine learning methods. Taking advantage of finding non-intuitive regularities on high-dimensional datasets, machine learning can be used to build effective predictive models. A total of 6554 descriptors for each compound were collected and an efficient descriptor selection method was chosen to find the appropriate descriptors. Four different methods including multiple linear regression (MLR), K Nearest Neighbors (KNN), Gradient Boosting Regression Tree (GBRT) and Support Vector Machine (SVM) were implemented to build prediction models with these selected descriptors.

RESULTS

The SVM model was the best one among these methods with R2=0.84, MSE=0.55 for the training set and R2=0.83, MSE=0.56 for the test set. Several validation methods such as yrandomization test and applicability domain evaluation, were adopted to assess the robustness and generalization ability of the model. The final model shows excellent stability and predictive ability and can be employed for rapid estimation of the inhibitory constant, which is full of help for designing novel thrombin inhibitors.

摘要

背景

凝血酶是脊椎动物血液凝固级联反应的核心蛋白酶,与心血管疾病密切相关。抑制常数Ki是凝血酶抑制剂最重要的特性。

方法

本研究旨在通过机器学习方法,基于一个大数据集预测凝血酶抑制剂的Ki值。利用在高维数据集上发现非直观规律的优势,机器学习可用于构建有效的预测模型。为每个化合物收集了总共6554个描述符,并选择了一种有效的描述符选择方法来找到合适的描述符。采用包括多元线性回归(MLR)、K近邻(KNN)、梯度提升回归树(GBRT)和支持向量机(SVM)在内的四种不同方法,用这些选定的描述符构建预测模型。

结果

在这些方法中,SVM模型是最好的,训练集的R2 = 0.84,MSE = 0.55,测试集的R2 = 0.83,MSE = 0.56。采用了几种验证方法,如随机化测试和适用域评估,来评估模型的稳健性和泛化能力。最终模型显示出优异的稳定性和预测能力,可用于快速估计抑制常数,这对设计新型凝血酶抑制剂有很大帮助。

相似文献

1
In silico Prediction of Inhibitory Constant of Thrombin Inhibitors Using Machine Learning.使用机器学习对凝血酶抑制剂抑制常数进行计算机模拟预测
Comb Chem High Throughput Screen. 2018;21(9):662-669. doi: 10.2174/1386207322666181220130232.
2
QSAR studies of the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by multiple linear regression (MLR) and support vector machine (SVM).通过多元线性回归(MLR)和支持向量机(SVM)对丙型肝炎病毒(HCV)NS3/4A蛋白酶抑制剂的生物活性进行定量构效关系(QSAR)研究。
Bioorg Med Chem Lett. 2017 Jul 1;27(13):2931-2938. doi: 10.1016/j.bmcl.2017.05.001. Epub 2017 May 3.
3
Prediction of matrix metal proteinases-12 inhibitors by machine learning approaches.基于机器学习方法预测基质金属蛋白酶-12 抑制剂。
J Biomol Struct Dyn. 2019 Jul;37(10):2627-2640. doi: 10.1080/07391102.2018.1492460. Epub 2018 Dec 24.
4
Exploring QSARs for inhibitory effect of a set of heterocyclic thrombin inhibitors by multilinear regression refined by artificial neural network and molecular docking simulations.通过人工神经网络和分子对接模拟改进的多元线性回归对一组杂环凝血酶抑制剂抑制作用的定量构效关系研究。
J Enzyme Inhib Med Chem. 2012 Apr;27(2):174-86. doi: 10.3109/14756366.2011.582467. Epub 2011 Jun 3.
5
Improved Prediction of Blood-Brain Barrier Permeability Through Machine Learning with Combined Use of Molecular Property-Based Descriptors and Fingerprints.通过机器学习结合分子性质基描述符和指纹提高血脑屏障通透性的预测。
AAPS J. 2018 Mar 21;20(3):54. doi: 10.1208/s12248-018-0215-8.
6
ADME Properties Evaluation in Drug Discovery: Prediction of Caco-2 Cell Permeability Using a Combination of NSGA-II and Boosting.药物研发中的ADME性质评估:使用NSGA-II和Boosting相结合的方法预测Caco-2细胞渗透性
J Chem Inf Model. 2016 Apr 25;56(4):763-73. doi: 10.1021/acs.jcim.5b00642. Epub 2016 Apr 5.
7
A comprehensive support vector machine binary hERG classification model based on extensive but biased end point hERG data sets.基于广泛但存在偏倚的终点 hERG 数据集的全面支持向量机二进制 hERG 分类模型。
Chem Res Toxicol. 2011 Jun 20;24(6):934-49. doi: 10.1021/tx200099j. Epub 2011 May 6.
8
Discovery of Dual FGFR4 and EGFR Inhibitors by Machine Learning and Biological Evaluation.通过机器学习和生物评估发现双重 FGFR4 和 EGFR 抑制剂。
J Chem Inf Model. 2020 Oct 26;60(10):4640-4652. doi: 10.1021/acs.jcim.0c00652. Epub 2020 Sep 23.
9
QSAR prediction of HIV-1 protease inhibitory activities using docking derived molecular descriptors.使用对接衍生分子描述符对HIV-1蛋白酶抑制活性进行定量构效关系预测。
J Theor Biol. 2015 Mar 21;369:13-22. doi: 10.1016/j.jtbi.2015.01.008. Epub 2015 Jan 16.
10
Developing a support vector machine based QSPR model for prediction of half-life of some herbicides.基于支持向量机开发用于预测某些除草剂半衰期的定量构效关系(QSPR)模型。
Ecotoxicol Environ Saf. 2016 Jul;129:10-5. doi: 10.1016/j.ecoenv.2016.03.002. Epub 2016 Mar 9.

引用本文的文献

1
Correlation of Experimental and Calculated Inhibition Constants of Protease Inhibitor Complexes.实验和计算的蛋白酶抑制剂复合物抑制常数的相关性。
Int J Mol Sci. 2024 Feb 19;25(4):2429. doi: 10.3390/ijms25042429.