• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用共形回归和支持向量机的logD置信度预测器。

A confidence predictor for logD using conformal regression and a support-vector machine.

作者信息

Lapins Maris, Arvidsson Staffan, Lampa Samuel, Berg Arvid, Schaal Wesley, Alvarsson Jonathan, Spjuth Ola

机构信息

Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24, Uppsala, Sweden.

出版信息

J Cheminform. 2018 Apr 3;10(1):17. doi: 10.1186/s13321-018-0271-1.

DOI:10.1186/s13321-018-0271-1
PMID:29616425
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5882484/
Abstract

Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water-octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service.

摘要

亲脂性是药物代谢动力学(ADMET)性质以及候选药物整体适用性的主要决定因素。我们开发了大规模模型来预测化合物的水-辛醇分配系数(logD),以辅助药物发现项目。利用来自ChEMBL数据库的160万种化合物的ACD/logD数据,使用具有线性核的支持向量机并采用共形预测方法创建和评估模型,在指定置信水平下输出预测区间。所得模型的预测能力为[公式:见原文],并且性能最佳的不一致性度量在80%置信度下的预测区间中位数为[公式:见原文]对数单位,在90%置信度下为[公式:见原文]对数单位。该模型可通过OpenAPI接口作为在线服务获取,也可通过带有分子编辑器的网页获取,我们还以RDF格式发布9100万个PubChem结构在90%置信水平下的预测值以供下载,并提供URI解析服务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/85d5b30656f4/13321_2018_271_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/87b12cc98cb0/13321_2018_271_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/ab9d4131eae9/13321_2018_271_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/84b3f8233065/13321_2018_271_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/22daf339e252/13321_2018_271_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/85d5b30656f4/13321_2018_271_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/87b12cc98cb0/13321_2018_271_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/ab9d4131eae9/13321_2018_271_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/84b3f8233065/13321_2018_271_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/22daf339e252/13321_2018_271_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcff/5882484/85d5b30656f4/13321_2018_271_Fig5_HTML.jpg

相似文献

1
A confidence predictor for logD using conformal regression and a support-vector machine.一种使用共形回归和支持向量机的logD置信度预测器。
J Cheminform. 2018 Apr 3;10(1):17. doi: 10.1186/s13321-018-0271-1.
2
Comparison of logP and logD correction models trained with public and proprietary data sets.比较使用公共数据集和专有数据集训练的 logP 和 logD 校正模型。
J Comput Aided Mol Des. 2022 Mar;36(3):253-262. doi: 10.1007/s10822-022-00450-9. Epub 2022 Apr 1.
3
Predicting anti-trypanosome effect of carbazole-derived compounds by powerful SVM with novel kernel function and comprehensive learning PSO.利用具有新型核函数和综合学习 PSO 的强大 SVM 预测咔唑衍生化合物的抗锥虫作用。
Antimicrob Agents Chemother. 2024 Jul 9;68(7):e0026524. doi: 10.1128/aac.00265-24. Epub 2024 May 29.
4
Predicting Off-Target Binding Profiles With Confidence Using Conformal Prediction.使用共形预测自信地预测脱靶结合谱。
Front Pharmacol. 2018 Nov 6;9:1256. doi: 10.3389/fphar.2018.01256. eCollection 2018.
5
Predicting With Confidence: Using Conformal Prediction in Drug Discovery.有信心的预测:在药物发现中使用一致性预测。
J Pharm Sci. 2021 Jan;110(1):42-49. doi: 10.1016/j.xphs.2020.09.055. Epub 2020 Oct 17.
6
Assessing predictive performance of supervised machine learning algorithms for a diamond pricing model.评估用于钻石定价模型的监督式机器学习算法的预测性能。
Sci Rep. 2023 Oct 12;13(1):17315. doi: 10.1038/s41598-023-44326-w.
7
Large-scale ligand-based predictive modelling using support vector machines.使用支持向量机的基于配体的大规模预测建模。
J Cheminform. 2016 Aug 10;8:39. doi: 10.1186/s13321-016-0151-5. eCollection 2016.
8
Evaluation of ADMET Predictor in Early Discovery Drug Metabolism and Pharmacokinetics Project Work.ADMET 预测器在早期发现药物代谢和药代动力学项目工作中的评估。
Drug Metab Dispos. 2022 Feb;50(2):95-104. doi: 10.1124/dmd.121.000552. Epub 2021 Nov 8.
9
Improved GNNs for Log  Prediction by Transferring Knowledge from Low-Fidelity Data.通过从低质量数据转移知识来改进图神经网络进行日志预测。
J Chem Inf Model. 2023 Apr 24;63(8):2345-2359. doi: 10.1021/acs.jcim.2c01564. Epub 2023 Mar 31.
10
Prediction of octanol-water partition coefficients for the SAMPL6- molecules using molecular dynamics simulations with OPLS-AA, AMBER and CHARMM force fields.使用OPLS-AA、AMBER和CHARMM力场通过分子动力学模拟预测SAMPL6分子的正辛醇-水分配系数。
J Comput Aided Mol Des. 2020 May;34(5):543-560. doi: 10.1007/s10822-019-00267-z. Epub 2020 Jan 20.

引用本文的文献

1
Integrative snRNA-seq, molecular docking and dynamics simulations identifies Lasmiditan as drug candidate for Alzheimer's disease.整合性单细胞核RNA测序、分子对接和动力学模拟确定拉米地坦为阿尔茨海默病的候选药物。
Clin Transl Med. 2025 Aug;15(8):e70443. doi: 10.1002/ctm2.70443.
2
Protein Spatial Structure Meets Artificial Intelligence: Revolutionizing Drug Synergy-Antagonism in Precision Medicine.蛋白质空间结构与人工智能相遇:革新精准医学中的药物协同 - 拮抗作用
Adv Sci (Weinh). 2025 Sep;12(33):e07764. doi: 10.1002/advs.202507764. Epub 2025 Aug 7.
3
FormulationBCS: A Machine Learning Platform Based on Diverse Molecular Representations for Biopharmaceutical Classification System (BCS) Class Prediction.

本文引用的文献

1
The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching.化学开发工具包(CDK)v2.0:原子类型标注、描绘、分子式及子结构搜索。
J Cheminform. 2017 Jun 6;9(1):33. doi: 10.1186/s13321-017-0220-4.
2
Predicting the Rate of Skin Penetration Using an Aggregated Conformal Prediction Framework.使用聚合共形预测框架预测皮肤渗透速率
Mol Pharm. 2017 May 1;14(5):1571-1576. doi: 10.1021/acs.molpharmaceut.7b00007. Epub 2017 Apr 17.
3
The ChEMBL database in 2017.2017年的ChEMBL数据库。
FormulationBCS:一种基于多种分子表征的机器学习平台,用于生物药剂分类系统(BCS)类别预测。
Mol Pharm. 2025 Jan 6;22(1):330-342. doi: 10.1021/acs.molpharmaceut.4c00946. Epub 2024 Dec 8.
4
CPSign: conformal prediction for cheminformatics modeling.CPSign:用于化学信息学建模的共形预测
J Cheminform. 2024 Jun 28;16(1):75. doi: 10.1186/s13321-024-00870-9.
5
FOTF-CPI: A compound-protein interaction prediction transformer based on the fusion of optimal transport fragments.FOTF-CPI:一种基于最优传输片段融合的复合蛋白相互作用预测变压器。
iScience. 2023 Dec 15;27(1):108756. doi: 10.1016/j.isci.2023.108756. eCollection 2024 Jan 19.
6
Extreme Gradient Boosting Combined with Conformal Predictors for Informative Solubility Estimation.用于信息性溶解度估计的极端梯度提升与共形预测器相结合
Molecules. 2023 Dec 19;29(1):19. doi: 10.3390/molecules29010019.
7
LogD7.4 prediction enhanced by transferring knowledge from chromatographic retention time, microscopic pKa and logP.通过从色谱保留时间、微观pKa和logP转移知识增强LogD7.4预测
J Cheminform. 2023 Sep 5;15(1):76. doi: 10.1186/s13321-023-00754-4.
8
Synthesis, Optical Properties, and In Vivo Biodistribution Performance of Polymethine Cyanine Fluorophores.聚甲炔菁荧光团的合成、光学性质及体内生物分布性能
ACS Pharmacol Transl Sci. 2023 Jul 25;6(8):1192-1206. doi: 10.1021/acsptsci.3c00101. eCollection 2023 Aug 11.
9
Discovery of Phenylcarbamoylazinane-1,2,4-Triazole Amides Derivatives as the Potential Inhibitors of Aldo-Keto Reductases (AKR1B1 & AKRB10): Potential Lead Molecules for Treatment of Colon Cancer.苯甲酰胺嗪烷-1,2,4-三唑酰胺衍生物的发现作为醛酮还原酶(AKR1B1 和 AKRB10)的潜在抑制剂:用于治疗结肠癌的潜在先导分子。
Molecules. 2022 Jun 21;27(13):3981. doi: 10.3390/molecules27133981.
10
Physicochemical and biopharmaceutical aspects influencing skin permeation and role of SLN and NLC for skin drug delivery.影响皮肤渗透的物理化学和生物药剂学方面以及固体脂质纳米粒和纳米结构脂质载体在皮肤给药中的作用。
Heliyon. 2022 Feb 11;8(2):e08938. doi: 10.1016/j.heliyon.2022.e08938. eCollection 2022 Feb.
Nucleic Acids Res. 2017 Jan 4;45(D1):D945-D954. doi: 10.1093/nar/gkw1074. Epub 2016 Nov 28.
4
Large-scale ligand-based predictive modelling using support vector machines.使用支持向量机的基于配体的大规模预测建模。
J Cheminform. 2016 Aug 10;8:39. doi: 10.1186/s13321-016-0151-5. eCollection 2016.
5
Prediction of PARP Inhibition with Proteochemometric Modelling and Conformal Prediction.利用蛋白质化学计量学建模和共形预测对PARP抑制进行预测。
Mol Inform. 2015 Jun;34(6-7):357-66. doi: 10.1002/minf.201400165. Epub 2015 Mar 20.
6
AMBIT-SMARTS: Efficient Searching of Chemical Structures and Fragments.AMBIT-SMARTS:高效的化学结构和片段搜索。
Mol Inform. 2011 Aug;30(8):707-20. doi: 10.1002/minf.201100028. Epub 2011 Aug 4.
7
Optimised method to estimate octanol water distribution coefficient (logD) in a high throughput format.以高通量形式估算正辛醇-水分配系数(logD)的优化方法。
Eur J Pharm Sci. 2016 Sep 20;92:110-6. doi: 10.1016/j.ejps.2016.06.024. Epub 2016 Jun 29.
8
Conformal prediction to define applicability domain - A case study on predicting ER and AR binding.用于定义适用域的共形预测——预测雌激素受体和雄激素受体结合的案例研究
SAR QSAR Environ Res. 2016 Apr;27(4):303-16. doi: 10.1080/1062936X.2016.1172665. Epub 2016 Apr 18.
9
PubChem Substance and Compound databases.美国国立医学图书馆化学物质数据库和化合物数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1202-13. doi: 10.1093/nar/gkv951. Epub 2015 Sep 22.
10
Improved large-scale prediction of growth inhibition patterns using the NCI60 cancer cell line panel.利用NCI60癌细胞系面板改进大规模生长抑制模式预测。
Bioinformatics. 2016 Jan 1;32(1):85-95. doi: 10.1093/bioinformatics/btv529. Epub 2015 Sep 8.