• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProfhEX:用于小分子安全性评估的基于人工智能的平台。

ProfhEX: AI-based platform for small molecules liability profiling.

作者信息

Lunghini Filippo, Fava Anna, Pisapia Vincenzo, Sacco Francesco, Iaconis Daniela, Beccari Andrea Rosario

机构信息

EXSCALATE, Dompé Farmaceutici SpA, Via Tommaso de Amicis 95, 80123, Naples, Italy.

Professional Service Department, SAS Institute, Via Darwin 20/22, 20143, Milan, Italy.

出版信息

J Cheminform. 2023 Jun 9;15(1):60. doi: 10.1186/s13321-023-00728-6.

DOI:10.1186/s13321-023-00728-6
PMID:37296454
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10251600/
Abstract

Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug's adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289'202 activity data for a total of 210'116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: https://profhex.exscalate.eu/ .

摘要

脱靶药物相互作用是药物研发过程中候选药物失败的主要原因。在早期阶段预测潜在药物的不良反应对于将患者健康风险、动物试验和经济成本降至最低至关重要。随着虚拟筛选库规模的不断扩大,人工智能驱动的方法可被用作一级筛选工具,为候选药物提供安全性评估。在这项工作中,我们展示了ProfhEX,这是一套由人工智能驱动的、符合经合组织标准的46个机器学习模型,可对小分子在7个相关安全性类别上进行分析:心血管、中枢神经系统、胃肠道、内分泌、肾脏、肺部和免疫系统毒性。实验亲和力数据来自公共和商业数据源。整个化学空间包含针对总共210116种独特化合物的289202条活性数据,涵盖46个靶点,数据集大小从819到18896不等。最初采用梯度提升和随机森林算法并进行集成,以选择最优模型。模型根据经合组织原则进行验证,包括强大的内部验证(交叉验证、自助法、y轴混淆)和外部验证。最优模型的平均皮尔逊相关系数为0.84(标准差为0.05),决定系数R为0.68(标准差 = 0.1),均方根误差为0.69(标准差为0.08)。所有安全性类别均显示出良好的命中检测能力,5%时的平均富集因子为13.1(标准差为4.5),曲线下面积为0.92(标准差为0.05)。与现有工具的基准测试证明了ProfhEX模型在大规模安全性分析方面的预测能力。该平台将通过纳入新靶点并采用互补建模方法(如基于结构和药效团的模型)进一步扩展。可通过以下网址免费访问ProfhEX:https://profhex.exscalate.eu/ 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/066d3cca5031/13321_2023_728_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/6e1d3b5ee112/13321_2023_728_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/15f8f06974ce/13321_2023_728_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/363458d76280/13321_2023_728_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/615296f53c02/13321_2023_728_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/bdabcaca26d1/13321_2023_728_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/cf11d2f13a0e/13321_2023_728_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/06239d9f2cfe/13321_2023_728_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/066d3cca5031/13321_2023_728_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/6e1d3b5ee112/13321_2023_728_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/15f8f06974ce/13321_2023_728_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/363458d76280/13321_2023_728_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/615296f53c02/13321_2023_728_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/bdabcaca26d1/13321_2023_728_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/cf11d2f13a0e/13321_2023_728_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/06239d9f2cfe/13321_2023_728_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5c3/10251600/066d3cca5031/13321_2023_728_Fig8_HTML.jpg

相似文献

1
ProfhEX: AI-based platform for small molecules liability profiling.ProfhEX:用于小分子安全性评估的基于人工智能的平台。
J Cheminform. 2023 Jun 9;15(1):60. doi: 10.1186/s13321-023-00728-6.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Target-driven machine learning-enabled virtual screening (TAME-VS) platform for early-stage hit identification.用于早期命中物识别的目标驱动型机器学习虚拟筛选(TAME-VS)平台。
Front Mol Biosci. 2023 Mar 13;10:1163536. doi: 10.3389/fmolb.2023.1163536. eCollection 2023.
4
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
5
DeepMalaria: Artificial Intelligence Driven Discovery of Potent Antiplasmodials.深度疟疾:人工智能驱动的强效抗疟药物发现
Front Pharmacol. 2020 Jan 15;10:1526. doi: 10.3389/fphar.2019.01526. eCollection 2019.
6
Multi-target-based polypharmacology prediction (mTPP): An approach using virtual screening and machine learning for multi-target drug discovery.基于多靶点的多药理学预测(mTPP):一种利用虚拟筛选和机器学习进行多靶点药物发现的方法。
Chem Biol Interact. 2022 Dec 1;368:110239. doi: 10.1016/j.cbi.2022.110239. Epub 2022 Oct 26.
7
Benchmarking the Predictive Power of Ligand Efficiency Indices in QSAR.定量构效关系中配体效率指数预测能力的基准测试
J Chem Inf Model. 2016 Aug 22;56(8):1576-87. doi: 10.1021/acs.jcim.6b00136. Epub 2016 Jul 19.
8
Safety screening in early drug discovery: An optimized assay panel.早期药物发现中的安全性筛选:优化的检测组合
J Pharmacol Toxicol Methods. 2019 Sep-Oct;99:106609. doi: 10.1016/j.vascn.2019.106609. Epub 2019 Jul 5.
9
A comparison of 12 machine learning models developed to predict ploidy, using a morphokinetic meta-dataset of 8147 embryos.比较 12 种机器学习模型,这些模型用于预测ploidy,使用的是一个包含 8147 个胚胎的形态动力学元数据集。
Hum Reprod. 2023 Apr 3;38(4):569-581. doi: 10.1093/humrep/dead034.
10
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.

引用本文的文献

1
off-target profiling for enhanced drug safety assessment.用于增强药物安全性评估的脱靶分析
Acta Pharm Sin B. 2024 Jul;14(7):2927-2941. doi: 10.1016/j.apsb.2024.03.002. Epub 2024 Mar 6.
2
Ensemble of structure and ligand-based classification models for hERG liability profiling.用于hERG风险评估的基于结构和配体的分类模型集成
Front Pharmacol. 2023 Mar 23;14:1148670. doi: 10.3389/fphar.2023.1148670. eCollection 2023.

本文引用的文献

1
MolPredictX: Online Biological Activity Predictions by Machine Learning Models.MolPredictX:通过机器学习模型进行在线生物活性预测
Mol Inform. 2022 Dec;41(12):e2200133. doi: 10.1002/minf.202200133. Epub 2022 Sep 1.
2
Current status and future directions for a neurotoxicity hazard assessment framework that integrates approaches.整合多种方法的神经毒性危害评估框架的现状与未来方向
Comput Toxicol. 2022 May;22. doi: 10.1016/j.comtox.2022.100223. Epub 2022 Mar 17.
3
approaches in organ toxicity hazard assessment: Current status and future needs for predicting heart, kidney and lung toxicities.
器官毒性危害评估方法:预测心脏、肾脏和肺部毒性的现状与未来需求
Comput Toxicol. 2021 Nov;20. doi: 10.1016/j.comtox.2021.100188. Epub 2021 Sep 13.
4
MolData, a molecular benchmark for disease and target based machine learning.MolData,一种基于疾病和靶点的机器学习分子基准。
J Cheminform. 2022 Mar 7;14(1):10. doi: 10.1186/s13321-022-00590-y.
5
The effect of noise on the predictive limit of QSAR models.噪声对定量构效关系(QSAR)模型预测极限的影响。
J Cheminform. 2021 Nov 25;13(1):92. doi: 10.1186/s13321-021-00571-7.
6
A review on machine learning approaches and trends in drug discovery.关于药物发现中机器学习方法与趋势的综述。
Comput Struct Biotechnol J. 2021 Aug 12;19:4538-4558. doi: 10.1016/j.csbj.2021.08.011. eCollection 2021.
7
"Molecular Anatomy": a new multi-dimensional hierarchical scaffold analysis tool.“分子解剖学”:一种新型的多维分层支架分析工具。
J Cheminform. 2021 Jul 23;13:54. doi: 10.1186/s13321-021-00526-y. eCollection 2021.
8
Machine learning models for classification tasks related to drug safety.用于药物安全相关分类任务的机器学习模型。
Mol Divers. 2021 Aug;25(3):1409-1424. doi: 10.1007/s11030-021-10239-x. Epub 2021 Jun 10.
9
Artificial intelligence to deep learning: machine intelligence approach for drug discovery.人工智能到深度学习:药物发现的机器智能方法。
Mol Divers. 2021 Aug;25(3):1315-1360. doi: 10.1007/s11030-021-10217-3. Epub 2021 Apr 12.
10
Novel Computational Approach to Predict Off-Target Interactions for Small Molecules.预测小分子脱靶相互作用的新型计算方法
Front Big Data. 2019 Jul 17;2:25. doi: 10.3389/fdata.2019.00025. eCollection 2019.