口服清除率、细胞毒性和半数致死剂量的构效关系模型：筛选有前景的抗癌化合物

Structure-activity models of oral clearance, cytotoxicity, and LD50: a screen for promising anticancer compounds.

作者信息

Boik John C, Newman Robert A

机构信息

Department of Experimental Therapeutics, University of Texas M, D, Anderson Cancer Center, 8000 El Rio, Houston, TX 77054, USA.

出版信息

BMC Pharmacol. 2008 Jun 13;8:12. doi: 10.1186/1471-2210-8-12.

DOI:10.1186/1471-2210-8-12

PMID:18554402

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2442056/

Abstract

BACKGROUND

Quantitative structure-activity relationship (QSAR) models have become popular tools to help identify promising lead compounds in anticancer drug development. Few QSAR studies have investigated multitask learning, however. Multitask learning is an approach that allows distinct but related data sets to be used in training. In this paper, a suite of three QSAR models is developed to identify compounds that are likely to (a) exhibit cytotoxic behavior against cancer cells, (b) exhibit high rat LD50 values (low systemic toxicity), and (c) exhibit low to modest human oral clearance (favorable pharmacokinetic characteristics). Models were constructed using Kernel Multitask Latent Analysis (KMLA), an approach that can effectively handle a large number of correlated data features, nonlinear relationships between features and responses, and multitask learning. Multitask learning is particularly useful when the number of available training records is small relative to the number of features, as was the case with the oral clearance data.

RESULTS

Multitask learning modestly but significantly improved the classification precision for the oral clearance model. For the cytotoxicity model, which was constructed using a large number of records, multitask learning did not affect precision but did reduce computation time. The models developed here were used to predict activities for 115,000 natural compounds. Hundreds of natural compounds, particularly in the anthraquinone and flavonoids groups, were predicted to be cytotoxic, have high LD50 values, and have low to moderate oral clearance.

CONCLUSION

Multitask learning can be useful in some QSAR models. A suite of QSAR models was constructed and used to screen a large drug library for compounds likely to be cytotoxic to multiple cancer cell lines in vitro, have low systemic toxicity in rats, and have favorable pharmacokinetic properties in humans.

摘要

背景

定量构效关系（QSAR）模型已成为抗癌药物研发中帮助识别有潜力先导化合物的常用工具。然而，很少有QSAR研究探讨多任务学习。多任务学习是一种允许在训练中使用不同但相关数据集的方法。本文开发了一组三个QSAR模型，以识别可能（a）对癌细胞表现出细胞毒性行为、（b）具有高大鼠半数致死剂量值（低全身毒性）以及（c）具有低至中等人体口服清除率（良好的药代动力学特征）的化合物。使用核多任务潜在分析（KMLA）构建模型，该方法能够有效处理大量相关数据特征、特征与响应之间的非线性关系以及多任务学习。当可用训练记录的数量相对于特征数量较少时，如口服清除率数据的情况，多任务学习特别有用。

结果

多任务学习适度但显著提高了口服清除率模型的分类精度。对于使用大量记录构建的细胞毒性模型，多任务学习不影响精度，但确实减少了计算时间。这里开发的模型用于预测115,000种天然化合物的活性。数百种天然化合物，特别是蒽醌类和黄酮类化合物，预计具有细胞毒性、高LD50值以及低至中等的口服清除率。

结论

多任务学习在某些QSAR模型中可能有用。构建了一组QSAR模型，并用于在一个大型药物库中筛选可能对多种体外癌细胞系具有细胞毒性、在大鼠中具有低全身毒性且在人体中具有良好药代动力学特性的化合物。

相似文献

Structure-activity models of oral clearance, cytotoxicity, and LD50: a screen for promising anticancer compounds.口服清除率、细胞毒性和半数致死剂量的构效关系模型：筛选有前景的抗癌化合物

BMC Pharmacol. 2008 Jun 13;8:12. doi: 10.1186/1471-2210-8-12.

Demystifying Multitask Deep Neural Networks for Quantitative Structure-Activity Relationships.揭开用于定量构效关系的多任务深度神经网络的神秘面纱。

J Chem Inf Model. 2017 Oct 23;57(10):2490-2504. doi: 10.1021/acs.jcim.7b00087. Epub 2017 Oct 2.

QSAR and Classification Study on Prediction of Acute Oral Toxicity of -Nitroso Compounds.定量构效关系和分类研究预测 - 亚硝胺化合物的急性口服毒性。

Int J Mol Sci. 2018 Oct 3;19(10):3015. doi: 10.3390/ijms19103015.

An Integrated Transfer Learning and Multitask Learning Approach for Pharmacokinetic Parameter Prediction.基于集成迁移学习和多任务学习的药代动力学参数预测方法。

Mol Pharm. 2019 Feb 4;16(2):533-541. doi: 10.1021/acs.molpharmaceut.8b00816. Epub 2019 Jan 4.

Evaluation of QSAR Equations for Virtual Screening.QSAR 方程在虚拟筛选中的评估。

Int J Mol Sci. 2020 Oct 22;21(21):7828. doi: 10.3390/ijms21217828.

3-D QSAR studies on new dibenzyltin(IV) anticancer agents by comparative molecular field analysis (CoMFA).

Bioorg Med Chem Lett. 2002 Jan 7;12(1):61-4. doi: 10.1016/s0960-894x(01)00684-9.

Anticancer activity of selected phenolic compounds: QSAR studies using ridge regression and neural networks.选定酚类化合物的抗癌活性：使用岭回归和神经网络的定量构效关系研究

Chem Biol Drug Des. 2007 Nov;70(5):424-36. doi: 10.1111/j.1747-0285.2007.00575.x.

Novel Consensus Architecture To Improve Performance of Large-Scale Multitask Deep Learning QSAR Models.新型共识架构可提高大规模多任务深度学习 QSAR 模型的性能。

J Chem Inf Model. 2019 Nov 25;59(11):4613-4624. doi: 10.1021/acs.jcim.9b00526. Epub 2019 Oct 25.

A novel two-step hierarchical quantitative structure-activity relationship modeling work flow for predicting acute toxicity of chemicals in rodents.一种用于预测啮齿动物中化学物质急性毒性的新型两步分层定量构效关系建模工作流程。

Environ Health Perspect. 2009 Aug;117(8):1257-64. doi: 10.1289/ehp.0800471. Epub 2009 Apr 3.

3D-QSAR and docking studies on ursolic acid derivatives for anticancer activity based on bladder cell line T24 targeting NF-kB pathway inhibition.基于靶向 NF-κB 通路抑制的膀胱细胞系 T24 的熊果酸衍生物的抗癌活性的 3D-QSAR 和对接研究。

J Biomol Struct Dyn. 2019 Sep;37(14):3822-3837. doi: 10.1080/07391102.2018.1528888. Epub 2018 Dec 31.

引用本文的文献

In Silico ADME Methods Used in the Evaluation of Natural Products.用于天然产物评估的计算机辅助ADME方法

Pharmaceutics. 2025 Jul 31;17(8):1002. doi: 10.3390/pharmaceutics17081002.

ApisTox: a new benchmark dataset for the classification of small molecules toxicity on honey bees.蜜蜂毒素：用于小分子对蜜蜂毒性分类的新基准数据集。

Sci Data. 2025 Jan 2;12(1):5. doi: 10.1038/s41597-024-04232-w.

Quassinoids from as Potential Dihydrofolate Reductase Inhibitors: A Computational Study.从中提取的苦木苦味素作为潜在的二氢叶酸还原酶抑制剂：一项计算研究。

Curr Pharm Biotechnol. 2024;25(16):2154-2165. doi: 10.2174/0113892010273336240221101506.

In Vitro and In Silico Analysis of the Anticancer Effects of Eurycomanone and Eurycomalactone from .来自[具体来源]的刺蒺藜皂甙和刺蒺藜内酯抗癌作用的体外和计算机模拟分析

Plants (Basel). 2023 Jul 31;12(15):2827. doi: 10.3390/plants12152827.

The In Vitro Anti-Cancer Activities and Mechanisms of Action of 9-Methoxycanthin-6-one from in Selected Cancer Cell Lines.9-甲氧基喜树碱在选定癌细胞系中的体外抗癌活性及作用机制。

Molecules. 2022 Jan 18;27(3):585. doi: 10.3390/molecules27030585.

Naïve Bayesian Models for Vero Cell Cytotoxicity.用于 Vero 细胞细胞毒性的朴素贝叶斯模型。

Pharm Res. 2018 Jun 29;35(9):170. doi: 10.1007/s11095-018-2439-9.

CLC-Pred: A freely available web-service for in silico prediction of human cell line cytotoxicity for drug-like compounds.CLC-Pred：一种可免费获取的网络服务，用于对类药物化合物的人细胞系细胞毒性进行计算机模拟预测。

PLoS One. 2018 Jan 25;13(1):e0191838. doi: 10.1371/journal.pone.0191838. eCollection 2018.

A classification model to predict synergism/antagonism of cytotoxic mixtures using protein-drug docking scores.一种使用蛋白质-药物对接分数预测细胞毒性混合物协同作用/拮抗作用的分类模型。

BMC Pharmacol. 2008 Jul 29;8:13. doi: 10.1186/1471-2210-8-13.

本文引用的文献

BMC Pharmacol. 2008 Jul 29;8:13. doi: 10.1186/1471-2210-8-13.

Combination of genetic algorithm and partial least squares for cloud point prediction of nonionic surfactants from molecular structures.遗传算法与偏最小二乘法相结合用于从分子结构预测非离子表面活性剂的浊点

Ann Chim. 2007 Jan-Feb;97(1-2):69-83. doi: 10.1002/adic.200690087.

Medicinal chemistry and bioinformatics--current trends in drugs discovery with networks topological indices.药物化学与生物信息学——基于网络拓扑指数的药物发现当前趋势

Curr Top Med Chem. 2007;7(10):1015-29. doi: 10.2174/156802607780906771.

ADME evaluation in drug discovery. 6. Can oral bioavailability in humans be effectively predicted by simple molecular property-based rules?药物发现中的ADME评估。6. 能否通过基于简单分子性质的规则有效预测人体口服生物利用度？

J Chem Inf Model. 2007 Mar-Apr;47(2):460-3. doi: 10.1021/ci6003515.

Clinical pharmacology of 1,4-butanediol and gamma-hydroxybutyrate after oral 1,4-butanediol administration to healthy volunteers.健康志愿者口服1,4-丁二醇后1,4-丁二醇和γ-羟基丁酸的临床药理学

Clin Pharmacol Ther. 2007 Feb;81(2):178-84. doi: 10.1038/sj.clpt.6100037. Epub 2006 Dec 27.

Exploiting QSAR methods in lead optimization.

Curr Opin Drug Discov Devel. 2006 Jul;9(4):419-24.

Structure-based methods for the prediction of drug metabolism.

Expert Opin Drug Metab Toxicol. 2006 Aug;2(4):545-57. doi: 10.1517/17425255.2.4.545.

Megavariate analysis of environmental QSAR data. Part I--a basic framework founded on principal component analysis (PCA), partial least squares (PLS), and statistical molecular design (SMD).环境定量构效关系（QSAR）数据的多变量分析。第一部分——基于主成分分析（PCA）、偏最小二乘法（PLS）和统计分子设计（SMD）的基本框架。

Mol Divers. 2006 May;10(2):169-86. doi: 10.1007/s11030-006-9024-6. Epub 2006 Jun 13.

Structure-toxicity relationships of nitroaromatic compounds.硝基芳香族化合物的结构-毒性关系

Mol Divers. 2006 May;10(2):233-45. doi: 10.1007/s11030-005-9002-4. Epub 2006 May 19.

PLS dimension reduction for classification with microarray data.用于微阵列数据分类的偏最小二乘降维法

Stat Appl Genet Mol Biol. 2004;3:Article33. doi: 10.2202/1544-6115.1075. Epub 2004 Nov 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验