基于决策树回归方法预测分布容积，使用预测的组织：血浆分配系数。

Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients.

机构信息

School of Computing, University of Kent, Canterbury, CT2 7NF UK.

Medway School of Pharmacy, Universities of Kent and Greenwich, Chatham, Kent, ME4 4TB UK.

出版信息

J Cheminform. 2015 Feb 26;7:6. doi: 10.1186/s13321-015-0054-x. eCollection 2015.

DOI:10.1186/s13321-015-0054-x

PMID:25767566

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4356883/

Abstract

BACKGROUND

Volume of distribution is an important pharmacokinetic property that indicates the extent of a drug's distribution in the body tissues. This paper addresses the problem of how to estimate the apparent volume of distribution at steady state (Vss) of chemical compounds in the human body using decision tree-based regression methods from the area of data mining (or machine learning). Hence, the pros and cons of several different types of decision tree-based regression methods have been discussed. The regression methods predict Vss using, as predictive features, both the compounds' molecular descriptors and the compounds' tissue:plasma partition coefficients (Kt:p) - often used in physiologically-based pharmacokinetics. Therefore, this work has assessed whether the data mining-based prediction of Vss can be made more accurate by using as input not only the compounds' molecular descriptors but also (a subset of) their predicted Kt:p values.

RESULTS

Comparison of the models that used only molecular descriptors, in particular, the Bagging decision tree (mean fold error of 2.33), with those employing predicted Kt:p values in addition to the molecular descriptors, such as the Bagging decision tree using adipose Kt:p (mean fold error of 2.29), indicated that the use of predicted Kt:p values as descriptors may be beneficial for accurate prediction of Vss using decision trees if prior feature selection is applied.

CONCLUSIONS

Decision tree based models presented in this work have an accuracy that is reasonable and similar to the accuracy of reported Vss inter-species extrapolations in the literature. The estimation of Vss for new compounds in drug discovery will benefit from methods that are able to integrate large and varied sources of data and flexible non-linear data mining methods such as decision trees, which can produce interpretable models. Graphical AbstractDecision trees for the prediction of tissue partition coefficient and volume of distribution of drugs.

摘要

背景

分布容积是一个重要的药代动力学性质，它表示药物在体内组织中的分布程度。本文讨论了如何使用数据挖掘（或机器学习）领域的基于决策树的回归方法来估计人体中化学化合物在稳态时的表观分布容积（Vss）。因此，讨论了几种不同类型的基于决策树的回归方法的优缺点。这些回归方法使用化合物的分子描述符和化合物的组织：血浆分配系数（Kt:p）作为预测特征来预测 Vss，Kt:p 通常用于基于生理学的药代动力学。因此，本工作评估了通过使用不仅是化合物的分子描述符，而且还（预测 Kt:p 值的）子集作为输入，是否可以使基于数据挖掘的 Vss 预测更准确。

结果

比较仅使用分子描述符的模型，特别是 Bagging 决策树（平均折叠误差为 2.33），与那些除了分子描述符外还使用预测的 Kt:p 值的模型，例如使用脂肪 Kt:p 的 Bagging 决策树（平均折叠误差为 2.29），表明如果应用了预先的特征选择，则将预测的 Kt:p 值用作描述符可能有助于使用决策树准确预测 Vss。

结论

本文提出的基于决策树的模型具有合理的准确性，与文献中报道的 Vss 种间外推的准确性相似。在药物发现中，新化合物的 Vss 估计将受益于能够整合大量和多样化的数据的方法，以及灵活的非线性数据挖掘方法，如能够生成可解释模型的决策树。

图摘要

用于预测药物组织分配系数和分布容积的决策树。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0879/4357154/c35bf304a63b/13321_2015_54_Figa_HTML.jpg

相似文献

Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients.

J Cheminform. 2015 Feb 26;7:6. doi: 10.1186/s13321-015-0054-x. eCollection 2015.

Application of the Tissue Composition-Based Model to Minipig for Predicting the Volume of Distribution at Steady State and Dermis-to-Plasma Partition Coefficients of Drugs Used in the Physiologically Based Pharmacokinetics Model in Dermatology.

J Pharm Sci. 2019 Jan;108(1):603-619. doi: 10.1016/j.xphs.2018.09.001. Epub 2018 Sep 14.

Evaluation of Prediction Accuracy for Volume of Distribution in Rat and Human Using In Vitro, In Vivo, PBPK and QSAR Methods.

J Pharm Sci. 2021 Apr;110(4):1799-1823. doi: 10.1016/j.xphs.2020.12.005. Epub 2020 Dec 16.

The Novel In Vitro Method to Calculate Tissue-to-Plasma Partition Coefficient in Humans for Predicting Pharmacokinetic Profiles by Physiologically-Based Pharmacokinetic Model With High Predictability.

J Pharm Sci. 2020 Jul;109(7):2345-2355. doi: 10.1016/j.xphs.2020.04.002. Epub 2020 Apr 10.

Development of a decision tree to classify the most accurate tissue-specific tissue to plasma partition coefficient algorithm for a given compound.

J Pharmacokinet Pharmacodyn. 2014 Feb;41(1):1-14. doi: 10.1007/s10928-013-9342-0. Epub 2013 Nov 21.

Prediction of Tissue to Plasma Concentration Ratios of Drugs in the Rat from Experimentally Estimated Volume of Distribution: Application of Allometry.

Curr Drug Metab. 2018;19(2):155-164. doi: 10.2174/1389200219666171129114337.

QSAR models for the prediction of plasma protein binding.

Bioimpacts. 2013;3(1):21-7. doi: 10.5681/bi.2013.011. Epub 2013 Feb 21.

Correlation-based prediction of tissue-to-plasma partition coefficients using readily available input parameters.

Xenobiotica. 2013 Oct;43(10):839-52. doi: 10.3109/00498254.2013.770182. Epub 2013 Feb 19.

A priori prediction of tissue:plasma partition coefficients of drugs to facilitate the use of physiologically-based pharmacokinetic models in drug discovery.

J Pharm Sci. 2000 Jan;89(1):16-35. doi: 10.1002/(SICI)1520-6017(200001)89:1<16::AID-JPS3>3.0.CO;2-E.

PhRMA CPCDC initiative on predictive models of human pharmacokinetics, part 2: comparative assessment of prediction methods of human volume of distribution.

J Pharm Sci. 2011 Oct;100(10):4074-89. doi: 10.1002/jps.22553. Epub 2011 Mar 30.

引用本文的文献

Prediction of pharmacokinetic/pharmacodynamic properties of aldosterone synthase inhibitors at drug discovery stage using an artificial intelligence-physiologically based pharmacokinetic model.

Front Pharmacol. 2025 Apr 28;16:1578117. doi: 10.3389/fphar.2025.1578117. eCollection 2025.

Predicting pharmacodynamic effects through early drug discovery with artificial intelligence-physiologically based pharmacokinetic (AI-PBPK) modelling.

Front Pharmacol. 2024 Feb 16;15:1330855. doi: 10.3389/fphar.2024.1330855. eCollection 2024.

Methods to Predict Volume of Distribution.

Curr Pharmacol Rep. 2019 Oct;5(5):391-399. doi: 10.1007/s40495-019-00186-5. Epub 2019 Jun 6.

An artificial neural network-pharmacokinetic model and its interpretation using Shapley additive explanations.

CPT Pharmacometrics Syst Pharmacol. 2021 Jul;10(7):760-768. doi: 10.1002/psp4.12643. Epub 2021 May 27.

Combining an in silico proarrhythmic risk assay with a tPKPD model to predict QTc interval prolongation in the anesthetized guinea pig assay.

Toxicol Appl Pharmacol. 2020 Mar 1;390:114883. doi: 10.1016/j.taap.2020.114883. Epub 2020 Jan 23.

Machine learning for predicting lifespan-extending chemical compounds.

Aging (Albany NY). 2017 Jul 18;9(7):1721-1737. doi: 10.18632/aging.101264.

Computational Exposure Science: An Emerging Discipline to Support 21st-Century Risk Assessment.

Environ Health Perspect. 2016 Jun;124(6):697-702. doi: 10.1289/ehp.1509748. Epub 2015 Nov 6.

本文引用的文献

Prediction of drug distribution in rat and humans using an artificial neural networks ensemble and a PBPK model.

Pharm Res. 2014 Dec;31(12):3313-22. doi: 10.1007/s11095-014-1421-4. Epub 2014 May 28.

Computational approaches to analyse and predict small molecule transport and distribution at cellular and subcellular levels.

Biopharm Drug Dispos. 2014 Jan;35(1):15-32. doi: 10.1002/bdd.1879. Epub 2013 Dec 10.

Applying linear and non-linear methods for parallel prediction of volume of distribution and fraction of unbound drug.

PLoS One. 2013 Oct 7;8(10):e74758. doi: 10.1371/journal.pone.0074758. eCollection 2013.

A perspective on the prediction of drug pharmacokinetics and disposition in drug research and development.

Drug Metab Dispos. 2013 Dec;41(12):1975-93. doi: 10.1124/dmd.113.054031. Epub 2013 Sep 24.

Pre-processing feature selection for improved C&RT models for oral absorption.

J Chem Inf Model. 2013 Oct 28;53(10):2730-42. doi: 10.1021/ci400378j. Epub 2013 Oct 9.

Quantitative structure-pharmacokinetic relationship (QSPkP) analysis of the volume of distribution values of anti-infective agents from J group of the ATC classification in humans.

Acta Pharm. 2012 Nov;62(3):305-23. doi: 10.2478/v10007-012-0024-z.

Prediction of pharmacokinetic parameters.

Methods Mol Biol. 2012;929:337-57. doi: 10.1007/978-1-62703-050-2_14.

The impact of training set data distributions for modelling of passive intestinal absorption.

Int J Pharm. 2012 Oct 15;436(1-2):711-20. doi: 10.1016/j.ijpharm.2012.07.041. Epub 2012 Jul 31.

Identification of a novel set of biomarkers for evaluating phospholipidosis-inducing potential of compounds using rat liver microarray data measured 24-h after single dose administration.

Toxicology. 2012 May 16;295(1-3):1-7. doi: 10.1016/j.tox.2012.02.015. Epub 2012 Mar 7.

Advancing prediction of tissue distribution and volume of distribution of highly lipophilic compounds from a simplified tissue-composition-based model as a mechanistic animal alternative method.

J Pharm Sci. 2012 Jun;101(6):2250-61. doi: 10.1002/jps.23090. Epub 2012 Mar 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于决策树回归方法预测分布容积，使用预测的组织：血浆分配系数。

Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients.

机构信息

School of Computing, University of Kent, Canterbury, CT2 7NF UK.

Medway School of Pharmacy, Universities of Kent and Greenwich, Chatham, Kent, ME4 4TB UK.