Suppr超能文献

一种使用密集和稀疏矩阵重排的最优方法以及逻辑回归来从体外数据预测体内毒性的新框架。

A novel framework for predicting in vivo toxicities from in vitro data using optimal methods for dense and sparse matrix reordering and logistic regression.

机构信息

Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544-5263, USA.

出版信息

Toxicol Sci. 2010 Nov;118(1):251-65. doi: 10.1093/toxsci/kfq233. Epub 2010 Aug 11.

Abstract

In this work, we combine the strengths of mixed-integer linear optimization (MILP) and logistic regression for predicting the in vivo toxicity of chemicals using only their measured in vitro assay data. The proposed approach utilizes a biclustering method based on iterative optimal reordering (DiMaggio, P. A., McAllister, S. R., Floudas, C. A., Feng, X. J., Rabinowitz, J. D., and Rabitz, H. A. (2008). Biclustering via optimal re-ordering of data matrices in systems biology: rigorous methods and comparative studies. BMC Bioinformatics 9, 458-474.; DiMaggio, P. A., McAllister, S. R., Floudas, C. A., Feng, X. J., Rabinowitz, J. D., and Rabitz, H. A. (2010b). A network flow model for biclustering via optimal re-ordering of data matrices. J. Global. Optim. 47, 343-354.) to identify biclusters corresponding to subsets of chemicals that have similar responses over distinct subsets of the in vitro assays. The biclustering of the in vitro assays is shown to result in significant clustering based on assay target (e.g., cytochrome P450 [CYP] and nuclear receptors) and type (e.g., downregulated BioMAP and biochemical high-throughput screening protein kinase activity assays). An optimal method based on mixed-integer linear optimization for reordering sparse data matrices (DiMaggio, P. A., McAllister, S. R., Floudas, C. A., Feng, X. J., Li, G. Y., Rabinowitz, J. D., and Rabitz, H. A. (2010a). Enhancing molecular discovery using descriptor-free rearrangement clustering techniques for sparse data sets. AIChE J. 56, 405-418.; McAllister, S. R., DiMaggio, P. A., and Floudas, C. A. (2009). Mathematical modeling and efficient optimization methods for the distance-dependent rearrangement clustering problem. J. Global. Optim. 45, 111-129) is then applied to the in vivo data set (21.7% sparse) in order to cluster end points that have similar lowest effect level (LEL) values, where it is observed that the end points are effectively clustered according to (1) animal species (i.e., the chronic mouse and chronic rat end points were clearly separated) and (2) similar physiological attributes (i.e., liver- and reproductive-related end points were found to separately cluster together). As the liver and reproductive end points exhibited the largest degree of correlation, we further analyzed them using regularized logistic regression in a rank-and-drop framework to identify which subset of in vitro features could be utilized for in vivo toxicity prediction. It was observed that the in vivo end points that had similar LEL responses over the 309 chemicals (as determined by the sparse clustering results) also shared a significant subset of selected in vitro descriptors. Comparing the significant descriptors between the two different categories of end points revealed a specificity of the CYP assays for the liver end points and preferential selection of the estrogen/androgen nuclear receptors by the reproductive end points.

摘要

在这项工作中,我们结合了混合整数线性优化(MILP)和逻辑回归的优势,仅使用化学物质的体外测定数据来预测其体内毒性。所提出的方法利用基于迭代最优重排的双聚类方法(DiMaggio,P. A.,McAllister,S. R.,Floudas,C. A.,Feng,X. J.,Rabinowitz,J. D.,和 Rabitz,H. A.(2008)。通过数据矩阵的最优重排进行双聚类:系统生物学中的严格方法和比较研究。BMC 生物信息学 9,458-474;DiMaggio,P. A.,McAllister,S. R.,Floudas,C. A.,Feng,X. J.,Rabinowitz,J. D.,和 Rabitz,H. A.(2010b)。通过数据矩阵的最优重排进行双聚类的网络流模型。J. Global. Optim. 47,343-354)识别与具有相似反应的化学物质子集对应的双聚类,这些子集在不同的体外测定子集上具有相似的反应。体外测定的双聚类显示基于测定靶标(例如细胞色素 P450 [CYP]和核受体)和类型(例如下调的 BioMAP 和生化高通量筛选蛋白激酶活性测定)进行了显著聚类。基于混合整数线性优化的用于重新排列稀疏数据矩阵的最优方法(DiMaggio,P. A.,McAllister,S. R.,Floudas,C. A.,Feng,X. J.,Li,G. Y.,Rabinowitz,J. D.,和 Rabitz,H. A.(2010a)。用于稀疏数据集的无描述符重排聚类技术的分子发现增强。AIChE J. 56,405-418;McAllister,S. R.,DiMaggio,P. A.,和 Floudas,C. A.(2009)。用于距离相关重排聚类问题的数学建模和有效优化方法。J. Global. Optim. 45,111-129)应用于体内数据集(21.7%稀疏),以对具有相似最低效应水平(LEL)值的终点进行聚类,观察到终点根据(1)动物物种(即慢性小鼠和慢性大鼠终点明显分离)和(2)相似的生理属性(即肝和生殖相关终点分别聚类在一起)进行有效聚类。由于肝和生殖终点表现出最大程度的相关性,我们进一步使用正则化逻辑回归在排序和丢弃框架中对它们进行分析,以确定哪些子集的体外特征可用于体内毒性预测。观察到具有相似 LEL 反应的体内终点(如稀疏聚类结果所示)也共享选定的体外描述符的重要子集。比较两个不同类别的终点之间的显著描述符,揭示了 CYP 测定对肝终点的特异性和生殖终点对雌激素/雄激素核受体的优先选择。

相似文献

3
Predictive Models for Human Organ Toxicity Based on Bioactivity Data and Chemical Structure.
Chem Res Toxicol. 2020 Mar 16;33(3):731-741. doi: 10.1021/acs.chemrestox.9b00305. Epub 2020 Mar 3.
5
Biclustering via sparse singular value decomposition.
Biometrics. 2010 Dec;66(4):1087-95. doi: 10.1111/j.1541-0420.2010.01392.x.
6
Building predictive in vitro pulmonary toxicity assays using high-throughput imaging and artificial intelligence.
Arch Toxicol. 2018 Jun;92(6):2055-2075. doi: 10.1007/s00204-018-2213-0. Epub 2018 Apr 28.
7
Strengths and limitations of using repeat-dose toxicity studies to predict effects on fertility.
Regul Toxicol Pharmacol. 2007 Aug;48(3):241-58. doi: 10.1016/j.yrtph.2007.04.001. Epub 2007 Apr 12.
8
Systems Toxicology of Male Reproductive Development: Profiling 774 Chemicals for Molecular Targets and Adverse Outcomes.
Environ Health Perspect. 2016 Jul;124(7):1050-61. doi: 10.1289/ehp.1510385. Epub 2015 Dec 11.
9
Predictive model of rat reproductive toxicity from ToxCast high throughput screening.
Biol Reprod. 2011 Aug;85(2):327-39. doi: 10.1095/biolreprod.111.090977. Epub 2011 May 12.
10
QSAR modeling with the electrotopological state indices: predicting the toxicity of organic chemicals.
Chemosphere. 2003 Feb;50(7):949-53. doi: 10.1016/s0045-6535(02)00172-8.

引用本文的文献

2
Relating Essential Proteins to Drug Side-Effects Using Canonical Component Analysis: A Structure-Based Approach.
J Chem Inf Model. 2015 Jul 27;55(7):1483-94. doi: 10.1021/acs.jcim.5b00030. Epub 2015 Jul 16.
3
ASTRO-FOLD 2.0: an Enhanced Framework for Protein Structure Prediction.
AIChE J. 2012 May 1;58(5):1619-1637. doi: 10.1002/aic.12669. Epub 2011 May 31.
4
Paradigm shift in toxicity testing and modeling.
AAPS J. 2012 Sep;14(3):473-80. doi: 10.1208/s12248-012-9358-1. Epub 2012 Apr 20.
5
β-sheet topology prediction with high precision and recall for β and mixed α/β proteins.
PLoS One. 2012;7(3):e32461. doi: 10.1371/journal.pone.0032461. Epub 2012 Mar 9.
6
Structure prediction of loops with fixed and flexible stems.
J Phys Chem B. 2012 Jun 14;116(23):6670-82. doi: 10.1021/jp2113957. Epub 2012 Mar 2.

本文引用的文献

1
Profiling bioactivity of the ToxCast chemical library using BioMAP primary human cell systems.
J Biomol Screen. 2009 Oct;14(9):1054-66. doi: 10.1177/1087057109345525. Epub 2009 Sep 22.
2
Evaluation of high-throughput genotoxicity assays used in profiling the US EPA ToxCast chemicals.
Regul Toxicol Pharmacol. 2009 Nov;55(2):188-99. doi: 10.1016/j.yrtph.2009.07.004. Epub 2009 Jul 8.
3
The toxicity data landscape for environmental chemicals.
Environ Health Perspect. 2009 May;117(5):685-95. doi: 10.1289/ehp.0800168. Epub 2008 Dec 22.
4
Profiling the activity of environmental chemicals in prenatal developmental toxicity studies using the U.S. EPA's ToxRefDB.
Reprod Toxicol. 2009 Sep;28(2):209-19. doi: 10.1016/j.reprotox.2009.03.016. Epub 2009 Apr 10.
5
Profiling the reproductive toxicity of chemicals from multigeneration studies in the toxicity reference database.
Toxicol Sci. 2009 Jul;110(1):181-90. doi: 10.1093/toxsci/kfp080. Epub 2009 Apr 10.
6
Profiling chemicals based on chronic toxicity results from the U.S. EPA ToxRef Database.
Environ Health Perspect. 2009 Mar;117(3):392-9. doi: 10.1289/ehp.0800074. Epub 2008 Oct 20.
8
Descriptor-free molecular discovery in large libraries by adaptive substituent reordering.
Bioorg Med Chem Lett. 2008 Nov 15;18(22):5967-70. doi: 10.1016/j.bmcl.2008.09.068. Epub 2008 Sep 21.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验