Research Center of Nonlinear Science, College of Mathematics and Computer Science, Engineering Research Center of Hubei Province for Clothing Information, Wuhan Textile University, Wuhan 430200, P R. China.
Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States.
J Chem Inf Model. 2021 Apr 26;61(4):1691-1700. doi: 10.1021/acs.jcim.0c01294. Epub 2021 Mar 15.
Toxicity analysis is a major challenge in drug design and discovery. Recently significant progress has been made through machine learning due to its accuracy, efficiency, and lower cost. US Toxicology in the 21st Century (Tox21) screened a large library of compounds, including approximately 12 000 environmental chemicals and drugs, for different mechanisms responsible for eliciting toxic effects. The Tox21 Data Challenge offered a platform to evaluate different computational methods for toxicity predictions. Inspired by the success of multiscale weighted colored graph (MWCG) theory in protein-ligand binding affinity predictions, we consider MWCG theory for toxicity analysis. In the present work, we develop a geometric graph learning toxicity (GGL-Tox) model by integrating MWCG features and the gradient boosting decision tree (GBDT) algorithm. The benchmark tests of the Tox21 Data Challenge are employed to demonstrate the utility and usefulness of the proposed GGL-Tox model. An extensive comparison with other state-of-the-art models indicates that GGL-Tox is an accurate and efficient model for toxicity analysis and prediction.
毒性分析是药物设计和发现中的一个主要挑战。由于其准确性、效率和低成本,机器学习最近取得了重大进展。美国 21 世纪毒理学(Tox21)筛选了一个大型化合物库,包括大约 12000 种环境化学物质和药物,以研究引发毒性作用的不同机制。Tox21 数据挑战提供了一个平台,用于评估不同的计算方法进行毒性预测。受多尺度加权有色图(MWCG)理论在蛋白质-配体结合亲和力预测中成功的启发,我们考虑将 MWCG 理论应用于毒性分析。在本工作中,我们通过整合 MWCG 特征和梯度提升决策树(GBDT)算法,开发了一个几何图学习毒性(GGL-Tox)模型。采用 Tox21 数据挑战的基准测试来证明所提出的 GGL-Tox 模型的实用性和有效性。与其他最先进模型的广泛比较表明,GGL-Tox 是一种用于毒性分析和预测的准确且高效的模型。