针对梨形四膜虫测试的化学毒物的组合定量构效关系建模。

Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis.

作者信息

Zhu Hao, Tropsha Alexander, Fourches Denis, Varnek Alexandre, Papa Ester, Gramatica Paola, Oberg Tomas, Dao Phuong, Cherkasov Artem, Tetko Igor V

机构信息

Laboratory for Molecular Modeling, Division of Medicinal Chemistry and Natural Products, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.

出版信息

J Chem Inf Model. 2008 Apr;48(4):766-84. doi: 10.1021/ci700443v. Epub 2008 Mar 1.

DOI:10.1021/ci700443v

PMID:18311912

Abstract

Selecting most rigorous quantitative structure-activity relationship (QSAR) approaches is of great importance in the development of robust and predictive models of chemical toxicity. To address this issue in a systematic way, we have formed an international virtual collaboratory consisting of six independent groups with shared interests in computational chemical toxicology. We have compiled an aqueous toxicity data set containing 983 unique compounds tested in the same laboratory over a decade against Tetrahymena pyriformis. A modeling set including 644 compounds was selected randomly from the original set and distributed to all groups that used their own QSAR tools for model development. The remaining 339 compounds in the original set (external set I) as well as 110 additional compounds (external set II) published recently by the same laboratory (after this computational study was already in progress) were used as two independent validation sets to assess the external predictive power of individual models. In total, our virtual collaboratory has developed 15 different types of QSAR models of aquatic toxicity for the training set. The internal prediction accuracy for the modeling set ranged from 0.76 to 0.93 as measured by the leave-one-out cross-validation correlation coefficient ( Q abs2). The prediction accuracy for the external validation sets I and II ranged from 0.71 to 0.85 (linear regression coefficient R absI2) and from 0.38 to 0.83 (linear regression coefficient R absII2), respectively. The use of an applicability domain threshold implemented in most models generally improved the external prediction accuracy but at the same time led to a decrease in chemical space coverage. Finally, several consensus models were developed by averaging the predicted aquatic toxicity for every compound using all 15 models, with or without taking into account their respective applicability domains. We find that consensus models afford higher prediction accuracy for the external validation data sets with the highest space coverage as compared to individual constituent models. Our studies prove the power of a collaborative and consensual approach to QSAR model development. The best validated models of aquatic toxicity developed by our collaboratory (both individual and consensus) can be used as reliable computational predictors of aquatic toxicity and are available from any of the participating laboratories.

摘要

选择最严格的定量构效关系（QSAR）方法对于开发可靠且具有预测性的化学毒性模型至关重要。为了系统地解决这一问题，我们组建了一个国际虚拟合作团队，由六个对计算化学毒理学有共同兴趣的独立小组组成。我们汇编了一个水相毒性数据集，其中包含在十年间于同一实验室针对梨形四膜虫测试的983种独特化合物。从原始数据集中随机选择了一个包含644种化合物的建模集，并分发给所有使用各自QSAR工具进行模型开发的小组。原始数据集中剩余的339种化合物（外部集I）以及同一实验室最近发表的另外110种化合物（外部集II）（在本计算研究已经进行之后）被用作两个独立的验证集，以评估各个模型的外部预测能力。我们的虚拟合作团队总共为训练集开发了15种不同类型的水生毒性QSAR模型。通过留一法交叉验证相关系数（Q abs2）衡量，建模集的内部预测准确率在0.76至0.93之间。外部验证集I和II的预测准确率分别在0.71至0.85（线性回归系数R absI2）和0.38至0.83（线性回归系数R absII2）之间。大多数模型中实施的适用域阈值的使用通常提高了外部预测准确率，但同时导致化学空间覆盖率下降。最后，通过使用所有15个模型对每种化合物的预测水生毒性进行平均，开发了几个共识模型，无论是否考虑其各自的适用域。我们发现，与各个组成模型相比，共识模型为外部验证数据集提供了更高的预测准确率，且具有最高的空间覆盖率。我们的研究证明了合作和共识方法在QSAR模型开发中的力量。我们合作团队开发的经过最佳验证的水生毒性模型（包括个体模型和共识模型）可作为可靠的水生毒性计算预测工具，可从任何一个参与实验室获取。

相似文献

Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis.

J Chem Inf Model. 2008 Apr;48(4):766-84. doi: 10.1021/ci700443v. Epub 2008 Mar 1.

Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: focusing on applicability domain and overfitting by variable selection.

J Chem Inf Model. 2008 Sep;48(9):1733-46. doi: 10.1021/ci800151m. Epub 2008 Aug 26.

QSTR with extended topochemical atom (ETA) indices. 12. QSAR for the toxicity of diverse aromatic compounds to Tetrahymena pyriformis using chemometric tools.

Chemosphere. 2009 Nov;77(7):999-1009. doi: 10.1016/j.chemosphere.2009.07.072. Epub 2009 Aug 25.

Application of random forest approach to QSAR prediction of aquatic toxicity.

J Chem Inf Model. 2009 Nov;49(11):2481-8. doi: 10.1021/ci900203n.

Classification of a diverse set of Tetrahymena pyriformis toxicity chemical compounds from molecular descriptors by statistical learning methods.

Chem Res Toxicol. 2006 Aug;19(8):1030-9. doi: 10.1021/tx0600550.

Novel inhibitors of human histone deacetylase (HDAC) identified by QSAR modeling of known inhibitors, virtual screening, and experimental validation.

J Chem Inf Model. 2009 Feb;49(2):461-76. doi: 10.1021/ci800366f.

Application of predictive QSAR models to database mining: identification and experimental validation of novel anticonvulsant compounds.

J Med Chem. 2004 Apr 22;47(9):2356-64. doi: 10.1021/jm030584q.

Combinatorial QSAR modeling of specificity and subtype selectivity of ligands binding to serotonin receptors 5HT1E and 5HT1F.

J Chem Inf Model. 2008 May;48(5):997-1013. doi: 10.1021/ci700404c. Epub 2008 May 10.

Assessing the reliability of a QSAR model's predictions.

J Mol Graph Model. 2005 Jun;23(6):503-23. doi: 10.1016/j.jmgm.2005.03.003.

A novel approach to predict aquatic toxicity from molecular structure.

Chemosphere. 2008 Sep;73(3):415-27. doi: 10.1016/j.chemosphere.2008.05.024. Epub 2008 Jul 1.

引用本文的文献

Enhancing Transthyretin Binding Affinity Prediction with a Consensus Model: Insights from the Tox24 Challenge.

Chem Res Toxicol. 2025 May 19;38(5):900-908. doi: 10.1021/acs.chemrestox.4c00560. Epub 2025 Apr 26.

Application of Machine Learning and Mechanistic Modeling to Predict Intravenous Pharmacokinetic Profiles in Humans.

J Med Chem. 2025 Apr 10;68(7):7737-7750. doi: 10.1021/acs.jmedchem.5c00340. Epub 2025 Mar 27.

The round-robin approach applied to nanoinformatics: consensus prediction of nanomaterials zeta potential.

Beilstein J Nanotechnol. 2024 Nov 29;15:1536-1553. doi: 10.3762/bjnano.15.121. eCollection 2024.

Predicting Chemical Immunotoxicity through Data-Driven QSAR Modeling of Aryl Hydrocarbon Receptor Agonism and Related Toxicity Mechanisms.

Environ Health (Wash). 2024 May 28;2(7):474-485. doi: 10.1021/envhealth.4c00026. eCollection 2024 Jul 19.

Rational Design of Multifunctional Ferulic Acid Derivatives Aimed for Alzheimer's and Parkinson's Diseases.

Antioxidants (Basel). 2023 Jun 11;12(6):1256. doi: 10.3390/antiox12061256.

Retrieval, Selection, and Evaluation of Chemical Property Data for Assessments of Chemical Emissions, Fate, Hazard, Exposure, and Risks.

ACS Environ Au. 2022 Jul 19;2(5):376-395. doi: 10.1021/acsenvironau.2c00010. eCollection 2022 Sep 21.

Data-Driven Quantitative Structure-Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure.

Environ Sci Technol. 2023 Apr 25;57(16):6573-6588. doi: 10.1021/acs.est.3c00648. Epub 2023 Apr 11.

Exposing the Limitations of Molecular Machine Learning with Activity Cliffs.

J Chem Inf Model. 2022 Dec 12;62(23):5938-5951. doi: 10.1021/acs.jcim.2c01073. Epub 2022 Dec 1.

CADMA-Chem: A Computational Protocol Based on Chemical Properties Aimed to Design Multifunctional Antioxidants.

Int J Mol Sci. 2022 Oct 31;23(21):13246. doi: 10.3390/ijms232113246.

Chalcone Derivatives with a High Potential as Multifunctional Antioxidant Neuroprotectors.

ACS Omega. 2022 Oct 18;7(43):38254-38268. doi: 10.1021/acsomega.2c05518. eCollection 2022 Nov 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

针对梨形四膜虫测试的化学毒物的组合定量构效关系建模。

Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献