Suppr超能文献

计算机辅助啮齿动物致癌性预测。

Computer-aided rodent carcinogenicity prediction.

作者信息

Lagunin Alexey A, Dearden John C, Filimonov Dmitri A, Poroikov Vladimir V

机构信息

Institute of Biomedical Chemistry RAMS, Pogodinskaya Str. 10, Moscow 119121, Russia.

出版信息

Mutat Res. 2005 Oct 3;586(2):138-46. doi: 10.1016/j.mrgentox.2005.06.005.

Abstract

The potential of the computer program PASS (Prediction Activity Spectra for Substances) to predict rodent carcinogenicity for chemical compounds was studied. PASS predicts carcinogenicity of chemical compounds on the basis of their structural formula and of structure-activity relationship analysis of known carcinogens and non-carcinogens. The data on structures and experimental results of 2-year carcinogenicity assays for 412 chemicals from the NTP (National Toxicological Program) and 1190 chemicals from the CPDB (Carcinogenic Potency Database) were used in our study. The predictions take into consideration information about species and sex of animals. For evaluation of the predictive accuracy we used two procedures: leave-one-out cross-validation (LOO CV) and leave-20%-out cross-validation. In the last case we randomly divided the studied data set 20 times into two subsets. The data from the first subset, containing 80% of the compounds, were added to the PASS training set (which includes about 46,000 compounds with about 1500 biological activity types collected during the last 20 years to predict biological activity spectra), the second subset with 20% of the compounds was used as an evaluation set. The mean accuracy of prediction calculated by LOO CV is about 73% for NTP compounds in the 'equivocal' category of carcinogenic activity and 80% for NTP compounds in the 'evidence' category of carcinogenicity. The mean accuracy of prediction for the CPDB database is 89.9% calculated by LOO CV and 63.4% calculated by leave-20%-out cross-validation. Influence of incorporation of species and sex data on the accuracy of carcinogenicity prediction was also investigated. It was shown that the accuracy was increased only for data on male animals.

摘要

研究了计算机程序PASS(物质预测活性谱)预测化合物啮齿动物致癌性的潜力。PASS基于化合物的结构式以及已知致癌物和非致癌物的构效关系分析来预测致癌性。我们的研究使用了来自美国国家毒理学计划(NTP)的412种化学物质以及来自致癌 potency 数据库(CPDB)的1190种化学物质的两年致癌性试验的结构和实验结果数据。这些预测考虑了有关动物物种和性别的信息。为了评估预测准确性,我们使用了两种方法:留一法交叉验证(LOO CV)和留出20%法交叉验证。在最后一种情况下,我们将研究数据集随机分成两个子集20次。第一个子集的数据包含80%的化合物,被添加到PASS训练集(其中包括在过去20年中收集的约46000种具有约1500种生物活性类型的化合物,用于预测生物活性谱),第二个子集包含20%的化合物用作评估集。通过LOO CV计算得出,对于致癌活性处于“模棱两可”类别的NTP化合物,预测的平均准确率约为73%,对于致癌性处于“有证据”类别的NTP化合物,预测的平均准确率为8%。对于CPDB数据库,通过LOO CV计算得出的预测平均准确率为89.9%,通过留出20%法交叉验证计算得出的为63.4%。还研究了纳入物种和性别数据对致癌性预测准确性的影响。结果表明,仅对于雄性动物的数据,准确性有所提高。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验