深度癌症预测：使用模型级表示的深度学习驱动的致癌性预测

DeepCarc: Deep Learning-Powered Carcinogenicity Prediction Using Model-Level Representation.

作者信息

Li Ting, Tong Weida, Roberts Ruth, Liu Zhichao, Thakkar Shraddha

机构信息

Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, United States.

University of Arkansas at Little Rock and University of Arkansas for Medical Sciences Joint Bioinformatics Program, Little Rock, AR, United States.

出版信息

Front Artif Intell. 2021 Nov 18;4:757780. doi: 10.3389/frai.2021.757780. eCollection 2021.

DOI:10.3389/frai.2021.757780

PMID:34870186

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8636933/

Abstract

Carcinogenicity testing plays an essential role in identifying carcinogens in environmental chemistry and drug development. However, it is a time-consuming and label-intensive process to evaluate the carcinogenic potency with conventional 2-years rodent animal studies. Thus, there is an urgent need for alternative approaches to providing reliable and robust assessments on carcinogenicity. In this study, we proposed a DeepCarc model to predict carcinogenicity for small molecules using deep learning-based model-level representations. The DeepCarc Model was developed using a data set of 692 compounds and evaluated on a test set containing 171 compounds in the National Center for Toxicological Research liver cancer database (NCTRlcdb). As a result, the proposed DeepCarc model yielded a Matthews correlation coefficient (MCC) of 0.432 for the test set, outperforming four advanced deep learning (DL) powered quantitative structure-activity relationship (QSAR) models with an average improvement rate of 37%. Furthermore, the DeepCarc model was also employed to screen the carcinogenicity potential of the compounds from both DrugBank and Tox21. Altogether, the proposed DeepCarc model could serve as an early detection tool (https://github.com/TingLi2016/DeepCarc) for carcinogenicity assessment.

摘要

致癌性测试在环境化学和药物开发中识别致癌物方面发挥着重要作用。然而，用传统的两年期啮齿动物研究来评估致癌潜力是一个耗时且标记密集的过程。因此，迫切需要替代方法来对致癌性进行可靠且有力的评估。在本研究中，我们提出了一种DeepCarc模型，利用基于深度学习的模型级表示来预测小分子的致癌性。DeepCarc模型是使用一个包含692种化合物的数据集开发的，并在美国国家毒理学研究中心肝癌数据库（NCTRlcdb）中一个包含171种化合物的测试集上进行了评估。结果，所提出的DeepCarc模型在测试集上的马修斯相关系数（MCC）为0.432，优于四个先进的深度学习（DL）驱动的定量构效关系（QSAR）模型，平均提高率为37%。此外，DeepCarc模型还被用于筛选DrugBank和Tox21中化合物的致癌潜力。总之,所提出的DeepCarc模型可以作为一种用于致癌性评估的早期检测工具(https://github.com/TingLi2016/DeepCarc)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a65/8636933/2a385152e8ca/frai-04-757780-g001.jpg

相似文献

DeepCarc: Deep Learning-Powered Carcinogenicity Prediction Using Model-Level Representation.

Front Artif Intell. 2021 Nov 18;4:757780. doi: 10.3389/frai.2021.757780. eCollection 2021.

Corrigendum: DeepCarc: Deep learning-powered carcinogenicity prediction using model-level representation.

Front Artif Intell. 2022 Nov 28;5:1046668. doi: 10.3389/frai.2022.1046668. eCollection 2022.

DeepDILI: Deep Learning-Powered Drug-Induced Liver Injury Prediction Using Model-Level Representation.

Chem Res Toxicol. 2021 Feb 15;34(2):550-565. doi: 10.1021/acs.chemrestox.0c00374. Epub 2020 Dec 23.

Prediction of rodent carcinogenic potential of naturally occurring chemicals in the human diet using high-throughput QSAR predictive modeling.

Toxicol Appl Pharmacol. 2007 Jul 1;222(1):1-16. doi: 10.1016/j.taap.2007.03.012. Epub 2007 Mar 24.

Comparative analysis of predictive models for nongenotoxic hepatocarcinogenicity using both toxicogenomics and quantitative structure-activity relationships.

Chem Res Toxicol. 2011 Jul 18;24(7):1062-70. doi: 10.1021/tx2000637. Epub 2011 Jun 20.

A graph neural network approach for molecule carcinogenicity prediction.

Bioinformatics. 2022 Jun 24;38(Suppl 1):i84-i91. doi: 10.1093/bioinformatics/btac266.

Development and validation of a robust QSAR model for prediction of carcinogenicity of drugs.

Indian J Biochem Biophys. 2011 Apr;48(2):111-22.

Carcinogenicity prediction using the index of ideality of correlation.

SAR QSAR Environ Res. 2022 Jun;33(6):419-428. doi: 10.1080/1062936X.2022.2076736. Epub 2022 Jun 1.

Deciphering exogenous chemical carcinogenicity through interpretable deep learning: A novel approach for evaluating atmospheric pollutant hazards.

J Hazard Mater. 2024 Mar 5;465:133092. doi: 10.1016/j.jhazmat.2023.133092. Epub 2023 Nov 25.

Carcinogenicity of the aromatic amines: from structure-activity relationships to mechanisms of action and risk assessment.

Mutat Res. 2002 Jul;511(3):191-206. doi: 10.1016/s1383-5742(02)00008-x.

引用本文的文献

Recent advances in AI-based toxicity prediction for drug discovery.

Front Chem. 2025 Jul 8;13:1632046. doi: 10.3389/fchem.2025.1632046. eCollection 2025.

Leveraging machine learning models in evaluating ADMET properties for drug discovery and development.

ADMET DMPK. 2025 Jun 7;13(3):2772. doi: 10.5599/admet.2772. eCollection 2025.

Role of artificial intelligence in revolutionizing drug discovery.

Fundam Res. 2024 May 9;5(3):1273-1287. doi: 10.1016/j.fmre.2024.04.021. eCollection 2025 May.

Comparative Analysis of Recurrent Neural Networks with Conjoint Fingerprints for Skin Corrosion Prediction.

J Chem Inf Model. 2025 Feb 10;65(3):1305-1317. doi: 10.1021/acs.jcim.4c02062. Epub 2025 Jan 21.

Four functional genotoxic marker genes (Bax, Btg2, Ccng1, and Cdkn1a) discriminate genotoxic hepatocarcinogens from non-genotoxic hepatocarcinogens and non-genotoxic non-hepatocarcinogens in rat public toxicogenomics data, Open TG-GATEs.

Genes Environ. 2024 Dec 19;46(1):28. doi: 10.1186/s41021-024-00322-8.

Advancing Drug Safety in Drug Development: Bridging Computational Predictions for Enhanced Toxicity Prediction.

Chem Res Toxicol. 2024 Jun 17;37(6):827-849. doi: 10.1021/acs.chemrestox.3c00352. Epub 2024 May 17.

Protecting Human and Animal Health: The Road from Animal Models to New Approach Methods.

Pharmacol Rev. 2024 Feb 13;76(2):251-266. doi: 10.1124/pharmrev.123.000967.

Revolutionizing Medicinal Chemistry: The Application of Artificial Intelligence (AI) in Early Drug Discovery.

Pharmaceuticals (Basel). 2023 Sep 6;16(9):1259. doi: 10.3390/ph16091259.

Advances of Artificial Intelligence in Anti-Cancer Drug Design: A Review of the Past Decade.

Pharmaceuticals (Basel). 2023 Feb 7;16(2):253. doi: 10.3390/ph16020253.

TransOrGAN: An Artificial Intelligence Mapping of Rat Transcriptomic Profiles between Organs, Ages, and Sexes.

Chem Res Toxicol. 2023 Jun 19;36(6):916-925. doi: 10.1021/acs.chemrestox.3c00037. Epub 2023 May 18.

本文引用的文献

Safer chemicals using less animals: kick-off of the European ONTOX project.

Toxicology. 2021 Jun 30;458:152846. doi: 10.1016/j.tox.2021.152846.

A Deep Convolutional Neural Network Method to Detect Seizures and Characteristic Frequencies Using Epileptic Electroencephalogram (EEG) Data.

IEEE J Transl Eng Health Med. 2021 Jan 11;9:2000112. doi: 10.1109/JTEHM.2021.3050925. eCollection 2021.

Deep convolutional neural networks to predict cardiovascular risk from computed tomography.

Nat Commun. 2021 Jan 29;12(1):715. doi: 10.1038/s41467-021-20966-2.

DeepDILI: Deep Learning-Powered Drug-Induced Liver Injury Prediction Using Model-Level Representation.

Chem Res Toxicol. 2021 Feb 15;34(2):550-565. doi: 10.1021/acs.chemrestox.0c00374. Epub 2020 Dec 23.

Deep Learning on High-Throughput Transcriptomics to Predict Drug-Induced Liver Injury.

Front Bioeng Biotechnol. 2020 Nov 27;8:562677. doi: 10.3389/fbioe.2020.562677. eCollection 2020.

Neural Network Vessel Lumen Regression for Automated Lumen Cross-Section Segmentation in Cardiovascular Image-Based Modeling.

Cardiovasc Eng Technol. 2020 Dec;11(6):621-635. doi: 10.1007/s13239-020-00497-5. Epub 2020 Nov 11.

PubChem in 2021: new data content and improved web interfaces.

Nucleic Acids Res. 2021 Jan 8;49(D1):D1388-D1395. doi: 10.1093/nar/gkaa971.

Computational Approaches to Identify Structural Alerts and Their Applications in Environmental Toxicology and Drug Discovery.

Chem Res Toxicol. 2020 Jun 15;33(6):1312-1322. doi: 10.1021/acs.chemrestox.0c00006. Epub 2020 Mar 5.

A cross-sector call to improve carcinogenicity risk assessment through use of genomic methodologies.

Regul Toxicol Pharmacol. 2020 Feb;110:104526. doi: 10.1016/j.yrtph.2019.104526. Epub 2019 Nov 11.

CapsCarcino: A novel sparse data deep learning tool for predicting carcinogens.

Food Chem Toxicol. 2020 Jan;135:110921. doi: 10.1016/j.fct.2019.110921. Epub 2019 Oct 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度癌症预测：使用模型级表示的深度学习驱动的致癌性预测

DeepCarc: Deep Learning-Powered Carcinogenicity Prediction Using Model-Level Representation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献