Department of Food Science and Human Nutrition, University of Illinois, 1201 W Gregory Dr, Urbana-ChampaignUrbana, IL, 61801, USA.
Illinois Informatics Institute, University of Illinois, Urbana-Champaign, Urbana, IL, USA.
Sci Rep. 2020 Nov 5;10(1):19128. doi: 10.1038/s41598-020-76129-8.
Screening agrochemicals and pharmaceuticals for potential liver toxicity is required for regulatory approval and is an expensive and time-consuming process. The identification and utilization of early exposure gene signatures and robust predictive models in regulatory toxicity testing has the potential to reduce time and costs substantially. In this study, comparative supervised machine learning approaches were applied to the rat liver TG-GATEs dataset to develop feature selection and predictive testing. We identified ten gene biomarkers using three different feature selection methods that predicted liver necrosis with high specificity and selectivity in an independent validation dataset from the Microarray Quality Control (MAQC)-II study. Nine of the ten genes that were selected with the supervised methods are involved in metabolism and detoxification (Car3, Crat, Cyp39a1, Dcd, Lbp, Scly, Slc23a1, and Tkfc) and transcriptional regulation (Ablim3). Several of these genes are also implicated in liver carcinogenesis, including Crat, Car3 and Slc23a1. Our biomarker gene signature provides high statistical accuracy and a manageable number of genes to study as indicators to potentially accelerate toxicity testing based on their ability to induce liver necrosis and, eventually, liver cancer.
筛选农用化学品和药品的潜在肝毒性是监管批准所必需的,这是一个昂贵且耗时的过程。在监管毒性测试中识别和利用早期暴露基因特征和稳健的预测模型有潜力大大减少时间和成本。在这项研究中,比较监督机器学习方法被应用于大鼠肝 TG-GATEs 数据集,以开发特征选择和预测测试。我们使用三种不同的特征选择方法识别了十个基因生物标志物,这些生物标志物在来自 Microarray Quality Control (MAQC)-II 研究的独立验证数据集中以高特异性和选择性预测了肝坏死。用监督方法选择的十个基因中的九个参与代谢和解毒(Car3、Crat、Cyp39a1、Dcd、Lbp、Scly、Slc23a1 和 Tkfc)和转录调节(Ablim3)。其中一些基因也与肝癌发生有关,包括 Crat、Car3 和 Slc23a1。我们的生物标志物基因特征提供了高统计准确性和可管理数量的基因,作为潜在的指标,以研究基于其诱导肝坏死并最终肝癌的能力来加速毒性测试。