Shin Sangyeon, Lee Chanhee, Park Taesung
Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.
Department of Statistics, Seoul National University, Seoul, Republic of Korea.
Arch Toxicol. 2025 Jun 4. doi: 10.1007/s00204-025-04089-x.
Drug-Induced Liver Injury (DILI) is a major challenge in drug development, occurring due to liver damage caused by the adverse effects of drugs or xenobiotics. High-throughput transcriptomics (HTTr) provides mechanistic insights into drug-induced hepatotoxicity, complementing traditional chemical structure-based methods. To address the challenges posed by DILI, this study aimed to evaluate the suitability of HTTr data for DILI classification and prediction. Initially, we reviewed the current landscape of HTTr-based DILI research, focusing on public datasets, computational tools, and bioinformatics techniques. Building on this foundation, we analyzed HTTr data from the Open TG-GATEs database, which includes primary human hepatocytes treated with 146 drugs at three concentrations. Gene expression data alone had limited ability to classify DILI phenotypes, performing similarly to chemical structure-based models. However, targeted gene sets improved clustering performance, and changes in clustering performance across concentration levels indicated that concentration information influences toxicity analysis. Machine learning models showed that integrating gene expression and chemical structure data enhanced predictive accuracy, emphasizing the need for multi-modal approaches. These findings underscore HTTr as a valuable tool for advancing DILI classification and prediction, contributing to more reliable drug safety assessments.
药物性肝损伤(DILI)是药物研发中的一项重大挑战,它是由药物或外源性物质的不良反应导致肝脏损伤而引起的。高通量转录组学(HTTr)为药物性肝毒性提供了机制性见解,对传统的基于化学结构的方法起到了补充作用。为应对DILI带来的挑战,本研究旨在评估HTTr数据用于DILI分类和预测的适用性。最初,我们回顾了基于HTTr的DILI研究的现状,重点关注公共数据集、计算工具和生物信息学技术。在此基础上,我们分析了来自Open TG-GATEs数据库的HTTr数据,该数据库包含用146种药物在三个浓度下处理的原代人肝细胞。仅基因表达数据对DILI表型进行分类的能力有限,其表现与基于化学结构的模型相似。然而,靶向基因集提高了聚类性能,并且跨浓度水平的聚类性能变化表明浓度信息会影响毒性分析。机器学习模型表明,整合基因表达和化学结构数据可提高预测准确性,强调了多模态方法的必要性。这些发现强调了HTTr作为推进DILI分类和预测的有价值工具,有助于进行更可靠的药物安全性评估。