Tsai Han-Hsuan D, Oware King D, Wright Fred A, Chiu Weihsueh A, Rusyn Ivan
Interdisciplinary Faculty of Toxicology, Texas A&M University, College Station, TX 77843, United States.
Department of Veterinary Physiology and Pharmacology, Texas A&M University, College Station, TX 77843, United States.
Toxicol Sci. 2025 Jun 1;205(2):310-325. doi: 10.1093/toxsci/kfaf036.
Key characteristics (KCs) are properties of chemicals that are associated with different types of human health hazards. KCs are used for systematic reviews in support of hazard identification. Transcriptomic data are a rich source of mechanistic data and are frequently interpreted through "enriched" pathways/gene sets. Such analyses may be challenging to interpret in regulatory science because of redundancy among pathways, complex data analyses, and unclear relevance to hazard identification. We hypothesized that by cross-mapping pathways/gene sets and KCs, the interpretability of transcriptomic data can be improved. We summarized 72 published KCs across 7 hazard traits into 34 umbrella KC terms. Gene sets from Reactome and Kyoto Encyclopedia of Genes and Genomes (KEGG) were mapped to these, resulting in "KC gene sets." These sets exhibit minimal overlap and vary in the number of genes. Comparisons of the same KC gene sets mapped from Reactome and KEGG revealed low similarity, indicating complementarity. Performance of these KC gene sets was tested using publicly available transcriptomic datasets of chemicals with known organ-specific toxicity: benzene and 2,3,7,8-tetrachlorodibenzo-p-dioxin tested in mouse liver and drugs sunitinib and amoxicillin tested in human-induced pluripotent stem cell-derived cardiomyocytes. We found that KC terms related to the mechanisms affected by tested compounds were highly enriched, while the negative control (amoxicillin) showed limited enrichment with marginal significance. This study's impact is in presenting a computational approach based on KCs for the analysis of toxicogenomic data and facilitating transparent interpretation of these data in the process of chemical hazard identification.
关键特性(KCs)是与不同类型人类健康危害相关的化学物质属性。KCs用于支持危害识别的系统综述。转录组数据是机制数据的丰富来源,常通过“富集”的途径/基因集进行解读。由于途径之间的冗余、复杂的数据分析以及与危害识别的相关性不明确,此类分析在监管科学中可能难以解释。我们假设通过交叉映射途径/基因集和KCs,可以提高转录组数据的可解释性。我们将7个危害特征的72个已发表的KCs总结为34个总体KC术语。将来自Reactome和京都基因与基因组百科全书(KEGG)的基因集映射到这些术语上,得到“KC基因集”。这些集合显示出最小的重叠,并且基因数量各不相同。对从Reactome和KEGG映射的相同KC基因集进行比较,结果显示相似性较低,表明具有互补性。使用具有已知器官特异性毒性的化学物质的公开转录组数据集测试了这些KC基因集的性能:在小鼠肝脏中测试的苯和2,3,7,8-四氯二苯并对二恶英,以及在人诱导多能干细胞衍生的心肌细胞中测试的药物舒尼替尼和阿莫西林。我们发现,与受试化合物影响的机制相关的KC术语高度富集,而阴性对照(阿莫西林)显示出有限的富集且具有边际显著性。本研究的影响在于提出了一种基于KCs的计算方法,用于分析毒理基因组数据,并在化学危害识别过程中促进对这些数据的透明解读。