Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States.
Center of Excellence in Environmental Toxicology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States.
Chem Res Toxicol. 2022 Aug 15;35(8):1370-1382. doi: 10.1021/acs.chemrestox.2c00074. Epub 2022 Jul 12.
ComptoxAI is a new data infrastructure for computational and artificial intelligence research in predictive toxicology. Here, we describe and showcase ComptoxAI's graph-structured knowledge base in the context of three real-world use-cases, demonstrating that it can rapidly answer complex questions about toxicology that are infeasible using previous technologies and data resources. These use-cases each demonstrate a tool for information retrieval from the knowledge base being used to solve a specific task: The "shortest path" module is used to identify mechanistic links between perfluorooctanoic acid (PFOA) exposure and nonalcoholic fatty liver disease; the "expand network" module identifies communities that are linked to dioxin toxicity; and the quantitative structure-activity relationship (QSAR) dataset generator predicts pregnane X receptor agonism in a set of 4,021 pesticide ingredients. The contents of ComptoxAI's source data are rigorously aggregated from a diverse array of public third-party databases, and ComptoxAI is designed as a free, public, and open-source toolkit to enable diverse classes of users including biomedical researchers, public health and regulatory officials, and the general public to predict toxicology of unknowns and modes of action.
ComptoxAI 是一个用于计算和人工智能预测毒理学研究的新数据基础设施。在这里,我们将在三个实际用例的背景下描述和展示 ComptoxAI 的图结构知识库,展示它如何快速回答使用以前的技术和数据资源难以回答的复杂毒理学问题。这些用例展示了从知识库中检索信息的工具,用于解决特定任务:“最短路径”模块用于识别全氟辛酸 (PFOA) 暴露与非酒精性脂肪性肝病之间的机制联系;“扩展网络”模块识别与二恶英毒性相关的社区;定量构效关系 (QSAR) 数据集生成器预测一组 4021 种农药成分中的孕烷 X 受体激动剂。ComptoxAI 的源数据内容是从各种公共第三方数据库中严格汇总而来的,ComptoxAI 被设计为一个免费、公共和开源的工具包,以使包括生物医学研究人员、公共卫生和监管官员以及普通公众在内的各种用户能够预测未知的毒理学和作用模式。