Liu Tao, Chen Lei, Pan Xiaoyong
College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China.
Department of Medical Informatics, Erasmus MC, Rotterdam, Netherlands.
Comb Chem High Throughput Screen. 2018;21(6):403-410. doi: 10.2174/1386207321666180601075428.
Chemical toxicity effect is one of the major reasons for declining candidate drugs. Detecting the toxicity effects of all chemicals can accelerate the procedures of drug discovery. However, it is time-consuming and expensive to identify the toxicity effects of a given chemical through traditional experiments. Designing quick, reliable and non-animal-involved computational methods is an alternative way.
In this study, a novel integrated multi-label classifier was proposed. First, based on five types of chemical-chemical interactions retrieved from STITCH, each of which is derived from one aspect of chemicals, five individual classifiers were built. Then, several integrated classifiers were built by integrating some or all individual classifiers.
By testing the integrated classifiers on a dataset with chemicals and their toxicity effects in Accelrys Toxicity database and non-toxic chemicals with their performance evaluated by jackknife test, an optimal integrated classifier was selected as the proposed classifier, which provided quite high prediction accuracies and wide applications.
化学毒性效应是候选药物减少的主要原因之一。检测所有化学物质的毒性效应可以加速药物发现的进程。然而,通过传统实验来确定一种给定化学物质的毒性效应既耗时又昂贵。设计快速、可靠且不涉及动物的计算方法是一种替代途径。
在本研究中,提出了一种新型的集成多标签分类器。首先,基于从STITCH检索到的五种类型的化学物质 - 化学物质相互作用(每种相互作用都源自化学物质的一个方面)构建了五个单独的分类器。然后,通过整合部分或所有单独的分类器构建了几个集成分类器。
通过在Accelrys毒性数据库中使用包含化学物质及其毒性效应的数据集以及通过留一法测试评估性能的无毒化学物质对集成分类器进行测试,选择了一个最优的集成分类器作为所提出的分类器,该分类器具有相当高的预测准确率和广泛的应用。