Park Keunwan, Kim Dongsup
Department of Bio and Brain Engineering, KAIST, 373-1, Guseong-dong, Yuseong-gu, Daejeon, 305-701, Republic of Korea.
BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S12. doi: 10.1186/1752-0509-5-S2-S12. Epub 2011 Dec 14.
Drugs that bind to common targets likely exert similar activities. In this target-centric view, the inclusion of richer target information may better represent the relationships between drugs and their activities. Under this assumption, we expanded the "common binding rule" assumption of QSAR to create a new drug-drug relationship score (DRS).
Our method uses various chemical features to encode drug target information into the drug-drug relationship information. Specifically, drug pairs were transformed into numerical vectors containing the basal drug properties and their differences. After that, machine learning techniques such as data cleaning, dimension reduction, and ensemble classifier were used to prioritize drug pairs bound to a common target. In other words, the estimation of the drug-drug relationship is restated as a large-scale classification problem, which provides the framework for using state-of-the-art machine learning techniques with thousands of chemical features for newly defining drug-drug relationships.
Various aspects of the presented score were examined to determine its reliability and usefulness: the abundance of common domains for the predicted drug pairs, c.a. 80% coverage for known targets, successful identifications of unknown targets, and a meaningful correlation with another cutting-edge method for analyzing drug similarities. The most significant strength of our method is that the DRS can be used to describe phenotypic similarities, such as pharmacological effects.
作用于共同靶点的药物可能具有相似的活性。在这种以靶点为中心的观点中,纳入更丰富的靶点信息可能能更好地体现药物与其活性之间的关系。基于此假设,我们扩展了定量构效关系(QSAR)的“共同结合规则”假设,以创建一种新的药物 - 药物关系评分(DRS)。
我们的方法利用各种化学特征将药物靶点信息编码到药物 - 药物关系信息中。具体而言,药物对被转化为包含基础药物特性及其差异的数值向量。之后,使用诸如数据清理、降维和集成分类器等机器学习技术对作用于共同靶点的药物对进行优先级排序。换句话说,药物 - 药物关系的估计被重新表述为一个大规模分类问题,这为使用具有数千种化学特征的先进机器学习技术来重新定义药物 - 药物关系提供了框架。
对所提出评分的各个方面进行了检验,以确定其可靠性和实用性:预测药物对的共同结构域丰富,对已知靶点的覆盖率约为80%,成功识别未知靶点,以及与另一种用于分析药物相似性的前沿方法具有有意义的相关性。我们方法最显著的优势在于DRS可用于描述表型相似性,如药理作用。