Sridhar Padmavati, Kahveci Tamer, Ranka Sanjay
Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL 32611, USA.
Pac Symp Biocomput. 2007:88-99.
Post-genomic advances in bioinformatics have refined drug-design strategies, by focusing on the reduction of serious side-effects through the identification of enzymatic targets. We consider the problem of identifying the enzymes (i.e., drug targets), whose inhibition will stop the production of a given target set of compounds, while eliminating minimal number of non-target compounds. An exhaustive evaluation of all possible enzyme combinations to find the optimal solution subset may become computationally infeasible for very large metabolic networks. We propose a scalable iterative algorithm which computes a sub-optimal solution within reasonable time-bounds. Our algorithm is based on the intuition that we can arrive at a solution close to the optimal one by tracing backward from the target compounds. It evaluates immediate precursors of the target compounds and iteratively moves backwards to identify the enzymes whose inhibition will stop the production of the target compounds while incurring minimum side-effects. We show that our algorithm converges to a sub-optimal solution within a finite number of such iterations. Our experiments on the E. Coli metabolic network show that the average accuracy of our method deviates from that of the exhaustive search only by 0.02%. Our iterative algorithm is highly scalable. It can solve the problem for the entire metabolic network of Escherichia Coli in less than 10 seconds.
生物信息学领域的后基因组学进展改进了药物设计策略,其重点是通过识别酶靶点来减少严重的副作用。我们考虑识别酶(即药物靶点)的问题,抑制这些酶将阻止给定目标化合物集的产生,同时消除最少数量的非目标化合物。对于非常大的代谢网络,详尽评估所有可能的酶组合以找到最优解子集在计算上可能不可行。我们提出一种可扩展的迭代算法,该算法能在合理的时间范围内计算出次优解。我们的算法基于这样一种直觉,即我们可以通过从目标化合物向后追溯来得到接近最优解的解。它评估目标化合物的直接前体,并迭代地向后移动以识别那些抑制后将阻止目标化合物产生同时产生最小副作用的酶。我们表明,在有限次数的此类迭代内,我们的算法会收敛到一个次优解。我们在大肠杆菌代谢网络上的实验表明,我们方法的平均准确率与详尽搜索的平均准确率相比仅相差0.02%。我们的迭代算法具有高度可扩展性。它能在不到10秒的时间内解决大肠杆菌整个代谢网络的问题。