Du Bing-Xue, Qin Yuan, Jiang Yan-Feng, Xu Yi, Yiu Siu-Ming, Yu Hui, Shi Jian-Yu
School of Life Sciences, Northwestern Polytechnical University, Xi'an 710072, China.
Department of Computer Science, The University of Hong Kong, Hong Kong, China.
Drug Discov Today. 2022 May;27(5):1350-1366. doi: 10.1016/j.drudis.2022.02.023. Epub 2022 Mar 3.
The screening of compound-protein interactions (CPIs) is one of the most crucial steps in finding hit and lead compounds. Deep learning (DL) methods for CPI prediction can address intrinsic limitations of traditional HTS and virtual screening with the advantage of low cost and high efficiency. This review provides a comprehensive survey of DL-based CPI prediction. It first summarizes popular databases of small-molecule compounds, proteins and binding complexes. Then, it outlines classical representations of compounds and proteins in turn. After that, this review briefly introduces state-of-the-art DL-based models in terms of design paradigms and investigates their prediction performance. Finally, it indicates current challenges and trends toward better CPI prediction and sketches out crucial approaches toward practical applications.
化合物-蛋白质相互作用(CPI)的筛选是发现先导化合物和活性化合物的关键步骤之一。用于CPI预测的深度学习(DL)方法能够克服传统高通量筛选和虚拟筛选的固有局限性,具有低成本和高效率的优势。本文综述对基于DL的CPI预测进行了全面概述。首先总结了小分子化合物、蛋白质和结合复合物的常用数据库。然后依次概述了化合物和蛋白质的经典表示方法。之后,本文综述根据设计范式简要介绍了基于DL的最新模型,并研究了它们的预测性能。最后,指出了当前在更好地进行CPI预测方面面临的挑战和趋势,并勾勒了实际应用的关键方法。