Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 9000, Ghent, Belgium.
Department of Computing, University of Turku, 20500, Turku, Finland.
BMC Bioinformatics. 2024 Feb 6;25(1):59. doi: 10.1186/s12859-024-05684-y.
The prediction of interactions between novel drugs and biological targets is a vital step in the early stage of the drug discovery pipeline. Many deep learning approaches have been proposed over the last decade, with a substantial fraction of them sharing the same underlying two-branch architecture. Their distinction is limited to the use of different types of feature representations and branches (multi-layer perceptrons, convolutional neural networks, graph neural networks and transformers). In contrast, the strategy used to combine the outputs (embeddings) of the branches has remained mostly the same. The same general architecture has also been used extensively in the area of recommender systems, where the choice of an aggregation strategy is still an open question. In this work, we investigate the effectiveness of three different embedding aggregation strategies in the area of drug-target interaction (DTI) prediction. We formally define these strategies and prove their universal approximator capabilities. We then present experiments that compare the different strategies on benchmark datasets from the area of DTI prediction, showcasing conditions under which specific strategies could be the obvious choice.
预测新型药物与生物靶标的相互作用是药物发现管道早期的重要步骤。在过去十年中,已经提出了许多深度学习方法,其中很大一部分具有相同的基础双分支架构。它们的区别仅限于使用不同类型的特征表示和分支(多层感知机、卷积神经网络、图神经网络和转换器)。相比之下,用于组合分支输出(嵌入)的策略基本保持不变。在推荐系统领域也广泛使用相同的通用架构,其中聚合策略的选择仍然是一个悬而未决的问题。在这项工作中,我们研究了三种不同的嵌入聚合策略在药物-靶标相互作用(DTI)预测领域的有效性。我们正式定义了这些策略,并证明了它们的通用逼近能力。然后,我们展示了在 DTI 预测领域的基准数据集上进行的不同策略的实验比较,展示了在特定策略可能是明显选择的条件下。