Tao Wen, Liu Yuansheng, Lin Xuan, Song Bosheng, Zeng Xiangxiang
College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082 Hunan, China.
School of Computer Science, Xiangtan University, Xiangtan, 411105 Hunan, China.
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad371.
Drug-gene interaction prediction occupies a crucial position in various areas of drug discovery, such as drug repurposing, lead discovery and off-target detection. Previous studies show good performance, but they are limited to exploring the binding interactions and ignoring the other interaction relationships. Graph neural networks have emerged as promising approaches owing to their powerful capability of modeling correlations under drug-gene bipartite graphs. Despite the widespread adoption of graph neural network-based methods, many of them experience performance degradation in situations where high-quality and sufficient training data are unavailable. Unfortunately, in practical drug discovery scenarios, interaction data are often sparse and noisy, which may lead to unsatisfactory results. To undertake the above challenges, we propose a novel Dynamic hyperGraph Contrastive Learning (DGCL) framework that exploits local and global relationships between drugs and genes. Specifically, graph convolutions are adopted to extract explicit local relations among drugs and genes. Meanwhile, the cooperation of dynamic hypergraph structure learning and hypergraph message passing enables the model to aggregate information in a global region. With flexible global-level messages, a self-augmented contrastive learning component is designed to constrain hypergraph structure learning and enhance the discrimination of drug/gene representations. Experiments conducted on three datasets show that DGCL is superior to eight state-of-the-art methods and notably gains a 7.6% performance improvement on the DGIdb dataset. Further analyses verify the robustness of DGCL for alleviating data sparsity and over-smoothing issues.
药物-基因相互作用预测在药物发现的各个领域占据着关键地位,如药物再利用、先导化合物发现和脱靶检测。先前的研究表现良好,但它们仅限于探索结合相互作用,而忽略了其他相互作用关系。图神经网络因其在药物-基因二分图下强大的相关性建模能力而成为有前景的方法。尽管基于图神经网络的方法被广泛采用,但其中许多方法在缺乏高质量和足够训练数据的情况下会出现性能下降。不幸的是,在实际的药物发现场景中,相互作用数据往往稀疏且有噪声,这可能导致不尽如人意的结果。为了应对上述挑战,我们提出了一种新颖的动态超图对比学习(DGCL)框架,该框架利用药物和基因之间的局部和全局关系。具体来说,采用图卷积来提取药物和基因之间明确的局部关系。同时,动态超图结构学习和超图消息传递的协作使模型能够在全局区域聚合信息。借助灵活的全局级消息,设计了一个自增强对比学习组件来约束超图结构学习并增强药物/基因表示的辨别力。在三个数据集上进行的实验表明,DGCL优于八种最先进的方法,并且在DGIdb数据集上显著提高了7.6%的性能。进一步的分析验证了DGCL在缓解数据稀疏性和过平滑问题方面的鲁棒性。