Xu Kaiyi, Wang Minhui, Zou Xin, Liu Jingjing, Wei Ao, Chen Jiajia, Tang Chang
School of Computer Science, China University of Geosciences, Wuhan 430074, China.
Department of Pharmacy, Lianshui People's Hospital Affiliated to Kangda College of Nanjing Medical University, Huai'an 223300, China.
Neural Netw. 2025 Jan;181:106779. doi: 10.1016/j.neunet.2024.106779. Epub 2024 Oct 23.
Identifying the frequencies of drug-side effects is crucial for assessing drug risk-benefit. However, accurately determining these frequencies remains challenging due to the limitations of time and scale in clinical randomized controlled trials. As a result, several computational methods have been proposed to address these issues. Nonetheless, two primary problems still persist. Firstly, most of these methods face challenges in generating accurate predictions for novel drugs, as they heavily depend on the interaction graph between drugs and side effects (SEs) within their modeling framework. Secondly, some previous methods often simply concatenate the features of drugs and SEs, which fails to effectively capture their underlying association. In this work, we present HSTrans, a novel approach that treats drugs and SEs as sets of substructures, leveraging a transformer encoder for unified substructure embedding and incorporating an interaction module for association capture. Specifically, HSTrans extracts drug substructures through a specialized algorithm and identifies effective substructures for each SE by employing an indicator that measures the importance of each substructure and SE. Additionally, HSTrans applies convolutional neural network (CNN) in the interaction module to capture complex relationships between drugs and SEs. Experimental results on datasets from Galeano et al.'s study demonstrate that the proposed method outperforms other state-of-the-art approaches. The demo codes for HSTrans are available at https://github.com/Dtdtxuky/HSTrans/tree/master.
确定药物副作用的发生频率对于评估药物的风险效益至关重要。然而,由于临床随机对照试验在时间和规模上的限制,准确确定这些频率仍然具有挑战性。因此,人们提出了几种计算方法来解决这些问题。尽管如此,两个主要问题仍然存在。首先,这些方法中的大多数在对新药进行准确预测时面临挑战,因为它们在建模框架中严重依赖于药物与副作用(SEs)之间的相互作用图。其次,一些先前的方法通常只是简单地将药物和SEs的特征连接起来,这无法有效地捕捉它们潜在的关联。在这项工作中,我们提出了HSTrans,这是一种新颖的方法,它将药物和SEs视为子结构集,利用变压器编码器进行统一的子结构嵌入,并结合一个相互作用模块来捕捉关联。具体来说,HSTrans通过一种专门的算法提取药物子结构,并通过使用一个衡量每个子结构和SE重要性的指标为每个SE识别有效的子结构。此外,HSTrans在相互作用模块中应用卷积神经网络(CNN)来捕捉药物和SEs之间的复杂关系。来自加利亚诺等人研究的数据集上的实验结果表明,所提出的方法优于其他现有技术方法。HSTrans的演示代码可在https://github.com/Dtdtxuky/HSTrans/tree/master获取。