Wang Shuai, Hui Cui, Zhang Tiangang, Wu Peiliang, Nakaguchi Toshiya, Xuan Ping
School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China.
Department of Computer Science and Information Technology, La Trobe University, Melbourne 3083, Australia.
J Chem Inf Model. 2023 Nov 13;63(21):6947-6958. doi: 10.1021/acs.jcim.3c01214. Epub 2023 Oct 31.
An increasing number of studies have shown that dysregulation of lncRNAs is related to the occurrence of various diseases. Most of the previous methods, however, are designed based on homogeneity assumption that the representation of a target lncRNA (or disease) node should be updated by aggregating the attributes of its neighbor nodes. However, the assumption ignores the affinity nodes that are far from the target node. We present a novel prediction method, GAIRD, to fully leverage the heterogeneous information in the network and the decoupled node features. The first major innovation is a random walk strategy based on width-first searching and depth-first searching. Different from previous methods that only focus on homogeneous information, our new strategy learns both the homogeneous information within local neighborhoods and the heterogeneous information within higher-order neighborhoods. The second innovation is a representation decoupling module to extract the purer attributes and the purer topologies. Third, a module based on group convolution and deep separable convolution is developed to promote the pairwise intrachannel and interchannel feature learning. The experimental results show that GAIRD outperforms comparing state-of-the-art methods, and the ablation studies prove the contributions of major innovations. We also performed case studies on 3 diseases to further demonstrate the effectiveness of the GAIRD model in applications.
越来越多的研究表明,长链非编码RNA(lncRNAs)的失调与各种疾病的发生有关。然而,以前的大多数方法都是基于同质性假设设计的,即目标lncRNA(或疾病)节点的表示应该通过聚合其相邻节点的属性来更新。然而,该假设忽略了远离目标节点的亲和节点。我们提出了一种新颖的预测方法GAIRD,以充分利用网络中的异构信息和解耦的节点特征。第一个主要创新是基于广度优先搜索和深度优先搜索的随机游走策略。与以前只关注同质性信息的方法不同,我们的新策略既学习局部邻域内的同质性信息,也学习高阶邻域内的异构信息。第二个创新是一个表示解耦模块,用于提取更纯粹的属性和更纯粹的拓扑结构。第三,开发了一个基于组卷积和深度可分离卷积的模块,以促进成对的通道内和通道间特征学习。实验结果表明,GAIRD优于比较的现有最先进方法,消融研究证明了主要创新的贡献。我们还对3种疾病进行了案例研究,以进一步证明GAIRD模型在应用中的有效性。