School of Computer Science and Engineering, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
IET Syst Biol. 2024 Oct;18(5):172-182. doi: 10.1049/syb2.12098. Epub 2024 Sep 22.
Long non-coding RNAs (lncRNAs) have emerged as significant contributors to the regulation of various biological processes, and their dysregulation has been linked to a variety of human disorders. Accurate prediction of potential correlations between lncRNAs and diseases is crucial for advancing disease diagnostics and treatment procedures. The authors introduced a novel computational method, iGATTLDA, for the prediction of lncRNA-disease associations. The model utilised lncRNA and disease similarity matrices, with known associations represented in an adjacency matrix. A heterogeneous network was constructed, dissecting lncRNAs and diseases as nodes and their associations as edges. The Graph Attention Network (GAT) is employed to process initial features and corresponding adjacency information. GAT identified significant neighbouring nodes in the network, capturing intricate relationships between lncRNAs and diseases, and generating new feature representations. Subsequently, the transformer captures global dependencies and interactions across the entire sequence of features produced by the GAT. Consequently, iGATTLDA successfully captures complex relationships and interactions that conventional approaches may overlook. In evaluating iGATTLDA, it attained an area under the receiver operating characteristic (ROC) curve (AUC) of 0.95 and an area under the precision recall curve (AUPRC) of 0.96 with a two-layer multilayer perceptron (MLP) classifier. These results were notably higher compared to the majority of previously proposed models, further substantiating the model's efficiency in predicting potential lncRNA-disease associations by incorporating both local and global interactions. The implementation details can be obtained from https://github.com/momanyibiffon/iGATTLDA.
长链非编码 RNA(lncRNAs)已成为调节各种生物过程的重要因素,其失调与多种人类疾病有关。准确预测 lncRNA 与疾病之间的潜在关联对于推进疾病诊断和治疗程序至关重要。作者引入了一种新的计算方法 iGATTLDA,用于预测 lncRNA-疾病关联。该模型利用 lncRNA 和疾病相似性矩阵,将已知关联表示在邻接矩阵中。构建了一个异构网络,将 lncRNAs 和疾病作为节点,它们的关联作为边进行剖析。Graph Attention Network(GAT)用于处理初始特征和相应的邻接信息。GAT 识别网络中的重要邻接节点,捕捉 lncRNA 和疾病之间的复杂关系,并生成新的特征表示。随后,transformer 捕获整个特征序列中的全局依赖关系和交互作用。因此,iGATTLDA 成功地捕捉到了传统方法可能忽略的复杂关系和相互作用。在评估 iGATTLDA 时,它使用两层多层感知机(MLP)分类器实现了 0.95 的接收器操作特性(ROC)曲线下面积(AUC)和 0.96 的精度召回曲线下面积(AUPRC)。与之前提出的大多数模型相比,这些结果明显更高,进一步证明了该模型通过整合局部和全局相互作用来预测潜在 lncRNA-疾病关联的效率。实施细节可从 https://github.com/momanyibiffon/iGATTLDA 获得。